Sample records for co-evolving genomic groups

  1. Stratification of co-evolving genomic groups using ranked phylogenetic profiles

    PubMed Central

    Freilich, Shiri; Goldovsky, Leon; Gottlieb, Assaf; Blanc, Eric; Tsoka, Sophia; Ouzounis, Christos A

    2009-01-01

    Background Previous methods of detecting the taxonomic origins of arbitrary sequence collections, with a significant impact to genome analysis and in particular metagenomics, have primarily focused on compositional features of genomes. The evolutionary patterns of phylogenetic distribution of genes or proteins, represented by phylogenetic profiles, provide an alternative approach for the detection of taxonomic origins, but typically suffer from low accuracy. Herein, we present rank-BLAST, a novel approach for the assignment of protein sequences into genomic groups of the same taxonomic origin, based on the ranking order of phylogenetic profiles of target genes or proteins across the reference database. Results The rank-BLAST approach is validated by computing the phylogenetic profiles of all sequences for five distinct microbial species of varying degrees of phylogenetic proximity, against a reference database of 243 fully sequenced genomes. The approach - a combination of sequence searches, statistical estimation and clustering - analyses the degree of sequence divergence between sets of protein sequences and allows the classification of protein sequences according to the species of origin with high accuracy, allowing taxonomic classification of 64% of the proteins studied. In most cases, a main cluster is detected, representing the corresponding species. Secondary, functionally distinct and species-specific clusters exhibit different patterns of phylogenetic distribution, thus flagging gene groups of interest. Detailed analyses of such cases are provided as examples. Conclusion Our results indicate that the rank-BLAST approach can capture the taxonomic origins of sequence collections in an accurate and efficient manner. The approach can be useful both for the analysis of genome evolution and the detection of species groups in metagenomics samples. PMID:19860884

  2. Single-cell genomics reveals co-metabolic interactions within uncultivated Marine Group A bacteria

    NASA Astrophysics Data System (ADS)

    Hawley, A. K.; Hallam, S. J.

    2016-02-01

    Marine Group A (MGA) bacteria represent a ubiquitous and abundant candidate phylum enriched in oxygen minimum zones (OMZs) and the deep ocean. Despite MGA prevalence little is known about their ecology and biogeochemistry. Here we chart the metabolic potential of 26 MGA single-cell amplified genomes sourced from different environments spanning ecothermodynamic gradients including open ocean waters, OMZs and methanogenic environments including a terephthalate-degrading bioreactor. Metagenomic contig recruitment to SAGs combined with tetra-nucleotide frequency distribution patterns resolved nine MGA population genome bins. All population genomes exhibited genomic streamlining with open ocean MGA being the most reduced. Different strategies for carbohydrate utilization, carbon fixation energy metabolism and respiratory pathways were identified between population genome bins, including various roles in the nitrogen and sulfur cycles. MGA inhabiting OMZ oxyclines encoded genes for partial denitrification with potential to feed into anammox and nitrification as well as a polysulfide reductase with a potential role in the cryptic sulfur cycle. MGA inhabiting anoxic waters, encoded NiFe hydrogenase and nitrous oxide reductase with the potential to complete partial denitrification pathways previously linked to sulfur oxidation in SUP05 bacteria. MGA from methanogenic environments encoded genes mediating cascading syntrophic interactions with fatty acid degraders and methanogens including reverse electron transport potential. The MGA phylum appears to have evolved alternative metabolic innovations adapting specific subgroups to occupy specific niches along ecothermodynamic gradients. Additionally, expression of MGA genes from different OMZ environments supports that these subgroups manifest an increasing propensity for co-metabolic interactions under energy limiting conditions that mandates a cooperative mode of existence with important implications for C, N and S cycling in

  3. Hybridization Reveals the Evolving Genomic Architecture of Speciation

    PubMed Central

    Kronforst, Marcus R.; Hansen, Matthew E.B.; Crawford, Nicholas G.; Gallant, Jason R.; Zhang, Wei; Kulathinal, Rob J.; Kapan, Durrell D.; Mullen, Sean P.

    2014-01-01

    SUMMARY The rate at which genomes diverge during speciation is unknown, as are the physical dynamics of the process. Here, we compare full genome sequences of 32 butterflies, representing five species from a hybridizing Heliconius butterfly community, to examine genome-wide patterns of introgression and infer how divergence evolves during the speciation process. Our analyses reveal that initial divergence is restricted to a small fraction of the genome, largely clustered around known wing-patterning genes. Over time, divergence evolves rapidly, due primarily to the origin of new divergent regions. Furthermore, divergent genomic regions display signatures of both selection and adaptive introgression, demonstrating the link between microevolutionary processes acting within species and the origin of species across macroevolutionary timescales. Our results provide a uniquely comprehensive portrait of the evolving species boundary due to the role that hybridization plays in reducing the background accumulation of divergence at neutral sites. PMID:24183670

  4. Modeling heterogeneous (co)variances from adjacent-SNP groups improves genomic prediction for milk protein composition traits.

    PubMed

    Gebreyesus, Grum; Lund, Mogens S; Buitenhuis, Bart; Bovenhuis, Henk; Poulsen, Nina A; Janss, Luc G

    2017-12-05

    Accurate genomic prediction requires a large reference population, which is problematic for traits that are expensive to measure. Traits related to milk protein composition are not routinely recorded due to costly procedures and are considered to be controlled by a few quantitative trait loci of large effect. The amount of variation explained may vary between regions leading to heterogeneous (co)variance patterns across the genome. Genomic prediction models that can efficiently take such heterogeneity of (co)variances into account can result in improved prediction reliability. In this study, we developed and implemented novel univariate and bivariate Bayesian prediction models, based on estimates of heterogeneous (co)variances for genome segments (BayesAS). Available data consisted of milk protein composition traits measured on cows and de-regressed proofs of total protein yield derived for bulls. Single-nucleotide polymorphisms (SNPs), from 50K SNP arrays, were grouped into non-overlapping genome segments. A segment was defined as one SNP, or a group of 50, 100, or 200 adjacent SNPs, or one chromosome, or the whole genome. Traditional univariate and bivariate genomic best linear unbiased prediction (GBLUP) models were also run for comparison. Reliabilities were calculated through a resampling strategy and using deterministic formula. BayesAS models improved prediction reliability for most of the traits compared to GBLUP models and this gain depended on segment size and genetic architecture of the traits. The gain in prediction reliability was especially marked for the protein composition traits β-CN, κ-CN and β-LG, for which prediction reliabilities were improved by 49 percentage points on average using the MT-BayesAS model with a 100-SNP segment size compared to the bivariate GBLUP. Prediction reliabilities were highest with the BayesAS model that uses a 100-SNP segment size. The bivariate versions of our BayesAS models resulted in extra gains of up to 6% in

  5. Evolutionary Genomics of Fast Evolving Tunicates

    PubMed Central

    Berná, Luisa; Alvarez-Valin, Fernando

    2014-01-01

    Tunicates have been extensively studied because of their crucial phylogenetic location (the closest living relatives of vertebrates) and particular developmental plan. Recent genome efforts have disclosed that tunicates are also remarkable in their genome organization and molecular evolutionary patterns. Here, we review these latter aspects, comparing the similarities and specificities of two model species of the group: Oikopleura dioica and Ciona intestinalis. These species exhibit great genome plasticity and Oikopleura in particular has undergone a process of extreme genome reduction and compaction that can be explained in part by gene loss, but is mostly due to other mechanisms such as shortening of intergenic distances and introns, and scarcity of mobile elements. In Ciona, genome reorganization was less severe being more similar to the other chordates in several aspects. Rates and patterns of molecular evolution are also peculiar in tunicates, being Ciona about 50% faster than vertebrates and Oikopleura three times faster. In fact, the latter species is considered as the fastest evolving metazoan recorded so far. Two processes of increase in evolutionary rates have taken place in tunicates. One of them is more extreme, and basically restricted to genes encoding regulatory proteins (transcription regulators, chromatin remodeling proteins, and metabolic regulators), and the other one is less pronounced but affects the whole genome. Very likely adaptive evolution has played a very significant role in the first, whereas the functional and/or evolutionary causes of the second are less clear and the evidence is not conclusive. The evidences supporting the incidence of increased mutation and less efficient negative selection are presented and discussed. PMID:25008364

  6. Evolving Approaches to the Ethical Management of Genomic Data

    PubMed Central

    Boyer, Joy T.; Sun, Kathie Y.

    2013-01-01

    The ethical landscape in the field of genomics is rapidly shifting. Plummeting sequencing costs, along with ongoing advances in bioinformatics, now make it possible to generate an enormous volume of genomic data about vast numbers of people. The informational richness, complexity, and frequently uncertain meaning of these data, coupled with evolving norms surrounding the sharing of data and samples and persistent privacy concerns, have generated a range of approaches to the ethical management of genomic information. As calls increase for the expanded use of broad or even open consent, and as controversy grows about how best to handle incidental genomic findings, these approaches, informed by normative analysis and empirical data, will continue to evolve alongside the science. PMID:23453621

  7. Evolving approaches to the ethical management of genomic data.

    PubMed

    McEwen, Jean E; Boyer, Joy T; Sun, Kathie Y

    2013-06-01

    The ethical landscape in the field of genomics is rapidly shifting. Plummeting sequencing costs, along with ongoing advances in bioinformatics, now make it possible to generate an enormous volume of genomic data about vast numbers of people. The informational richness, complexity, and frequently uncertain meaning of these data, coupled with evolving norms surrounding the sharing of data and samples and persistent privacy concerns, have generated a range of approaches to the ethical management of genomic information. As calls increase for the expanded use of broad or even open consent, and as controversy grows about how best to handle incidental genomic findings, these approaches, informed by normative analysis and empirical data, will continue to evolve alongside the science. Published by Elsevier Ltd.

  8. Genome-Wide Analysis in Three Fusarium Pathogens Identifies Rapidly Evolving Chromosomes and Genes Associated with Pathogenicity

    PubMed Central

    Sperschneider, Jana; Gardiner, Donald M.; Thatcher, Louise F.; Lyons, Rebecca; Singh, Karam B.; Manners, John M.; Taylor, Jennifer M.

    2015-01-01

    Pathogens and hosts are in an ongoing arms race and genes involved in host–pathogen interactions are likely to undergo diversifying selection. Fusarium plant pathogens have evolved diverse infection strategies, but how they interact with their hosts in the biotrophic infection stage remains puzzling. To address this, we analyzed the genomes of three Fusarium plant pathogens for genes that are under diversifying selection. We found a two-speed genome structure both on the chromosome and gene group level. Diversifying selection acts strongly on the dispensable chromosomes in Fusarium oxysporum f. sp. lycopersici and on distinct core chromosome regions in Fusarium graminearum, all of which have associations with virulence. Members of two gene groups evolve rapidly, namely those that encode proteins with an N-terminal [SG]-P-C-[KR]-P sequence motif and proteins that are conserved predominantly in pathogens. Specifically, 29 F. graminearum genes are rapidly evolving, in planta induced and encode secreted proteins, strongly pointing toward effector function. In summary, diversifying selection in Fusarium is strongly reflected as genomic footprints and can be used to predict a small gene set likely to be involved in host–pathogen interactions for experimental verification. PMID:25994930

  9. Sauropod dinosaurs evolved moderately sized genomes unrelated to body size.

    PubMed

    Organ, Chris L; Brusatte, Stephen L; Stein, Koen

    2009-12-22

    Sauropodomorph dinosaurs include the largest land animals to have ever lived, some reaching up to 10 times the mass of an African elephant. Despite their status defining the upper range for body size in land animals, it remains unknown whether sauropodomorphs evolved larger-sized genomes than non-avian theropods, their sister taxon, or whether a relationship exists between genome size and body size in dinosaurs, two questions critical for understanding broad patterns of genome evolution in dinosaurs. Here we report inferences of genome size for 10 sauropodomorph taxa. The estimates are derived from a Bayesian phylogenetic generalized least squares approach that generates posterior distributions of regression models relating genome size to osteocyte lacunae volume in extant tetrapods. We estimate that the average genome size of sauropodomorphs was 2.02 pg (range of species means: 1.77-2.21 pg), a value in the upper range of extant birds (mean = 1.42 pg, range: 0.97-2.16 pg) and near the average for extant non-avian reptiles (mean = 2.24 pg, range: 1.05-5.44 pg). The results suggest that the variation in size and architecture of genomes in extinct dinosaurs was lower than the variation found in mammals. A substantial difference in genome size separates the two major clades within dinosaurs, Ornithischia (large genomes) and Saurischia (moderate to small genomes). We find no relationship between body size and estimated genome size in extinct dinosaurs, which suggests that neutral forces did not dominate the evolution of genome size in this group.

  10. Sauropod dinosaurs evolved moderately sized genomes unrelated to body size

    PubMed Central

    Organ, Chris L.; Brusatte, Stephen L.; Stein, Koen

    2009-01-01

    Sauropodomorph dinosaurs include the largest land animals to have ever lived, some reaching up to 10 times the mass of an African elephant. Despite their status defining the upper range for body size in land animals, it remains unknown whether sauropodomorphs evolved larger-sized genomes than non-avian theropods, their sister taxon, or whether a relationship exists between genome size and body size in dinosaurs, two questions critical for understanding broad patterns of genome evolution in dinosaurs. Here we report inferences of genome size for 10 sauropodomorph taxa. The estimates are derived from a Bayesian phylogenetic generalized least squares approach that generates posterior distributions of regression models relating genome size to osteocyte lacunae volume in extant tetrapods. We estimate that the average genome size of sauropodomorphs was 2.02 pg (range of species means: 1.77–2.21 pg), a value in the upper range of extant birds (mean = 1.42 pg, range: 0.97–2.16 pg) and near the average for extant non-avian reptiles (mean = 2.24 pg, range: 1.05–5.44 pg). The results suggest that the variation in size and architecture of genomes in extinct dinosaurs was lower than the variation found in mammals. A substantial difference in genome size separates the two major clades within dinosaurs, Ornithischia (large genomes) and Saurischia (moderate to small genomes). We find no relationship between body size and estimated genome size in extinct dinosaurs, which suggests that neutral forces did not dominate the evolution of genome size in this group. PMID:19793755

  11. Delineating slowly and rapidly evolving fractions of the Drosophila genome.

    PubMed

    Keith, Jonathan M; Adams, Peter; Stephen, Stuart; Mattick, John S

    2008-05-01

    Evolutionary conservation is an important indicator of function and a major component of bioinformatic methods to identify non-protein-coding genes. We present a new Bayesian method for segmenting pairwise alignments of eukaryotic genomes while simultaneously classifying segments into slowly and rapidly evolving fractions. We also describe an information criterion similar to the Akaike Information Criterion (AIC) for determining the number of classes. Working with pairwise alignments enables detection of differences in conservation patterns among closely related species. We analyzed three whole-genome and three partial-genome pairwise alignments among eight Drosophila species. Three distinct classes of conservation level were detected. Sequences comprising the most slowly evolving component were consistent across a range of species pairs, and constituted approximately 62-66% of the D. melanogaster genome. Almost all (>90%) of the aligned protein-coding sequence is in this fraction, suggesting much of it (comprising the majority of the Drosophila genome, including approximately 56% of non-protein-coding sequences) is functional. The size and content of the most rapidly evolving component was species dependent, and varied from 1.6% to 4.8%. This fraction is also enriched for protein-coding sequence (while containing significant amounts of non-protein-coding sequence), suggesting it is under positive selection. We also classified segments according to conservation and GC content simultaneously. This analysis identified numerous sub-classes of those identified on the basis of conservation alone, but was nevertheless consistent with that classification. Software, data, and results available at www.maths.qut.edu.au/-keithj/. Genomic segments comprising the conservation classes available in BED format.

  12. Genomic profiles of low-grade murine gliomas evolve during progression to glioblastoma. | Office of Cancer Genomics

    Cancer.gov

    Background: Gliomas are diverse neoplasms with multiple molecular subtypes. How tumor-initiating mutations relate to molecular subtypes as these tumors evolve during malignant progression remains unclear.Methods: We used genetically engineered mouse models, histopathology, genetic lineage tracing, expression profiling, and copy number analyses to examine how genomic tumor diversity evolves during the course of malignant progression from low- to high-grade disease.

  13. CoCoNUT: an efficient system for the comparison and analysis of genomes

    PubMed Central

    2008-01-01

    Background Comparative genomics is the analysis and comparison of genomes from different species. This area of research is driven by the large number of sequenced genomes and heavily relies on efficient algorithms and software to perform pairwise and multiple genome comparisons. Results Most of the software tools available are tailored for one specific task. In contrast, we have developed a novel system CoCoNUT (Computational Comparative geNomics Utility Toolkit) that allows solving several different tasks in a unified framework: (1) finding regions of high similarity among multiple genomic sequences and aligning them, (2) comparing two draft or multi-chromosomal genomes, (3) locating large segmental duplications in large genomic sequences, and (4) mapping cDNA/EST to genomic sequences. Conclusion CoCoNUT is competitive with other software tools w.r.t. the quality of the results. The use of state of the art algorithms and data structures allows CoCoNUT to solve comparative genomics tasks more efficiently than previous tools. With the improved user interface (including an interactive visualization component), CoCoNUT provides a unified, versatile, and easy-to-use software tool for large scale studies in comparative genomics. PMID:19014477

  14. Contribution of Mobile Group II Introns to Sinorhizobium meliloti Genome Evolution.

    PubMed

    Toro, Nicolás; Martínez-Abarca, Francisco; Molina-Sánchez, María D; García-Rodríguez, Fernando M; Nisa-Martínez, Rafael

    2018-01-01

    Mobile group II introns are ribozymes and retroelements that probably originate from bacteria. Sinorhizobium meliloti , the nitrogen-fixing endosymbiont of legumes of genus Medicago , harbors a large number of these retroelements. One of these elements, RmInt1, has been particularly successful at colonizing this multipartite genome. Many studies have improved our understanding of RmInt1 and phylogenetically related group II introns, their mobility mechanisms, spread and dynamics within S. meliloti and closely related species. Although RmInt1 conserves the ancient retroelement behavior, its evolutionary history suggests that this group II intron has played a role in the short- and long-term evolution of the S. meliloti genome. We will discuss its proposed role in genome evolution by controlling the spread and coexistence of potentially harmful mobile genetic elements, by ectopic transposition to different genetic loci as a source of early genomic variation and by generating sequence variation after a very slow degradation process, through intron remnants that may have continued to evolve, contributing to bacterial speciation.

  15. Contribution of Mobile Group II Introns to Sinorhizobium meliloti Genome Evolution

    PubMed Central

    Toro, Nicolás; Martínez-Abarca, Francisco; Molina-Sánchez, María D.; García-Rodríguez, Fernando M.; Nisa-Martínez, Rafael

    2018-01-01

    Mobile group II introns are ribozymes and retroelements that probably originate from bacteria. Sinorhizobium meliloti, the nitrogen-fixing endosymbiont of legumes of genus Medicago, harbors a large number of these retroelements. One of these elements, RmInt1, has been particularly successful at colonizing this multipartite genome. Many studies have improved our understanding of RmInt1 and phylogenetically related group II introns, their mobility mechanisms, spread and dynamics within S. meliloti and closely related species. Although RmInt1 conserves the ancient retroelement behavior, its evolutionary history suggests that this group II intron has played a role in the short- and long-term evolution of the S. meliloti genome. We will discuss its proposed role in genome evolution by controlling the spread and coexistence of potentially harmful mobile genetic elements, by ectopic transposition to different genetic loci as a source of early genomic variation and by generating sequence variation after a very slow degradation process, through intron remnants that may have continued to evolve, contributing to bacterial speciation. PMID:29670598

  16. A tutorial of diverse genome analysis tools found in the CoGe web-platform using Plasmodium spp. as a model

    PubMed Central

    Castillo, Andreina I; Nelson, Andrew D L; Haug-Baltzell, Asher K; Lyons, Eric

    2018-01-01

    Abstract Integrated platforms for storage, management, analysis and sharing of large quantities of omics data have become fundamental to comparative genomics. CoGe (https://genomevolution.org/coge/) is an online platform designed to manage and study genomic data, enabling both data- and hypothesis-driven comparative genomics. CoGe’s tools and resources can be used to organize and analyse both publicly available and private genomic data from any species. Here, we demonstrate the capabilities of CoGe through three example workflows using 17 Plasmodium genomes as a model. Plasmodium genomes present unique challenges for comparative genomics due to their rapidly evolving and highly variable genomic AT/GC content. These example workflows are intended to serve as templates to help guide researchers who would like to use CoGe to examine diverse aspects of genome evolution. In the first workflow, trends in genome composition and amino acid usage are explored. In the second, changes in genome structure and the distribution of synonymous (Ks) and non-synonymous (Kn) substitution values are evaluated across species with different levels of evolutionary relatedness. In the third workflow, microsyntenic analyses of multigene families’ genomic organization are conducted using two Plasmodium-specific gene families—serine repeat antigen, and cytoadherence-linked asexual gene—as models. In general, these example workflows show how to achieve quick, reproducible and shareable results using the CoGe platform. We were able to replicate previously published results, as well as leverage CoGe’s tools and resources to gain additional insight into various aspects of Plasmodium genome evolution. Our results highlight the usefulness of the CoGe platform, particularly in understanding complex features of genome evolution. Database URL: https://genomevolution.org/coge/

  17. Evolved hexose transporter enhances xylose uptake and glucose/xylose co-utilization in Saccharomyces cerevisiae

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Reider Apel, Amanda; Ouellet, Mario; Szmidt-Middleton, Heather

    Enhancing xylose utilization has been a major focus in Saccharomyces cerevisiae strain-engineering efforts. The incentive for these studies arises from the need to use all sugars in the typical carbon mixtures that comprise standard renewable plant-biomass-based carbon sources. While major advances have been made in developing utilization pathways, the efficient import of five carbon sugars into the cell remains an important bottleneck in this endeavor. Here we use an engineered S. cerevisiae BY4742 strain, containing an established heterologous xylose utilization pathway, and imposed a laboratory evolution regime with xylose as the sole carbon source. We obtained several evolved strains withmore » improved growth phenotypes and evaluated the best candidate using genome resequencing. We observed remarkably few single nucleotide polymorphisms in the evolved strain, among which we confirmed a single amino acid change in the hexose transporter HXT7 coding sequence to be responsible for the evolved phenotype. Lastly, the mutant HXT7(F79S) shows improved xylose uptake rates (Vmax = 186.4 ± 20.1 nmol•min -1•mg -1) that allows the S. cerevisiae strain to show significant growth with xylose as the sole carbon source, as well as partial co-utilization of glucose and xylose in a mixed sugar cultivation.« less

  18. Evolved hexose transporter enhances xylose uptake and glucose/xylose co-utilization in Saccharomyces cerevisiae

    DOE PAGES

    Reider Apel, Amanda; Ouellet, Mario; Szmidt-Middleton, Heather; ...

    2016-01-19

    Enhancing xylose utilization has been a major focus in Saccharomyces cerevisiae strain-engineering efforts. The incentive for these studies arises from the need to use all sugars in the typical carbon mixtures that comprise standard renewable plant-biomass-based carbon sources. While major advances have been made in developing utilization pathways, the efficient import of five carbon sugars into the cell remains an important bottleneck in this endeavor. Here we use an engineered S. cerevisiae BY4742 strain, containing an established heterologous xylose utilization pathway, and imposed a laboratory evolution regime with xylose as the sole carbon source. We obtained several evolved strains withmore » improved growth phenotypes and evaluated the best candidate using genome resequencing. We observed remarkably few single nucleotide polymorphisms in the evolved strain, among which we confirmed a single amino acid change in the hexose transporter HXT7 coding sequence to be responsible for the evolved phenotype. Lastly, the mutant HXT7(F79S) shows improved xylose uptake rates (Vmax = 186.4 ± 20.1 nmol•min -1•mg -1) that allows the S. cerevisiae strain to show significant growth with xylose as the sole carbon source, as well as partial co-utilization of glucose and xylose in a mixed sugar cultivation.« less

  19. Assembler: Efficient Discovery of Spatial Co-evolving Patterns in Massive Geo-sensory Data.

    PubMed

    Zhang, Chao; Zheng, Yu; Ma, Xiuli; Han, Jiawei

    2015-08-01

    Recent years have witnessed the wide proliferation of geo-sensory applications wherein a bundle of sensors are deployed at different locations to cooperatively monitor the target condition. Given massive geo-sensory data, we study the problem of mining spatial co-evolving patterns (SCPs), i.e ., groups of sensors that are spatially correlated and co-evolve frequently in their readings. SCP mining is of great importance to various real-world applications, yet it is challenging because (1) the truly interesting evolutions are often flooded by numerous trivial fluctuations in the geo-sensory time series; and (2) the pattern search space is extremely large due to the spatiotemporal combinatorial nature of SCP. In this paper, we propose a two-stage method called Assembler. In the first stage, Assembler filters trivial fluctuations using wavelet transform and detects frequent evolutions for individual sensors via a segment-and-group approach. In the second stage, Assembler generates SCPs by assembling the frequent evolutions of individual sensors. Leveraging the spatial constraint, it conceptually organizes all the SCPs into a novel structure called the SCP search tree, which facilitates the effective pruning of the search space to generate SCPs efficiently. Our experiments on both real and synthetic data sets show that Assembler is effective, efficient, and scalable.

  20. CoGI: Towards Compressing Genomes as an Image.

    PubMed

    Xie, Xiaojing; Zhou, Shuigeng; Guan, Jihong

    2015-01-01

    Genomic science is now facing an explosive increase of data thanks to the fast development of sequencing technology. This situation poses serious challenges to genomic data storage and transferring. It is desirable to compress data to reduce storage and transferring cost, and thus to boost data distribution and utilization efficiency. Up to now, a number of algorithms / tools have been developed for compressing genomic sequences. Unlike the existing algorithms, most of which treat genomes as one-dimensional text strings and compress them based on dictionaries or probability models, this paper proposes a novel approach called CoGI (the abbreviation of Compressing Genomes as an Image) for genome compression, which transforms the genomic sequences to a two-dimensional binary image (or bitmap), then applies a rectangular partition coding algorithm to compress the binary image. CoGI can be used as either a reference-based compressor or a reference-free compressor. For the former, we develop two entropy-based algorithms to select a proper reference genome. Performance evaluation is conducted on various genomes. Experimental results show that the reference-based CoGI significantly outperforms two state-of-the-art reference-based genome compressors GReEn and RLZ-opt in both compression ratio and compression efficiency. It also achieves comparable compression ratio but two orders of magnitude higher compression efficiency in comparison with XM--one state-of-the-art reference-free genome compressor. Furthermore, our approach performs much better than Gzip--a general-purpose and widely-used compressor, in both compression speed and compression ratio. So, CoGI can serve as an effective and practical genome compressor. The source code and other related documents of CoGI are available at: http://admis.fudan.edu.cn/projects/cogi.htm.

  1. Co-creating meaningful structures within long-term psychotherapy group culture.

    PubMed

    Gayle, Robin G

    2009-07-01

    Meaningful group structures are co-created within the long-term outpatient psychotherapy group through a hermeneutical interaction between structure and immediate experience of structure by individuals embedded in personal and collective contexts. Co-created meanings expand original group- and self-understandings and further evolve structures that are stable yet do not exist independently of the narratives and affects of the members who interact with them. Group structures do not reduce, expand, or dissolve but change in connection to the experiences and meaning attributions within the group. This intersubjective process mediates the emphasis within group theory on leader responsibility for culture building that risks overpromoting certain psychotherapeutic cultural intentions over others. Three examples of intersubjective hermeneutical interaction within long-term psychotherapy groups lend insight into global, cultural, and societal groups.

  2. Comprehensive Genome-Wide Classification Reveals That Many Plant-Specific Transcription Factors Evolved in Streptophyte Algae

    PubMed Central

    Wilhelmsson, Per K I; Mühlich, Cornelia; Ullrich, Kristian K

    2017-01-01

    Abstract Plant genomes encode many lineage-specific, unique transcription factors. Expansion of such gene families has been previously found to coincide with the evolution of morphological complexity, although comparative analyses have been hampered by severe sampling bias. Here, we make use of the recently increased availability of plant genomes. We have updated and expanded previous rule sets for domain-based classification of transcription associated proteins (TAPs), comprising transcription factors and transcriptional regulators. The genome-wide annotation of these protein families has been analyzed and made available via the novel TAPscan web interface. We find that many TAP families previously thought to be specific for land plants actually evolved in streptophyte (charophyte) algae; 26 out of 36 TAP family gains are inferred to have occurred in the common ancestor of the Streptophyta (uniting the land plants—Embryophyta—with their closest algal relatives). In contrast, expansions of TAP families were found to occur throughout streptophyte evolution. 17 out of 76 expansion events were found to be common to all land plants and thus probably evolved concomitant with the water-to-land-transition. PMID:29216360

  3. Feature co-localization landscape of the human genome

    PubMed Central

    Ng, Siu-Kin; Hu, Taobo; Long, Xi; Chan, Cheuk-Hin; Tsang, Shui-Ying; Xue, Hong

    2016-01-01

    Although feature co-localizations could serve as useful guide-posts to genome architecture, a comprehensive and quantitative feature co-localization map of the human genome has been lacking. Herein we show that, in contrast to the conventional bipartite division of genomic sequences into genic and inter-genic regions, pairwise co-localizations of forty-two genomic features in the twenty-two autosomes based on 50-kb to 2,000-kb sequence windows indicate a tripartite zonal architecture comprising Genic zones enriched with gene-related features and Alu-elements; Proximal zones enriched with MIR- and L2-elements, transcription-factor-binding-sites (TFBSs), and conserved-indels (CIDs); and Distal zones enriched with L1-elements. Co-localizations between single-nucleotide-polymorphisms (SNPs) and copy-number-variations (CNVs) reveal a fraction of sequence windows displaying steeply enhanced levels of SNPs, CNVs and recombination rates that point to active adaptive evolution in such pathways as immune response, sensory perceptions, and cognition. The strongest positive co-localization observed between TFBSs and CIDs suggests a regulatory role of CIDs in cooperation with TFBSs. The positive co-localizations of cancer somatic CNVs (CNVT) with all Proximal zone and most Genic zone features, in contrast to the distinctly more restricted co-localizations exhibited by germline CNVs (CNVG), reveal disparate distributions of CNVTs and CNVGs indicative of dissimilarity in their underlying mechanisms. PMID:26854351

  4. Whole-genome phylogeny of Escherichia coli/Shigella group by feature frequency profiles (FFPs)

    PubMed Central

    Sims, Gregory E.; Kim, Sung-Hou

    2011-01-01

    A whole-genome phylogeny of the Escherichia coli/Shigella group was constructed by using the feature frequency profile (FFP) method. This alignment-free approach uses the frequencies of l-mer features of whole genomes to infer phylogenic distances. We present two phylogenies that accentuate different aspects of E. coli/Shigella genomic evolution: (i) one based on the compositions of all possible features of length l = 24 (∼8.4 million features), which are likely to reveal the phenetic grouping and relationship among the organisms and (ii) the other based on the compositions of core features with low frequency and low variability (∼0.56 million features), which account for ∼69% of all commonly shared features among 38 taxa examined and are likely to have genome-wide lineal evolutionary signal. Shigella appears as a single clade when all possible features are used without filtering of noncore features. However, results using core features show that Shigella consists of at least two distantly related subclades, implying that the subclades evolved into a single clade because of a high degree of convergence influenced by mobile genetic elements and niche adaptation. In both FFP trees, the basal group of the E. coli/Shigella phylogeny is the B2 phylogroup, which contains primarily uropathogenic strains, suggesting that the E. coli/Shigella ancestor was likely a facultative or opportunistic pathogen. The extant commensal strains diverged relatively late and appear to be the result of reductive evolution of genomes. We also identify clade distinguishing features and their associated genomic regions within each phylogroup. Such features may provide useful information for understanding evolution of the groups and for quick diagnostic identification of each phylogroup. PMID:21536867

  5. Molecular abundances and C/O ratios in chemically evolving planet-forming disk midplanes

    NASA Astrophysics Data System (ADS)

    Eistrup, Christian; Walsh, Catherine; van Dishoeck, Ewine F.

    2018-05-01

    Context. Exoplanet atmospheres are thought be built up from accretion of gas as well as pebbles and planetesimals in the midplanes of planet-forming disks. The chemical composition of this material is usually assumed to be unchanged during the disk lifetime. However, chemistry can alter the relative abundances of molecules in this planet-building material. Aims: We aim to assess the impact of disk chemistry during the era of planet formation. This is done by investigating the chemical changes to volatile gases and ices in a protoplanetary disk midplane out to 30 AU for up to 7 Myr, considering a variety of different conditions, including a physical midplane structure that is evolving in time, and also considering two disks with different masses. Methods: An extensive kinetic chemistry gas-grain reaction network was utilised to evolve the abundances of chemical species over time. Two disk midplane ionisation levels (low and high) were explored, as well as two different makeups of the initial abundances ("inheritance" or "reset"). Results: Given a high level of ionisation, chemical evolution in protoplanetary disk midplanes becomes significant after a few times 105 yr, and is still ongoing by 7 Myr between the H2O and the O2 icelines. Inside the H2O iceline, and in the outer, colder regions of the disk midplane outside the O2 iceline, the relative abundances of the species reach (close to) steady state by 7 Myr. Importantly, the changes in the abundances of the major elemental carbon and oxygen-bearing molecules imply that the traditional "stepfunction" for the C/O ratios in gas and ice in the disk midplane (as defined by sharp changes at icelines of H2O, CO2 and CO) evolves over time, and cannot be assumed fixed, with the C/O ratio in the gas even becoming smaller than the C/O ratio in the ice. In addition, at lower temperatures (<29 K), gaseous CO colliding with the grains gets converted into CO2 and other more complex ices, lowering the CO gas abundance between

  6. Developing improved durum wheat germplasm by altering the cytoplasmic genome

    USDA-ARS?s Scientific Manuscript database

    In eukaryotic organisms, nuclear and cytoplasmic genomes interact to drive cellular functions. These genomes have co-evolved to form specific nuclear-cytoplasmic interactions that are essential to the origin, success, and evolution of diploid and polyploid species. Hundreds of genetic diseases in h...

  7. Genome duplication and mutations in ACE2 cause multicellular, fast-sedimenting phenotypes in evolved Saccharomyces cerevisiae

    PubMed Central

    Oud, Bart; Guadalupe-Medina, Victor; Nijkamp, Jurgen F.; de Ridder, Dick; Pronk, Jack T.; van Maris, Antonius J. A.; Daran, Jean-Marc

    2013-01-01

    Laboratory evolution of the yeast Saccharomyces cerevisiae in bioreactor batch cultures yielded variants that grow as multicellular, fast-sedimenting clusters. Knowledge of the molecular basis of this phenomenon may contribute to the understanding of natural evolution of multicellularity and to manipulating cell sedimentation in laboratory and industrial applications of S. cerevisiae. Multicellular, fast-sedimenting lineages obtained from a haploid S. cerevisiae strain in two independent evolution experiments were analyzed by whole genome resequencing. The two evolved cell lines showed different frameshift mutations in a stretch of eight adenosines in ACE2, which encodes a transcriptional regulator involved in cell cycle control and mother-daughter cell separation. Introduction of the two ace2 mutant alleles into the haploid parental strain led to slow-sedimenting cell clusters that consisted of just a few cells, thus representing only a partial reconstruction of the evolved phenotype. In addition to single-nucleotide mutations, a whole-genome duplication event had occurred in both evolved multicellular strains. Construction of a diploid reference strain with two mutant ace2 alleles led to complete reconstruction of the multicellular-fast sedimenting phenotype. This study shows that whole-genome duplication and a frameshift mutation in ACE2 are sufficient to generate a fast-sedimenting, multicellular phenotype in S. cerevisiae. The nature of the ace2 mutations and their occurrence in two independent evolution experiments encompassing fewer than 500 generations of selective growth suggest that switching between unicellular and multicellular phenotypes may be relevant for competitiveness of S. cerevisiae in natural environments. PMID:24145419

  8. Network Analysis of Earth's Co-Evolving Geosphere and Biosphere

    NASA Astrophysics Data System (ADS)

    Hazen, R. M.; Eleish, A.; Liu, C.; Morrison, S. M.; Meyer, M.; Consortium, K. D.

    2017-12-01

    A fundamental goal of Earth science is the deep understanding of Earth's dynamic, co-evolving geosphere and biosphere through deep time. Network analysis of geo- and bio- `big data' provides an interactive, quantitative, and predictive visualization framework to explore complex and otherwise hidden high-dimension features of diversity, distribution, and change in the evolution of Earth's geochemistry, mineralogy, paleobiology, and biochemistry [1]. Networks also facilitate quantitative comparison of different geological time periods, tectonic settings, and geographical regions, as well as different planets and moons, through network metrics, including density, centralization, diameter, and transitivity.We render networks by employing data related to geographical, paragenetic, environmental, or structural relationships among minerals, fossils, proteins, and microbial taxa. An important recent finding is that the topography of many networks reflects parameters not explicitly incorporated in constructing the network. For example, networks for minerals, fossils, and protein structures reveal embedded qualitative time axes, with additional network geometries possibly related to extinction and/or other punctuation events (see Figure). Other axes related to chemical activities and volatile fugacities, as well as pressure and/or depth of formation, may also emerge from network analysis. These patterns provide new insights into the way planets evolve, especially Earth's co-evolving geosphere and biosphere. 1. Morrison, S.M. et al. (2017) Network analysis of mineralogical systems. American Mineralogist 102, in press. Figure Caption: A network of Phanerozoic Era fossil animals from the past 540 million years includes blue, red, and black circles (nodes) representing family-level taxa and grey lines (links) between coexisting families. Age information was not used in the construction of this network; nevertheless an intrinsic timeline is embedded in the network topology. In

  9. Correlations and analytical approaches to co-evolving voter models

    NASA Astrophysics Data System (ADS)

    Ji, M.; Xu, C.; Choi, C. W.; Hui, P. M.

    2013-11-01

    The difficulty in formulating analytical treatments in co-evolving networks is studied in light of the Vazquez-Eguíluz-San Miguel voter model (VM) and a modified VM (MVM) that introduces a random mutation of the opinion as a noise in the VM. The density of active links, which are links that connect the nodes of opposite opinions, is shown to be highly sensitive to both the degree k of a node and the active links n among the neighbors of a node. We test the validity in the formalism of analytical approaches and show explicitly that the assumptions behind the commonly used homogeneous pair approximation scheme in formulating a mean-field theory are the source of the theory's failure due to the strong correlations between k, n and n2. An improved approach that incorporates spatial correlation to the nearest-neighbors explicitly and a random approximation for the next-nearest neighbors is formulated for the VM and the MVM, and it gives better agreement with the simulation results. We introduce an empirical approach that quantifies the correlations more accurately and gives results in good agreement with the simulation results. The work clarifies why simply mean-field theory fails and sheds light on how to analyze the correlations in the dynamic equations that are often generated in co-evolving processes.

  10. COGNAT: a web server for comparative analysis of genomic neighborhoods.

    PubMed

    Klimchuk, Olesya I; Konovalov, Kirill A; Perekhvatov, Vadim V; Skulachev, Konstantin V; Dibrova, Daria V; Mulkidjanian, Armen Y

    2017-11-22

    In prokaryotic genomes, functionally coupled genes can be organized in conserved gene clusters enabling their coordinated regulation. Such clusters could contain one or several operons, which are groups of co-transcribed genes. Those genes that evolved from a common ancestral gene by speciation (i.e. orthologs) are expected to have similar genomic neighborhoods in different organisms, whereas those copies of the gene that are responsible for dissimilar functions (i.e. paralogs) could be found in dissimilar genomic contexts. Comparative analysis of genomic neighborhoods facilitates the prediction of co-regulated genes and helps to discern different functions in large protein families. We intended, building on the attribution of gene sequences to the clusters of orthologous groups of proteins (COGs), to provide a method for visualization and comparative analysis of genomic neighborhoods of evolutionary related genes, as well as a respective web server. Here we introduce the COmparative Gene Neighborhoods Analysis Tool (COGNAT), a web server for comparative analysis of genomic neighborhoods. The tool is based on the COG database, as well as the Pfam protein families database. As an example, we show the utility of COGNAT in identifying a new type of membrane protein complex that is formed by paralog(s) of one of the membrane subunits of the NADH:quinone oxidoreductase of type 1 (COG1009) and a cytoplasmic protein of unknown function (COG3002). This article was reviewed by Drs. Igor Zhulin, Uri Gophna and Igor Rogozin.

  11. Variability among the Most Rapidly Evolving Plastid Genomic Regions is Lineage-Specific: Implications of Pairwise Genome Comparisons in Pyrus (Rosaceae) and Other Angiosperms for Marker Choice

    PubMed Central

    Ter-Voskanyan, Hasmik; Allgaier, Martin; Borsch, Thomas

    2014-01-01

    Plastid genomes exhibit different levels of variability in their sequences, depending on the respective kinds of genomic regions. Genes are usually more conserved while noncoding introns and spacers evolve at a faster pace. While a set of about thirty maximum variable noncoding genomic regions has been suggested to provide universally promising phylogenetic markers throughout angiosperms, applications often require several regions to be sequenced for many individuals. Our project aims to illuminate evolutionary relationships and species-limits in the genus Pyrus (Rosaceae)—a typical case with very low genetic distances between taxa. In this study, we have sequenced the plastid genome of Pyrus spinosa and aligned it to the already available P. pyrifolia sequence. The overall p-distance of the two Pyrus genomes was 0.00145. The intergenic spacers between ndhC–trnV, trnR–atpA, ndhF–rpl32, psbM–trnD, and trnQ–rps16 were the most variable regions, also comprising the highest total numbers of substitutions, indels and inversions (potentially informative characters). Our comparative analysis of further plastid genome pairs with similar low p-distances from Oenothera (representing another rosid), Olea (asterids) and Cymbidium (monocots) showed in each case a different ranking of genomic regions in terms of variability and potentially informative characters. Only two intergenic spacers (ndhF–rpl32 and trnK–rps16) were consistently found among the 30 top-ranked regions. We have mapped the occurrence of substitutions and microstructural mutations in the four genome pairs. High AT content in specific sequence elements seems to foster frequent mutations. We conclude that the variability among the fastest evolving plastid genomic regions is lineage-specific and thus cannot be precisely predicted across angiosperms. The often lineage-specific occurrence of stem-loop elements in the sequences of introns and spacers also governs lineage-specific mutations

  12. Recombinant transfer in the basic genome of E. coli

    DOE PAGES

    Dixit, Purushottam; Studier, F. William; Pang, Tin Yau; ...

    2015-07-07

    An approximation to the ~4-Mbp basic genome shared by 32 strains of E. coli representing six evolutionary groups has been derived and analyzed computationally. A multiple-alignment of the 32 complete genome sequences was filtered to remove mobile elements and identify the most reliable ~90% of the aligned length of each of the resulting 496 basic-genome pairs. Patterns of single bp mutations (SNPs) in aligned pairs distinguish clonally inherited regions from regions where either genome has acquired DNA fragments from diverged genomes by homologous recombination since their last common ancestor. Such recombinant transfer is pervasive across the basic genome, mostly betweenmore » genomes in the same evolutionary group, and generates many unique mosaic patterns. The six least-diverged genome-pairs have one or two recombinant transfers of length ~40–115 kbp (and few if any other transfers), each containing one or more gene clusters known to confer strong selective advantage in some environments. Moderately diverged genome pairs (0.4–1% SNPs) show mosaic patterns of interspersed clonal and recombinant regions of varying lengths throughout the basic genome, whereas more highly diverged pairs within an evolutionary group or pairs between evolutionary groups having >1.3% SNPs have few clonal matches longer than a few kbp. Many recombinant transfers appear to incorporate fragments of the entering DNA produced by restriction systems of the recipient cell. A simple computational model can closely fit the data. As a result, most recombinant transfers seem likely to be due to generalized transduction by co-evolving populations of phages, which could efficiently distribute variability throughout bacterial genomes.« less

  13. Recombinant transfer in the basic genome of E. coli

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Dixit, Purushottam; Studier, F. William; Pang, Tin Yau

    An approximation to the ~4-Mbp basic genome shared by 32 strains of E. coli representing six evolutionary groups has been derived and analyzed computationally. A multiple-alignment of the 32 complete genome sequences was filtered to remove mobile elements and identify the most reliable ~90% of the aligned length of each of the resulting 496 basic-genome pairs. Patterns of single bp mutations (SNPs) in aligned pairs distinguish clonally inherited regions from regions where either genome has acquired DNA fragments from diverged genomes by homologous recombination since their last common ancestor. Such recombinant transfer is pervasive across the basic genome, mostly betweenmore » genomes in the same evolutionary group, and generates many unique mosaic patterns. The six least-diverged genome-pairs have one or two recombinant transfers of length ~40–115 kbp (and few if any other transfers), each containing one or more gene clusters known to confer strong selective advantage in some environments. Moderately diverged genome pairs (0.4–1% SNPs) show mosaic patterns of interspersed clonal and recombinant regions of varying lengths throughout the basic genome, whereas more highly diverged pairs within an evolutionary group or pairs between evolutionary groups having >1.3% SNPs have few clonal matches longer than a few kbp. Many recombinant transfers appear to incorporate fragments of the entering DNA produced by restriction systems of the recipient cell. A simple computational model can closely fit the data. As a result, most recombinant transfers seem likely to be due to generalized transduction by co-evolving populations of phages, which could efficiently distribute variability throughout bacterial genomes.« less

  14. Group normalization for genomic data.

    PubMed

    Ghandi, Mahmoud; Beer, Michael A

    2012-01-01

    Data normalization is a crucial preliminary step in analyzing genomic datasets. The goal of normalization is to remove global variation to make readings across different experiments comparable. In addition, most genomic loci have non-uniform sensitivity to any given assay because of variation in local sequence properties. In microarray experiments, this non-uniform sensitivity is due to different DNA hybridization and cross-hybridization efficiencies, known as the probe effect. In this paper we introduce a new scheme, called Group Normalization (GN), to remove both global and local biases in one integrated step, whereby we determine the normalized probe signal by finding a set of reference probes with similar responses. Compared to conventional normalization methods such as Quantile normalization and physically motivated probe effect models, our proposed method is general in the sense that it does not require the assumption that the underlying signal distribution be identical for the treatment and control, and is flexible enough to correct for nonlinear and higher order probe effects. The Group Normalization algorithm is computationally efficient and easy to implement. We also describe a variant of the Group Normalization algorithm, called Cross Normalization, which efficiently amplifies biologically relevant differences between any two genomic datasets.

  15. Cooperative behavior and phase transitions in co-evolving stag hunt game

    NASA Astrophysics Data System (ADS)

    Zhang, W.; Li, Y. S.; Xu, C.; Hui, P. M.

    2016-02-01

    Cooperative behavior and different phases in a co-evolving network dynamics based on the stag hunt game is studied. The dynamical processes are parameterized by a payoff r that tends to promote non-cooperative behavior and a probability q for a rewiring attempt that could isolate the non-cooperators. The interplay between the parameters leads to different phases. Detailed simulations and a mean field theory are employed to reveal the properties of different phases. For small r, the cooperators are the majority and form a connected cluster while the non-cooperators increase with q but remain isolated over the whole range of q, and it is a static phase. For sufficiently large r, cooperators disappear in an intermediate range qL ≤ q ≤qU and a dynamical all-non-cooperators phase results. For q >qU, a static phase results again. A mean field theory based on how the link densities change in time by the co-evolving dynamics is constructed. The theory gives a phase diagram in the q- r parameter space that is qualitatively in agreement with simulation results. The sources of discrepancies between theory and simulations are discussed.

  16. MicroScope in 2017: an expanding and evolving integrated resource for community expertise of microbial genomes

    PubMed Central

    Vallenet, David; Calteau, Alexandra; Cruveiller, Stéphane; Gachet, Mathieu; Lajus, Aurélie; Josso, Adrien; Mercier, Jonathan; Renaux, Alexandre; Rollin, Johan; Rouy, Zoe; Roche, David; Scarpelli, Claude; Médigue, Claudine

    2017-01-01

    The annotation of genomes from NGS platforms needs to be automated and fully integrated. However, maintaining consistency and accuracy in genome annotation is a challenging problem because millions of protein database entries are not assigned reliable functions. This shortcoming limits the knowledge that can be extracted from genomes and metabolic models. Launched in 2005, the MicroScope platform (http://www.genoscope.cns.fr/agc/microscope) is an integrative resource that supports systematic and efficient revision of microbial genome annotation, data management and comparative analysis. Effective comparative analysis requires a consistent and complete view of biological data, and therefore, support for reviewing the quality of functional annotation is critical. MicroScope allows users to analyze microbial (meta)genomes together with post-genomic experiment results if any (i.e. transcriptomics, re-sequencing of evolved strains, mutant collections, phenotype data). It combines tools and graphical interfaces to analyze genomes and to perform the expert curation of gene functions in a comparative context. Starting with a short overview of the MicroScope system, this paper focuses on some major improvements of the Web interface, mainly for the submission of genomic data and on original tools and pipelines that have been developed and integrated in the platform: computation of pan-genomes and prediction of biosynthetic gene clusters. Today the resource contains data for more than 6000 microbial genomes, and among the 2700 personal accounts (65% of which are now from foreign countries), 14% of the users are performing expert annotations, on at least a weekly basis, contributing to improve the quality of microbial genome annotations. PMID:27899624

  17. Interrogation of Mammalian Protein Complex Structure, Function, and Membership Using Genome-Scale Fitness Screens. | Office of Cancer Genomics

    Cancer.gov

    Protein complexes are assemblies of subunits that have co-evolved to execute one or many coordinated functions in the cellular environment. Functional annotation of mammalian protein complexes is critical to understanding biological processes, as well as disease mechanisms. Here, we used genetic co-essentiality derived from genome-scale RNAi- and CRISPR-Cas9-based fitness screens performed across hundreds of human cancer cell lines to assign measures of functional similarity.

  18. Group Normalization for Genomic Data

    PubMed Central

    Ghandi, Mahmoud; Beer, Michael A.

    2012-01-01

    Data normalization is a crucial preliminary step in analyzing genomic datasets. The goal of normalization is to remove global variation to make readings across different experiments comparable. In addition, most genomic loci have non-uniform sensitivity to any given assay because of variation in local sequence properties. In microarray experiments, this non-uniform sensitivity is due to different DNA hybridization and cross-hybridization efficiencies, known as the probe effect. In this paper we introduce a new scheme, called Group Normalization (GN), to remove both global and local biases in one integrated step, whereby we determine the normalized probe signal by finding a set of reference probes with similar responses. Compared to conventional normalization methods such as Quantile normalization and physically motivated probe effect models, our proposed method is general in the sense that it does not require the assumption that the underlying signal distribution be identical for the treatment and control, and is flexible enough to correct for nonlinear and higher order probe effects. The Group Normalization algorithm is computationally efficient and easy to implement. We also describe a variant of the Group Normalization algorithm, called Cross Normalization, which efficiently amplifies biologically relevant differences between any two genomic datasets. PMID:22912661

  19. MicroScope in 2017: an expanding and evolving integrated resource for community expertise of microbial genomes.

    PubMed

    Vallenet, David; Calteau, Alexandra; Cruveiller, Stéphane; Gachet, Mathieu; Lajus, Aurélie; Josso, Adrien; Mercier, Jonathan; Renaux, Alexandre; Rollin, Johan; Rouy, Zoe; Roche, David; Scarpelli, Claude; Médigue, Claudine

    2017-01-04

    The annotation of genomes from NGS platforms needs to be automated and fully integrated. However, maintaining consistency and accuracy in genome annotation is a challenging problem because millions of protein database entries are not assigned reliable functions. This shortcoming limits the knowledge that can be extracted from genomes and metabolic models. Launched in 2005, the MicroScope platform (http://www.genoscope.cns.fr/agc/microscope) is an integrative resource that supports systematic and efficient revision of microbial genome annotation, data management and comparative analysis. Effective comparative analysis requires a consistent and complete view of biological data, and therefore, support for reviewing the quality of functional annotation is critical. MicroScope allows users to analyze microbial (meta)genomes together with post-genomic experiment results if any (i.e. transcriptomics, re-sequencing of evolved strains, mutant collections, phenotype data). It combines tools and graphical interfaces to analyze genomes and to perform the expert curation of gene functions in a comparative context. Starting with a short overview of the MicroScope system, this paper focuses on some major improvements of the Web interface, mainly for the submission of genomic data and on original tools and pipelines that have been developed and integrated in the platform: computation of pan-genomes and prediction of biosynthetic gene clusters. Today the resource contains data for more than 6000 microbial genomes, and among the 2700 personal accounts (65% of which are now from foreign countries), 14% of the users are performing expert annotations, on at least a weekly basis, contributing to improve the quality of microbial genome annotations. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.

  20. OrthoMCL: Identification of Ortholog Groups for Eukaryotic Genomes

    PubMed Central

    Li, Li; Stoeckert, Christian J.; Roos, David S.

    2003-01-01

    The identification of orthologous groups is useful for genome annotation, studies on gene/protein evolution, comparative genomics, and the identification of taxonomically restricted sequences. Methods successfully exploited for prokaryotic genome analysis have proved difficult to apply to eukaryotes, however, as larger genomes may contain multiple paralogous genes, and sequence information is often incomplete. OrthoMCL provides a scalable method for constructing orthologous groups across multiple eukaryotic taxa, using a Markov Cluster algorithm to group (putative) orthologs and paralogs. This method performs similarly to the INPARANOID algorithm when applied to two genomes, but can be extended to cluster orthologs from multiple species. OrthoMCL clusters are coherent with groups identified by EGO, but improved recognition of “recent” paralogs permits overlapping EGO groups representing the same gene to be merged. Comparison with previously assigned EC annotations suggests a high degree of reliability, implying utility for automated eukaryotic genome annotation. OrthoMCL has been applied to the proteome data set from seven publicly available genomes (human, fly, worm, yeast, Arabidopsis, the malaria parasite Plasmodium falciparum, and Escherichia coli). A Web interface allows queries based on individual genes or user-defined phylogenetic patterns (http://www.cbil.upenn.edu/gene-family). Analysis of clusters incorporating P. falciparum genes identifies numerous enzymes that were incompletely annotated in first-pass annotation of the parasite genome. PMID:12952885

  1. How Life and Rocks Have Co-Evolved

    NASA Astrophysics Data System (ADS)

    Hazen, R.

    2014-04-01

    The near-surface environment of terrestrial planets and moons evolves as a consequence of selective physical, chemical, and biological processes - an evolution that is preserved in the mineralogical record. Mineral evolution begins with approximately 12 different refractory minerals that form in the cooling envelopes of exploding stars. Subsequent aqueous and thermal alteration of planetessimals results in the approximately 250 minerals now found in unweathered lunar and meteorite samples. Following Earth's accretion and differentiation, mineral evolution resulted from a sequence of geochemical and petrologic processes, which led to perhaps 1500 mineral species. According to some origin-of-life scenarios, a planet must progress through at least some of these stages of chemical processing as a prerequisite for life. Once life emerged, mineralogy and biology co-evolved and dramatically increased Earth's mineral diversity to >4000 species. Sequential stages of a planet's near-surface evolution arise from three primary mechanisms: (1) the progressive separation and concentration of the elements from their original relatively uniform distribution in the presolar nebula; (2) the increase in range of intensive variables such as pressure, temperature, and volatile activities; and (3) the generation of far-from-equilibrium conditions by living systems. Remote observations of the mineralogy of other terrestrial bodies may thus provide evidence for biological influences beyond Earth. Recent studies of mineral diversification through time reveal striking correlations with major geochemical, tectonic, and biological events, including large-changes in ocean chemistry, the supercontinent cycle, the increase of atmospheric oxygen, and the rise of the terrestrial biosphere.

  2. Co-evolving prisoner's dilemma: Performance indicators and analytic approaches

    NASA Astrophysics Data System (ADS)

    Zhang, W.; Choi, C. W.; Li, Y. S.; Xu, C.; Hui, P. M.

    2017-02-01

    Understanding the intrinsic relation between the dynamical processes in a co-evolving network and the necessary ingredients in formulating a reliable theory is an important question and a challenging task. Using two slightly different definitions of performance indicator in the context of a co-evolving prisoner's dilemma game, it is shown that very different cooperative levels result and theories of different complexity are required to understand the key features. When the payoff per opponent is used as the indicator (Case A), non-cooperative strategy has an edge and dominates in a large part of the parameter space formed by the cutting-and-rewiring probability and the strategy imitation probability. When the payoff from all opponents is used (Case B), cooperative strategy has an edge and dominates the parameter space. Two distinct phases, one homogeneous and dynamical and another inhomogeneous and static, emerge and the phase boundary in the parameter space is studied in detail. A simple theory assuming an average competing environment for cooperative agents and another for non-cooperative agents is shown to perform well in Case A. The same theory, however, fails badly for Case B. It is necessary to include more spatial correlation into a theory for Case B. We show that the local configuration approximation, which takes into account of the different competing environments for agents with different strategies and degrees, is needed to give reliable results for Case B. The results illustrate that formulating a proper theory requires both a conceptual understanding of the effects of the adaptive processes in the problem and a delicate balance between simplicity and accuracy.

  3. Compact configurations within small evolving groups of galaxies

    NASA Astrophysics Data System (ADS)

    Mamon, G. A.

    Small virialized groups of galaxies are evolved with a gravitational N-body code, where the galaxies and a diffuse background are treated as single particles, but with mass and luminosity profiles attached, which enbles the estimation of parameters such as internal energies, half-mass radii, and the softened potential energies of interaction. The numerical treatment includes mergers, collisional stripping, tidal limitation by the mean-field of the background (evaluated using a combination of instantaneous and impulsive formulations), galaxy heating from collisons, and background heating from dynamical friction. The groups start out either as dense as appear the groups in Hickson's (1982) catalog, or as loose as appear those in Turner and Gott's (1976a) catalog, and they are simulated many times (usually 20) with different initial positions and velocities. Dense groups of galaxies with massive dark haloes coalesce into a single galaxy and lose their compact group appearance in approximately 3 group half-mass crossing times, while dense groups of galaxies without massive haloes survive the merger instability for 15 half-mass crossing times (in a more massive background to keep the same total group mass).

  4. Evolved Populations of Shigella flexneri Phage Sf6 Acquire Large Deletions, Altered Genomic Architecture, and Faster Life Cycles.

    PubMed

    Dover, John A; Burmeister, Alita R; Molineux, Ian J; Parent, Kristin N

    2016-09-19

    Genomic architecture is the framework within which genes and regulatory elements evolve and where specific constructs may constrain or potentiate particular adaptations. One such construct is evident in phages that use a headful packaging strategy that results in progeny phage heads packaged with DNA until full rather than encapsidating a simple unit-length genome. Here, we investigate the evolution of the headful packaging phage Sf6 in response to barriers that impede efficient phage adsorption to the host cell. Ten replicate populations evolved faster Sf6 life cycles by parallel mutations found in a phage lysis gene and/or by large, 1.2- to 4.0-kb deletions that remove a mobile genetic IS911 element present in the ancestral phage genome. The fastest life cycles were found in phages that acquired both mutations. No mutations were found in genes encoding phage structural proteins, which were a priori expected from the experimental design that imposed a challenge for phage adsorption by using a Shigella flexneri host lacking receptors preferred by Sf6. We used DNA sequencing, molecular approaches, and physiological experiments on 82 clonal isolates taken from all 10 populations to reveal the genetic basis of the faster Sf6 life cycle. The majority of our isolates acquired deletions in the phage genome. Our results suggest that deletions are adaptive and can influence the duration of the phage life cycle while acting in conjunction with other lysis time-determining point mutations. © The Author 2016. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

  5. A group evolving-based framework with perturbations for link prediction

    NASA Astrophysics Data System (ADS)

    Si, Cuiqi; Jiao, Licheng; Wu, Jianshe; Zhao, Jin

    2017-06-01

    Link prediction is a ubiquitous application in many fields which uses partially observed information to predict absence or presence of links between node pairs. The group evolving study provides reasonable explanations on the behaviors of nodes, relations between nodes and community formation in a network. Possible events in group evolution include continuing, growing, splitting, forming and so on. The changes discovered in networks are to some extent the result of these events. In this work, we present a group evolving-based characterization of node's behavioral patterns, and via which we can estimate the probability they tend to interact. In general, the primary aim of this paper is to offer a minimal toy model to detect missing links based on evolution of groups and give a simpler explanation on the rationality of the model. We first introduce perturbations into networks to obtain stable cluster structures, and the stable clusters determine the stability of each node. Then fluctuations, another node behavior, are assumed by the participation of each node to its own belonging group. Finally, we demonstrate that such characteristics allow us to predict link existence and propose a model for link prediction which outperforms many classical methods with a decreasing computational time in large scales. Encouraging experimental results obtained on real networks show that our approach can effectively predict missing links in network, and even when nearly 40% of the edges are missing, it also retains stationary performance.

  6. What helminth genomes have taught us about parasite evolution.

    PubMed

    Zarowiecki, Magdalena; Berriman, Matt

    2015-02-01

    The genomes of more than 20 helminths have now been sequenced. Here we perform a meta-analysis of all sequenced genomes of nematodes and Platyhelminthes, and attempt to address the question of what are the defining characteristics of helminth genomes. We find that parasitic worms lack systems for surface antigenic variation, instead maintaining infections using their surfaces as the first line of defence against the host immune system, with several expanded gene families of genes associated with the surface and tegument. Parasite excretory/secretory products evolve rapidly, and proteases even more so, with each parasite exhibiting unique modifications of its protease repertoire. Endoparasitic flatworms show striking losses of metabolic capabilities, not matched by nematodes. All helminths do however exhibit an overall reduction in auxiliary metabolism (biogenesis of co-factors and vitamins). Overall, the prevailing pattern is that there are few commonalities between the genomes of independently evolved parasitic worms, with each parasite having undergone specific adaptations for their particular niche.

  7. Genomic instability--an evolving hallmark of cancer.

    PubMed

    Negrini, Simona; Gorgoulis, Vassilis G; Halazonetis, Thanos D

    2010-03-01

    Genomic instability is a characteristic of most cancers. In hereditary cancers, genomic instability results from mutations in DNA repair genes and drives cancer development, as predicted by the mutator hypothesis. In sporadic (non-hereditary) cancers the molecular basis of genomic instability remains unclear, but recent high-throughput sequencing studies suggest that mutations in DNA repair genes are infrequent before therapy, arguing against the mutator hypothesis for these cancers. Instead, the mutation patterns of the tumour suppressor TP53 (which encodes p53), ataxia telangiectasia mutated (ATM) and cyclin-dependent kinase inhibitor 2A (CDKN2A; which encodes p16INK4A and p14ARF) support the oncogene-induced DNA replication stress model, which attributes genomic instability and TP53 and ATM mutations to oncogene-induced DNA damage.

  8. Big Role for a Tiny Genome.

    PubMed

    Douglas, Angela E

    2017-12-14

    In this issue of Cell, Salem et al. demonstrate a remarkable instance of herbivory dependent on a co-evolved mutualism with specialized bacteria. Despite having a tiny genome and limited metabolic repertoire, the bacteria in Cassida beetles produce pectinases predicted to mediate degradation of plant cell walls in the insect diet. Copyright © 2017 Elsevier Inc. All rights reserved.

  9. Functional gene groups are concentrated within chromosomes, among chromosomes and in the nuclear space of the human genome.

    PubMed

    Thévenin, Annelyse; Ein-Dor, Liat; Ozery-Flato, Michal; Shamir, Ron

    2014-09-01

    Genomes undergo changes in organization as a result of gene duplications, chromosomal rearrangements and local mutations, among other mechanisms. In contrast to prokaryotes, in which genes of a common function are often organized in operons and reside contiguously along the genome, most eukaryotes show much weaker clustering of genes by function, except for few concrete functional groups. We set out to check systematically if there is a relation between gene function and gene organization in the human genome. We test this question for three types of functional groups: pairs of interacting proteins, complexes and pathways. We find a significant concentration of functional groups both in terms of their distance within the same chromosome and in terms of their dispersal over several chromosomes. Moreover, using Hi-C contact map of the tendency of chromosomal segments to appear close in the 3D space of the nucleus, we show that members of the same functional group that reside on distinct chromosomes tend to co-localize in space. The result holds for all three types of functional groups that we tested. Hence, the human genome shows substantial concentration of functional groups within chromosomes and across chromosomes in space. © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.

  10. The Legionella pneumophila genome evolved to accommodate multiple regulatory mechanisms controlled by the CsrA-system

    PubMed Central

    Sahr, Tobias; Rusniok, Christophe; Impens, Francis; Oliva, Giulia; Sismeiro, Odile; Coppée, Jean-Yves

    2017-01-01

    The carbon storage regulator protein CsrA regulates cellular processes post-transcriptionally by binding to target-RNAs altering translation efficiency and/or their stability. Here we identified and analyzed the direct targets of CsrA in the human pathogen Legionella pneumophila. Genome wide transcriptome, proteome and RNA co-immunoprecipitation followed by deep sequencing of a wild type and a csrA mutant strain identified 479 RNAs with potential CsrA interaction sites located in the untranslated and/or coding regions of mRNAs or of known non-coding sRNAs. Further analyses revealed that CsrA exhibits a dual regulatory role in virulence as it affects the expression of the regulators FleQ, LqsR, LetE and RpoS but it also directly regulates the timely expression of over 40 Dot/Icm substrates. CsrA controls its own expression and the stringent response through a regulatory feedback loop as evidenced by its binding to RelA-mRNA and links it to quorum sensing and motility. CsrA is a central player in the carbon, amino acid, fatty acid metabolism and energy transfer and directly affects the biosynthesis of cofactors, vitamins and secondary metabolites. We describe the first L. pneumophila riboswitch, a thiamine pyrophosphate riboswitch whose regulatory impact is fine-tuned by CsrA, and identified a unique regulatory mode of CsrA, the active stabilization of RNA anti-terminator conformations inside a coding sequence preventing Rho-dependent termination of the gap operon through transcriptional polarity effects. This allows L. pneumophila to regulate the pentose phosphate pathway and the glycolysis combined or individually although they share genes in a single operon. Thus the L. pneumophila genome has evolved to acclimate at least five different modes of regulation by CsrA giving it a truly unique position in its life cycle. PMID:28212376

  11. Phylogenetic investigation of human FGFR-bearing paralogons favors piecemeal duplication theory of vertebrate genome evolution.

    PubMed

    Ajmal, Wajya; Khan, Hiba; Abbasi, Amir Ali

    2014-12-01

    Understanding the genetic mechanisms underlying the organismal complexity and origin of novelties during vertebrate history is one of the central goals of evolutionary biology. Ohno (1970) was the first to postulate that whole genome duplications (WGD) have played a vital role in the evolution of new gene functions: permitting an increase in morphological, physiological and anatomical complexity during early vertebrate history. Here, we analyze the evolutionary history of human FGFR-bearing paralogon (human autosome 4/5/8/10) by the phylogenetic analysis of multigene families with triplicate and quadruplicate distribution on these chromosomes. Our results categorized the histories of 21 families into discrete co-duplicated groups. Genes of a particular co-duplicated group exhibit identical evolutionary history and have duplicated in concert with each other, whereas genes belonging to different groups have dissimilar histories and have not duplicated concurrently. Taken together with our previously published data, we submit that there is sufficient empirical evidence to disprove the 1R/2R hypothesis and to support the general prediction that vertebrate genome evolved by relatively small-scale, regional duplication events that spread across the history of life. Copyright © 2014 Elsevier Inc. All rights reserved.

  12. Global DNA cytosine methylation as an evolving trait: phylogenetic signal and correlated evolution with genome size in angiosperms

    PubMed Central

    Alonso, Conchita; Pérez, Ricardo; Bazaga, Pilar; Herrera, Carlos M.

    2015-01-01

    DNA cytosine methylation is a widespread epigenetic mechanism in eukaryotes, and plant genomes commonly are densely methylated. Genomic methylation can be associated with functional consequences such as mutational events, genomic instability or altered gene expression, but little is known on interspecific variation in global cytosine methylation in plants. In this paper, we compare global cytosine methylation estimates obtained by HPLC and use a phylogenetically-informed analytical approach to test for significance of evolutionary signatures of this trait across 54 angiosperm species in 25 families. We evaluate whether interspecific variation in global cytosine methylation is statistically related to phylogenetic distance and also whether it is evolutionarily correlated with genome size (C-value). Global cytosine methylation varied widely between species, ranging between 5.3% (Arabidopsis) and 39.2% (Narcissus). Differences between species were related to their evolutionary trajectories, as denoted by the strong phylogenetic signal underlying interspecific variation. Global cytosine methylation and genome size were evolutionarily correlated, as revealed by the significant relationship between the corresponding phylogenetically independent contrasts. On average, a ten-fold increase in genome size entailed an increase of about 10% in global cytosine methylation. Results show that global cytosine methylation is an evolving trait in angiosperms whose evolutionary trajectory is significantly linked to changes in genome size, and suggest that the evolutionary implications of epigenetic mechanisms are likely to vary between plant lineages. PMID:25688257

  13. Recently evolved human-specific methylated regions are enriched in schizophrenia signals.

    PubMed

    Banerjee, Niladri; Polushina, Tatiana; Bettella, Francesco; Giddaluru, Sudheer; Steen, Vidar M; Andreassen, Ole A; Le Hellard, Stephanie

    2018-05-11

    One explanation for the persistence of schizophrenia despite the reduced fertility of patients is that it is a by-product of recent human evolution. This hypothesis is supported by evidence suggesting that recently-evolved genomic regions in humans are involved in the genetic risk for schizophrenia. Using summary statistics from genome-wide association studies (GWAS) of schizophrenia and 11 other phenotypes, we tested for enrichment of association with GWAS traits in regions that have undergone methylation changes in the human lineage compared to Neanderthals and Denisovans, i.e. human-specific differentially methylated regions (DMRs). We used analytical tools that evaluate polygenic enrichment of a subset of genomic variants against all variants. Schizophrenia was the only trait in which DMR SNPs showed clear enrichment of association that passed the genome-wide significance threshold. The enrichment was not observed for Neanderthal or Denisovan DMRs. The enrichment seen in human DMRs is comparable to that for genomic regions tagged by Neanderthal Selective Sweep markers, and stronger than that for Human Accelerated Regions. The enrichment survives multiple testing performed through permutation (n = 10,000) and bootstrapping (n = 5000) in INRICH (p < 0.01). Some enrichment of association with height was observed at the gene level. Regions where DNA methylation modifications have changed during recent human evolution show enrichment of association with schizophrenia and possibly with height. Our study further supports the hypothesis that genetic variants conferring risk of schizophrenia co-occur in genomic regions that have changed as the human species evolved. Since methylation is an epigenetic mark, potentially mediated by environmental changes, our results also suggest that interaction with the environment might have contributed to that association.

  14. Directed evolution to re-adapt a co-evolved network within an enzyme.

    PubMed

    Strafford, John; Payongsri, Panwajee; Hibbert, Edward G; Morris, Phattaraporn; Batth, Sukhjeet S; Steadman, David; Smith, Mark E B; Ward, John M; Hailes, Helen C; Dalby, Paul A

    2012-01-01

    We have previously used targeted active-site saturation mutagenesis to identify a number of transketolase single mutants that improved activity towards either glycolaldehyde (GA), or the non-natural substrate propionaldehyde (PA). Here, all attempts to recombine the singles into double mutants led to unexpected losses of specific activity towards both substrates. A typical trade-off occurred between soluble expression levels and specific activity for all single mutants, but many double mutants decreased both properties more severely suggesting a critical loss of protein stability or native folding. Statistical coupling analysis (SCA) of a large multiple sequence alignment revealed a network of nine co-evolved residues that affected all but one double mutant. Such networks maintain important functional properties such as activity, specificity, folding, stability, and solubility and may be rapidly disrupted by introducing one or more non-naturally occurring mutations. To identify variants of this network that would accept and improve upon our best D469 mutants for activity towards PA, we created a library of random single, double and triple mutants across seven of the co-evolved residues, combining our D469 variants with only naturally occurring mutations at the remaining sites. A triple mutant cluster at D469, E498 and R520 was found to behave synergistically for the specific activity towards PA. Protein expression was severely reduced by E498D and improved by R520Q, yet variants containing both mutations led to improved specific activity and enzyme expression, but with loss of solubility and the formation of inclusion bodies. D469S and R520Q combined synergistically to improve k(cat) 20-fold for PA, more than for any previous transketolase mutant. R520Q also doubled the specific activity of the previously identified D469T to create our most active transketolase mutant to date. Our results show that recombining active-site mutants obtained by saturation mutagenesis

  15. Genome-wide analysis of allele frequency change in sunflower crop-wild hybrid populations evolving under natural conditions.

    PubMed

    Corbi, Jonathan; Baack, Eric J; Dechaine, Jennifer M; Seiler, Gerald; Burke, John M

    2018-01-01

    Crop-wild hybridization occurs in numerous plant species and could alter the genetic structure and evolutionary dynamics of wild populations. Studying crop-derived alleles in wild populations is also relevant to assessing/mitigating the risks associated with transgene escape. To date, crop-wild hybridization has generally been examined via short-term studies, typically within a single generation, focusing on few traits or genetic markers. Little is known about patterns of selection on crop-derived alleles over multiple generations, particularly at a genome-wide scale. Here, we documented patterns of natural selection in an experimental crop × wild sunflower population that was allowed to evolve under natural conditions for two generations at two locations. Allele frequencies at a genome-wide collection of SNPs were tracked across generations, and a common garden experiment was conducted to compare trait means between generations. These data allowed us to identify instances of selection on crop-derived alleles/traits and, in concert with QTL mapping results, test for congruence between our genotypic and phenotypic results. We found that natural selection overwhelmingly favours wild alleles and phenotypes. However, crop alleles in certain genomic regions can be favoured, and these changes often occurred in parallel across locations. We did not, however, consistently observe close agreement between our genotypic and phenotypic results. For example, when a trait evolved towards the wild phenotype, wild QTL alleles associated with that trait did not consistently increase in frequency. We discuss these results in the context of crop allele introgression into wild populations and implications for the management of GM crops. © 2017 John Wiley & Sons Ltd.

  16. Genome composition and phylogeny of microbes predict their co-occurrence in the environment

    PubMed Central

    2017-01-01

    The genomic information of microbes is a major determinant of their phenotypic properties, yet it is largely unknown to what extent ecological associations between different species can be explained by their genome composition. To bridge this gap, this study introduces two new genome-wide pairwise measures of microbe-microbe interaction. The first (genome content similarity index) quantifies similarity in genome composition between two microbes, while the second (microbe-microbe functional association index) summarizes the topology of a protein functional association network built for a given pair of microbes and quantifies the fraction of network edges crossing organismal boundaries. These new indices are then used to predict co-occurrence between reference genomes from two 16S-based ecological datasets, accounting for phylogenetic relatedness of the taxa. Phylogenetic relatedness was found to be a strong predictor of ecological associations between microbes which explains about 10% of variance in co-occurrence data, but genome composition was found to be a strong predictor as well, it explains up to 4% the variance in co-occurrence when all genomic-based indices are used in combination, even after accounting for evolutionary relationships between the species. On their own, the metrics proposed here explain a larger proportion of variance than previously reported more complex methods that rely on metabolic network comparisons. In summary, results of this study indicate that microbial genomes do indeed contain detectable signal of organismal ecology, and the methods described in the paper can be used to improve mechanistic understanding of microbe-microbe interactions. PMID:28152007

  17. Unraveling the mystery of music: music as an evolved group process.

    PubMed

    Loersch, Chris; Arbuckle, Nathan L

    2013-11-01

    As prominently highlighted by Charles Darwin, music is one of the most mysterious aspects of human nature. Despite its ubiquitous presence across cultures and throughout recorded history, the reason humans respond emotionally to music remains unknown. Although many scientists and philosophers have offered hypotheses, there is little direct empirical evidence for any perspective. Here we address this issue, providing data which support the idea that music evolved in service of group living. Using 7 studies, we demonstrate that people's emotional responses to music are intricately tied to the other core social phenomena that bind us together into groups. In sum, this work establishes human musicality as a special form of social cognition and provides the first direct support for the hypothesis that music evolved as a tool of social living. In addition, the findings provide a reason for the intense psychological pull of music in modern life, suggesting that the pleasure we derive from listening to music results from its innate connection to the basic social drives that create our interconnected world. PsycINFO Database Record (c) 2013 APA, all rights reserved.

  18. Genomes in turmoil: quantification of genome dynamics in prokaryote supergenomes.

    PubMed

    Puigbò, Pere; Lobkovsky, Alexander E; Kristensen, David M; Wolf, Yuri I; Koonin, Eugene V

    2014-08-21

    Genomes of bacteria and archaea (collectively, prokaryotes) appear to exist in incessant flux, expanding via horizontal gene transfer and gene duplication, and contracting via gene loss. However, the actual rates of genome dynamics and relative contributions of different types of event across the diversity of prokaryotes are largely unknown, as are the sizes of microbial supergenomes, i.e. pools of genes that are accessible to the given microbial species. We performed a comprehensive analysis of the genome dynamics in 35 groups (34 bacterial and one archaeal) of closely related microbial genomes using a phylogenetic birth-and-death maximum likelihood model to quantify the rates of gene family gain and loss, as well as expansion and reduction. The results show that loss of gene families dominates the evolution of prokaryotes, occurring at approximately three times the rate of gain. The rates of gene family expansion and reduction are typically seven and twenty times less than the gain and loss rates, respectively. Thus, the prevailing mode of evolution in bacteria and archaea is genome contraction, which is partially compensated by the gain of new gene families via horizontal gene transfer. However, the rates of gene family gain, loss, expansion and reduction vary within wide ranges, with the most stable genomes showing rates about 25 times lower than the most dynamic genomes. For many groups, the supergenome estimated from the fraction of repetitive gene family gains includes about tenfold more gene families than the typical genome in the group although some groups appear to have vast, 'open' supergenomes. Reconstruction of evolution for groups of closely related bacteria and archaea reveals an extremely rapid and highly variable flux of genes in evolving microbial genomes, demonstrates that extensive gene loss and horizontal gene transfer leading to innovation are the two dominant evolutionary processes, and yields robust estimates of the supergenome size.

  19. Analysis of IAV Replication and Co-infection Dynamics by a Versatile RNA Viral Genome Labeling Method.

    PubMed

    Dou, Dan; Hernández-Neuta, Iván; Wang, Hao; Östbye, Henrik; Qian, Xiaoyan; Thiele, Swantje; Resa-Infante, Patricia; Kouassi, Nancy Mounogou; Sender, Vicky; Hentrich, Karina; Mellroth, Peter; Henriques-Normark, Birgitta; Gabriel, Gülsah; Nilsson, Mats; Daniels, Robert

    2017-07-05

    Genome delivery to the proper cellular compartment for transcription and replication is a primary goal of viruses. However, methods for analyzing viral genome localization and differentiating genomes with high identity are lacking, making it difficult to investigate entry-related processes and co-examine heterogeneous RNA viral populations. Here, we present an RNA labeling approach for single-cell analysis of RNA viral replication and co-infection dynamics in situ, which uses the versatility of padlock probes. We applied this method to identify influenza A virus (IAV) infections in cells and lung tissue with single-nucleotide specificity and to classify entry and replication stages by gene segment localization. Extending the classification strategy to co-infections of IAVs with single-nucleotide variations, we found that the dependence on intracellular trafficking places a time restriction on secondary co-infections necessary for genome reassortment. Altogether, these data demonstrate how RNA viral genome labeling can help dissect entry and co-infections. Copyright © 2017 The Authors. Published by Elsevier Inc. All rights reserved.

  20. Genome size differentiates co-occurring populations of the planktonic diatom Ditylum brightwellii (Bacillariophyta)

    PubMed Central

    2010-01-01

    Background Diatoms are one of the most species-rich groups of eukaryotic microbes known. Diatoms are also the only group of eukaryotic micro-algae with a diplontic life history, suggesting that the ancestral diatom switched to a life history dominated by a duplicated genome. A key mechanism of speciation among diatoms could be a propensity for additional stable genome duplications. Across eukaryotic taxa, genome size is directly correlated to cell size and inversely correlated to physiological rates. Differences in relative genome size, cell size, and acclimated growth rates were analyzed in isolates of the diatom Ditylum brightwellii. Ditylum brightwellii consists of two main populations with identical 18s rDNA sequences; one population is distributed globally at temperate latitudes and the second appears to be localized to the Pacific Northwest coast of the USA. These two populations co-occur within the Puget Sound estuary of WA, USA, although their peak abundances differ depending on local conditions. Results All isolates from the more regionally-localized population (population 2) possessed 1.94 ± 0.74 times the amount of DNA, grew more slowly, and were generally larger than isolates from the more globally distributed population (population 1). The ITS1 sequences, cell sizes, and genome sizes of isolates from New Zealand were the same as population 1 isolates from Puget Sound, but their growth rates were within the range of the slower-growing population 2 isolates. Importantly, the observed genome size difference between isolates from the two populations was stable regardless of time in culture or the changes in cell size that accompany the diatom life history. Conclusions The observed two-fold difference in genome size between the D. brightwellii populations suggests that whole genome duplication occurred within cells of population 1 ultimately giving rise to population 2 cells. The apparent regional localization of population 2 is consistent with a recent

  1. DeCoSTAR: Reconstructing the Ancestral Organization of Genes or Genomes Using Reconciled Phylogenies

    PubMed Central

    Anselmetti, Yoann; Patterson, Murray; Ponty, Yann; B�rard, S�verine; Chauve, Cedric; Scornavacca, Celine; Daubin, Vincent; Tannier, Eric

    2017-01-01

    DeCoSTAR is a software that aims at reconstructing the organization of ancestral genes or genomes in the form of sets of neighborhood relations (adjacencies) between pairs of ancestral genes or gene domains. It can also improve the assembly of fragmented genomes by proposing evolutionary-induced adjacencies between scaffolding fragments. Ancestral genes or domains are deduced from reconciled phylogenetic trees under an evolutionary model that considers gains, losses, speciations, duplications, and transfers as possible events for gene evolution. Reconciliations are either given as input or computed with the ecceTERA package, into which DeCoSTAR is integrated. DeCoSTAR computes adjacency evolutionary scenarios using a scoring scheme based on a weighted sum of adjacency gains and breakages. Solutions, both optimal and near-optimal, are sampled according to the Boltzmann–Gibbs distribution centered around parsimonious solutions, and statistical supports on ancestral and extant adjacencies are provided. DeCoSTAR supports the features of previously contributed tools that reconstruct ancestral adjacencies, namely DeCo, DeCoLT, ART-DeCo, and DeClone. In a few minutes, DeCoSTAR can reconstruct the evolutionary history of domains inside genes, of gene fusion and fission events, or of gene order along chromosomes, for large data sets including dozens of whole genomes from all kingdoms of life. We illustrate the potential of DeCoSTAR with several applications: ancestral reconstruction of gene orders for Anopheles mosquito genomes, multidomain proteins in Drosophila, and gene fusion and fission detection in Actinobacteria. Availability: http://pbil.univ-lyon1.fr/software/DeCoSTAR (Last accessed April 24, 2017). PMID:28402423

  2. Mitochondrial genome evolution in the Saccharomyces sensu stricto complex.

    PubMed

    Ruan, Jiangxing; Cheng, Jian; Zhang, Tongcun; Jiang, Huifeng

    2017-01-01

    Exploring the evolutionary patterns of mitochondrial genomes is important for our understanding of the Saccharomyces sensu stricto (SSS) group, which is a model system for genomic evolution and ecological analysis. In this study, we first obtained the complete mitochondrial sequences of two important species, Saccharomyces mikatae and Saccharomyces kudriavzevii. We then compared the mitochondrial genomes in the SSS group with those of close relatives, and found that the non-coding regions evolved rapidly, including dramatic expansion of intergenic regions, fast evolution of introns and almost 20-fold higher rearrangement rates than those of the nuclear genomes. However, the coding regions, and especially the protein-coding genes, are more conserved than those in the nuclear genomes of the SSS group. The different evolutionary patterns of coding and non-coding regions in the mitochondrial and nuclear genomes may be related to the origin of the aerobic fermentation lifestyle in this group. Our analysis thus provides novel insights into the evolution of mitochondrial genomes.

  3. Detection and full genome characterization of two beta CoV viruses related to Middle East respiratory syndrome from bats in Italy.

    PubMed

    Moreno, Ana; Lelli, Davide; de Sabato, Luca; Zaccaria, Guendalina; Boni, Arianna; Sozzi, Enrica; Prosperi, Alice; Lavazza, Antonio; Cella, Eleonora; Castrucci, Maria Rita; Ciccozzi, Massimo; Vaccari, Gabriele

    2017-12-19

    Middle East respiratory syndrome coronavirus (MERS-CoV), which belongs to beta group of coronavirus, can infect multiple host species and causes severe diseases in humans. Multiple surveillance and phylogenetic studies suggest a bat origin. In this study, we describe the detection and full genome characterization of two CoVs closely related to MERS-CoV from two Italian bats, Pipistrellus kuhlii and Hypsugo savii. Pool of viscera were tested by a pan-coronavirus RT-PCR. Virus isolation was attempted by inoculation in different cell lines. Full genome sequencing was performed using the Ion Torrent platform and phylogenetic trees were performed using IQtree software. Similarity plots of CoV clade c genomes were generated by using SSE v1.2. The three dimensional macromolecular structure (3DMMS) of the receptor binding domain (RBD) in the S protein was predicted by sequence-homology method using the protein data bank (PDB). Both samples resulted positive to the pan-coronavirus RT-PCR (IT-batCoVs) and their genome organization showed identical pattern of MERS CoV. Phylogenetic analysis showed a monophyletic group placed in the Beta2c clade formed by MERS-CoV sequences originating from humans and camels and bat-related sequences from Africa, Italy and China. The comparison of the secondary and 3DMMS of the RBD of IT-batCoVs with MERS, HKU4 and HKU5 bat sequences showed two aa deletions located in a region corresponding to the external subdomain of MERS-RBD in IT-batCoV and HKU5 RBDs. This study reported two beta CoVs closely related to MERS that were obtained from two bats belonging to two commonly recorded species in Italy (P. kuhlii and H. savii). The analysis of the RBD showed similar structure in IT-batCoVs and HKU5 respect to HKU4 sequences. Since the RBD domain of HKU4 but not HKU5 can bind to the human DPP4 receptor for MERS-CoV, it is possible to suggest also for IT-batCoVs the absence of DPP4-binding potential. More surveillance studies are needed to better

  4. Comparative genomics of a plant-parasitic nematode endosymbiont suggest a role in nutritional symbiosis

    USDA-ARS?s Scientific Manuscript database

    Bacterial mutualists can increase the biochemical capacity of animals. Highly co-evolved nutritional mutualists do this by synthesizing nutrients missing from the host's diet. Genomics tools have recently advanced the study of these partnerships. Here we examined the endosymbiont Xiphinematobacter (...

  5. Alignment-free genome tree inference by learning group-specific distance metrics.

    PubMed

    Patil, Kaustubh R; McHardy, Alice C

    2013-01-01

    Understanding the evolutionary relationships between organisms is vital for their in-depth study. Gene-based methods are often used to infer such relationships, which are not without drawbacks. One can now attempt to use genome-scale information, because of the ever increasing number of genomes available. This opportunity also presents a challenge in terms of computational efficiency. Two fundamentally different methods are often employed for sequence comparisons, namely alignment-based and alignment-free methods. Alignment-free methods rely on the genome signature concept and provide a computationally efficient way that is also applicable to nonhomologous sequences. The genome signature contains evolutionary signal as it is more similar for closely related organisms than for distantly related ones. We used genome-scale sequence information to infer taxonomic distances between organisms without additional information such as gene annotations. We propose a method to improve genome tree inference by learning specific distance metrics over the genome signature for groups of organisms with similar phylogenetic, genomic, or ecological properties. Specifically, our method learns a Mahalanobis metric for a set of genomes and a reference taxonomy to guide the learning process. By applying this method to more than a thousand prokaryotic genomes, we showed that, indeed, better distance metrics could be learned for most of the 18 groups of organisms tested here. Once a group-specific metric is available, it can be used to estimate the taxonomic distances for other sequenced organisms from the group. This study also presents a large scale comparison between 10 methods--9 alignment-free and 1 alignment-based.

  6. Genomics and the making of yeast biodiversity.

    PubMed

    Hittinger, Chris Todd; Rokas, Antonis; Bai, Feng-Yan; Boekhout, Teun; Gonçalves, Paula; Jeffries, Thomas W; Kominek, Jacek; Lachance, Marc-André; Libkind, Diego; Rosa, Carlos A; Sampaio, José Paulo; Kurtzman, Cletus P

    2015-12-01

    Yeasts are unicellular fungi that do not form fruiting bodies. Although the yeast lifestyle has evolved multiple times, most known species belong to the subphylum Saccharomycotina (syn. Hemiascomycota, hereafter yeasts). This diverse group includes the premier eukaryotic model system, Saccharomyces cerevisiae; the common human commensal and opportunistic pathogen, Candida albicans; and over 1000 other known species (with more continuing to be discovered). Yeasts are found in every biome and continent and are more genetically diverse than angiosperms or chordates. Ease of culture, simple life cycles, and small genomes (∼10-20Mbp) have made yeasts exceptional models for molecular genetics, biotechnology, and evolutionary genomics. Here we discuss recent developments in understanding the genomic underpinnings of the making of yeast biodiversity, comparing and contrasting natural and human-associated evolutionary processes. Only a tiny fraction of yeast biodiversity and metabolic capabilities has been tapped by industry and science. Expanding the taxonomic breadth of deep genomic investigations will further illuminate how genome function evolves to encode their diverse metabolisms and ecologies. Copyright © 2015 Elsevier Ltd. All rights reserved.

  7. Histone variant innovation in a rapidly evolving chordate lineage.

    PubMed

    Moosmann, Alexandra; Campsteijn, Coen; Jansen, Pascal Wtc; Nasrallah, Carole; Raasholm, Martina; Stunnenberg, Henk G; Thompson, Eric M

    2011-07-15

    Histone variants alter the composition of nucleosomes and play crucial roles in transcription, chromosome segregation, DNA repair, and sperm compaction. Modification of metazoan histone variant lineages occurs on a background of genome architecture that shows global similarities from sponges to vertebrates, but the urochordate, Oikopleura dioica, a member of the sister group to vertebrates, exhibits profound modification of this ancestral architecture. We show that a histone complement of 47 gene loci encodes 31 histone variants, grouped in distinct sets of developmental expression profiles throughout the life cycle. A particularly diverse array of 15 male-specific histone variants was uncovered, including a testes-specific H4t, the first metazoan H4 sequence variant reported. Universal histone variants H3.3, CenH3, and H2A.Z are present but O. dioica lacks homologs of macroH2A and H2AX. The genome encodes many H2A and H2B variants and the repertoire of H2A.Z isoforms is expanded through alternative splicing, incrementally regulating the number of acetylatable lysine residues in the functionally important N-terminal "charge patch". Mass spectrometry identified 40 acetylation, methylation and ubiquitylation posttranslational modifications (PTMs) and showed that hallmark PTMs of "active" and "repressive" chromatin were present in O. dioica. No obvious reduction in silent heterochromatic marks was observed despite high gene density in this extraordinarily compacted chordate genome. These results show that histone gene complements and their organization differ considerably even over modest phylogenetic distances. Substantial innovation among all core and linker histone variants has evolved in concert with adaptation of specific life history traits in this rapidly evolving chordate lineage.

  8. A systems approach defining constraints of the genome architecture on lineage selection and evolvability during somatic cancer evolution

    PubMed Central

    Rübben, Albert; Nordhoff, Ole

    2013-01-01

    Summary Most clinically distinguishable malignant tumors are characterized by specific mutations, specific patterns of chromosomal rearrangements and a predominant mechanism of genetic instability but it remains unsolved whether modifications of cancer genomes can be explained solely by mutations and selection through the cancer microenvironment. It has been suggested that internal dynamics of genomic modifications as opposed to the external evolutionary forces have a significant and complex impact on Darwinian species evolution. A similar situation can be expected for somatic cancer evolution as molecular key mechanisms encountered in species evolution also constitute prevalent mutation mechanisms in human cancers. This assumption is developed into a systems approach of carcinogenesis which focuses on possible inner constraints of the genome architecture on lineage selection during somatic cancer evolution. The proposed systems approach can be considered an analogy to the concept of evolvability in species evolution. The principal hypothesis is that permissive or restrictive effects of the genome architecture on lineage selection during somatic cancer evolution exist and have a measurable impact. The systems approach postulates three classes of lineage selection effects of the genome architecture on somatic cancer evolution: i) effects mediated by changes of fitness of cells of cancer lineage, ii) effects mediated by changes of mutation probabilities and iii) effects mediated by changes of gene designation and physical and functional genome redundancy. Physical genome redundancy is the copy number of identical genetic sequences. Functional genome redundancy of a gene or a regulatory element is defined as the number of different genetic elements, regardless of copy number, coding for the same specific biological function within a cancer cell. Complex interactions of the genome architecture on lineage selection may be expected when modifications of the genome

  9. Comparative Genomics Unravels the Functional Roles of Co-occurring Acidophilic Bacteria in Bioleaching Heaps

    PubMed Central

    Zhang, Xian; Liu, Xueduan; Liang, Yili; Xiao, Yunhua; Ma, Liyuan; Guo, Xue; Miao, Bo; Liu, Hongwei; Peng, Deliang; Huang, Wenkun; Yin, Huaqun

    2017-01-01

    The spatial-temporal distribution of populations in various econiches is thought to be potentially related to individual differences in the utilization of nutrients or other resources, but their functional roles in the microbial communities remain elusive. We compared differentiation in gene repertoire and metabolic profiles, with a focus on the potential functional traits of three commonly recognized members (Acidithiobacillus caldus, Leptospirillum ferriphilum, and Sulfobacillus thermosulfidooxidans) in bioleaching heaps. Comparative genomics revealed that intra-species divergence might be driven by horizontal gene transfer. These co-occurring bacteria shared a few homologous genes, which significantly suggested the genomic differences between these organisms. Notably, relatively more genes assigned to the Clusters of Orthologous Groups category [G] (carbohydrate transport and metabolism) were identified in Sulfobacillus thermosulfidooxidans compared to the two other species, which probably indicated their mixotrophic capabilities that assimilate both organic and inorganic forms of carbon. Further inspection revealed distinctive metabolic capabilities involving carbon assimilation, nitrogen uptake, and iron-sulfur cycling, providing robust evidence for functional differences with respect to nutrient utilization. Therefore, we proposed that the mutual compensation of functionalities among these co-occurring organisms might provide a selective advantage for efficiently utilizing the limited resources in their habitats. Furthermore, it might be favorable to chemoautotrophs' lifestyles to form mutualistic interactions with these heterotrophic and/or mixotrophic acidophiles, whereby the latter could degrade organic compounds to effectively detoxify the environments. Collectively, the findings shed light on the genetic traits and potential metabolic activities of these organisms, and enable us to make some inferences about genomic and functional differences that might

  10. Virtual Genomes in Flux: An Interplay of Neutrality and Adaptability Explains Genome Expansion and Streamlining

    PubMed Central

    Cuypers, Thomas D.; Hogeweg, Paulien

    2012-01-01

    The picture that emerges from phylogenetic gene content reconstructions is that genomes evolve in a dynamic pattern of rapid expansion and gradual streamlining. Ancestral organisms have been estimated to possess remarkably rich gene complements, although gene loss is a driving force in subsequent lineage adaptation and diversification. Here, we study genome dynamics in a model of virtual cells evolving to maintain homeostasis. We observe a pattern of an initial rapid expansion of the genome and a prolonged phase of mutational load reduction. Generally, load reduction is achieved by the deletion of redundant genes, generating a streamlining pattern. Load reduction can also occur as a result of the generation of highly neutral genomic regions. These regions can expand and contract in a neutral fashion. Our study suggests that genome expansion and streamlining are generic patterns of evolving systems. We propose that the complex genotype to phenotype mapping in virtual cells as well as in their biological counterparts drives genome size dynamics, due to an emerging interplay between adaptation, neutrality, and evolvability. PMID:22234601

  11. Towards understanding the first genome sequence of a crenarchaeon by genome annotation using clusters of orthologous groups of proteins (COGs).

    PubMed

    Natale, D A; Shankavaram, U T; Galperin, M Y; Wolf, Y I; Aravind, L; Koonin, E V

    2000-01-01

    Standard archival sequence databases have not been designed as tools for genome annotation and are far from being optimal for this purpose. We used the database of Clusters of Orthologous Groups of proteins (COGs) to reannotate the genomes of two archaea, Aeropyrum pernix, the first member of the Crenarchaea to be sequenced, and Pyrococcus abyssi. A. pernix and P. abyssi proteins were assigned to COGs using the COGNITOR program; the results were verified on a case-by-case basis and augmented by additional database searches using the PSI-BLAST and TBLASTN programs. Functions were predicted for over 300 proteins from A. pernix, which could not be assigned a function using conventional methods with a conservative sequence similarity threshold, an approximately 50% increase compared to the original annotation. A. pernix shares most of the conserved core of proteins that were previously identified in the Euryarchaeota. Cluster analysis or distance matrix tree construction based on the co-occurrence of genomes in COGs showed that A. pernix forms a distinct group within the archaea, although grouping with the two species of Pyrococci, indicative of similar repertoires of conserved genes, was observed. No indication of a specific relationship between Crenarchaeota and eukaryotes was obtained in these analyses. Several proteins that are conserved in Euryarchaeota and most bacteria are unexpectedly missing in A. pernix, including the entire set of de novo purine biosynthesis enzymes, the GTPase FtsZ (a key component of the bacterial and euryarchaeal cell-division machinery), and the tRNA-specific pseudouridine synthase, previously considered universal. A. pernix is represented in 48 COGs that do not contain any euryarchaeal members. Many of these proteins are TCA cycle and electron transport chain enzymes, reflecting the aerobic lifestyle of A. pernix. Special-purpose databases organized on the basis of phylogenetic analysis and carefully curated with respect to known and

  12. Towards understanding the first genome sequence of a crenarchaeon by genome annotation using clusters of orthologous groups of proteins (COGs)

    PubMed Central

    Natale, Darren A; Shankavaram, Uma T; Galperin, Michael Y; Wolf, Yuri I; Aravind, L; Koonin, Eugene V

    2000-01-01

    Background: Standard archival sequence databases have not been designed as tools for genome annotation and are far from being optimal for this purpose. We used the database of Clusters of Orthologous Groups of proteins (COGs) to reannotate the genomes of two archaea, Aeropyrum pernix, the first member of the Crenarchaea to be sequenced, and Pyrococcus abyssi. Results: A. pernix and P. abyssi proteins were assigned to COGs using the COGNITOR program; the results were verified on a case-by-case basis and augmented by additional database searches using the PSI-BLAST and TBLASTN programs. Functions were predicted for over 300 proteins from A. pernix, which could not be assigned a function using conventional methods with a conservative sequence similarity threshold, an approximately 50% increase compared to the original annotation. A. pernix shares most of the conserved core of proteins that were previously identified in the Euryarchaeota. Cluster analysis or distance matrix tree construction based on the co-occurrence of genomes in COGs showed that A. pernix forms a distinct group within the archaea, although grouping with the two species of Pyrococci, indicative of similar repertoires of conserved genes, was observed. No indication of a specific relationship between Crenarchaeota and eukaryotes was obtained in these analyses. Several proteins that are conserved in Euryarchaeota and most bacteria are unexpectedly missing in A. pernix, including the entire set of de novo purine biosynthesis enzymes, the GTPase FtsZ (a key component of the bacterial and euryarchaeal cell-division machinery), and the tRNA-specific pseudouridine synthase, previously considered universal. A. pernix is represented in 48 COGs that do not contain any euryarchaeal members. Many of these proteins are TCA cycle and electron transport chain enzymes, reflecting the aerobic lifestyle of A. pernix. Conclusions: Special-purpose databases organized on the basis of phylogenetic analysis and carefully

  13. Carbon Isotopic Composition of CO2, Evolved During Perchlorate-Induced Reactions in Mars Analog Materials: Interpreting SAM/MSL Rocknest Data

    NASA Technical Reports Server (NTRS)

    Stern, J. C.; McAdam, A. C.; Archer, P. D., Jr.; Bower, H.; Buch, A.; Eigenbrode, J.; Freissinet, C.; Franz, H. B.; Glavin, D.; Jones, J. H.; hide

    2013-01-01

    The Sample Analysis at Mars (SAM) Instrument Suite on the Mars Science Laboratory (MSL) Rover Curiosity made its first solid sample evolved gas analysis of unconsolidated material at aeolian bedform Rocknest in Gale Crater. The magnitude of O2 evolved in each run as well as the chlorinated hydrocarbons detected by SAM gas chromatograph/ mass spectrometer (GCMS) [1] suggest a chlorinated oxidant such as perchlorate in Rocknest materials [2]. Perchlorate induced combustion of organics present in the sample would contribute to the CO2 volatile inventory, possibly overlapping with CO2 from inorganic sources. The resulting carbon and oxygen isotopic composition of CO2 sent to the Tunable Laser Spectrometer (TLS) for analysis would represent mixed sources. This work was undertaken to better understand a) how well the carbon isotopic composition ( 13C) of CO2 from partially combusted products represents their source and b) how the 13C of combusted products can be deconvolved from other carbon sources such as thermal decomposition of carbonate.

  14. Reflective Practice in Group Co-Leadership

    ERIC Educational Resources Information Center

    Okech, Jane E. Atieno

    2008-01-01

    Group literature on co-leaders' experiences and perceptions while leading groups illuminate reflective practice as highly influential to co-leader relationships and performances. Using practical examples grounded by interdisciplinary literature on reflective practice, this article explores and expands dialogue on the complex interplay between…

  15. Mosquito genomics. Highly evolvable malaria vectors: the genomes of 16 Anopheles mosquitoes.

    PubMed

    Neafsey, Daniel E; Waterhouse, Robert M; Abai, Mohammad R; Aganezov, Sergey S; Alekseyev, Max A; Allen, James E; Amon, James; Arcà, Bruno; Arensburger, Peter; Artemov, Gleb; Assour, Lauren A; Basseri, Hamidreza; Berlin, Aaron; Birren, Bruce W; Blandin, Stephanie A; Brockman, Andrew I; Burkot, Thomas R; Burt, Austin; Chan, Clara S; Chauve, Cedric; Chiu, Joanna C; Christensen, Mikkel; Costantini, Carlo; Davidson, Victoria L M; Deligianni, Elena; Dottorini, Tania; Dritsou, Vicky; Gabriel, Stacey B; Guelbeogo, Wamdaogo M; Hall, Andrew B; Han, Mira V; Hlaing, Thaung; Hughes, Daniel S T; Jenkins, Adam M; Jiang, Xiaofang; Jungreis, Irwin; Kakani, Evdoxia G; Kamali, Maryam; Kemppainen, Petri; Kennedy, Ryan C; Kirmitzoglou, Ioannis K; Koekemoer, Lizette L; Laban, Njoroge; Langridge, Nicholas; Lawniczak, Mara K N; Lirakis, Manolis; Lobo, Neil F; Lowy, Ernesto; MacCallum, Robert M; Mao, Chunhong; Maslen, Gareth; Mbogo, Charles; McCarthy, Jenny; Michel, Kristin; Mitchell, Sara N; Moore, Wendy; Murphy, Katherine A; Naumenko, Anastasia N; Nolan, Tony; Novoa, Eva M; O'Loughlin, Samantha; Oringanje, Chioma; Oshaghi, Mohammad A; Pakpour, Nazzy; Papathanos, Philippos A; Peery, Ashley N; Povelones, Michael; Prakash, Anil; Price, David P; Rajaraman, Ashok; Reimer, Lisa J; Rinker, David C; Rokas, Antonis; Russell, Tanya L; Sagnon, N'Fale; Sharakhova, Maria V; Shea, Terrance; Simão, Felipe A; Simard, Frederic; Slotman, Michel A; Somboon, Pradya; Stegniy, Vladimir; Struchiner, Claudio J; Thomas, Gregg W C; Tojo, Marta; Topalis, Pantelis; Tubio, José M C; Unger, Maria F; Vontas, John; Walton, Catherine; Wilding, Craig S; Willis, Judith H; Wu, Yi-Chieh; Yan, Guiyun; Zdobnov, Evgeny M; Zhou, Xiaofan; Catteruccia, Flaminia; Christophides, George K; Collins, Frank H; Cornman, Robert S; Crisanti, Andrea; Donnelly, Martin J; Emrich, Scott J; Fontaine, Michael C; Gelbart, William; Hahn, Matthew W; Hansen, Immo A; Howell, Paul I; Kafatos, Fotis C; Kellis, Manolis; Lawson, Daniel; Louis, Christos; Luckhart, Shirley; Muskavitch, Marc A T; Ribeiro, José M; Riehle, Michael A; Sharakhov, Igor V; Tu, Zhijian; Zwiebel, Laurence J; Besansky, Nora J

    2015-01-02

    Variation in vectorial capacity for human malaria among Anopheles mosquito species is determined by many factors, including behavior, immunity, and life history. To investigate the genomic basis of vectorial capacity and explore new avenues for vector control, we sequenced the genomes of 16 anopheline mosquito species from diverse locations spanning ~100 million years of evolution. Comparative analyses show faster rates of gene gain and loss, elevated gene shuffling on the X chromosome, and more intron losses, relative to Drosophila. Some determinants of vectorial capacity, such as chemosensory genes, do not show elevated turnover but instead diversify through protein-sequence changes. This dynamism of anopheline genes and genomes may contribute to their flexible capacity to take advantage of new ecological niches, including adapting to humans as primary hosts. Copyright © 2015, American Association for the Advancement of Science.

  16. Comparative genomics of the bacterial genus Streptococcus illuminates evolutionary implications of species groups.

    PubMed

    Gao, Xiao-Yang; Zhi, Xiao-Yang; Li, Hong-Wei; Klenk, Hans-Peter; Li, Wen-Jun

    2014-01-01

    Members of the genus Streptococcus within the phylum Firmicutes are among the most diverse and significant zoonotic pathogens. This genus has gone through considerable taxonomic revision due to increasing improvements of chemotaxonomic approaches, DNA hybridization and 16S rRNA gene sequencing. It is proposed to place the majority of streptococci into "species groups". However, the evolutionary implications of species groups are not clear presently. We use comparative genomic approaches to yield a better understanding of the evolution of Streptococcus through genome dynamics, population structure, phylogenies and virulence factor distribution of species groups. Genome dynamics analyses indicate that the pan-genome size increases with the addition of newly sequenced strains, while the core genome size decreases with sequential addition at the genus level and species group level. Population structure analysis reveals two distinct lineages, one including Pyogenic, Bovis, Mutans and Salivarius groups, and the other including Mitis, Anginosus and Unknown groups. Phylogenetic dendrograms show that species within the same species group cluster together, and infer two main clades in accordance with population structure analysis. Distribution of streptococcal virulence factors has no obvious patterns among the species groups; however, the evolution of some common virulence factors is congruous with the evolution of species groups, according to phylogenetic inference. We suggest that the proposed streptococcal species groups are reasonable from the viewpoints of comparative genomics; evolution of the genus is congruent with the individual evolutionary trajectories of different species groups.

  17. Xylella fastidiosa CoDiRO strain associated with the olive quick decline syndrome in southern Italy belongs to a clonal complex of the subspecies pauca that evolved in Central America.

    PubMed

    Marcelletti, Simone; Scortichini, Marco

    2016-12-01

    Xylella fastidiosa, a xylem-limited bacterium transmitted by xylem-fluid-feeding Hemiptera insects, causes economic losses of both woody and herbaceous plant species. A Xyl. fastidiosa subsp. pauca strain, namely CoDiRO, was recently found to be associated with the 'olive quick decline syndrome' in southern Italy (i.e. Apulia region). Recently, some Xyl. fastidiosa strains intercepted in France from Coffea spp. plant cuttings imported from Central and South America were characterized. The introduction of infected plant material from Central America in Apulia was also postulated even though an ad hoc study to confirm this hypothesis is lacking. In the present study, we assessed the complete and draft genome of 27 Xyl. fastidiosa strains. Through a genome-wide approach, we confirmed the occurrence of three subspecies within Xyl. fastidiosa, namely fastidiosa, multiplex and pauca, and demonstrated the occurrence of a genetic clonal complex of four Xyl. fastidiosa strains belonging to subspecies pauca which evolved in Central America. The CoDiRO strain displayed 13 SNPs when compared with a strain isolated in Costa Rica from Coffea sp. and 32 SNPs when compared with two strains obtained from Nerium oleander in Costa Rica. These results support the close relationships of the two strains. The four strains in the clonal complex contain prophage-like genes in their genomes. This study strongly supports the possibility of the introduction of Xyl. fastidiosa in southern Italy via coffee plants grown in Central America. The data also stress how the current global circulation of agricultural commodities potentially threatens the agrosystems worldwide.

  18. Host-symbiont co-speciation and reductive genome evolution in gut symbiotic bacteria of acanthosomatid stinkbugs

    PubMed Central

    Kikuchi, Yoshitomo; Hosokawa, Takahiro; Nikoh, Naruo; Meng, Xian-Ying; Kamagata, Yoichi; Fukatsu, Takema

    2009-01-01

    Background Host-symbiont co-speciation and reductive genome evolution have been commonly observed among obligate endocellular insect symbionts, while such examples have rarely been identified among extracellular ones, the only case reported being from gut symbiotic bacteria of stinkbugs of the family Plataspidae. Considering that gut symbiotic communities are vulnerable to invasion of foreign microbes, gut symbiotic associations have been thought to be evolutionarily not stable. Stinkbugs of the family Acanthosomatidae harbor a bacterial symbiont in the midgut crypts, the lumen of which is completely sealed off from the midgut main tract, thereby retaining the symbiont in the isolated cryptic cavities. We investigated histological, ecological, phylogenetic, and genomic aspects of the unique gut symbiosis of the acanthosomatid stinkbugs. Results Phylogenetic analyses showed that the acanthosomatid symbionts constitute a distinct clade in the γ-Proteobacteria, whose sister groups are the obligate endocellular symbionts of aphids Buchnera and the obligate gut symbionts of plataspid stinkbugs Ishikawaella. In addition to the midgut crypts, the symbionts were located in a pair of peculiar lubricating organs associated with the female ovipositor, by which the symbionts are vertically transmitted via egg surface contamination. The symbionts were detected not from ovaries but from deposited eggs, and surface sterilization of eggs resulted in symbiont-free hatchlings. The symbiont-free insects suffered retarded growth, high mortality, and abnormal morphology, suggesting important biological roles of the symbiont for the host insects. The symbiont phylogeny was generally concordant with the host phylogeny, indicating host-symbiont co-speciation over evolutionary time despite the extracellular association. Meanwhile, some local host-symbiont phylogenetic discrepancies were found, suggesting occasional horizontal symbiont transfers across the host lineages. The symbionts

  19. Comparative Genomics of the Bacterial Genus Streptococcus Illuminates Evolutionary Implications of Species Groups

    PubMed Central

    Gao, Xiao-Yang; Zhi, Xiao-Yang; Li, Hong-Wei; Klenk, Hans-Peter; Li, Wen-Jun

    2014-01-01

    Members of the genus Streptococcus within the phylum Firmicutes are among the most diverse and significant zoonotic pathogens. This genus has gone through considerable taxonomic revision due to increasing improvements of chemotaxonomic approaches, DNA hybridization and 16S rRNA gene sequencing. It is proposed to place the majority of streptococci into “species groups”. However, the evolutionary implications of species groups are not clear presently. We use comparative genomic approaches to yield a better understanding of the evolution of Streptococcus through genome dynamics, population structure, phylogenies and virulence factor distribution of species groups. Genome dynamics analyses indicate that the pan-genome size increases with the addition of newly sequenced strains, while the core genome size decreases with sequential addition at the genus level and species group level. Population structure analysis reveals two distinct lineages, one including Pyogenic, Bovis, Mutans and Salivarius groups, and the other including Mitis, Anginosus and Unknown groups. Phylogenetic dendrograms show that species within the same species group cluster together, and infer two main clades in accordance with population structure analysis. Distribution of streptococcal virulence factors has no obvious patterns among the species groups; however, the evolution of some common virulence factors is congruous with the evolution of species groups, according to phylogenetic inference. We suggest that the proposed streptococcal species groups are reasonable from the viewpoints of comparative genomics; evolution of the genus is congruent with the individual evolutionary trajectories of different species groups. PMID:24977706

  20. A Possible Organic Contribution to the Low Temperature CO2 Release Seen in Mars Phoenix Thermal and Evolved Gas Analyzer Data

    NASA Technical Reports Server (NTRS)

    Archer, P. D. Jr.; Lauer, H. V., Jr.; Sutter, B.; Ming, D. W.; Niles, P. B.; Boynton, W. V.

    2012-01-01

    Two of the most important discoveries of the Phoenix Mars Lander were the discovery of approx.0.6% perchlorate [1] and 3-5% carbonate [2] in the soils at the landing site in the martian northern plains. The Thermal and Evolved Gas Analyzer (TEGA) instrument was one of the tools that made this discovery. After soil samples were delivered to TEGA and transferred into small ovens, the samples could be heated up to approx.1000 C and the gases that evolved during heating were monitored by a mass spectrometer. A CO2 signal was detected at high temperature (approx.750 C) that has been attributed to calcium carbonate decomposition. In addition to this CO2 release, a lower temperature signal was seen. This lower temperature CO2 release was postulated to be one of three things: 1) desorption of CO2, 2) decomposition of a different carbonate mineral, or 3) CO2 released due to organic combustion. Cannon et al. [3] present another novel hypothesis involving the interaction of decomposition products of a perchlorate salt and calcium carbonate.

  1. The visualCMAT: A web-server to select and interpret correlated mutations/co-evolving residues in protein families.

    PubMed

    Suplatov, Dmitry; Sharapova, Yana; Timonina, Daria; Kopylov, Kirill; Švedas, Vytas

    2018-04-01

    The visualCMAT web-server was designed to assist experimental research in the fields of protein/enzyme biochemistry, protein engineering, and drug discovery by providing an intuitive and easy-to-use interface to the analysis of correlated mutations/co-evolving residues. Sequence and structural information describing homologous proteins are used to predict correlated substitutions by the Mutual information-based CMAT approach, classify them into spatially close co-evolving pairs, which either form a direct physical contact or interact with the same ligand (e.g. a substrate or a crystallographic water molecule), and long-range correlations, annotate and rank binding sites on the protein surface by the presence of statistically significant co-evolving positions. The results of the visualCMAT are organized for a convenient visual analysis and can be downloaded to a local computer as a content-rich all-in-one PyMol session file with multiple layers of annotation corresponding to bioinformatic, statistical and structural analyses of the predicted co-evolution, or further studied online using the built-in interactive analysis tools. The online interactivity is implemented in HTML5 and therefore neither plugins nor Java are required. The visualCMAT web-server is integrated with the Mustguseal web-server capable of constructing large structure-guided sequence alignments of protein families and superfamilies using all available information about their structures and sequences in public databases. The visualCMAT web-server can be used to understand the relationship between structure and function in proteins, implemented at selecting hotspots and compensatory mutations for rational design and directed evolution experiments to produce novel enzymes with improved properties, and employed at studying the mechanism of selective ligand's binding and allosteric communication between topologically independent sites in protein structures. The web-server is freely available at https

  2. Insights into the history of a bacterial group II intron remnant from the genomes of the nitrogen-fixing symbionts Sinorhizobium meliloti and Sinorhizobium medicae.

    PubMed

    Toro, N; Martínez-Rodríguez, L; Martínez-Abarca, F

    2014-10-01

    Group II introns are self-splicing catalytic RNAs that act as mobile retroelements. In bacteria, they are thought to be tolerated to some extent because they self-splice and home preferentially to sites outside of functional genes, generally within intergenic regions or in other mobile genetic elements, by mechanisms including the divergence of DNA target specificity to prevent target site saturation. RmInt1 is a mobile group II intron that is widespread in natural populations of Sinorhizobium meliloti and was first described in the GR4 strain. Like other bacterial group II introns, RmInt1 tends to evolve toward an inactive form by fragmentation, with loss of the 3' terminus. We identified genomic evidence of a fragmented intron closely related to RmInt1 buried in the genome of the extant S. meliloti/S. medicae species. By studying this intron, we obtained evidence for the occurrence of intron insertion before the divergence of ancient rhizobial species. This fragmented group II intron has thus existed for a long time and has provided sequence variation, on which selection can act, contributing to diverse genetic rearrangements, and to generate pan-genome divergence after strain differentiation. The data presented here suggest that fragmented group II introns within intergenic regions closed to functionally important neighboring genes may have been microevolutionary forces driving adaptive evolution of these rhizobial species.

  3. Insights into the history of a bacterial group II intron remnant from the genomes of the nitrogen-fixing symbionts Sinorhizobium meliloti and Sinorhizobium medicae

    PubMed Central

    Toro, N; Martínez-Rodríguez, L; Martínez-Abarca, F

    2014-01-01

    Group II introns are self-splicing catalytic RNAs that act as mobile retroelements. In bacteria, they are thought to be tolerated to some extent because they self-splice and home preferentially to sites outside of functional genes, generally within intergenic regions or in other mobile genetic elements, by mechanisms including the divergence of DNA target specificity to prevent target site saturation. RmInt1 is a mobile group II intron that is widespread in natural populations of Sinorhizobium meliloti and was first described in the GR4 strain. Like other bacterial group II introns, RmInt1 tends to evolve toward an inactive form by fragmentation, with loss of the 3′ terminus. We identified genomic evidence of a fragmented intron closely related to RmInt1 buried in the genome of the extant S. meliloti/S. medicae species. By studying this intron, we obtained evidence for the occurrence of intron insertion before the divergence of ancient rhizobial species. This fragmented group II intron has thus existed for a long time and has provided sequence variation, on which selection can act, contributing to diverse genetic rearrangements, and to generate pan-genome divergence after strain differentiation. The data presented here suggest that fragmented group II introns within intergenic regions closed to functionally important neighboring genes may have been microevolutionary forces driving adaptive evolution of these rhizobial species. PMID:24736785

  4. Differentiation of strains from the Bacillus cereus group by RFLP-PFGE genomic fingerprinting.

    PubMed

    Otlewska, Anna; Oltuszak-Walczak, Elzbieta; Walczak, Piotr

    2013-11-01

    Bacillus mycoides, Bacillus pseudomycoides, Bacillus weihenstephanensis, Bacillus anthracis, Bacillus thuringiensis, and Bacillus cereus belong to the B. cereus group. The last three species are characterized by different phenotype features and pathogenicity spectrum, but it has been shown that these species are genetically closely related. The macrorestriction analysis of the genomic DNA with the NotI enzyme was used to generate polymorphism of restriction profiles for 39 food-borne isolates (B. cereus, B. mycoides) and seven reference strains (B. mycoides, B. thuringiensis, B. weihenstephanensis, and B. cereus). The PFGE method was applied to differentiate the examined strains of the B. cereus group. On the basis of the unweighted pair group method with the arithmetic mean method and Dice coefficient, the strains were divided into five clusters (types A-E), and the most numerous group was group A (25 strains). A total of 21 distinct pulsotypes were observed. The RFLP-PFGE analysis was successfully used for the differentiation and characterization of B. cereus and B. mycoides strains isolated from different food products. © 2013 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

  5. An experimental and computational evolution-based method to study a mode of co-evolution of overlapping open reading frames in the AAV2 viral genome.

    PubMed

    Kawano, Yasuhiro; Neeley, Shane; Adachi, Kei; Nakai, Hiroyuki

    2013-01-01

    Overlapping open reading frames (ORFs) in viral genomes undergo co-evolution; however, how individual amino acids coded by overlapping ORFs are structurally, functionally, and co-evolutionarily constrained remains difficult to address by conventional homologous sequence alignment approaches. We report here a new experimental and computational evolution-based methodology to address this question and report its preliminary application to elucidating a mode of co-evolution of the frame-shifted overlapping ORFs in the adeno-associated virus (AAV) serotype 2 viral genome. These ORFs encode both capsid VP protein and non-structural assembly-activating protein (AAP). To show proof of principle of the new method, we focused on the evolutionarily conserved QVKEVTQ and KSKRSRR motifs, a pair of overlapping heptapeptides in VP and AAP, respectively. In the new method, we first identified a large number of capsid-forming VP3 mutants and functionally competent AAP mutants of these motifs from mutant libraries by experimental directed evolution under no co-evolutionary constraints. We used Illumina sequencing to obtain a large dataset and then statistically assessed the viability of VP and AAP heptapeptide mutants. The obtained heptapeptide information was then integrated into an evolutionary algorithm, with which VP and AAP were co-evolved from random or native nucleotide sequences in silico. As a result, we demonstrate that these two heptapeptide motifs could exhibit high degeneracy if coded by separate nucleotide sequences, and elucidate how overlap-evoked co-evolutionary constraints play a role in making the VP and AAP heptapeptide sequences into the present shape. Specifically, we demonstrate that two valine (V) residues and β-strand propensity in QVKEVTQ are structurally important, the strongly negative and hydrophilic nature of KSKRSRR is functionally important, and overlap-evoked co-evolution imposes strong constraints on serine (S) residues in KSKRSRR, despite high

  6. Modeling growth and dissemination of lymphoma in a co-evolving lymph node: a diffuse-domain approach

    NASA Astrophysics Data System (ADS)

    Chuang, Yao-Li; Cristini, Vittorio; Chen, Ying; Li, Xiangrong; Frieboes, Hermann; Lowengrub, John

    2013-03-01

    While partial differential equation models of tumor growth have successfully described various spatiotemporal phenomena observed for in-vitro tumor spheroid experiments, one challenge towards taking these models to further study in-vivo tumors is that instead of relatively static tissue culture with regular boundary conditions, in-vivo tumors are often confined in organ tissues that co-evolve with the tumor growth. Here we adopt a recently developed diffuse-domain method to account for the co-evolving domain boundaries, adapting our previous in-vitro tumor model for the development of lymphoma encapsulated in a lymph node, which may swell or shrink due to proliferation and dissemination of lymphoma cells and treatment by chemotherapy. We use the model to study the induced spatial heterogeneity, which may arise as an emerging phenomenon in experimental observations and model analysis. Spatial heterogeneity is believed to lead to tumor infiltration patterns and reduce the efficacy of chemotherapy, leaving residuals that cause cancer relapse after the treatment. Understanding the spatiotemporal evolution of in-vivo tumors can be an essential step towards more effective strategies of curing cancer. Supported by NIH-PSOC grant 1U54CA143907-01.

  7. Molecular cytogenetic and genomic analyses reveal new insights into the origin of the wheat B genome.

    PubMed

    Zhang, Wei; Zhang, Mingyi; Zhu, Xianwen; Cao, Yaping; Sun, Qing; Ma, Guojia; Chao, Shiaoman; Yan, Changhui; Xu, Steven S; Cai, Xiwen

    2018-02-01

    This work pinpointed the goatgrass chromosomal segment in the wheat B genome using modern cytogenetic and genomic technologies, and provided novel insights into the origin of the wheat B genome. Wheat is a typical allopolyploid with three homoeologous subgenomes (A, B, and D). The donors of the subgenomes A and D had been identified, but not for the subgenome B. The goatgrass Aegilops speltoides (genome SS) has been controversially considered a possible candidate for the donor of the wheat B genome. However, the relationship of the Ae. speltoides S genome with the wheat B genome remains largely obscure. The present study assessed the homology of the B and S genomes using an integrative cytogenetic and genomic approach, and revealed the contribution of Ae. speltoides to the origin of the wheat B genome. We discovered noticeable homology between wheat chromosome 1B and Ae. speltoides chromosome 1S, but not between other chromosomes in the B and S genomes. An Ae. speltoides-originated segment spanning a genomic region of approximately 10.46 Mb was detected on the long arm of wheat chromosome 1B (1BL). The Ae. speltoides-originated segment on 1BL was found to co-evolve with the rest of the B genome. Evidently, Ae. speltoides had been involved in the origin of the wheat B genome, but should not be considered an exclusive donor of this genome. The wheat B genome might have a polyphyletic origin with multiple ancestors involved, including Ae. speltoides. These novel findings will facilitate genome studies in wheat and other polyploids.

  8. A Synergism between Adaptive Effects and Evolvability Drives Whole Genome Duplication to Fixation

    PubMed Central

    Cuypers, Thomas D.; Hogeweg, Paulien

    2014-01-01

    Whole genome duplication has shaped eukaryotic evolutionary history and has been associated with drastic environmental change and species radiation. While the most common fate of WGD duplicates is a return to single copy, retained duplicates have been found enriched for highly interacting genes. This pattern has been explained by a neutral process of subfunctionalization and more recently, dosage balance selection. However, much about the relationship between environmental change, WGD and adaptation remains unknown. Here, we study the duplicate retention pattern postWGD, by letting virtual cells adapt to environmental changes. The virtual cells have structured genomes that encode a regulatory network and simple metabolism. Populations are under selection for homeostasis and evolve by point mutations, small indels and WGD. After populations had initially adapted fully to fluctuating resource conditions re-adaptation to a broad range of novel environments was studied by tracking mutations in the line of descent. WGD was established in a minority (≈30%) of lineages, yet, these were significantly more successful at re-adaptation. Unexpectedly, WGD lineages conserved more seemingly redundant genes, yet had higher per gene mutation rates. While WGD duplicates of all functional classes were significantly over-retained compared to a model of neutral losses, duplicate retention was clearly biased towards highly connected TFs. Importantly, no subfunctionalization occurred in conserved pairs, strongly suggesting that dosage balance shaped retention. Meanwhile, singles diverged significantly. WGD, therefore, is a powerful mechanism to cope with environmental change, allowing conservation of a core machinery, while adapting the peripheral network to accommodate change. PMID:24743268

  9. A synergism between adaptive effects and evolvability drives whole genome duplication to fixation.

    PubMed

    Cuypers, Thomas D; Hogeweg, Paulien

    2014-04-01

    Whole genome duplication has shaped eukaryotic evolutionary history and has been associated with drastic environmental change and species radiation. While the most common fate of WGD duplicates is a return to single copy, retained duplicates have been found enriched for highly interacting genes. This pattern has been explained by a neutral process of subfunctionalization and more recently, dosage balance selection. However, much about the relationship between environmental change, WGD and adaptation remains unknown. Here, we study the duplicate retention pattern postWGD, by letting virtual cells adapt to environmental changes. The virtual cells have structured genomes that encode a regulatory network and simple metabolism. Populations are under selection for homeostasis and evolve by point mutations, small indels and WGD. After populations had initially adapted fully to fluctuating resource conditions re-adaptation to a broad range of novel environments was studied by tracking mutations in the line of descent. WGD was established in a minority (≈30%) of lineages, yet, these were significantly more successful at re-adaptation. Unexpectedly, WGD lineages conserved more seemingly redundant genes, yet had higher per gene mutation rates. While WGD duplicates of all functional classes were significantly over-retained compared to a model of neutral losses, duplicate retention was clearly biased towards highly connected TFs. Importantly, no subfunctionalization occurred in conserved pairs, strongly suggesting that dosage balance shaped retention. Meanwhile, singles diverged significantly. WGD, therefore, is a powerful mechanism to cope with environmental change, allowing conservation of a core machinery, while adapting the peripheral network to accommodate change.

  10. Genome-wide DNA methylation analysis in lung fibroblasts co-cultured with silica-exposed alveolar macrophages.

    PubMed

    Li, Juan; Yao, Wu; Zhang, Lin; Bao, Lei; Chen, Huiting; Wang, Di; Yue, Zhongzheng; Li, Yiping; Zhang, Miao; Hao, Changfu

    2017-05-12

    Exposure to crystalline silica is considered to increase the risk of lung fibrosis. The primary effector cell, the myofibroblast, plays an important role in the deposition of extracellular matrix (ECM). DNA methylation change is considered to have a potential effect on myofibroblast differentiation. Therefore, the present study was designed to investigate the genome-wide DNA methylation profiles of lung fibroblasts co-cultured with alveolar macrophages exposed to crystalline silica in vitro. AM/fibroblast co-culture system was established. CCK8 was used to assess the toxicity of AMs. mRNA and protein expression of collagen I, α-SMA, MAPK9 and TGF-β1 of fibroblasts after AMs exposed to 100 μg /ml SiO 2 for 0-, 24-, or 48 h were determined by means of quantitative real-time PCR, immunoblotting and immunohistochemistry. Genomic DNA of fibroblasts was isolated using MeDIP-Seq to sequence. R software, GO, KEGG and Cytoscape were used to analyze the data. SiO 2 exposure increased the expression of collagen I and α-SMA in fibroblasts in co-culture system. Analysis of fibroblast methylome identified extensive methylation changes involved in several signaling pathways, such as the MAPK signaling pathway and metabolic pathways. Several candidates, including Tgfb1 and Mapk9, are hubs who can connect the gene clusters. MAPK9 mRNA expression was significantly higher in fibroblast exposed to SiO 2 in co-culture system for 48 h. MAPK9 protein expression was increased at both 24-h and 48-h treatment groups. TGF-β1 mRNA expression of fibroblast has a time-dependent manner, but we didn't observe the TGF-β1 protein expression. Tgfb1 and Mapk9 are helpful to explore the mechanism of myofibroblast differentiation. The genome-wide DNA methylation profiles of fibroblasts in this experimental silicosis model will be useful for future studies on epigenetic gene regulation during myofibroblast differentiation.

  11. COPS: Detecting Co-Occurrence and Spatial Arrangement of Transcription Factor Binding Motifs in Genome-Wide Datasets

    PubMed Central

    Lohmann, Ingrid

    2012-01-01

    In multi-cellular organisms, spatiotemporal activity of cis-regulatory DNA elements depends on their occupancy by different transcription factors (TFs). In recent years, genome-wide ChIP-on-Chip, ChIP-Seq and DamID assays have been extensively used to unravel the combinatorial interaction of TFs with cis-regulatory modules (CRMs) in the genome. Even though genome-wide binding profiles are increasingly becoming available for different TFs, single TF binding profiles are in most cases not sufficient for dissecting complex regulatory networks. Thus, potent computational tools detecting statistically significant and biologically relevant TF-motif co-occurrences in genome-wide datasets are essential for analyzing context-dependent transcriptional regulation. We have developed COPS (Co-Occurrence Pattern Search), a new bioinformatics tool based on a combination of association rules and Markov chain models, which detects co-occurring TF binding sites (BSs) on genomic regions of interest. COPS scans DNA sequences for frequent motif patterns using a Frequent-Pattern tree based data mining approach, which allows efficient performance of the software with respect to both data structure and implementation speed, in particular when mining large datasets. Since transcriptional gene regulation very often relies on the formation of regulatory protein complexes mediated by closely adjoining TF binding sites on CRMs, COPS additionally detects preferred short distance between co-occurring TF motifs. The performance of our software with respect to biological significance was evaluated using three published datasets containing genomic regions that are independently bound by several TFs involved in a defined biological process. In sum, COPS is a fast, efficient and user-friendly tool mining statistically and biologically significant TFBS co-occurrences and therefore allows the identification of TFs that combinatorially regulate gene expression. PMID:23272209

  12. The Capsaspora genome reveals a complex unicellular prehistory of animals.

    PubMed

    Suga, Hiroshi; Chen, Zehua; de Mendoza, Alex; Sebé-Pedrós, Arnau; Brown, Matthew W; Kramer, Eric; Carr, Martin; Kerner, Pierre; Vervoort, Michel; Sánchez-Pons, Núria; Torruella, Guifré; Derelle, Romain; Manning, Gerard; Lang, B Franz; Russ, Carsten; Haas, Brian J; Roger, Andrew J; Nusbaum, Chad; Ruiz-Trillo, Iñaki

    2013-01-01

    To reconstruct the evolutionary origin of multicellular animals from their unicellular ancestors, the genome sequences of diverse unicellular relatives are essential. However, only the genome of the choanoflagellate Monosiga brevicollis has been reported to date. Here we completely sequence the genome of the filasterean Capsaspora owczarzaki, the closest known unicellular relative of metazoans besides choanoflagellates. Analyses of this genome alter our understanding of the molecular complexity of metazoans' unicellular ancestors showing that they had a richer repertoire of proteins involved in cell adhesion and transcriptional regulation than previously inferred only with the choanoflagellate genome. Some of these proteins were secondarily lost in choanoflagellates. In contrast, most intercellular signalling systems controlling development evolved later concomitant with the emergence of the first metazoans. We propose that the acquisition of these metazoan-specific developmental systems and the co-option of pre-existing genes drove the evolutionary transition from unicellular protists to metazoans.

  13. Insights into the strategies used by related group II introns to adapt successfully for the colonisation of a bacterial genome

    PubMed Central

    Martínez-Rodríguez, Laura; García-Rodríguez, Fernando M; Molina-Sánchez, María Dolores; Toro, Nicolás; Martínez-Abarca, Francisco

    2014-01-01

    Group II introns are self-splicing RNAs and site-specific mobile retroelements found in bacterial and organellar genomes. The group II intron RmInt1 is present at high copy number in Sinorhizobium meliloti species, and has a multifunctional intron-encoded protein (IEP) with reverse transcriptase/maturase activities, but lacking the DNA-binding and endonuclease domains. We characterized two RmInt1-related group II introns RmInt2 from S. meliloti strain GR4 and Sr.md.I1 from S. medicae strain WSM419 in terms of splicing and mobility activities. We used both wild-type and engineered intron-donor constructs based on ribozyme ΔORF-coding sequence derivatives, and we determined the DNA target requirements for RmInt2, the element most distantly related to RmInt1. The excision and mobility patterns of intron-donor constructs expressing different combinations of IEP and intron RNA provided experimental evidence for the co-operation of IEPs and intron RNAs from related elements in intron splicing and, in some cases, in intron homing. We were also able to identify the DNA target regions recognized by these IEPs lacking the DNA endonuclease domain. Our results provide new insight into the versatility of related group II introns and the possible co-operation between these elements to facilitate the colonization of bacterial genomes. PMID:25482895

  14. Insights into the strategies used by related group II introns to adapt successfully for the colonisation of a bacterial genome.

    PubMed

    Martínez-Rodríguez, Laura; García-Rodríguez, Fernando M; Molina-Sánchez, María Dolores; Toro, Nicolás; Martínez-Abarca, Francisco

    2014-01-01

    Group II introns are self-splicing RNAs and site-specific mobile retroelements found in bacterial and organellar genomes. The group II intron RmInt1 is present at high copy number in Sinorhizobium meliloti species, and has a multifunctional intron-encoded protein (IEP) with reverse transcriptase/maturase activities, but lacking the DNA-binding and endonuclease domains. We characterized two RmInt1-related group II introns RmInt2 from S. meliloti strain GR4 and Sr.md.I1 from S. medicae strain WSM419 in terms of splicing and mobility activities. We used both wild-type and engineered intron-donor constructs based on ribozyme ΔORF-coding sequence derivatives, and we determined the DNA target requirements for RmInt2, the element most distantly related to RmInt1. The excision and mobility patterns of intron-donor constructs expressing different combinations of IEP and intron RNA provided experimental evidence for the co-operation of IEPs and intron RNAs from related elements in intron splicing and, in some cases, in intron homing. We were also able to identify the DNA target regions recognized by these IEPs lacking the DNA endonuclease domain. Our results provide new insight into the versatility of related group II introns and the possible co-operation between these elements to facilitate the colonization of bacterial genomes.

  15. StreptoBase: An Oral Streptococcus mitis Group Genomic Resource and Analysis Platform.

    PubMed

    Zheng, Wenning; Tan, Tze King; Paterson, Ian C; Mutha, Naresh V R; Siow, Cheuk Chuen; Tan, Shi Yang; Old, Lesley A; Jakubovics, Nicholas S; Choo, Siew Woh

    2016-01-01

    The oral streptococci are spherical Gram-positive bacteria categorized under the phylum Firmicutes which are among the most common causative agents of bacterial infective endocarditis (IE) and are also important agents in septicaemia in neutropenic patients. The Streptococcus mitis group is comprised of 13 species including some of the most common human oral colonizers such as S. mitis, S. oralis, S. sanguinis and S. gordonii as well as species such as S. tigurinus, S. oligofermentans and S. australis that have only recently been classified and are poorly understood at present. We present StreptoBase, which provides a specialized free resource focusing on the genomic analyses of oral species from the mitis group. It currently hosts 104 S. mitis group genomes including 27 novel mitis group strains that we sequenced using the high throughput Illumina HiSeq technology platform, and provides a comprehensive set of genome sequences for analyses, particularly comparative analyses and visualization of both cross-species and cross-strain characteristics of S. mitis group bacteria. StreptoBase incorporates sophisticated in-house designed bioinformatics web tools such as Pairwise Genome Comparison (PGC) tool and Pathogenomic Profiling Tool (PathoProT), which facilitate comparative pathogenomics analysis of Streptococcus strains. Examples are provided to demonstrate how StreptoBase can be employed to compare genome structure of different S. mitis group bacteria and putative virulence genes profile across multiple streptococcal strains. In conclusion, StreptoBase offers access to a range of streptococci genomic resources as well as analysis tools and will be an invaluable platform to accelerate research in streptococci. Database URL: http://streptococcus.um.edu.my.

  16. Molecular Epidemiology and Genomics of Group A Streptococcus

    PubMed Central

    Bessen, Debra E.; McShan, W. Michael; Nguyen, Scott V.; Shetty, Amol; Agrawal, Sonia; Tettelin, Hervé

    2014-01-01

    Streptococcus pyogenes (group A streptococcus; GAS) is a strict human pathogen with a very high prevalence worldwide. This review highlights the genetic organization of the species and the important ecological considerations that impact its evolution. Recent advances are presented on the topics of molecular epidemiology, population biology, molecular basis for genetic change, genome structure and genetic flux, phylogenomics and closely related streptococcal species, and the long- and short-term evolution of GAS. The application of whole genome sequence data to addressing key biological questions is discussed. PMID:25460818

  17. HelmCoP: An Online Resource for Helminth Functional Genomics and Drug and Vaccine Targets Prioritization

    PubMed Central

    Taylor, Christina M.; Mitreva, Makedonka

    2011-01-01

    A vast majority of the burden from neglected tropical diseases result from helminth infections (nematodes and platyhelminthes). Parasitic helminthes infect over 2 billion, exerting a high collective burden that rivals high-mortality conditions such as AIDS or malaria, and cause devastation to crops and livestock. The challenges to improve control of parasitic helminth infections are multi-fold and no single category of approaches will meet them all. New information such as helminth genomics, functional genomics and proteomics coupled with innovative bioinformatic approaches provide fundamental molecular information about these parasites, accelerating both basic research as well as development of effective diagnostics, vaccines and new drugs. To facilitate such studies we have developed an online resource, HelmCoP (Helminth Control and Prevention), built by integrating functional, structural and comparative genomic data from plant, animal and human helminthes, to enable researchers to develop strategies for drug, vaccine and pesticide prioritization, while also providing a useful comparative genomics platform. HelmCoP encompasses genomic data from several hosts, including model organisms, along with a comprehensive suite of structural and functional annotations, to assist in comparative analyses and to study host-parasite interactions. The HelmCoP interface, with a sophisticated query engine as a backbone, allows users to search for multi-factorial combinations of properties and serves readily accessible information that will assist in the identification of various genes of interest. HelmCoP is publicly available at: http://www.nematode.net/helmcop.html. PMID:21760913

  18. Genome-wide patterns of promoter sharing and co-expression in bovine skeletal muscle.

    PubMed

    Gu, Quan; Nagaraj, Shivashankar H; Hudson, Nicholas J; Dalrymple, Brian P; Reverter, Antonio

    2011-01-12

    Gene regulation by transcription factors (TF) is species, tissue and time specific. To better understand how the genetic code controls gene expression in bovine muscle we associated gene expression data from developing Longissimus thoracis et lumborum skeletal muscle with bovine promoter sequence information. We created a highly conserved genome-wide promoter landscape comprising 87,408 interactions relating 333 TFs with their 9,242 predicted target genes (TGs). We discovered that the complete set of predicted TGs share an average of 2.75 predicted TF binding sites (TFBSs) and that the average co-expression between a TF and its predicted TGs is higher than the average co-expression between the same TF and all genes. Conversely, pairs of TFs sharing predicted TGs showed a co-expression correlation higher that pairs of TFs not sharing TGs. Finally, we exploited the co-occurrence of predicted TFBS in the context of muscle-derived functionally-coherent modules including cell cycle, mitochondria, immune system, fat metabolism, muscle/glycolysis, and ribosome. Our findings enabled us to reverse engineer a regulatory network of core processes, and correctly identified the involvement of E2F1, GATA2 and NFKB1 in the regulation of cell cycle, fat, and muscle/glycolysis, respectively. The pivotal implication of our research is two-fold: (1) there exists a robust genome-wide expression signal between TFs and their predicted TGs in cattle muscle consistent with the extent of promoter sharing; and (2) this signal can be exploited to recover the cellular mechanisms underpinning transcription regulation of muscle structure and development in bovine. Our study represents the first genome-wide report linking tissue specific co-expression to co-regulation in a non-model vertebrate.

  19. Effect of co-payment on behavioral response to consumer genomic testing.

    PubMed

    Liu, Wendy; Outlaw, Jessica J; Wineinger, Nathan; Boeldt, Debra; Bloss, Cinnamon S

    2018-01-29

    Existing research in consumer behavior suggests that perceptions and usage of a product post-purchase depends, in part, on how the product was marketed, including price paid. In the current study, we examine the effect of providing an out-of-pocket co-payment for consumer genomic testing (CGT) on consumer post-purchase behavior using both correlational field evidence and a hypothetical online experiment. Participants were enrolled in a longitudinal cohort study of the impact of CGT and completed behavioral assessments before and after receipt of CGT results. Most participants provided a co-payment for the test (N = 1668), while others (N = 369) received fully subsidized testing. The two groups were compared regarding changes in health behaviors and post-test use of health care resources. Participants who paid were more likely to share results with their physician (p = .012) and obtain follow-up health screenings (p = .005) relative to those who received fully subsidized testing. A follow-up online experiment in which participants (N = 303) were randomized to a "fully-subsidized" versus "co-payment" condition found that simulating provision of a co-payment significantly increased intentions to seek follow-up screening tests (p = .050) and perceptions of the test results as more trustworthy (p = .02). Provision of an out-of-pocket co-payment for CGT may influence consumer's post-purchase behavior consistent with a price placebo effect. Cognitive dissonance or sunk cost may help explain the increase in screening propensity among paying consumers. Such individuals may obtain follow-up screenings to validate their initial decision to expend personal resources to obtain CGT. © Society of Behavioral Medicine 2018.

  20. Unbiased whole-genome deep sequencing of human and porcine stool samples reveals circulation of multiple groups of rotaviruses and a putative zoonotic infection

    PubMed Central

    Phan, My V. T.; Anh, Pham Hong; Cuong, Nguyen Van; Munnink, Bas B. Oude; van der Hoek, Lia; My, Phuc Tran; Tri, Tue Ngo; Bryant, Juliet E.; Baker, Stephen; Thwaites, Guy; Woolhouse, Mark; Kellam, Paul; Rabaa, Maia A.

    2016-01-01

    Abstract Coordinated and synchronous surveillance for zoonotic viruses in both human clinical cases and animal reservoirs provides an opportunity to identify interspecies virus movement. Rotavirus (RV) is an important cause of viral gastroenteritis in humans and animals. In this study, we document the RV diversity within co-located humans and animals sampled from the Mekong delta region of Vietnam using a primer-independent, agnostic, deep sequencing approach. A total of 296 stool samples (146 from diarrhoeal human patients and 150 from pigs living in the same geographical region) were directly sequenced, generating the genomic sequences of sixty human rotaviruses (all group A) and thirty-one porcine rotaviruses (thirteen group A, seven group B, six group C, and five group H). Phylogenetic analyses showed the co-circulation of multiple distinct RV group A (RVA) genotypes/strains, many of which were divergent from the strain components of licensed RVA vaccines, as well as considerable virus diversity in pigs including full genomes of rotaviruses in groups B, C, and H, none of which have been previously reported in Vietnam. Furthermore, the detection of an atypical RVA genotype constellation (G4-P[6]-I1-R1-C1-M1-A8-N1-T7-E1-H1) in a human patient and a pig from the same region provides some evidence for a zoonotic event. PMID:28748110

  1. In situ characterization of cofacial Co(IV) centers in Co4O4 cubane: Modeling the high-valent active site in oxygen-evolving catalysts.

    PubMed

    Brodsky, Casey N; Hadt, Ryan G; Hayes, Dugan; Reinhart, Benjamin J; Li, Nancy; Chen, Lin X; Nocera, Daniel G

    2017-04-11

    The Co 4 O 4 cubane is a representative structural model of oxidic cobalt oxygen-evolving catalysts (Co-OECs). The Co-OECs are active when residing at two oxidation levels above an all-Co(III) resting state. This doubly oxidized Co(IV) 2 state may be captured in a Co(III) 2 (IV) 2 cubane. We demonstrate that the Co(III) 2 (IV) 2 cubane may be electrochemically generated and the electronic properties of this unique high-valent state may be probed by in situ spectroscopy. Intervalence charge-transfer (IVCT) bands in the near-IR are observed for the Co(III) 2 (IV) 2 cubane, and spectroscopic analysis together with electrochemical kinetics measurements reveal a larger reorganization energy and a smaller electron transfer rate constant for the doubly versus singly oxidized cubane. Spectroelectrochemical X-ray absorption data further reveal systematic spectral changes with successive oxidations from the cubane resting state. Electronic structure calculations correlated to experimental data suggest that this state is best represented as a localized, antiferromagnetically coupled Co(IV) 2 dimer. The exchange coupling in the cofacial Co(IV) 2 site allows for parallels to be drawn between the electronic structure of the Co 4 O 4 cubane model system and the high-valent active site of the Co-OEC, with specific emphasis on the manifestation of a doubly oxidized Co(IV) 2 center on O-O bond formation.

  2. Thermal and Evolved Gas Analysis of "Nanophase" Carbonates: Implications for Thermal and Evolved Gas Analysis on Mars Missions

    NASA Technical Reports Server (NTRS)

    Lauer, Howard V., Jr.; Archer, P. D., Jr.; Sutter, B.; Niles, P. B.; Ming, Douglas W.

    2012-01-01

    Data collected by the Mars Phoenix Lander's Thermal and Evolved Gas Analyzer (TEGA) suggested the presence of calcium-rich carbonates as indicated by a high temperature CO2 release while a low temperature (approx.400-680 C) CO2 release suggested possible Mg- and/or Fe-carbonates [1,2]. Interpretations of the data collected by Mars remote instruments is done by comparing the mission data to a database on the thermal properties of well-characterized Martian analog materials collected under reduced and Earth ambient pressures [3,4]. We are proposing that "nano-phase" carbonates may also be contributing to the low temperature CO2 release. The objectives of this paper is to (1) characterize the thermal and evolved gas proper-ties of carbonates of varying particle size, (2) evaluate the CO2 releases from CO2 treated CaO samples and (3) examine the secondary CO2 release from reheated calcite of varying particle size.

  3. Large clusters of co-expressed genes in the Drosophila genome.

    PubMed

    Boutanaev, Alexander M; Kalmykova, Alla I; Shevelyov, Yuri Y; Nurminsky, Dmitry I

    2002-12-12

    Clustering of co-expressed, non-homologous genes on chromosomes implies their co-regulation. In lower eukaryotes, co-expressed genes are often found in pairs. Clustering of genes that share aspects of transcriptional regulation has also been reported in higher eukaryotes. To advance our understanding of the mode of coordinated gene regulation in multicellular organisms, we performed a genome-wide analysis of the chromosomal distribution of co-expressed genes in Drosophila. We identified a total of 1,661 testes-specific genes, one-third of which are clustered on chromosomes. The number of clusters of three or more genes is much higher than expected by chance. We observed a similar trend for genes upregulated in the embryo and in the adult head, although the expression pattern of individual genes cannot be predicted on the basis of chromosomal position alone. Our data suggest that the prevalent mechanism of transcriptional co-regulation in higher eukaryotes operates with extensive chromatin domains that comprise multiple genes.

  4. Phylogenomics and the Dynamic Genome Evolution of the Genus Streptococcus

    PubMed Central

    Richards, Vincent P.; Palmer, Sara R.; Pavinski Bitar, Paulina D.; Qin, Xiang; Weinstock, George M.; Highlander, Sarah K.; Town, Christopher D.; Burne, Robert A.; Stanhope, Michael J.

    2014-01-01

    The genus Streptococcus comprises important pathogens that have a severe impact on human health and are responsible for substantial economic losses to agriculture. Here, we utilize 46 Streptococcus genome sequences (44 species), including eight species sequenced here, to provide the first genomic level insight into the evolutionary history and genetic basis underlying the functional diversity of all major groups of this genus. Gene gain/loss analysis revealed a dynamic pattern of genome evolution characterized by an initial period of gene gain followed by a period of loss, as the major groups within the genus diversified. This was followed by a period of genome expansion associated with the origins of the present extant species. The pattern is concordant with an emerging view that genomes evolve through a dynamic process of expansion and streamlining. A large proportion of the pan-genome has experienced lateral gene transfer (LGT) with causative factors, such as relatedness and shared environment, operating over different evolutionary scales. Multiple gene ontology terms were significantly enriched for each group, and mapping terms onto the phylogeny showed that those corresponding to genes born on branches leading to the major groups represented approximately one-fifth of those enriched. Furthermore, despite the extensive LGT, several biochemical characteristics have been retained since group formation, suggesting genomic cohesiveness through time, and that these characteristics may be fundamental to each group. For example, proteolysis: mitis group; urea metabolism: salivarius group; carbohydrate metabolism: pyogenic group; and transcription regulation: bovis group. PMID:24625962

  5. In-Class Reflective Group Discussion as a Strategy for the Development of Students as Evolving Professionals

    ERIC Educational Resources Information Center

    Tsang, Annetta Kit Lam

    2011-01-01

    The primary aim of this study was to determine perceptions of three cohorts of third year undergraduate students (n = 65) on in-class reflective group discussion as a critical reflective approach for evolving professionals. Reflective group discussions were embedded into a final year course within the University of Queensland Bachelor of Oral…

  6. Signatures of co-evolutionary host-pathogen interactions in the genome of the entomopathogenic nematode Steinernema carpocapsae.

    PubMed

    Flores-Ponce, Mitzi; Vallebueno-Estrada, Miguel; González-Orozco, Eduardo; Ramos-Aboites, Hilda E; García-Chávez, J Noé; Simões, Nelson; Montiel, Rafael

    2017-04-26

    The entomopathogenic nematode Steinernema carpocapsae has been used worldwide as a biocontrol agent for insect pests, making it an interesting model for understanding parasite-host interactions. Two models propose that these interactions are co-evolutionary processes in such a way that equilibrium is never reached. In one model, known as "arms race", new alleles in relevant genes are fixed in both host and pathogens by directional positive selection, producing recurrent and alternating selective sweeps. In the other model, known as"trench warfare", persistent dynamic fluctuations in allele frequencies are sustained by balancing selection. There are some examples of genes evolving according to both models, however, it is not clear to what extent these interactions might alter genome-level evolutionary patterns and intraspecific diversity. Here we investigate some of these aspects by studying genomic variation in S. carpocapsae and other pathogenic and free-living nematodes from phylogenetic clades IV and V. To look for signatures of an arms-race dynamic, we conducted massive scans to detect directional positive selection in interspecific data. In free-living nematodes, we detected a significantly higher proportion of genes with sites under positive selection than in parasitic nematodes. However, in these genes, we found more enriched Gene Ontology terms in parasites. To detect possible effects of dynamic polymorphisms interactions we looked for signatures of balancing selection in intraspecific genomic data. The observed distribution of Tajima's D values in S. carpocapsae was more skewed to positive values and significantly different from the observed distribution in the free-living Caenorhabditis briggsae. Also, the proportion of significant positive values of Tajima's D was elevated in genes that were differentially expressed after induction with insect tissues as compared to both non-differentially expressed genes and the global scan. Our study provides a first

  7. Chloroplast genome expansion by intron multiplication in the basal psychrophilic euglenoid Eutreptiella pomquetensis

    PubMed Central

    Bennett, Matthew S.; Triemer, Richard E.; Preisfeld, Angelika

    2017-01-01

    Background Over the last few years multiple studies have been published showing a great diversity in size of chloroplast genomes (cpGenomes), and in the arrangement of gene clusters, in the Euglenales. However, while these genomes provided important insights into the evolution of cpGenomes across the Euglenales and within their genera, only two genomes were analyzed in regard to genomic variability between and within Euglenales and Eutreptiales. To better understand the dynamics of chloroplast genome evolution in early evolving Eutreptiales, this study focused on the cpGenome of Eutreptiella pomquetensis, and the spread and peculiarities of introns. Methods The Etl. pomquetensis cpGenome was sequenced, annotated and afterwards examined in structure, size, gene order and intron content. These features were compared with other euglenoid cpGenomes as well as those of prasinophyte green algae, including Pyramimonas parkeae. Results and Discussion With about 130,561 bp the chloroplast genome of Etl. pomquetensis, a basal taxon in the phototrophic euglenoids, was considerably larger than the two other Eutreptiales cpGenomes sequenced so far. Although the detected quadripartite structure resembled most green algae and plant chloroplast genomes, the gene content of the single copy regions in Etl. pomquetensis was completely different from those observed in green algae and plants. The gene composition of Etl. pomquetensis was extensively changed and turned out to be almost identical to other Eutreptiales and Euglenales, and not to P. parkeae. Furthermore, the cpGenome of Etl. pomquetensis was unexpectedly permeated by a high number of introns, which led to a substantially larger genome. The 51 identified introns of Etl. pomquetensis showed two major unique features: (i) more than half of the introns displayed a high level of pairwise identities; (ii) no group III introns could be identified in the protein coding genes. These findings support the hypothesis that group III

  8. Sequence Search and Comparative Genomic Analysis of SUMO-Activating Enzymes Using CoGe.

    PubMed

    Carretero-Paulet, Lorenzo; Albert, Victor A

    2016-01-01

    The growing number of genome sequences completed during the last few years has made necessary the development of bioinformatics tools for the easy access and retrieval of sequence data, as well as for downstream comparative genomic analyses. Some of these are implemented as online platforms that integrate genomic data produced by different genome sequencing initiatives with data mining tools as well as various comparative genomic and evolutionary analysis possibilities.Here, we use the online comparative genomics platform CoGe ( http://www.genomevolution.org/coge/ ) (Lyons and Freeling. Plant J 53:661-673, 2008; Tang and Lyons. Front Plant Sci 3:172, 2012) (1) to retrieve the entire complement of orthologous and paralogous genes belonging to the SUMO-Activating Enzymes 1 (SAE1) gene family from a set of species representative of the Brassicaceae plant eudicot family with genomes fully sequenced, and (2) to investigate the history, timing, and molecular mechanisms of the gene duplications driving the evolutionary expansion and functional diversification of the SAE1 family in Brassicaceae.

  9. In situ characterization of cofacial Co(IV) centers in Co 4O 4 cubane: Modeling the high-valent active site in oxygen-evolving catalysts

    DOE PAGES

    Brodsky, Casey N.; Hadt, Ryan G.; Hayes, Dugan; ...

    2017-03-27

    The Co 4O 4 cubane is a representative structural model of oxidic cobalt oxygen evolving catalysts (Co-OECs). The Co-OECs are active when residing at two oxidation levels above an all Co(III) resting state. This doubly oxidized Co(IV) 2 state may be captured in a Co(III) 2(IV) 2 cubane. We demonstrate that the Co(III) 2(IV) 2 cubane may be electrochemically generated and the electronic properties of this unique high-valent state may be probed by in situ spectroscopy. Intervalence charge transfer (IVCT) bands in the near-IR are observed for the Co(III) 2(IV) 2 cubane, and spectroscopic analysis together with electrochemical kinetics measurementsmore » reveal a larger reorganization energy and a smaller electron transfer rate constant for the doubly versus singly oxidized cubane. Spectroelectrochemical X-ray absorption data further reveal systematic spectral changes with successive oxidations from the cubane resting state. Electronic structure calculations correlated to experimental data suggest that this state is best represented as a localized, antiferromagnetically coupled Co(IV) 2 dimer. The exchange coupling in the cofacial Co(IV) 2 site allows for parallels to be drawn between the electronic structure of the Co 4O 4 cubane model system and the high valent active site of the Co-OEC, with specific emphasis on the manifestation of a doubly oxidized Co(IV) 2 center on O–O bond formation.« less

  10. Three Groups of Transposable Elements with Contrasting Copy Number Dynamics and Host Responses in the Maize (Zea mays ssp. mays) Genome

    PubMed Central

    Diez, Concepcion M.; Meca, Esteban; Tenaillon, Maud I.; Gaut, Brandon S.

    2014-01-01

    Most angiosperm nuclear DNA is repetitive and derived from silenced transposable elements (TEs). TE silencing requires substantial resources from the plant host, including the production of small interfering RNAs (siRNAs). Thus, the interaction between TEs and siRNAs is a critical aspect of both the function and the evolution of plant genomes. Yet the co-evolutionary dynamics between these two entities remain poorly characterized. Here we studied the organization of TEs within the maize (Zea mays ssp mays) genome, documenting that TEs fall within three groups based on the class and copy numbers. These groups included DNA elements, low copy RNA elements and higher copy RNA elements. The three groups varied statistically in characteristics that included length, location, age, siRNA expression and 24∶22 nucleotide (nt) siRNA targeting ratios. In addition, the low copy retroelements encompassed a set of TEs that had previously been shown to decrease expression within a 24 nt siRNA biogenesis mutant (mop1). To investigate the evolutionary dynamics of the three groups, we estimated their abundance in two landraces, one with a genome similar in size to that of the maize reference and the other with a 30% larger genome. For all three accessions, we assessed TE abundance as well as 22 nt and 24 nt siRNA content within leaves. The high copy number retroelements are under targeted similarly by siRNAs among accessions, appear to be born of a rapid bust of activity, and may be currently transpositionally dead or limited. In contrast, the lower copy number group of retrolements are targeted more dynamically and have had a long and ongoing history of transposition in the maize genome. PMID:24743518

  11. [Research advances of genomic GYP coding MNS blood group antigens].

    PubMed

    Liu, Chang-Li; Zhao, Wei-Jun

    2012-02-01

    The MNS blood group system includes more than 40 antigens, and the M, N, S and s antigens are the most significant ones in the system. The antigenic determinants of M and N antigens lie on the top of GPA on the surface of red blood cells, while the antigenic determinants of S and s antigens lie on the top of GPB on the surface of red blood cells. The GYPA gene coding GPA and the GYPB gene coding GPB locate at the longarm of chromosome 4 and display 95% homologus sequence, meanwhile both genes locate closely to GYPE gene that did not express product. These three genes formed "GYPA-GYPB-GYPE" structure called GYP genome. This review focuses on the molecular basis of genomic GYP and the variety of GYP genome in the expression of diversity MNS blood group antigens. The molecular basis of Miltenberger hybrid glycophorin polymorphism is specifically expounded.

  12. Genome Sequence of Candidatus Nitrososphaera evergladensis from Group I.1b Enriched from Everglades Soil Reveals Novel Genomic Features of the Ammonia-Oxidizing Archaea

    PubMed Central

    Zhalnina, Kateryna V.; Dias, Raquel; Leonard, Michael T.; Dorr de Quadros, Patricia; Camargo, Flavio A. O.; Drew, Jennifer C.; Farmerie, William G.; Daroub, Samira H.; Triplett, Eric W.

    2014-01-01

    The activity of ammonia-oxidizing archaea (AOA) leads to the loss of nitrogen from soil, pollution of water sources and elevated emissions of greenhouse gas. To date, eight AOA genomes are available in the public databases, seven are from the group I.1a of the Thaumarchaeota and only one is from the group I.1b, isolated from hot springs. Many soils are dominated by AOA from the group I.1b, but the genomes of soil representatives of this group have not been sequenced and functionally characterized. The lack of knowledge of metabolic pathways of soil AOA presents a critical gap in understanding their role in biogeochemical cycles. Here, we describe the first complete genome of soil archaeon Candidatus Nitrososphaera evergladensis, which has been reconstructed from metagenomic sequencing of a highly enriched culture obtained from an agricultural soil. The AOA enrichment was sequenced with the high throughput next generation sequencing platforms from Pacific Biosciences and Ion Torrent. The de novo assembly of sequences resulted in one 2.95 Mb contig. Annotation of the reconstructed genome revealed many similarities of the basic metabolism with the rest of sequenced AOA. Ca. N. evergladensis belongs to the group I.1b and shares only 40% of whole-genome homology with the closest sequenced relative Ca. N. gargensis. Detailed analysis of the genome revealed coding sequences that were completely absent from the group I.1a. These unique sequences code for proteins involved in control of DNA integrity, transporters, two-component systems and versatile CRISPR defense system. Notably, genomes from the group I.1b have more gene duplications compared to the genomes from the group I.1a. We suggest that the presence of these unique genes and gene duplications may be associated with the environmental versatility of this group. PMID:24999826

  13. Estimation of (co)variances for genomic regions of flexible sizes: application to complex infectious udder diseases in dairy cattle

    PubMed Central

    2012-01-01

    Background Multi-trait genomic models in a Bayesian context can be used to estimate genomic (co)variances, either for a complete genome or for genomic regions (e.g. per chromosome) for the purpose of multi-trait genomic selection or to gain further insight into the genomic architecture of related traits such as mammary disease traits in dairy cattle. Methods Data on progeny means of six traits related to mastitis resistance in dairy cattle (general mastitis resistance and five pathogen-specific mastitis resistance traits) were analyzed using a bivariate Bayesian SNP-based genomic model with a common prior distribution for the marker allele substitution effects and estimation of the hyperparameters in this prior distribution from the progeny means data. From the Markov chain Monte Carlo samples of the allele substitution effects, genomic (co)variances were calculated on a whole-genome level, per chromosome, and in regions of 100 SNP on a chromosome. Results Genomic proportions of the total variance differed between traits. Genomic correlations were lower than pedigree-based genetic correlations and they were highest between general mastitis and pathogen-specific traits because of the part-whole relationship between these traits. The chromosome-wise genomic proportions of the total variance differed between traits, with some chromosomes explaining higher or lower values than expected in relation to chromosome size. Few chromosomes showed pleiotropic effects and only chromosome 19 had a clear effect on all traits, indicating the presence of QTL with a general effect on mastitis resistance. The region-wise patterns of genomic variances differed between traits. Peaks indicating QTL were identified but were not very distinctive because a common prior for the marker effects was used. There was a clear difference in the region-wise patterns of genomic correlation among combinations of traits, with distinctive peaks indicating the presence of pleiotropic QTL. Conclusions

  14. Group-theoretic models of the inversion process in bacterial genomes.

    PubMed

    Egri-Nagy, Attila; Gebhardt, Volker; Tanaka, Mark M; Francis, Andrew R

    2014-07-01

    The variation in genome arrangements among bacterial taxa is largely due to the process of inversion. Recent studies indicate that not all inversions are equally probable, suggesting, for instance, that shorter inversions are more frequent than longer, and those that move the terminus of replication are less probable than those that do not. Current methods for establishing the inversion distance between two bacterial genomes are unable to incorporate such information. In this paper we suggest a group-theoretic framework that in principle can take these constraints into account. In particular, we show that by lifting the problem from circular permutations to the affine symmetric group, the inversion distance can be found in polynomial time for a model in which inversions are restricted to acting on two regions. This requires the proof of new results in group theory, and suggests a vein of new combinatorial problems concerning permutation groups on which group theorists will be needed to collaborate with biologists. We apply the new method to inferring distances and phylogenies for published Yersinia pestis data.

  15. Genomic Definition of Hypervirulent and Multidrug-Resistant Klebsiella pneumoniae Clonal Groups

    PubMed Central

    Bialek-Davenet, Suzanne; Criscuolo, Alexis; Ailloud, Florent; Passet, Virginie; Jones, Louis; Delannoy-Vieillard, Anne-Sophie; Garin, Benoit; Le Hello, Simon; Arlet, Guillaume; Nicolas-Chanoine, Marie-Hélène; Decré, Dominique

    2014-01-01

    Multidrug-resistant and highly virulent Klebsiella pneumoniae isolates are emerging, but the clonal groups (CGs) corresponding to these high-risk strains have remained imprecisely defined. We aimed to identify K. pneumoniae CGs on the basis of genome-wide sequence variation and to provide a simple bioinformatics tool to extract virulence and resistance gene data from genomic data. We sequenced 48 K. pneumoniae isolates, mostly of serotypes K1 and K2, and compared the genomes with 119 publicly available genomes. A total of 694 highly conserved genes were included in a core-genome multilocus sequence typing scheme, and cluster analysis of the data enabled precise definition of globally distributed hypervirulent and multidrug-resistant CGs. In addition, we created a freely accessible database, BIGSdb-Kp, to enable rapid extraction of medically and epidemiologically relevant information from genomic sequences of K. pneumoniae. Although drug-resistant and virulent K. pneumoniae populations were largely nonoverlapping, isolates with combined virulence and resistance features were detected. PMID:25341126

  16. Dynamic Nucleotide Mutation Gradients and Control Region Usage in Squamate Reptile Mitochondrial Genomes

    PubMed Central

    Castoe, T.A.; Gu, W.; de Koning, A.P.J.; Daza, J.M.; Jiang, Z.J.; Parkinson, C.L.; Pollock, D.D.

    2010-01-01

    Gradients of nucleotide bias and substitution rates occur in vertebrate mitochondrial genomes due to the asymmetric nature of the replication process. The evolution of these gradients has previously been studied in detail in primates, but not in other vertebrate groups. From the primate study, the strengths of these gradients are known to evolve in ways that can substantially alter the substitution process, but it is unclear how rapidly they evolve over evolutionary time or how different they may be in different lineages or groups of vertebrates. Given the importance of mitochondrial genomes in phylogenetics and molecular evolutionary research, a better understanding of how asymmetric mitochondrial substitution gradients evolve would contribute key insights into how this gradient evolution may mislead evolutionary inferences, and how it may also be incorporated into new evolutionary models. Most snake mitochondrial genomes have an additional interesting feature, 2 nearly identical control regions, which vary among different species in the extent that they are used as origins of replication. Given the expanded sampling of complete snake genomes currently available, together with 2 additional snakes sequenced in this study, we reexamined gradient strength and CR usage in alethinophidian snakes as well as several lizards that possess dual CRs. Our results suggest that nucleotide substitution gradients (and corresponding nucleotide bias) and CR usage is highly labile over the ∼200 m.y. of squamate evolution, and demonstrates greater overall variability than previously shown in primates. The evidence for the existence of such gradients, and their ability to evolve rapidly and converge among unrelated species suggests that gradient dynamics could easily mislead phylogenetic and molecular evolutionary inferences, and argues strongly that these dynamics should be incorporated into phylogenetic models. PMID:20215734

  17. Genome-nutrition divergence: evolving understanding of the malnutrition spectrum.

    PubMed

    Eaton, Jacob C; Iannotti, Lora L

    2017-11-01

    Humans adapted over a period of 2.3 million years to a diet high in quality and diversity. Genome-nutrition divergence describes the misalignment between modern global diets and the genome formed through evolution. A survey of hominin diets over time shows that humans have thrived on a broad range of foods. Earlier diets were highly diverse and nutrient dense, in contrast to modern food systems in which monotonous diets of staple cereals and ultraprocessed foods play a more prominent role. Applying the lens of genome-nutrition divergence to malnutrition reveals shared risk factors for undernutrition and overnutrition at nutrient, food, and environmental levels. Mechanisms for food system shifts, such as crop-neutral agricultural policy, agroecology, and social policy, are explored as a means to realign modern diets with the nutritional patterns to which humans may be better adapted to thrive. © The Author(s) 2017. Published by Oxford University Press on behalf of the International Life Sciences Institute. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

  18. Autonomous Agent-Based Systems and Their Applications in Fluid Dynamics, Particle Separation, and Co-evolving Networks

    NASA Astrophysics Data System (ADS)

    Graeser, Oliver

    This thesis comprises three parts, reporting research results in Fluid Dynamics (Part I), Particle Separation (Part II) and Co-evolving Networks (Part III). Part I deals with the simulation of fluid dynamics using the lattice-Boltzmann method. Microfluidic devices often feature two-dimensional, repetitive arrays. Flows through such devices are pressure-driven and confined by solid walls. We have defined new adaptive generalised periodic boundary conditions to represent the effects of outer solid walls, and are thus able to exploit the periodicity of the array by simulating the flow through one unit cell in lieu of the entire device. The so-calculated fully developed flow describes the flow through the entire array accurately, but with computational requirements that are reduced according to the dimensions of the array. Part II discusses the problem of separating macromolecules like proteins or DNA coils. The reliable separation of such molecules is a crucial task in molecular biology. The use of Brownian ratchets as mechanisms for the separation of such particles has been proposed and discussed during the last decade. Pressure-driven flows have so far been dismissed as possible driving forces for Brownian ratchets, as they do not generate ratchet asymmetry. We propose a microfluidic design that uses pressure-driven flows to create asymmetry and hence allows particle separation. The dependence of the asymmetry on various factors of the microfluidic geometry is discussed. We further exemplify the feasibility of our approach using Brownian dynamics simulations of particles of different sizes in such a device. The results show that ratchet-based particle separation using flows as the driving force is possible. Simulation results and ratchet theory predictions are in excellent agreement. Part III deals with the co-evolution of networks and dynamic models. A group of agents occupies the nodes of a network, which defines the relationship between these agents. The

  19. Genome scans on experimentally evolved populations reveal candidate regions for adaptation to plant resistance in the potato cyst nematode Globodera pallida.

    PubMed

    Eoche-Bosy, D; Gautier, M; Esquibet, M; Legeai, F; Bretaudeau, A; Bouchez, O; Fournet, S; Grenier, E; Montarry, J

    2017-09-01

    Improving resistance durability involves to be able to predict the adaptation speed of pathogen populations. Identifying the genetic bases of pathogen adaptation to plant resistances is a useful step to better understand and anticipate this phenomenon. Globodera pallida is a major pest of potato crop for which a resistance QTL, GpaV vrn , has been identified in Solanum vernei. However, its durability is threatened as G. pallida populations are able to adapt to the resistance in few generations. The aim of this study was to investigate the genomic regions involved in the resistance breakdown by coupling experimental evolution and high-density genome scan. We performed a whole-genome resequencing of pools of individuals (Pool-Seq) belonging to G. pallida lineages derived from two independent populations having experimentally evolved on susceptible and resistant potato cultivars. About 1.6 million SNPs were used to perform the genome scan using a recent model testing for adaptive differentiation and association to population-specific covariables. We identified 275 outliers and 31 of them, which also showed a significant reduction in diversity in adapted lineages, were investigated for their genic environment. Some candidate genomic regions contained genes putatively encoding effectors and were enriched in SPRYSECs, known in cyst nematodes to be involved in pathogenicity and in (a)virulence. Validated candidate SNPs will provide a useful molecular tool to follow frequencies of virulence alleles in natural G. pallida populations and define efficient strategies of use of potato resistances maximizing their durability. © 2017 John Wiley & Sons Ltd.

  20. Indel Group in Genomes (IGG) Molecular Genetic Markers1[OPEN

    PubMed Central

    Burkart-Waco, Diana; Kuppu, Sundaram; Britt, Anne; Chetelat, Roger

    2016-01-01

    Genetic markers are essential when developing or working with genetically variable populations. Indel Group in Genomes (IGG) markers are primer pairs that amplify single-locus sequences that differ in size for two or more alleles. They are attractive for their ease of use for rapid genotyping and their codominant nature. Here, we describe a heuristic algorithm that uses a k-mer-based approach to search two or more genome sequences to locate polymorphic regions suitable for designing candidate IGG marker primers. As input to the IGG pipeline software, the user provides genome sequences and the desired amplicon sizes and size differences. Primer sequences flanking polymorphic insertions/deletions are produced as output. IGG marker files for three sets of genomes, Solanum lycopersicum/Solanum pennellii, Arabidopsis (Arabidopsis thaliana) Columbia-0/Landsberg erecta-0 accessions, and S. lycopersicum/S. pennellii/Solanum tuberosum (three-way polymorphic) are included. PMID:27436831

  1. Identification and assembly of genomes and genetic elements in complex metagenomic samples without using reference genomes.

    PubMed

    Nielsen, H Bjørn; Almeida, Mathieu; Juncker, Agnieszka Sierakowska; Rasmussen, Simon; Li, Junhua; Sunagawa, Shinichi; Plichta, Damian R; Gautier, Laurent; Pedersen, Anders G; Le Chatelier, Emmanuelle; Pelletier, Eric; Bonde, Ida; Nielsen, Trine; Manichanh, Chaysavanh; Arumugam, Manimozhiyan; Batto, Jean-Michel; Quintanilha Dos Santos, Marcelo B; Blom, Nikolaj; Borruel, Natalia; Burgdorf, Kristoffer S; Boumezbeur, Fouad; Casellas, Francesc; Doré, Joël; Dworzynski, Piotr; Guarner, Francisco; Hansen, Torben; Hildebrand, Falk; Kaas, Rolf S; Kennedy, Sean; Kristiansen, Karsten; Kultima, Jens Roat; Léonard, Pierre; Levenez, Florence; Lund, Ole; Moumen, Bouziane; Le Paslier, Denis; Pons, Nicolas; Pedersen, Oluf; Prifti, Edi; Qin, Junjie; Raes, Jeroen; Sørensen, Søren; Tap, Julien; Tims, Sebastian; Ussery, David W; Yamada, Takuji; Renault, Pierre; Sicheritz-Ponten, Thomas; Bork, Peer; Wang, Jun; Brunak, Søren; Ehrlich, S Dusko

    2014-08-01

    Most current approaches for analyzing metagenomic data rely on comparisons to reference genomes, but the microbial diversity of many environments extends far beyond what is covered by reference databases. De novo segregation of complex metagenomic data into specific biological entities, such as particular bacterial strains or viruses, remains a largely unsolved problem. Here we present a method, based on binning co-abundant genes across a series of metagenomic samples, that enables comprehensive discovery of new microbial organisms, viruses and co-inherited genetic entities and aids assembly of microbial genomes without the need for reference sequences. We demonstrate the method on data from 396 human gut microbiome samples and identify 7,381 co-abundance gene groups (CAGs), including 741 metagenomic species (MGS). We use these to assemble 238 high-quality microbial genomes and identify affiliations between MGS and hundreds of viruses or genetic entities. Our method provides the means for comprehensive profiling of the diversity within complex metagenomic samples.

  2. Mechanisms Used for Genomic Proliferation by Thermophilic Group II Introns

    PubMed Central

    Mohr, Georg; Ghanem, Eman; Lambowitz, Alan M.

    2010-01-01

    Mobile group II introns, which are found in bacterial and organellar genomes, are site-specific retroelments hypothesized to be evolutionary ancestors of spliceosomal introns and retrotransposons in higher organisms. Most bacteria, however, contain no more than one or a few group II introns, making it unclear how introns could have proliferated to higher copy numbers in eukaryotic genomes. An exception is the thermophilic cyanobacterium Thermosynechococcus elongatus, which contains 28 closely related copies of a group II intron, constituting ∼1.3% of the genome. Here, by using a combination of bioinformatics and mobility assays at different temperatures, we identified mechanisms that contribute to the proliferation of T. elongatus group II introns. These mechanisms include divergence of DNA target specificity to avoid target site saturation; adaptation of some intron-encoded reverse transcriptases to splice and mobilize multiple degenerate introns that do not encode reverse transcriptases, leading to a common splicing apparatus; and preferential insertion within other mobile introns or insertion elements, which provide new unoccupied sites in expanding non-essential DNA regions. Additionally, unlike mesophilic group II introns, the thermophilic T. elongatus introns rely on elevated temperatures to help promote DNA strand separation, enabling access to a larger number of DNA target sites by base pairing of the intron RNA, with minimal constraint from the reverse transcriptase. Our results provide insight into group II intron proliferation mechanisms and show that higher temperatures, which are thought to have prevailed on Earth during the emergence of eukaryotes, favor intron proliferation by increasing the accessibility of DNA target sites. We also identify actively mobile thermophilic introns, which may be useful for structural studies, gene targeting in thermophiles, and as a source of thermostable reverse transcriptases. PMID:20543989

  3. The patterning center of excellence (CoE): an evolving lithographic enablement model

    NASA Astrophysics Data System (ADS)

    Montgomery, Warren; Chun, Jun Sung; Liehr, Michael; Tittnich, Michael

    2015-03-01

    As EUV lithography moves toward high-volume manufacturing (HVM), a key need for the lithography materials makers is access to EUV photons and imaging. The SEMATECH Resist Materials Development Center (RMDC) provided a solution path by enabling the Resist and Materials companies to work together (using SUNY Polytechnic Institute's Colleges of Nanoscale Science and Engineering (SUNY Poly CNSE) -based exposure systems), in a consortium fashion, in order to address the need for EUV photons. Thousands of wafers have been processed by the RMDC (leveraging the SUNY Poly CNSE/SEMATECH MET, SUNY Poly CNSE Alpha Demo Tool (ADT) and the SEMATECH Lawrence Berkeley MET) allowing many of the questions associated with EUV materials development to be answered. In this regard the activities associated with the RMDC are continuing. As the major Integrated Device Manufacturers (IDMs) have continued to purchase EUV scanners, Materials companies must now provide scanner based test data that characterizes the lithography materials they are producing. SUNY Poly CNSE and SEMATECH have partnered to evolve the RMDC into "The Patterning Center of Excellence (CoE)". The new CoE leverages the capability of the SUNY Poly CNSE-based full field ASML 3300 EUV scanner and combines that capability with EUV Microexposure (MET) systems resident in the SEMATECH RMDC to create an integrated lithography model which will allow materials companies to advance materials development in ways not previously possible.

  4. The Effects of Protostellar Disk Turbulence on CO Emission Lines: A Comparison Study of Disks with Constant CO Abundance versus Chemically Evolving Disks

    NASA Astrophysics Data System (ADS)

    Yu, Mo; Evans, Neal J., II; Dodson-Robinson, Sarah E.; Willacy, Karen; Turner, Neal J.

    2017-12-01

    Turbulence is the leading candidate for angular momentum transport in protoplanetary disks and therefore influences disk lifetimes and planet formation timescales. However, the turbulent properties of protoplanetary disks are poorly constrained observationally. Recent studies have found turbulent speeds smaller than what fully-developed MRI would produce (Flaherty et al.). However, existing studies assumed a constant CO/H2 ratio of 10-4 in locations where CO is not frozen-out or photo-dissociated. Our previous studies of evolving disk chemistry indicate that CO is depleted by incorporation into complex organic molecules well inside the freeze-out radius of CO. We consider the effects of this chemical depletion on measurements of turbulence. Simon et al. suggested that the ratio of the peak line flux to the flux at line center of the CO J = 3-2 transition is a reasonable diagnostic of turbulence, so we focus on that metric, while adding some analysis of the more complex effects on spatial distribution. We simulate the emission lines of CO based on chemical evolution models presented in Yu et al., and find that the peak-to-trough ratio changes as a function of time as CO is destroyed. Specifically, a CO-depleted disk with high turbulent velocity mimics the peak-to-trough ratios of a non-CO-depleted disk with lower turbulent velocity. We suggest that disk observers and modelers take into account the possibility of CO depletion when using line profiles or peak-to-trough ratios to constrain the degree of turbulence in disks. Assuming that {CO}/{{{H}}}2={10}-4 at all disk radii can lead to underestimates of turbulent speeds in the disk by at least 0.2 km s-1.

  5. History of genome editing in yeast.

    PubMed

    Fraczek, Marcin G; Naseeb, Samina; Delneri, Daniela

    2018-05-01

    For thousands of years humans have used the budding yeast Saccharomyces cerevisiae for the production of bread and alcohol; however, in the last 30-40 years our understanding of the yeast biology has dramatically increased, enabling us to modify its genome. Although S. cerevisiae has been the main focus of many research groups, other non-conventional yeasts have also been studied and exploited for biotechnological purposes. Our experiments and knowledge have evolved from recombination to high-throughput PCR-based transformations to highly accurate CRISPR methods in order to alter yeast traits for either research or industrial purposes. Since the release of the genome sequence of S. cerevisiae in 1996, the precise and targeted genome editing has increased significantly. In this 'Budding topic' we discuss the significant developments of genome editing in yeast, mainly focusing on Cre-loxP mediated recombination, delitto perfetto and CRISPR/Cas. © 2018 The Authors. Yeast published by John Wiley & Sons, Ltd.

  6. The 5S rDNA family evolves through concerted and birth-and-death evolution in fish genomes: an example from freshwater stingrays

    PubMed Central

    2011-01-01

    Background Ribosomal 5S genes are well known for the critical role they play in ribosome folding and functionality. These genes are thought to evolve in a concerted fashion, with high rates of homogenization of gene copies. However, the majority of previous analyses regarding the evolutionary process of rDNA repeats were conducted in invertebrates and plants. Studies have also been conducted on vertebrates, but these analyses were usually restricted to the 18S, 5.8S and 28S rRNA genes. The recent identification of divergent 5S rRNA gene paralogs in the genomes of elasmobranches and teleost fishes indicate that the eukaryotic 5S rRNA gene family has a more complex genomic organization than previously thought. The availability of new sequence data from lower vertebrates such as teleosts and elasmobranches enables an enhanced evolutionary characterization of 5S rDNA among vertebrates. Results We identified two variant classes of 5S rDNA sequences in the genomes of Potamotrygonidae stingrays, similar to the genomes of other vertebrates. One class of 5S rRNA genes was shared only by elasmobranches. A broad comparative survey among 100 vertebrate species suggests that the 5S rRNA gene variants in fishes originated from rounds of genome duplication. These variants were then maintained or eliminated by birth-and-death mechanisms, under intense purifying selection. Clustered multiple copies of 5S rDNA variants could have arisen due to unequal crossing over mechanisms. Simultaneously, the distinct genome clusters were independently homogenized, resulting in the maintenance of clusters of highly similar repeats through concerted evolution. Conclusions We believe that 5S rDNA molecular evolution in fish genomes is driven by a mixed mechanism that integrates birth-and-death and concerted evolution. PMID:21627815

  7. Different selective pressures lead to different genomic outcomes as newly-formed hybrid yeasts evolve.

    PubMed

    Piotrowski, Jeff S; Nagarajan, Saisubramanian; Kroll, Evgueny; Stanbery, Alison; Chiotti, Kami E; Kruckeberg, Arthur L; Dunn, Barbara; Sherlock, Gavin; Rosenzweig, Frank

    2012-04-02

    Interspecific hybridization occurs in every eukaryotic kingdom. While hybrid progeny are frequently at a selective disadvantage, in some instances their increased genome size and complexity may result in greater stress resistance than their ancestors, which can be adaptively advantageous at the edges of their ancestors' ranges. While this phenomenon has been repeatedly documented in the field, the response of hybrid populations to long-term selection has not often been explored in the lab. To fill this knowledge gap we crossed the two most distantly related members of the Saccharomyces sensu stricto group, S. cerevisiae and S. uvarum, and established a mixed population of homoploid and aneuploid hybrids to study how different types of selection impact hybrid genome structure. As temperature was raised incrementally from 31°C to 46.5°C over 500 generations of continuous culture, selection favored loss of the S. uvarum genome, although the kinetics of genome loss differed among independent replicates. Temperature-selected isolates exhibited greater inherent and induced thermal tolerance than parental species and founding hybrids, and also exhibited ethanol resistance. In contrast, as exogenous ethanol was increased from 0% to 14% over 500 generations of continuous culture, selection favored euploid S. cerevisiae x S. uvarum hybrids. Ethanol-selected isolates were more ethanol tolerant than S. uvarum and one of the founding hybrids, but did not exhibit resistance to temperature stress. Relative to parental and founding hybrids, temperature-selected strains showed heritable differences in cell wall structure in the forms of increased resistance to zymolyase digestion and Micafungin, which targets cell wall biosynthesis. This is the first study to show experimentally that the genomic fate of newly-formed interspecific hybrids depends on the type of selection they encounter during the course of evolution, underscoring the importance of the ecological theatre in

  8. Positive Selection in Rapidly Evolving Plastid–Nuclear Enzyme Complexes

    PubMed Central

    Rockenbach, Kate; Havird, Justin C.; Monroe, J. Grey; Triant, Deborah A.; Taylor, Douglas R.; Sloan, Daniel B.

    2016-01-01

    Rates of sequence evolution in plastid genomes are generally low, but numerous angiosperm lineages exhibit accelerated evolutionary rates in similar subsets of plastid genes. These genes include clpP1 and accD, which encode components of the caseinolytic protease (CLP) and acetyl-coA carboxylase (ACCase) complexes, respectively. Whether these extreme and repeated accelerations in rates of plastid genome evolution result from adaptive change in proteins (i.e., positive selection) or simply a loss of functional constraint (i.e., relaxed purifying selection) is a source of ongoing controversy. To address this, we have taken advantage of the multiple independent accelerations that have occurred within the genus Silene (Caryophyllaceae) by examining phylogenetic and population genetic variation in the nuclear genes that encode subunits of the CLP and ACCase complexes. We found that, in species with accelerated plastid genome evolution, the nuclear-encoded subunits in the CLP and ACCase complexes are also evolving rapidly, especially those involved in direct physical interactions with plastid-encoded proteins. A massive excess of nonsynonymous substitutions between species relative to levels of intraspecific polymorphism indicated a history of strong positive selection (particularly in CLP genes). Interestingly, however, some species are likely undergoing loss of the native (heteromeric) plastid ACCase and putative functional replacement by a duplicated cytosolic (homomeric) ACCase. Overall, the patterns of molecular evolution in these plastid–nuclear complexes are unusual for anciently conserved enzymes. They instead resemble cases of antagonistic coevolution between pathogens and host immune genes. We discuss a possible role of plastid–nuclear conflict as a novel cause of accelerated evolution. PMID:27707788

  9. CyanoClust: comparative genome resources of cyanobacteria and plastids.

    PubMed

    Sasaki, Naobumi V; Sato, Naoki

    2010-01-01

    Cyanobacteria, which perform oxygen-evolving photosynthesis as do chloroplasts of plants and algae, are one of the best-studied prokaryotic phyla and one from which many representative genomes have been sequenced. Lack of a suitable comparative genomic database has been a problem in cyanobacterial genomics because many proteins involved in physiological functions such as photosynthesis and nitrogen fixation are not catalogued in commonly used databases, such as Clusters of Orthologous Proteins (COG). CyanoClust is a database of homolog groups in cyanobacteria and plastids that are produced by the program Gclust. We have developed a web-server system for the protein homology database featuring cyanobacteria and plastids. Database URL: http://cyanoclust.c.u-tokyo.ac.jp/.

  10. Structure of p-shell nuclei using three-nucleon interactions evolved with the similarity renormalization group

    DOE PAGES

    Jurgenson, E. D.; Maris, P.; Furnstahl, R. J.; ...

    2013-05-13

    The similarity renormalization group (SRG) is used to soften interactions for ab initio nuclear structure calculations by decoupling low- and high-energy Hamiltonian matrix elements. The substantial contribution of both initial and SRG-induced three-nucleon forces requires their consistent evolution in a three-particle basis space before applying them to larger nuclei. While, in principle, the evolved Hamiltonians are unitarily equivalent, in practice the need for basis truncation introduces deviations, which must be monitored. Here we present benchmark no-core full configuration calculations with SRG-evolved interactions in p-shell nuclei over a wide range of softening. As a result, these calculations are used to assessmore » convergence properties, extrapolation techniques, and the dependence of energies, including four-body contributions, on the SRG resolution scale.« less

  11. Origin of the Y genome in Elymus and its relationship to other genomes in Triticeae based on evidence from elongation factor G (EF-G) gene sequences.

    PubMed

    Sun, Genlou; Komatsuda, Takao

    2010-08-01

    It is well known that Elymus arose through hybridization between representatives of different genera. Cytogenetic analyses show that all its members include the St genome in combination with one or more of four other genomes, the H, Y, P, and W genomes. The origins of the H, P, and W genomes are known, but not for the Y genome. We analyzed the single copy nuclear gene coding for elongation factor G (EF-G) from 28 accessions of polyploid Elymus species and 45 accessions of diploid Triticeae species in order to investigate origin of the Y genome and its relationship to other genomes in the tribe Triticeae. Sequence comparisons among the St, H, Y, P, W, and E genomes detected genome-specific polymorphisms at 66 nucleotide positions. The St and Y genomes are relatively dissimilar. The phylogeny of the Y genome sequences was investigated for the first time. They were most similar to the W genome sequences. The Y genome sequences were placed in two different groups. These two groups were included in an unresolved clade that included the W and E sequences as well as sequences from many annual species. The H genomes sequences were in a clade with the F, P, and Ns genome sequences as sister groups. These two clades were more closely related to each other and to the L and Xp genomes than they were to the St genome sequences. These data support the hypothesis that the Y genome evolved in a diploid species and has a different origin from the St genome. Copyright 2010 Elsevier Inc. All rights reserved.

  12. [Three-dimensional genome organization: a lesson from the Polycomb-Group proteins].

    PubMed

    Bantignies, Frédéric

    2013-01-01

    As more and more genomes are being explored and annotated, important features of three-dimensional (3D) genome organization are just being uncovered. In the light of what we know about Polycomb group (PcG) proteins, we will present the latest findings on this topic. The PcG proteins are well-conserved chromatin factors that repress transcription of numerous target genes. They bind the genome at specific sites, forming chromatin domains of associated histone modifications as well as higher-order chromatin structures. These 3D chromatin structures involve the interactions between PcG-bound regulatory regions at short- and long-range distances, and may significantly contribute to PcG function. Recent high throughput "Chromosome Conformation Capture" (3C) analyses have revealed many other higher order structures along the chromatin fiber, partitioning the genomes into well demarcated topological domains. This revealed an unprecedented link between linear epigenetic domains and chromosome architecture, which might be intimately connected to genome function. © Société de Biologie, 2013.

  13. Genomics, evolution and development of amphioxus and tunicates: The Goldilocks principle.

    PubMed

    Holland, Linda Z

    2015-06-01

    Morphological comparisons among extant animals have long been used to infer their long-extinct ancestors for which the fossil record is poor or non-existent. For evolution of the vertebrates, the comparison has typically involved amphioxus and vertebrates. Both groups are evolving relatively slowly, and their genomes share a high level of synteny. Both vertebrates and amphioxus have regulative development in which cell fates become fixed only gradually during embryogenesis. Thus, their development fits a modified hourglass model in which constraints are greatest at the phylotypic stage (i.e., the late neurula/early larva), but are somewhat greater on earlier development than on later development. In contrast, the third group of chordates, the tunicates, which are sister group to vertebrates, are evolving rapidly. Constraints on evolution of tunicate genomes are relaxed, and they have discarded key developmental genes and organized much of their coding sequences into operons, which are transcribed as a single mRNA that undergoes trans-splicing. This contrasts with vertebrates and amphioxus, whose genomes are not organized into operons. Concomitantly, tunicates have switched to determinant development with very early fixation of cell fates. Thus, tunicate development more closely fits a progressive divergence model (shaped more like a wine glass than an hourglass) in which the constraints on the zygote and very early development are greatest. This model can help explain why tunicate body plans are so very diverse. The relaxed constraints on development after early cleavage stages are correlated with relaxed constraints on genome evolution. The question remains: which came first? © 2014 Wiley Periodicals, Inc.

  14. Genome Evolution of Plant-Parasitic Nematodes.

    PubMed

    Kikuchi, Taisei; Eves-van den Akker, Sebastian; Jones, John T

    2017-08-04

    Plant parasitism has evolved independently on at least four separate occasions in the phylum Nematoda. The application of next-generation sequencing (NGS) to plant-parasitic nematodes has allowed a wide range of genome- or transcriptome-level comparisons, and these have identified genome adaptations that enable parasitism of plants. Current genome data suggest that horizontal gene transfer, gene family expansions, evolution of new genes that mediate interactions with the host, and parasitism-specific gene regulation are important adaptations that allow nematodes to parasitize plants. Sequencing of a larger number of nematode genomes, including plant parasites that show different modes of parasitism or that have evolved in currently unsampled clades, and using free-living taxa as comparators would allow more detailed analysis and a better understanding of the organization of key genes within the genomes. This would facilitate a more complete understanding of the way in which parasitism has shaped the genomes of plant-parasitic nematodes.

  15. Comparative Genomic Analysis MERS CoV Isolated from Humans and Camels with Special Reference to Virus Encoded Helicase.

    PubMed

    Alnazawi, Mohamed; Altaher, Abdallah; Kandeel, Mahmoud

    2017-01-01

    Middle East Respiratory Syndrome Coronavirus (MERS CoV) is a new emerging viral disease characterized by high fatality rate. Understanding MERS CoV genetic aspects and codon usage pattern is important to understand MERS CoV survival, adaptation, evolution, resistance to innate immunity, and help in finding the unique aspects of the virus for future drug discovery experiments. In this work, we provide comprehensive analysis of 238 MERS CoV full genomes comprised of human (hMERS) and camel (cMERS) isolates of the virus. MERS CoV genome shaping seems to be under compositional and mutational bias, as revealed by preference of A/T over G/C nucleotides, preferred codons, nucleotides at the third position of codons (NT3s), relative synonymous codon usage, hydropathicity (Gravy), and aromaticity (Aromo) indices. Effective number of codons (ENc) analysis reveals a general slight codon usage bias. Codon adaptation index reveals incomplete adaptation to host environment. MERS CoV showed high ability to resist the innate immune response by showing lower CpG frequencies. Neutrality evolution analysis revealed a more significant role of mutation pressure in cMERS over hMERS. Correspondence analysis revealed that MERS CoV genomes have three genetic clusters, which were distinct in their codon usage, host, and geographic distribution. Additionally, virtual screening and binding experiments were able to identify three new virus-encoded helicase binding compounds. These compounds can be used for further optimization of inhibitors.

  16. Evolving molecular era of childhood medulloblastoma: time to revisit therapy.

    PubMed

    Khatua, Soumen

    2016-01-01

    Currently medulloblastoma is treated with a uniform therapeutic approach based on histopathology and clinico-radiological risk stratification, resulting in unpredictable treatment failure and relapses. Improved understanding of the biological, molecular and genetic make-up of these tumors now clearly identifies it as a compendium of four distinct subtypes (WNT, SHH, group 3 and 4). Advances in utilization of the genomic and epigenomic machinery have now delineated genetic aberrations and epigenetic perturbations in each subgroup as potential druggable targets. This has resulted in endeavors to profile targeted therapy. The challenge and future of medulloblastoma therapeutics will be to keep pace with the evolving novel biological insights and translating them into optimal targeted treatment regimens.

  17. Telomeres and NextGen CO-FISH: Directional Genomic Hybridization (Telo-dGH™).

    PubMed

    McKenna, Miles J; Robinson, Erin; Goodwin, Edwin H; Cornforth, Michael N; Bailey, Susan M

    2017-01-01

    The cytogenomics-based methodology of Directional Genomic Hybridization (dGH™) emerged from the concept of strand-specific hybridization, first made possible by Chromosome Orientation FISH (CO-FISH), the utility of which was demonstrated in a variety of early applications, often involving telomeres. Similar to standard whole chromosome painting (FISH), dGH™ is capable of identifying inter-chromosomal rearrangements (translocations between chromosomes), but its distinctive strength stems from its ability to detect intra-chromosomal rearrangements (inversions within chromosomes), and to do so at higher resolution than previously possible. dGH™ brings together the strand specificity and directionality of CO-FISH with sophisticated bioinformatics-based oligonucleotide probe design to unique sequences. dGH™ serves not only as a powerful discovery tool-capable of interrogating the entire genome at the megabase level-it can also be used for high-resolution targeted detection of known inversions, a valuable attribute in both research and clinical settings. Detection of chromosomal inversions, particularly small ones, poses a formidable challenge for more traditional cytogenetic approaches, especially when they occur near the ends or telomeric regions. Here, we describe Telo-dGH™, a strand-specific scheme that utilizes dGH™ in combination with telomere CO-FISH to differentiate between terminal exchange events, specifically terminal inversions, and an altogether different form of genetic recombination that often occurs near the telomere, namely sister chromatid exchange (SCE).

  18. Genomic insights into the taxonomic status of the Bacillus cereus group

    PubMed Central

    Liu, Yang; Lai, Qiliang; Göker, Markus; Meier-Kolthoff, Jan P.; Wang, Meng; Sun, Yamin; Wang, Lei; Shao, Zongze

    2015-01-01

    The identification and phylogenetic relationships of bacteria within the Bacillus cereus group are controversial. This study aimed at determining the taxonomic affiliations of these strains using the whole-genome sequence-based Genome BLAST Distance Phylogeny (GBDP) approach. The GBDP analysis clearly separated 224 strains into 30 clusters, representing eleven known, partially merged species and accordingly 19–20 putative novel species. Additionally, 16S rRNA gene analysis, a novel variant of multi-locus sequence analysis (nMLSA) and screening of virulence genes were performed. The 16S rRNA gene sequence was not sufficient to differentiate the bacteria within this group due to its high conservation. The nMLSA results were consistent with GBDP. Moreover, a fast typing method was proposed using the pycA gene, and where necessary, the ccpA gene. The pXO plasmids and cry genes were widely distributed, suggesting little correlation with the phylogenetic positions of the host bacteria. This might explain why classifications based on virulence characteristics proved unsatisfactory in the past. In summary, this is the first large-scale and systematic study of the taxonomic status of the bacteria within the B. cereus group using whole-genome sequences, and is likely to contribute to further insights into their pathogenicity, phylogeny and adaptation to diverse environments. PMID:26373441

  19. Extending the Bacillus cereus group genomics to putative food-borne pathogens of different toxicity.

    PubMed

    Lapidus, Alla; Goltsman, Eugene; Auger, Sandrine; Galleron, Nathalie; Ségurens, Béatrice; Dossat, Carole; Land, Miriam L; Broussolle, Veronique; Brillard, Julien; Guinebretiere, Marie-Helene; Sanchis, Vincent; Nguen-The, Christophe; Lereclus, Didier; Richardson, Paul; Wincker, Patrick; Weissenbach, Jean; Ehrlich, S Dusko; Sorokin, Alexei

    2008-01-30

    The Bacillus cereus group represents sporulating soil bacteria containing pathogenic strains which may cause diarrheic or emetic food poisoning outbreaks. Multiple locus sequence typing revealed a presence in natural samples of these bacteria of about 30 clonal complexes. Application of genomic methods to this group was however biased due to the major interest for representatives closely related to Bacillus anthracis. Albeit the most important food-borne pathogens were not yet defined, existing data indicate that they are scattered all over the phylogenetic tree. The preliminary analysis of the sequences of three genomes discussed in this paper narrows down the gaps in our knowledge of the B. cereus group. The strain NVH391-98 is a rare but particularly severe food-borne pathogen. Sequencing revealed that the strain should be a representative of a novel bacterial species, for which the name Bacillus cytotoxis or Bacillus cytotoxicus is proposed. This strain has a reduced genome size compared to other B. cereus group strains. Genome analysis revealed absence of sigma B factor and the presence of genes encoding diarrheic Nhe toxin, not detected earlier. The strain B. cereus F837/76 represents a clonal complex close to that of B. anthracis. Including F837/76, three such B. cereus strains had been sequenced. Alignment of genomes suggests that B. anthracis is their common ancestor. Since such strains often emerge from clinical cases, they merit a special attention. The third strain, KBAB4, is a typical facultative psychrophile generally found in soil. Phylogenic studies show that in nature it is the most active group in terms of gene exchange. Genomic sequence revealed high presence of extra-chromosomal genetic material (about 530kb) that may account for this phenomenon. Genes coding Nhe-like toxin were found on a big plasmid in this strain. This may indicate a potential mechanism of toxicity spread from the psychrophile strain community. The results of this genomic

  20. Evidence that viral RNAs have evolved for efficient, two-stage packaging.

    PubMed

    Borodavka, Alexander; Tuma, Roman; Stockley, Peter G

    2012-09-25

    Genome packaging is an essential step in virus replication and a potential drug target. Single-stranded RNA viruses have been thought to encapsidate their genomes by gradual co-assembly with capsid subunits. In contrast, using a single molecule fluorescence assay to monitor RNA conformation and virus assembly in real time, with two viruses from differing structural families, we have discovered that packaging is a two-stage process. Initially, the genomic RNAs undergo rapid and dramatic (approximately 20-30%) collapse of their solution conformations upon addition of cognate coat proteins. The collapse occurs with a substoichiometric ratio of coat protein subunits and is followed by a gradual increase in particle size, consistent with the recruitment of additional subunits to complete a growing capsid. Equivalently sized nonviral RNAs, including high copy potential in vivo competitor mRNAs, do not collapse. They do support particle assembly, however, but yield many aberrant structures in contrast to viral RNAs that make only capsids of the correct size. The collapse is specific to viral RNA fragments, implying that it depends on a series of specific RNA-protein interactions. For bacteriophage MS2, we have shown that collapse is driven by subsequent protein-protein interactions, consistent with the RNA-protein contacts occurring in defined spatial locations. Conformational collapse appears to be a distinct feature of viral RNA that has evolved to facilitate assembly. Aspects of this process mimic those seen in ribosome assembly.

  1. Identification and genome organization of saponin pathway genes from a wild crucifer, and their use for transient production of saponins in Nicotiana benthamiana.

    PubMed

    Khakimov, Bekzod; Kuzina, Vera; Erthmann, Pernille Ø; Fukushima, Ery Odette; Augustin, Jörg M; Olsen, Carl Erik; Scholtalbers, Jelle; Volpin, Hanne; Andersen, Sven Bode; Hauser, Thure P; Muranaka, Toshiya; Bak, Søren

    2015-11-01

    The ability to evolve novel metabolites has been instrumental for the defence of plants against antagonists. A few species in the Barbarea genus are the only crucifers known to produce saponins, some of which make plants resistant to specialist herbivores, like Plutella xylostella, the diamondback moth. Genetic mapping in Barbarea vulgaris revealed that genes for saponin biosynthesis are not clustered but are located in different linkage groups. Using co-location with quantitative trait loci (QTLs) for resistance, transcriptome and genome sequences, we identified two 2,3-oxidosqualene cyclases that form the major triterpenoid backbones. LUP2 mainly produces lupeol, and is preferentially expressed in insect-susceptible B. vulgaris plants, whereas LUP5 produces β-amyrin and α-amyrin, and is preferentially expressed in resistant plants; β-amyrin is the backbone for the resistance-conferring saponins in Barbarea. Two loci for cytochromes P450, predicted to add functional groups to the saponin backbone, were identified: CYP72As co-localized with insect resistance, whereas CYP716As did not. When B. vulgaris sapogenin biosynthesis genes were transiently expressed by CPMV-HT technology in Nicotiana benthamiana, high levels of hydroxylated and carboxylated triterpenoid structures accumulated, including oleanolic acid, which is a precursor of the major resistance-conferring saponins. When the B. vulgaris gene for sapogenin 3-O-glucosylation was co-expressed, the insect deterrent 3-O-oleanolic acid monoglucoside accumulated, as well as triterpene structures with up to six hexoses, demonstrating that N. benthamiana further decorates the monoglucosides. We argue that saponin biosynthesis in the Barbarea genus evolved by a neofunctionalized glucosyl transferase, whereas the difference between resistant and susceptible B. vulgaris chemotypes evolved by different expression of oxidosqualene cyclases (OSCs). © 2015 The Authors The Plant Journal © 2015 John Wiley & Sons

  2. A complete mitochondrial genome of wheat (Triticum aestivum cv. Chinese Yumai), and fast evolving mitochondrial genes in higher plants.

    PubMed

    Cui, Peng; Liu, Huitao; Lin, Qiang; Ding, Feng; Zhuo, Guoyin; Hu, Songnian; Liu, Dongcheng; Yang, Wenlong; Zhan, Kehui; Zhang, Aimin; Yu, Jun

    2009-12-01

    Plant mitochondrial genomes, encoding necessary proteins involved in the system of energy production, play an important role in the development and reproduction of the plant. They occupy a specific evolutionary pattern relative to their nuclear counterparts. Here, we determined the winter wheat (Triticum aestivum cv. Chinese Yumai) mitochondrial genome in a length of 452 and 526 bp by shotgun sequencing its BAC library. It contains 202 genes, including 35 known protein-coding genes, three rRNA and 17 tRNA genes, as well as 149 open reading frames (ORFs; greater than 300 bp in length). The sequence is almost identical to the previously reported sequence of the spring wheat (T. aestivum cv. Chinese Spring); we only identified seven SNPs (three transitions and four transversions) and 10 indels (insertions and deletions) between the two independently acquired sequences, and all variations were found in non-coding regions. This result confirmed the accuracy of the previously reported mitochondrial sequence of the Chinese Spring wheat. The nucleotide frequency and codon usage of wheat are common among the lineage of higher plant with a high AT-content of 58%. Molecular evolutionary analysis demonstrated that plant mitochondrial genomes evolved at different rates, which may correlate with substantial variations in metabolic rate and generation time among plant lineages. In addition, through the estimation of the ratio of non-synonymous to synonymous substitution rates between orthologous mitochondrion-encoded genes of higher plants, we found an accelerated evolutionary rate that seems to be the result of relaxed selection.

  3. Clear: Composition of Likelihoods for Evolve and Resequence Experiments.

    PubMed

    Iranmehr, Arya; Akbari, Ali; Schlötterer, Christian; Bafna, Vineet

    2017-06-01

    The advent of next generation sequencing technologies has made whole-genome and whole-population sampling possible, even for eukaryotes with large genomes. With this development, experimental evolution studies can be designed to observe molecular evolution "in action" via evolve-and-resequence (E&R) experiments. Among other applications, E&R studies can be used to locate the genes and variants responsible for genetic adaptation. Most existing literature on time-series data analysis often assumes large population size, accurate allele frequency estimates, or wide time spans. These assumptions do not hold in many E&R studies. In this article, we propose a method-composition of likelihoods for evolve-and-resequence experiments (Clear)-to identify signatures of selection in small population E&R experiments. Clear takes whole-genome sequences of pools of individuals as input, and properly addresses heterogeneous ascertainment bias resulting from uneven coverage. Clear also provides unbiased estimates of model parameters, including population size, selection strength, and dominance, while being computationally efficient. Extensive simulations show that Clear achieves higher power in detecting and localizing selection over a wide range of parameters, and is robust to variation of coverage. We applied the Clear statistic to multiple E&R experiments, including data from a study of adaptation of Drosophila melanogaster to alternating temperatures and a study of outcrossing yeast populations, and identified multiple regions under selection with genome-wide significance. Copyright © 2017 by the Genetics Society of America.

  4. Genome-wide analyses of the bHLH superfamily in crustaceans: reappraisal of higher-order groupings and evidence for lineage-specific duplications

    PubMed Central

    2018-01-01

    The basic helix-loop-helix (bHLH) proteins represent a key group of transcription factors implicated in numerous eukaryotic developmental and signal transduction processes. Characterization of bHLHs from model species such as humans, fruit flies, nematodes and plants have yielded important information on their functions and evolutionary origin. However, relatively little is known about bHLHs in non-model organisms despite the availability of a vast number of high-throughput sequencing datasets, enabling previously intractable genome-wide and cross-species analyses to be now performed. We extensively searched for bHLHs in 126 crustacean species represented across major Crustacea taxa and identified 3777 putative bHLH orthologues. We have also included seven whole-genome datasets representative of major arthropod lineages to obtain a more accurate prediction of the full bHLH gene complement. With focus on important food crop species from Decapoda, we further defined higher-order groupings and have successfully recapitulated previous observations in other animals. Importantly, we also observed evidence for lineage-specific bHLH expansions in two basal crustaceans (branchiopod and copepod), suggesting a mode of evolution through gene duplication as an adaptation to changing environments. In-depth analysis on bHLH-PAS members confirms the phenomenon coined as ‘modular evolution’ (independently evolved domains) typically seen in multidomain proteins. With the amphipod Parhyale hawaiensis as the exception, our analyses have focused on crustacean transcriptome datasets. Hence, there is a clear requirement for future analyses on whole-genome sequences to overcome potential limitations associated with transcriptome mining. Nonetheless, the present work will serve as a key resource for future mechanistic and biochemical studies on bHLHs in economically important crustacean food crop species. PMID:29657824

  5. Deconstruction of the (Paleo)Polyploid Grapevine Genome Based on the Analysis of Transposition Events Involving NBS Resistance Genes

    PubMed Central

    Cestaro, Alessandro; Sterck, Lieven; Fontana, Paolo; Van de Peer, Yves; Viola, Roberto; Velasco, Riccardo; Salamini, Francesco

    2012-01-01

    Plants have followed a reticulate type of evolution and taxa have frequently merged via allopolyploidization. A polyploid structure of sequenced genomes has often been proposed, but the chromosomes belonging to putative component genomes are difficult to identify. The 19 grapevine chromosomes are evolutionary stable structures: their homologous triplets have strongly conserved gene order, interrupted by rare translocations. The aim of this study is to examine how the grapevine nucleotide-binding site (NBS)-encoding resistance (NBS-R) genes have evolved in the genomic context and to understand mechanisms for the genome evolution. We show that, in grapevine, i) helitrons have significantly contributed to transposition of NBS-R genes, and ii) NBS-R gene cluster similarity indicates the existence of two groups of chromosomes (named as Va and Vc) that may have evolved independently. Chromosome triplets consist of two Va and one Vc chromosomes, as expected from the tetraploid and diploid conditions of the two component genomes. The hexaploid state could have been derived from either allopolyploidy or the separation of the Va and Vc component genomes in the same nucleus before fusion, as known for Rosaceae species. Time estimation indicates that grapevine component genomes may have fused about 60 mya, having had at least 40–60 mya to evolve independently. Chromosome number variation in the Vitaceae and related families, and the gap between the time of eudicot radiation and the age of Vitaceae fossils, are accounted for by our hypothesis. PMID:22253773

  6. Iron-Rich Carbonates as the Potential Source of Evolved CO2 Detected by the Sample Analysis at Mars (SAM) Instrument in Gale Crater

    NASA Technical Reports Server (NTRS)

    Sutter, B.; Heil, E.; Rampe, E. B.; Morris, R. V.; Ming, D. W.; Archer, P. D.; Eigenbrode, J. L.; Franz, H. B.; Glavin, D. P.; McAdam, A. C.; hide

    2015-01-01

    The Sample Analysis at Mars (SAM) instrument detected at least 4 distinct CO2 release during the pyrolysis of a sample scooped from the Rocknest (RN) eolian deposit. The highest peak CO2 release temperature (478-502 C) has been attributed to either a Fe-rich carbonate or nano-phase Mg-carbonate. The objective of this experimental study was to evaluate the thermal evolved gas analysis (T/EGA) characteristics of a series of terrestrial Fe-rich carbonates under analog SAM operating conditions to compare with the RN CO2 releases. Natural Fe-rich carbonates (<53 microns) with varying Fe amounts (Fe(0.66)X(0.34)- to Fe(0.99)X(0.01)-CO3, where X refers to Mg and/or Mn) were selected for T/EGA. The carbonates were heated from 25 to 715 C (35 C/min) and evolved CO2 was measured as a function of temperature. The highest Fe containing carbonates (e.g., Fe(0.99)X(0.01)-CO3) yielded CO2 peak temperatures between 466-487 C, which is consistent with the high temperature RN CO2 release. The lower Fe-bearing carbonates (e.g., Fe(0.66)X(0.34)CO3) did not have peak CO2 release temperatures that matched the RN peak CO2 temperatures; however, their entire CO2 releases did occur within RN temperature range of the high temperature CO2 release. Results from this laboratory analog analysis demonstrate that the high temperature RN CO2 release is consistent with Fe-rich carbonate (approx.0.7 to 1 wt.% FeCO3). The similar RN geochemistry with other materials in Gale Crater and elsewhere on Mars (e.g., Gusev Crater, Meridiani) suggests that up to 1 wt. % Fe-rich carbonate may occur throughout the Gale Crater region and could be widespread on Mars. The Rocknest Fe-carbonate may have formed from the interaction of reduced Fe phases (e.g., Fe2+ bearing olivine) with atmospheric CO2 and transient water. Alternatively, the Rocknest Fe-carbonate could be derived by eolian processes that have eroded distally exposed deep crustal material that possesses Fe-carbonate that may have formed through

  7. "Is It Worth Knowing?" Focus Group Participants' Perceived Utility of Genomic Preconception Carrier Screening.

    PubMed

    Schneider, Jennifer L; Goddard, Katrina A B; Davis, James; Wilfond, Benjamin; Kauffman, Tia L; Reiss, Jacob A; Gilmore, Marian; Himes, Patricia; Lynch, Frances L; Leo, Michael C; McMullen, Carmit

    2016-02-01

    As genome sequencing technology advances, research is needed to guide decision-making about what results can or should be offered to patients in different clinical settings. We conducted three focus groups with individuals who had prior preconception genetic testing experience to explore perceived advantages and disadvantages of genome sequencing for preconception carrier screening, compared to usual care. Using a discussion guide, a trained qualitative moderator facilitated the audio-recorded focus groups. Sixteen individuals participated. Thematic analysis of transcripts started with a grounded approach and subsequently focused on participants' perceptions of the value of genetic information. Analysis uncovered two orientations toward genomic preconception carrier screening: "certain" individuals desiring all possible screening information; and "hesitant" individuals who were more cautious about its value. Participants revealed valuable information about barriers to screening: fear/anxiety about results; concerns about the method of returning results; concerns about screening necessity; and concerns about partner participation. All participants recommended offering choice to patients to enhance the value of screening and reduce barriers. Overall, two groups of likely users of genome sequencing for preconception carrier screening demonstrated different perceptions of the advantages or disadvantages of screening, suggesting tailored approaches to education, consent, and counseling may be warranted with each group.

  8. Complete genome sequence of Bacillus amyloliquefaciens strain Co1-6, a plant growth-promoting rhizobacterium of Calendula officinalis

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Köberl, Martina; White, Richard A.; Erschen, Sabine

    The genome sequence of Bacillus amyloliquefaciens strain Co1-6, a plant growth-promoting rhizobacterium (PGPR) with broad-spectrum antagonistic activity against plant-pathogenic fungi, bacteria, and nematodes, consists of a single 3.9-Mb circular chromosome. The genome reveals genes putatively responsible for its promising biocontrol and PGP properties.

  9. Complete genome sequence of Bacillus amyloliquefaciens strain Co1-6, a plant growth-promoting rhizobacterium of Calendula officinalis

    DOE PAGES

    Köberl, Martina; White, Richard A.; Erschen, Sabine; ...

    2015-08-13

    The genome sequence of Bacillus amyloliquefaciens strain Co1-6, a plant growth-promoting rhizobacterium (PGPR) with broad-spectrum antagonistic activity against plant-pathogenic fungi, bacteria, and nematodes, consists of a single 3.9-Mb circular chromosome. The genome reveals genes putatively responsible for its promising biocontrol and PGP properties.

  10. Assignment of simian rotavirus SA11 temperature-sensitive mutant groups B and E to genome segments

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Gombold, J.L.; Estes, M.K.; Ramig, R.F.

    1985-05-01

    Recombinant (reassortant) viruses were selected from crosses between temperature-sensitive (ts) mutants of simian rotavirus SA11 and wild-type human rotavirus Wa. The double-stranded genome RNAs of the reassortants were examined by electrophoresis in Tris-glycine-buffered polyacrylamide gels and by dot hybridization with a cloned DNA probe for genome segment 2. Analysis of replacements of genome segments in the reassortants allowed construction of a map correlating genome segments providing functions interchangeable between SA11 and Wa. The reassortants revealed a functional correspondence in order of increasing electrophoretic mobility of genome segments. Analysis of the parental origin of genome segments in ts+ SA11/Wa reassortants derivedmore » from the crosses SA11 tsB(339) X Wa and SA11 tsE(1400) X Wa revealed that the group B lesion of tsB(339) was located on genome segment 3 and the group E lesion of tsE(1400) was on segment 8.« less

  11. Australians' views on personal genomic testing: focus group findings from the Genioz study.

    PubMed

    Metcalfe, Sylvia A; Hickerton, Chriselle; Savard, Jacqueline; Terrill, Bronwyn; Turbitt, Erin; Gaff, Clara; Gray, Kathleen; Middleton, Anna; Wilson, Brenda; Newson, Ainsley J

    2018-04-30

    Personal genomic testing provides healthy individuals with access to information about their genetic makeup for purposes including ancestry, paternity, sporting ability and health. Such tests are available commercially and globally, with accessibility expected to continue to grow, including in Australia; yet little is known of the views/expectations of Australians. Focus groups were conducted within a multi-stage, cross-disciplinary project (Genioz) to explore this. In mid-2015, 56 members of the public participated in seven focus groups, allocated into three age groups: 18-24, 25-49, and ≥50 years. Three researchers coded transcripts independently and generated themes. Awareness of personal genomic testing was low, but most could deduce what "personal genomics" might entail. Very few had heard of the term "direct-to-consumer" testing, which has implications for organisations developing information to support individuals in their decision-making. Participants' understanding of genetics was varied and drawn from several sources. There were diverse perceptions of the relative influence of genetics and environment on health, mental health, behavior, talent, or personality. Views about having a personal genomic test were mixed, with greater interest in health-related tests if they believed there was a reason for doing so. However, many expressed scepticisms about the types of tests available, and how the information might be used; concerns were also raised about privacy and the potential for discrimination. These exploratory findings inform subsequent stages of the Genioz study, thereby contributing to strategies of supporting Australians to understand and make meaningful and well-considered decisions about the benefits, harms, and implications of personal genomic tests.

  12. Spider genomes provide insight into composition and evolution of venom and silk

    PubMed Central

    Sanggaard, Kristian W.; Bechsgaard, Jesper S.; Fang, Xiaodong; Duan, Jinjie; Dyrlund, Thomas F.; Gupta, Vikas; Jiang, Xuanting; Cheng, Ling; Fan, Dingding; Feng, Yue; Han, Lijuan; Huang, Zhiyong; Wu, Zongze; Liao, Li; Settepani, Virginia; Thøgersen, Ida B.; Vanthournout, Bram; Wang, Tobias; Zhu, Yabing; Funch, Peter; Enghild, Jan J.; Schauser, Leif; Andersen, Stig U.; Villesen, Palle; Schierup, Mikkel H; Bilde, Trine; Wang, Jun

    2014-01-01

    Spiders are ecologically important predators with complex venom and extraordinarily tough silk that enables capture of large prey. Here we present the assembled genome of the social velvet spider and a draft assembly of the tarantula genome that represent two major taxonomic groups of spiders. The spider genomes are large with short exons and long introns, reminiscent of mammalian genomes. Phylogenetic analyses place spiders and ticks as sister groups supporting polyphyly of the Acari. Complex sets of venom and silk genes/proteins are identified. We find that venom genes evolved by sequential duplication, and that the toxic effect of venom is most likely activated by proteases present in the venom. The set of silk genes reveals a highly dynamic gene evolution, new types of silk genes and proteins, and a novel use of aciniform silk. These insights create new opportunities for pharmacological applications of venom and biomaterial applications of silk. PMID:24801114

  13. Co-invading symbiotic mutualists of Medicago polymorpha retain high ancestral diversity and contain diverse accessory genomes.

    PubMed

    Porter, Stephanie S; Faber-Hammond, Joshua J; Friesen, Maren L

    2018-01-01

    Exotic, invasive plants and animals can wreak havoc on ecosystems by displacing natives and altering environmental conditions. However, much less is known about the identities or evolutionary dynamics of the symbiotic microbes that accompany invasive species. Most leguminous plants rely upon symbiotic rhizobium bacteria to fix nitrogen and are incapable of colonizing areas devoid of compatible rhizobia. We compare the genomes of symbiotic rhizobia in a portion of the legume's invaded range with those of the rhizobium symbionts from across the legume's native range. We show that in an area of California the legume Medicago polymorpha has invaded, its Ensifer medicae symbionts: (i) exhibit genome-wide patterns of relatedness that together with historical evidence support host-symbiont co-invasion from Europe into California, (ii) exhibit population genomic patterns consistent with the introduction of the majority of deep diversity from the native range, rather than a genetic bottleneck during colonization of California and (iii) harbor a large set of accessory genes uniquely enriched in binding functions, which could play a role in habitat invasion. Examining microbial symbiont genome dynamics during biological invasions is critical for assessing host-symbiont co-invasions whereby microbial symbiont range expansion underlies plant and animal invasions. © FEMS 2017. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

  14. “Highly evolvable malaria vectors: the genomes of 16 Anopheles mosquitoes”

    PubMed Central

    Neafsey, Daniel E.; Waterhouse, Robert M.; Abai, Mohammad R.; Aganezov, Sergey S.; Alekseyev, Max A.; Allen, James E.; Amon, James; Arcà, Bruno; Arensburger, Peter; Artemov, Gleb; Assour, Lauren A.; Basseri, Hamidreza; Berlin, Aaron; Birren, Bruce W.; Blandin, Stephanie A.; Brockman, Andrew I.; Burkot, Thomas R.; Burt, Austin; Chan, Clara S.; Chauve, Cedric; Chiu, Joanna C.; Christensen, Mikkel; Costantini, Carlo; Davidson, Victoria L.M.; Deligianni, Elena; Dottorini, Tania; Dritsou, Vicky; Gabriel, Stacey B.; Guelbeogo, Wamdaogo M.; Hall, Andrew B.; Han, Mira V.; Hlaing, Thaung; Hughes, Daniel S.T.; Jenkins, Adam M.; Jiang, Xiaofang; Jungreis, Irwin; Kakani, Evdoxia G.; Kamali, Maryam; Kemppainen, Petri; Kennedy, Ryan C.; Kirmitzoglou, Ioannis K.; Koekemoer, Lizette L.; Laban, Njoroge; Langridge, Nicholas; Lawniczak, Mara K.N.; Lirakis, Manolis; Lobo, Neil F.; Lowy, Ernesto; MacCallum, Robert M.; Mao, Chunhong; Maslen, Gareth; Mbogo, Charles; McCarthy, Jenny; Michel, Kristin; Mitchell, Sara N.; Moore, Wendy; Murphy, Katherine A.; Naumenko, Anastasia N.; Nolan, Tony; Novoa, Eva M.; O'Loughlin, Samantha; Oringanje, Chioma; Oshaghi, Mohammad A.; Pakpour, Nazzy; Papathanos, Philippos A.; Peery, Ashley N.; Povelones, Michael; Prakash, Anil; Price, David P.; Rajaraman, Ashok; Reimer, Lisa J.; Rinker, David C.; Rokas, Antonis; Russell, Tanya L.; Sagnon, N'Fale; Sharakhova, Maria V.; Shea, Terrance; Simão, Felipe A.; Simard, Frederic; Slotman, Michel A.; Somboon, Pradya; Stegniy, Vladimir; Struchiner, Claudio J.; Thomas, Gregg W.C.; Tojo, Marta; Topalis, Pantelis; Tubio, José M.C.; Unger, Maria F.; Vontas, John; Walton, Catherine; Wilding, Craig S.; Willis, Judith H.; Wu, Yi-Chieh; Yan, Guiyun; Zdobnov, Evgeny M.; Zhou, Xiaofan; Catteruccia, Flaminia; Christophides, George K.; Collins, Frank H.; Cornman, Robert S.; Crisanti, Andrea; Donnelly, Martin J.; Emrich, Scott J.; Fontaine, Michael C.; Gelbart, William; Hahn, Matthew W.; Hansen, Immo A.; Howell, Paul I.; Kafatos, Fotis C.; Kellis, Manolis; Lawson, Daniel; Louis, Christos; Luckhart, Shirley; Muskavitch, Marc A.T.; Ribeiro, José M.; Riehle, Michael A.; Sharakhov, Igor V.; Tu, Zhijian; Zwiebel, Laurence J.; Besansky, Nora J.

    2015-01-01

    Variation in vectorial capacity for human malaria among Anopheles mosquito species is determined by many factors, including behavior, immunity, and life history. To investigate the genomic basis of vectorial capacity and explore new avenues for vector control, we sequenced the genomes of 16 anopheline mosquito species from diverse locations spanning ~100 million years of evolution. Comparative analyses show faster rates of gene gain and loss, elevated gene shuffling on the X chromosome, and more intron losses, relative to Drosophila. Some determinants of vectorial capacity, such as chemosensory genes, do not show elevated turnover, but instead diversify through protein-sequence changes. This dynamism of anopheline genes and genomes may contribute to their flexible capacity to take advantage of new ecological niches, including adapting to humans as primary hosts. PMID:25554792

  15. Evolved stars in the Local Group galaxies - II. AGB, RSG stars and dust production in IC10

    NASA Astrophysics Data System (ADS)

    Dell'Agli, F.; Di Criscienzo, M.; Ventura, P.; Limongi, M.; García-Hernández, D. A.; Marini, E.; Rossi, C.

    2018-06-01

    We study the evolved stellar population of the Local Group galaxy IC10, with the aim of characterizing the individual sources observed and to derive global information on the galaxy, primarily the star formation history and the dust production rate. To this aim, we use evolutionary sequences of low- and intermediate-mass (M < 8 M⊙) stars, evolved through the asymptotic giant branch phase, with the inclusion of the description of dust formation. We also use models of higher mass stars. From the analysis of the distribution of stars in the observational planes obtained with IR bands, we find that the reddening and distance of IC10 are E(B - V) = 1.85 mag and d = 0.77 Mpc, respectively. The evolved stellar population is dominated by carbon stars, that account for 40% of the sources brighter than the tip of the red giant branch. Most of these stars descend from ˜1.1 - 1.3 M⊙ progenitors, formed during the major epoch of star formation, which occurred ˜2.5 Gyr ago. The presence of a significant number of bright stars indicates that IC10 has been site of significant star formation in recent epochs and currently hosts a group of massive stars in the core helium-burning phase. Dust production in this galaxy is largely dominated by carbon stars; the overall dust production rate estimated is 7 × 10-6 M⊙/yr.

  16. Conditional Selection of Genomic Alterations Dictates Cancer Evolution and Oncogenic Dependencies.

    PubMed

    Mina, Marco; Raynaud, Franck; Tavernari, Daniele; Battistello, Elena; Sungalee, Stephanie; Saghafinia, Sadegh; Laessle, Titouan; Sanchez-Vega, Francisco; Schultz, Nikolaus; Oricchio, Elisa; Ciriello, Giovanni

    2017-08-14

    Cancer evolves through the emergence and selection of molecular alterations. Cancer genome profiling has revealed that specific events are more or less likely to be co-selected, suggesting that the selection of one event depends on the others. However, the nature of these evolutionary dependencies and their impact remain unclear. Here, we designed SELECT, an algorithmic approach to systematically identify evolutionary dependencies from alteration patterns. By analyzing 6,456 genomes from multiple tumor types, we constructed a map of oncogenic dependencies associated with cellular pathways, transcriptional readouts, and therapeutic response. Finally, modeling of cancer evolution shows that alteration dependencies emerge only under conditional selection. These results provide a framework for the design of strategies to predict cancer progression and therapeutic response. Copyright © 2017 Elsevier Inc. All rights reserved.

  17. Genomes Behave as Social Entities: Alien Chromatin Minorities Evolve Through Specificities Reduction

    USDA-ARS?s Scientific Manuscript database

    Hybridization and chromosome doubling entailed by allopolyploidization requires genetic and epigenetic modifications, resulting in the adjustment of different genomes to the same nuclear environment. Recently, the main role of retrotransposon/microsatellite-rich regions of the genome in DNA sequenc...

  18. Network analysis of genomic alteration profiles reveals co-altered functional modules and driver genes for glioblastoma.

    PubMed

    Gu, Yunyan; Wang, Hongwei; Qin, Yao; Zhang, Yujing; Zhao, Wenyuan; Qi, Lishuang; Zhang, Yuannv; Wang, Chenguang; Guo, Zheng

    2013-03-01

    The heterogeneity of genetic alterations in human cancer genomes presents a major challenge to advancing our understanding of cancer mechanisms and identifying cancer driver genes. To tackle this heterogeneity problem, many approaches have been proposed to investigate genetic alterations and predict driver genes at the individual pathway level. However, most of these approaches ignore the correlation of alteration events between pathways and miss many genes with rare alterations collectively contributing to carcinogenesis. Here, we devise a network-based approach to capture the cooperative functional modules hidden in genome-wide somatic mutation and copy number alteration profiles of glioblastoma (GBM) from The Cancer Genome Atlas (TCGA), where a module is a set of altered genes with dense interactions in the protein interaction network. We identify 7 pairs of significantly co-altered modules that involve the main pathways known to be altered in GBM (TP53, RB and RTK signaling pathways) and highlight the striking co-occurring alterations among these GBM pathways. By taking into account the non-random correlation of gene alterations, the property of co-alteration could distinguish oncogenic modules that contain driver genes involved in the progression of GBM. The collaboration among cancer pathways suggests that the redundant models and aggravating models could shed new light on the potential mechanisms during carcinogenesis and provide new indications for the design of cancer therapeutic strategies.

  19. The genome sequence of the psychrophilic archaeon, Methanococcoides burtonii: the role of genome evolution in cold adaptation

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Allen, Michele A; Lauro, Federico M; Williams, Timothy J

    2009-01-01

    Psychrophilic archaea are abundant and perform critical roles throughout the Earth's expansive cold biosphere. Here we report the first complete genome sequence for a psychrophilic methanogenic archaeon, Methanococcoides burtonii. The genome sequence was manually annotated including the use of a five-tiered evidence rating (ER) system that ranked annotations from ER1 (gene product experimentally characterized from the parent organism) to ER5 (hypothetical gene product) to provide a rapid means of assessing the certainty of gene function predictions. The genome is characterized by a higher level of aberrant sequence composition (51%) than any other archaeon. In comparison to hyper/thermophilic archaea, which aremore » subject to selection of synonymous codon usage, M. burtonii has evolved cold adaptation through a genomic capacity to accommodate highly skewed amino-acid content, while retaining codon usage in common with its mesophilic Methanosarcina cousins. Polysaccharide biosynthesis genes comprise at least 3.3% of protein coding genes in the genome, and Cell wall, membrane, envelope biogenesis COG genes are overrepresented. Likewise, signal transduction (COG category T) genes are overrepresented and M. burtonii has a high 'IQ' (a measure of adaptive potential) compared to many methanogens. Numerous genes in these two overrepresented COG categories appear to have been acquired from - and -Proteobacteria, as do specific genes involved in central metabolism such as a novel B form of aconitase. Transposases also distinguish M. burtonii from other archaea, and their genomic characteristics indicate they have an important role in evolving the M. burtonii genome. Our study reveals a capacity for this model psychrophile to evolve through genome plasticity (including nucleotide skew, horizontal gene transfer and transposase activity) that enables adaptation to the cold, and to the biological and physical changes that have occurred over the last several thousand years

  20. Genome-to-Watershed Predictive Understanding of Terrestrial Environments

    NASA Astrophysics Data System (ADS)

    Hubbard, S. S.; Agarwal, D.; Banfield, J. F.; Beller, H. R.; Brodie, E.; Long, P.; Nico, P. S.; Steefel, C. I.; Tokunaga, T. K.; Williams, K. H.

    2014-12-01

    Although terrestrial environments play a critical role in cycling water, greenhouse gasses, and other life-critical elements, the complexity of interactions among component microbes, plants, minerals, migrating fluids and dissolved constituents hinders predictive understanding of system behavior. The 'Sustainable Systems 2.0' project is developing genome-to-watershed scale predictive capabilities to quantify how the microbiome affects biogeochemical watershed functioning, how watershed-scale hydro-biogeochemical processes affect microbial functioning, and how these interactions co-evolve with climate and land-use changes. Development of such predictive capabilities is critical for guiding the optimal management of water resources, contaminant remediation, carbon stabilization, and agricultural sustainability - now and with global change. Initial investigations are focused on floodplains in the Colorado River Basin, and include iterative model development, experiments and observations with an early emphasis on subsurface aspects. Field experiments include local-scale experiments at Rifle CO to quantify spatiotemporal metabolic and geochemical responses to O2and nitrate amendments as well as floodplain-scale monitoring to quantify genomic and biogeochemical response to natural hydrological perturbations. Information obtained from such experiments are represented within GEWaSC, a Genome-Enabled Watershed Simulation Capability, which is being developed to allow mechanistic interrogation of how genomic information stored in a subsurface microbiome affects biogeochemical cycling. This presentation will describe the genome-to-watershed scale approach as well as early highlights associated with the project. Highlights include: first insights into the diversity of the subsurface microbiome and metabolic roles of organisms involved in subsurface nitrogen, sulfur and hydrogen and carbon cycling; the extreme variability of subsurface DOC and hydrological controls on carbon and

  1. The contribution of co-transcriptional RNA:DNA hybrid structures to DNA damage and genome instability

    PubMed Central

    Hamperl, Stephan; Cimprich, Karlene A.

    2014-01-01

    Accurate DNA replication and DNA repair are crucial for the maintenance of genome stability, and it is generally accepted that failure of these processes is a major source of DNA damage in cells. Intriguingly, recent evidence suggests that DNA damage is more likely to occur at genomic loci with high transcriptional activity. Furthermore, loss of certain RNA processing factors in eukaryotic cells is associated with increased formation of co-transcriptional RNA:DNA hybrid structures known as R-loops, resulting in double-strand breaks (DSBs) and DNA damage. However, the molecular mechanisms by which R-loop structures ultimately lead to DNA breaks and genome instability is not well understood. In this review, we summarize the current knowledge about the formation, recognition and processing of RNA:DNA hybrids, and discuss possible mechanisms by which these structures contribute to DNA damage and genome instability in the cell. PMID:24746923

  2. Organic Combustion in the Presence of Ca-Carbonate and Mg-Perchlorate: A Possible Source for the Low Temperature CO2 Release Seen in Mars Phoenix Thermal and Evolved Gas Analyzer Data

    NASA Technical Reports Server (NTRS)

    Archer, Douglas; Ming, D.; Niles, P.; Sutter, B.; Lauer, H.

    2012-01-01

    Two of the most important discoveries of the Phoenix Lander were the detection of approx.0.6% perchlorate [1] and 3-5% carbonate [2] in landing site soils. The Thermal and Evolved Gas Analyzer (TEGA) instrument on the Phoenix lander could heat samples up to approx.1000 C and monitor evolved gases with a mass spectrometer. TEGA detected a low (approx.350 C) and high (approx.750 C) temperature CO2 release. The high temp release was attributed to the thermal decomposition of Ca-carbonate (calcite). The low temperature CO2 release could be due to desorption of CO2, decomposition of a different carbonate mineral, or the combustion of organic material. A new hypothesis has also been proposed that the low temperature CO2 release could be due to the early breakdown of calcite in the presence of the decomposition products of certain perchlorate salts [3]. We have investigated whether or not this new hypothesis is also compatible with organic combustion. Magnesium perchlorate is stable as Mg(ClO4)2-6H2O on the martian surface [4]. During thermal decomposition, this perchlorate salt releases H2O, Cl2, and O2 gases. The Cl2 can react with water to form HCl which then reacts with calcite, releasing CO2 below the standard thermal decomposition temperature of calcite. However, when using concentrations of perchlorate and calcite similar to what was detected by Phoenix, the ratio of high:low temperature CO2 evolved is much larger in the lab, indicating that although this process might contribute to the low temp CO2 release, it cannot account for all of it. While H2O and Cl2 cause calcite decomposition, the O2 evolved during perchlorate decomposition can lead to the combustion of any reduced carbon present in the sample [5]. We investigate the possible contribution of organic molecules to the low temperature CO2 release seen on Mars.

  3. Genome-wide co-localization of Polycomb orthologs and their effects on gene expression in human fibroblasts

    PubMed Central

    2014-01-01

    Background Polycomb group proteins form multicomponent complexes that are important for establishing lineage-specific patterns of gene expression. Mammalian cells encode multiple permutations of the prototypic Polycomb repressive complex 1 (PRC1) with little evidence for functional specialization. An aim of this study is to determine whether the multiple orthologs that are co-expressed in human fibroblasts act on different target genes and whether their genomic location changes during cellular senescence. Results Deep sequencing of chromatin immunoprecipitated with antibodies against CBX6, CBX7, CBX8, RING1 and RING2 reveals that the orthologs co-localize at multiple sites. PCR-based validation at representative loci suggests that a further six PRC1 proteins have similar binding patterns. Importantly, sequential chromatin immunoprecipitation with antibodies against different orthologs implies that multiple variants of PRC1 associate with the same DNA. At many loci, the binding profiles have a distinctive architecture that is preserved in two different types of fibroblast. Conversely, there are several hundred loci at which PRC1 binding is cell type-specific and, contrary to expectations, the presence of PRC1 does not necessarily equate with transcriptional silencing. Interestingly, the PRC1 binding profiles are preserved in senescent cells despite changes in gene expression. Conclusions The multiple permutations of PRC1 in human fibroblasts congregate at common rather than specific sites in the genome and with overlapping but distinctive binding profiles in different fibroblasts. The data imply that the effects of PRC1 complexes on gene expression are more subtle than simply repressing the loci at which they bind. PMID:24485159

  4. Evolutionary genomics of dog domestication.

    PubMed

    Wayne, Robert K; vonHoldt, Bridgett M

    2012-02-01

    We review the underlying principles and tools used in genomic studies of domestic dogs aimed at understanding the genetic changes that have occurred during domestication. We show that there are two principle modes of evolution within dogs. One primary mode that accounts for much of the remarkable diversity of dog breeds is the fixation of discrete mutations of large effect in individual lineages that are then crossed to various breed groupings. This transfer of mutations across the dog evolutionary tree leads to the appearance of high phenotypic diversity that in actuality reflects a small number of major genes. A second mechanism causing diversification involves the selective breeding of dogs within distinct phenotypic or functional groups, which enhances specific group attributes such as heading or tracking. Such progressive selection leads to a distinct genetic structure in evolutionary trees such that functional and phenotypic groups cluster genetically. We trace the origin of the nuclear genome in dogs based on haplotype-sharing analyses between dogs and gray wolves and show that contrary to previous mtDNA analyses, the nuclear genome of dogs derives primarily from Middle Eastern or European wolves, a result more consistent with the archeological record. Sequencing analysis of the IGF1 gene, which has been the target of size selection in small breeds, further supports this conclusion. Finally, we discuss how a black coat color mutation that evolved in dogs has transformed North American gray wolf populations, providing a first example of a mutation that appeared under domestication and selectively swept through a wild relative.

  5. Investigating the potential for ethnic group harm in collaborative genomics research in Africa: Is ethnic stigmatisation likely?

    PubMed Central

    de Vries, Jantina; Jallow, Muminatou; Williams, Thomas N.; Kwiatkowski, Dominic; Parker, Michael; Fitzpatrick, Raymond

    2013-01-01

    A common assumption in genomics research is that the use of ethnic categories has the potential to lead to ethnic stigmatisation – particularly when the research is done on minority populations. Yet few empirical studies have sought to investigate the relation between genomics and stigma, and fewer still with a focus on Africa. In this paper, we investigate the potential for genomics research to lead to harms to ethnic groups. We carried out 49 semi-structured, open-ended interviews with stakeholders in a current medical genomics research project in Africa, MalariaGEN. Interviews were conducted with MalariaGEN researchers, fieldworkers, members of three ethics committees who reviewed MalariaGEN project proposals, and with members of the two funding bodies providing support to the MalariaGEN project. Interviews were conducted in Kenya, The Gambia and the UK between June 2008 and October 2009. They covered a range of aspects relating to the use of ethnicity in the genomics project, including views on adverse effects of the inclusion of ethnicity in such research. Drawing on the empirical data, we argue that the risk of harm to ethnic groups is likely to be more acute in specific types of genomics research. We develop a typology of research questions and projects that carry a greater risk of harm to the populations included in genomics research. We conclude that the potential of generating harm to ethnic groups in genomics research is present if research includes populations that are already stigmatised or discriminated against, or where the research investigates questions with particular normative implications. We identify a clear need for genomics researchers to take account of the social context of the work they are proposing to do, including understanding the local realities and relations between ethnic groups, and whether diseases are already stigmatised. PMID:22749442

  6. Genome-wide association study for refractive astigmatism reveals genetic co-determination with spherical equivalent refractive error: the CREAM consortium.

    PubMed

    Li, Qing; Wojciechowski, Robert; Simpson, Claire L; Hysi, Pirro G; Verhoeven, Virginie J M; Ikram, Mohammad Kamran; Höhn, René; Vitart, Veronique; Hewitt, Alex W; Oexle, Konrad; Mäkelä, Kari-Matti; MacGregor, Stuart; Pirastu, Mario; Fan, Qiao; Cheng, Ching-Yu; St Pourcain, Beaté; McMahon, George; Kemp, John P; Northstone, Kate; Rahi, Jugnoo S; Cumberland, Phillippa M; Martin, Nicholas G; Sanfilippo, Paul G; Lu, Yi; Wang, Ya Xing; Hayward, Caroline; Polašek, Ozren; Campbell, Harry; Bencic, Goran; Wright, Alan F; Wedenoja, Juho; Zeller, Tanja; Schillert, Arne; Mirshahi, Alireza; Lackner, Karl; Yip, Shea Ping; Yap, Maurice K H; Ried, Janina S; Gieger, Christian; Murgia, Federico; Wilson, James F; Fleck, Brian; Yazar, Seyhan; Vingerling, Johannes R; Hofman, Albert; Uitterlinden, André; Rivadeneira, Fernando; Amin, Najaf; Karssen, Lennart; Oostra, Ben A; Zhou, Xin; Teo, Yik-Ying; Tai, E Shyong; Vithana, Eranga; Barathi, Veluchamy; Zheng, Yingfeng; Siantar, Rosalynn Grace; Neelam, Kumari; Shin, Youchan; Lam, Janice; Yonova-Doing, Ekaterina; Venturini, Cristina; Hosseini, S Mohsen; Wong, Hoi-Suen; Lehtimäki, Terho; Kähönen, Mika; Raitakari, Olli; Timpson, Nicholas J; Evans, David M; Khor, Chiea-Chuen; Aung, Tin; Young, Terri L; Mitchell, Paul; Klein, Barbara; van Duijn, Cornelia M; Meitinger, Thomas; Jonas, Jost B; Baird, Paul N; Mackey, David A; Wong, Tien Yin; Saw, Seang-Mei; Pärssinen, Olavi; Stambolian, Dwight; Hammond, Christopher J; Klaver, Caroline C W; Williams, Cathy; Paterson, Andrew D; Bailey-Wilson, Joan E; Guggenheim, Jeremy A

    2015-02-01

    To identify genetic variants associated with refractive astigmatism in the general population, meta-analyses of genome-wide association studies were performed for: White Europeans aged at least 25 years (20 cohorts, N = 31,968); Asian subjects aged at least 25 years (7 cohorts, N = 9,295); White Europeans aged <25 years (4 cohorts, N = 5,640); and all independent individuals from the above three samples combined with a sample of Chinese subjects aged <25 years (N = 45,931). Participants were classified as cases with refractive astigmatism if the average cylinder power in their two eyes was at least 1.00 diopter and as controls otherwise. Genome-wide association analysis was carried out for each cohort separately using logistic regression. Meta-analysis was conducted using a fixed effects model. In the older European group the most strongly associated marker was downstream of the neurexin-1 (NRXN1) gene (rs1401327, P = 3.92E-8). No other region reached genome-wide significance, and association signals were lower for the younger European group and Asian group. In the meta-analysis of all cohorts, no marker reached genome-wide significance: The most strongly associated regions were, NRXN1 (rs1401327, P = 2.93E-07), TOX (rs7823467, P = 3.47E-07) and LINC00340 (rs12212674, P = 1.49E-06). For 34 markers identified in prior GWAS for spherical equivalent refractive error, the beta coefficients for genotype versus spherical equivalent, and genotype versus refractive astigmatism, were highly correlated (r = -0.59, P = 2.10E-04). This work revealed no consistent or strong genetic signals for refractive astigmatism; however, the TOX gene region previously identified in GWAS for spherical equivalent refractive error was the second most strongly associated region. Analysis of additional markers provided evidence supporting widespread genetic co-susceptibility for spherical and astigmatic refractive errors.

  7. Genome Evolution Due to Allopolyploidization in Wheat

    PubMed Central

    Feldman, Moshe; Levy, Avraham A.

    2012-01-01

    The wheat group has evolved through allopolyploidization, namely, through hybridization among species from the plant genera Aegilops and Triticum followed by genome doubling. This speciation process has been associated with ecogeographical expansion and with domestication. In the past few decades, we have searched for explanations for this impressive success. Our studies attempted to probe the bases for the wide genetic variation characterizing these species, which accounts for their great adaptability and colonizing ability. Central to our work was the investigation of how allopolyploidization alters genome structure and expression. We found in wheat that allopolyploidy accelerated genome evolution in two ways: (1) it triggered rapid genome alterations through the instantaneous generation of a variety of cardinal genetic and epigenetic changes (which we termed “revolutionary” changes), and (2) it facilitated sporadic genomic changes throughout the species’ evolution (i.e., evolutionary changes), which are not attainable at the diploid level. Our major findings in natural and synthetic allopolyploid wheat indicate that these alterations have led to the cytological and genetic diploidization of the allopolyploids. These genetic and epigenetic changes reflect the dynamic structural and functional plasticity of the allopolyploid wheat genome. The significance of this plasticity for the successful establishment of wheat allopolyploids, in nature and under domestication, is discussed. PMID:23135324

  8. Evolutionary genomics of yeast pathogens in the Saccharomycotina

    PubMed Central

    Naranjo-Ortíz, Miguel A.; Marcet-Houben, Marina

    2016-01-01

    Saccharomycotina comprises a diverse group of yeasts that includes numerous species of industrial or clinical relevance. Opportunistic pathogens within this clade are often assigned to the genus Candida but belong to phylogenetically distant lineages that also comprise non-pathogenic species. This indicates that the ability to infect humans has evolved independently several times among Saccharomycotina. Although the mechanisms of infection of the main groups of Candida pathogens are starting to be unveiled, we still lack sufficient understanding of the evolutionary paths that led to a virulent phenotype in each of the pathogenic lineages. Deciphering what genomic changes underlie the evolutionary emergence of a virulence trait will not only aid the discovery of novel virulence mechanisms but it will also provide valuable information to understand how new pathogens emerge, and what clades may pose a future danger. Here we review recent comparative genomics efforts that have revealed possible evolutionary paths to pathogenesis in different lineages, focusing on the main three agents of candidiasis worldwide: Candida albicans, C. parapsilosis and C. glabrata. We will discuss what genomic traits may facilitate the emergence of virulence, and focus on two different genome evolution mechanisms able to generate drastic phenotypic changes and which have been associated to the emergence of virulence: gene family expansion and interspecies hybridization. PMID:27493146

  9. Detection of genomic rearrangements in cucumber using genomecmp software

    NASA Astrophysics Data System (ADS)

    Kulawik, Maciej; Pawełkowicz, Magdalena Ewa; Wojcieszek, Michał; PlÄ der, Wojciech; Nowak, Robert M.

    2017-08-01

    Comparative genomic by increasing information about the genomes sequences available in the databases is a rapidly evolving science. A simple comparison of the general features of genomes such as genome size, number of genes, and chromosome number presents an entry point into comparative genomic analysis. Here we present the utility of the new tool genomecmp for finding rearrangements across the compared sequences and applications in plant comparative genomics.

  10. Are there ergodic limits to evolution? Ergodic exploration of genome space and convergence.

    PubMed

    McLeish, Tom C B

    2015-12-06

    We examine the analogy between evolutionary dynamics and statistical mechanics to include the fundamental question of ergodicity-the representative exploration of the space of possible states (in the case of evolution this is genome space). Several properties of evolutionary dynamics are identified that allow a generalization of the ergodic dynamics, familiar in dynamical systems theory, to evolution. Two classes of evolved biological structure then arise, differentiated by the qualitative duration of their evolutionary time scales. The first class has an ergodicity time scale (the time required for representative genome exploration) longer than available evolutionary time, and has incompletely explored the genotypic and phenotypic space of its possibilities. This case generates no expectation of convergence to an optimal phenotype or possibility of its prediction. The second, more interesting, class exhibits an evolutionary form of ergodicity-essentially all of the structural space within the constraints of slower evolutionary variables have been sampled; the ergodicity time scale for the system evolution is less than the evolutionary time. In this case, some convergence towards similar optima may be expected for equivalent systems in different species where both possess ergodic evolutionary dynamics. When the fitness maximum is set by physical, rather than co-evolved, constraints, it is additionally possible to make predictions of some properties of the evolved structures and systems. We propose four structures that emerge from evolution within genotypes whose fitness is induced from their phenotypes. Together, these result in an exponential speeding up of evolution, when compared with complete exploration of genomic space. We illustrate a possible case of application and a prediction of convergence together with attaining a physical fitness optimum in the case of invertebrate compound eye resolution.

  11. Ancient papillomavirus-host co-speciation in Felidae

    PubMed Central

    Rector, Annabel; Lemey, Philippe; Tachezy, Ruth; Mostmans, Sara; Ghim, Shin-Je; Van Doorslaer, Koenraad; Roelke, Melody; Bush, Mitchell; Montali, Richard J; Joslin, Janis; Burk, Robert D; Jenson, Alfred B; Sundberg, John P; Shapiro, Beth; Van Ranst, Marc

    2007-01-01

    Background Estimating evolutionary rates for slowly evolving viruses such as papillomaviruses (PVs) is not possible using fossil calibrations directly or sequences sampled over a time-scale of decades. An ability to correlate their divergence with a host species, however, can provide a means to estimate evolutionary rates for these viruses accurately. To determine whether such an approach is feasible, we sequenced complete feline PV genomes, previously available only for the domestic cat (Felis domesticus, FdPV1), from four additional, globally distributed feline species: Lynx rufus PV type 1, Puma concolor PV type 1, Panthera leo persica PV type 1, and Uncia uncia PV type 1. Results The feline PVs all belong to the Lambdapapillomavirus genus, and contain an unusual second noncoding region between the early and late protein region, which is only present in members of this genus. Our maximum likelihood and Bayesian phylogenetic analyses demonstrate that the evolutionary relationships between feline PVs perfectly mirror those of their feline hosts, despite a complex and dynamic phylogeographic history. By applying host species divergence times, we provide the first precise estimates for the rate of evolution for each PV gene, with an overall evolutionary rate of 1.95 × 10-8 (95% confidence interval 1.32 × 10-8 to 2.47 × 10-8) nucleotide substitutions per site per year for the viral coding genome. Conclusion Our work provides evidence for long-term virus-host co-speciation of feline PVs, indicating that viral diversity in slowly evolving viruses can be used to investigate host species evolution. These findings, however, should not be extrapolated to other viral lineages without prior confirmation of virus-host co-divergence. PMID:17430578

  12. Genome evolution in Reptilia, the sister group of mammals.

    PubMed

    Janes, Daniel E; Organ, Christopher L; Fujita, Matthew K; Shedlock, Andrew M; Edwards, Scott V

    2010-01-01

    The genomes of birds and nonavian reptiles (Reptilia) are critical for understanding genome evolution in mammals and amniotes generally. Despite decades of study at the chromosomal and single-gene levels, and the evidence for great diversity in genome size, karyotype, and sex chromosome diversity, reptile genomes are virtually unknown in the comparative genomics era. The recent sequencing of the chicken and zebra finch genomes, in conjunction with genome scans and the online publication of the Anolis lizard genome, has begun to clarify the events leading from an ancestral amniote genome--predicted to be large and to possess a diverse repeat landscape on par with mammals and a birdlike sex chromosome system--to the small and highly streamlined genomes of birds. Reptilia exhibit a wide range of evolutionary rates of different subgenomes and, from isochores to mitochondrial DNA, provide a critical contrast to the genomic paradigms established in mammals.

  13. Role of transposon-derived small RNAs in the interplay between genomes and parasitic DNA in rice.

    PubMed

    Nosaka, Misuzu; Itoh, Jun-Ichi; Nagato, Yasuo; Ono, Akemi; Ishiwata, Aiko; Sato, Yutaka

    2012-09-01

    RNA silencing is a defense system against "genomic parasites" such as transposable elements (TE), which are potentially harmful to host genomes. In plants, transcripts from TEs induce production of double-stranded RNAs (dsRNAs) and are processed into small RNAs (small interfering RNAs, siRNAs) that suppress TEs by RNA-directed DNA methylation. Thus, the majority of TEs are epigenetically silenced. On the other hand, most of the eukaryotic genome is composed of TEs and their remnants, suggesting that TEs have evolved countermeasures against host-mediated silencing. Under some circumstances, TEs can become active and increase in copy number. Knowledge is accumulating on the mechanisms of TE silencing by the host; however, the mechanisms by which TEs counteract silencing are poorly understood. Here, we show that a class of TEs in rice produces a microRNA (miRNA) to suppress host silencing. Members of the microRNA820 (miR820) gene family are located within CACTA DNA transposons in rice and target a de novo DNA methyltransferase gene, OsDRM2, one of the components of epigenetic silencing. We confirmed that miR820 negatively regulates the expression of OsDRM2. In addition, we found that expression levels of various TEs are increased quite sensitively in response to decreased OsDRM2 expression and DNA methylation at TE loci. Furthermore, we found that the nucleotide sequence of miR820 and its recognition site within the target gene in some Oryza species have co-evolved to maintain their base-pairing ability. The co-evolution of these sequences provides evidence for the functionality of this regulation. Our results demonstrate how parasitic elements in the genome escape the host's defense machinery. Furthermore, our analysis of the regulation of OsDRM2 by miR820 sheds light on the action of transposon-derived small RNAs, not only as a defense mechanism for host genomes but also as a regulator of interactions between hosts and their parasitic elements.

  14. Navigating yeast genome maintenance with functional genomics.

    PubMed

    Measday, Vivien; Stirling, Peter C

    2016-03-01

    Maintenance of genome integrity is a fundamental requirement of all organisms. To address this, organisms have evolved extremely faithful modes of replication, DNA repair and chromosome segregation to combat the deleterious effects of an unstable genome. Nonetheless, a small amount of genome instability is the driver of evolutionary change and adaptation, and thus a low level of instability is permitted in populations. While defects in genome maintenance almost invariably reduce fitness in the short term, they can create an environment where beneficial mutations are more likely to occur. The importance of this fact is clearest in the development of human cancer, where genome instability is a well-established enabling characteristic of carcinogenesis. This raises the crucial question: what are the cellular pathways that promote genome maintenance and what are their mechanisms? Work in model organisms, in particular the yeast Saccharomyces cerevisiae, has provided the global foundations of genome maintenance mechanisms in eukaryotes. The development of pioneering genomic tools inS. cerevisiae, such as the systematic creation of mutants in all nonessential and essential genes, has enabled whole-genome approaches to identifying genes with roles in genome maintenance. Here, we review the extensive whole-genome approaches taken in yeast, with an emphasis on functional genomic screens, to understand the genetic basis of genome instability, highlighting a range of genetic and cytological screening modalities. By revealing the biological pathways and processes regulating genome integrity, these analyses contribute to the systems-level map of the yeast cell and inform studies of human disease, especially cancer. © The Author 2015. Published by Oxford University Press. All rights reserved. For permissions, please email: journals.permissions@oup.com.

  15. Expanding genomics of mycorrhizal symbiosis

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Kuo, Alan; Kohler, Annegret; Martin, Francis M.

    The mycorrhizal symbiosis between soil fungi and plant roots is a ubiquitous mutualism that plays key roles in plant nutrition, soil health, and carbon cycling. The symbiosis evolved repeatedly and independently as multiple morphotypes [e.g., arbuscular mycorrhizae (AM), ectomycorrhizal (ECM)] in multiple fungal clades (e.g., phyla Glomeromycota, Ascomycota, Basidiomycota). The accessibility and cultivability of many mycorrhizal partners make them ideal models for symbiosis studies. Alongside molecular, physiological, and ecological investigations, sequencing led to the first three mycorrhizal fungal genomes, representing two morphotypes and three phyla. The genome of the ECM basidiomycete Laccaria bicolor showed that the mycorrhizal lifestyle can evolvemore » through loss of plant cell wall-degrading enzymes (PCWDEs) and expansion of lineage-specific gene families such as short secreted protein (SSP) effectors. The genome of the ECM ascomycete Tuber melanosporum showed that the ECM type can evolve without expansion of families as in Laccaria, and thus a different set of symbiosis genes. The genome of the AM glomeromycete Rhizophagus irregularis showed that despite enormous phylogenetic distance and morphological difference from the other two fungi, symbiosis can involve similar solutions as symbiosis-induced SSPs and loss of PCWDEs. The three genomes provide a solid base for addressing fundamental questions about the nature and role of a vital mutualism.« less

  16. Expanding genomics of mycorrhizal symbiosis

    DOE PAGES

    Kuo, Alan; Kohler, Annegret; Martin, Francis M.; ...

    2014-11-04

    The mycorrhizal symbiosis between soil fungi and plant roots is a ubiquitous mutualism that plays key roles in plant nutrition, soil health, and carbon cycling. The symbiosis evolved repeatedly and independently as multiple morphotypes [e.g., arbuscular mycorrhizae (AM), ectomycorrhizal (ECM)] in multiple fungal clades (e.g., phyla Glomeromycota, Ascomycota, Basidiomycota). The accessibility and cultivability of many mycorrhizal partners make them ideal models for symbiosis studies. Alongside molecular, physiological, and ecological investigations, sequencing led to the first three mycorrhizal fungal genomes, representing two morphotypes and three phyla. The genome of the ECM basidiomycete Laccaria bicolor showed that the mycorrhizal lifestyle can evolvemore » through loss of plant cell wall-degrading enzymes (PCWDEs) and expansion of lineage-specific gene families such as short secreted protein (SSP) effectors. The genome of the ECM ascomycete Tuber melanosporum showed that the ECM type can evolve without expansion of families as in Laccaria, and thus a different set of symbiosis genes. The genome of the AM glomeromycete Rhizophagus irregularis showed that despite enormous phylogenetic distance and morphological difference from the other two fungi, symbiosis can involve similar solutions as symbiosis-induced SSPs and loss of PCWDEs. The three genomes provide a solid base for addressing fundamental questions about the nature and role of a vital mutualism.« less

  17. A screen for immunity genes evolving under positive selection in Drosophila.

    PubMed

    Jiggins, F M; Kim, K W

    2007-05-01

    Genes involved in the immune system tend to have higher rates of adaptive evolution than other genes in the genome, probably because they are coevolving with pathogens. We have screened a sample of Drosophila genes to identify those evolving under positive selection. First, we identified rapidly evolving immunity genes by comparing 140 loci in Drosophila erecta and D. yakuba. Secondly, we resequenced 23 of the fastest evolving genes from the independent species pair D. melanogaster and D. simulans, and identified those under positive selection using a McDonald-Kreitman test. There was strong evidence of adaptive evolution in two serine proteases (persephone and spirit) and a homolog of the Anopheles serpin SRPN6, and weaker evidence in another serine protease and the death domain protein dFADD. These results add to mounting evidence that immune signalling pathway molecules often evolve rapidly, possibly because they are sites of host-parasite coevolution.

  18. Genome-wide analysis of the SBP-box gene family in Chinese cabbage (Brassica rapa subsp. pekinensis).

    PubMed

    Tan, Hua-Wei; Song, Xiao-Ming; Duan, Wei-Ke; Wang, Yan; Hou, Xi-Lin

    2015-11-01

    The SQUAMOSA PROMOTER BINDING PROTEIN (SBP)-box gene family contains highly conserved plant-specific transcription factors that play an important role in plant development, especially in flowering. Chinese cabbage (Brassica rapa subsp. pekinensis) is a leafy vegetable grown worldwide and is used as a model crop for research in genome duplication. The present study aimed to characterize the SBP-box transcription factor genes in Chinese cabbage. Twenty-nine SBP-box genes were identified in the Chinese cabbage genome and classified into six groups. We identified 23 orthologous and 5 co-orthologous SBP-box gene pairs between Chinese cabbage and Arabidopsis. An interaction network among these genes was constructed. Sixteen SBP-box genes were expressed more abundantly in flowers than in other tissues, suggesting their involvement in flowering. We show that the MiR156/157 family members may regulate the coding regions or 3'-UTR regions of Chinese cabbage SBP-box genes. As SBP-box genes were found to potentially participate in some plant development pathways, quantitative real-time PCR analysis was performed and showed that Chinese cabbage SBP-box genes were also sensitive to the exogenous hormones methyl jasmonic acid and salicylic acid. The SBP-box genes have undergone gene duplication and loss, evolving a more refined regulation for diverse stimulation in plant tissues. Our comprehensive genome-wide analysis provides insights into the SBP-box gene family of Chinese cabbage.

  19. The Genome Sequence of the psychrophilic archaeon, Methanococcoides burtonii: the Role of Genome Evolution in Cold-adaptation

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Allen, Michelle A.; Lauro, Federico M.; Williams, Timothy J.

    2009-04-01

    Psychrophilic archaea are abundant and perform critical roles throughout the Earth's expansive cold biosphere. Here we report the first complete genome sequence for a psychrophilic methanogenic archaeon, Methanococcoides burtonii. The genome sequence was manually annotated including the use of a five tiered Evidence Rating system that ranked annotations from Evidence Rating (ER) 1 (gene product experimentally characterized from the parent organism) to ER5 (hypothetical gene product) to provide a rapid means of assessing the certainty of gene function predictions. The genome is characterized by a higher level of aberrant sequence composition (51%) than any other archaeon. In comparison to hyper/thermophilicmore » archaea which are subject to selection of synonymous codon usage, M. burtonii has evolved cold adaptation through a genomic capacity to accommodate highly skewed amino acid content, while retaining codon usage in common with its mesophilic Methanosarcina cousins. Polysaccharide biosynthesis genes comprise at least 3.3% of protein coding genes in the genome, and Cell wall/membrane/envelope biogenesis COG genes are over-represented. Likewise, signal transduction (COG category T) genes are over-represented and M. burtonii has a high 'IQ' (a measure of adaptive potential) compared to many methanogens. Numerous genes in these two over-represented COG categories appear to have been acquired from {var_epsilon}- and {delta}-proteobacteria, as do specific genes involved in central metabolism such as a novel B form of aconitase. Transposases also distinguish M. burtonii from other archaea, and their genomic characteristics indicate they play an important role in evolving the M. burtonii genome. Our study reveals a capacity for this model psychrophile to evolve through genome plasticity (including nucleotide skew, horizontal gene transfer and transposase activity) that enables adaptation to the cold, and to the biological and physical changes that have occurred over

  20. Interrogation of Mammalian Protein Complex Structure, Function, and Membership Using Genome-Scale Fitness Screens.

    PubMed

    Pan, Joshua; Meyers, Robin M; Michel, Brittany C; Mashtalir, Nazar; Sizemore, Ann E; Wells, Jonathan N; Cassel, Seth H; Vazquez, Francisca; Weir, Barbara A; Hahn, William C; Marsh, Joseph A; Tsherniak, Aviad; Kadoch, Cigall

    2018-05-23

    Protein complexes are assemblies of subunits that have co-evolved to execute one or many coordinated functions in the cellular environment. Functional annotation of mammalian protein complexes is critical to understanding biological processes, as well as disease mechanisms. Here, we used genetic co-essentiality derived from genome-scale RNAi- and CRISPR-Cas9-based fitness screens performed across hundreds of human cancer cell lines to assign measures of functional similarity. From these measures, we systematically built and characterized functional similarity networks that recapitulate known structural and functional features of well-studied protein complexes and resolve novel functional modules within complexes lacking structural resolution, such as the mammalian SWI/SNF complex. Finally, by integrating functional networks with large protein-protein interaction networks, we discovered novel protein complexes involving recently evolved genes of unknown function. Taken together, these findings demonstrate the utility of genetic perturbation screens alone, and in combination with large-scale biophysical data, to enhance our understanding of mammalian protein complexes in normal and disease states. Copyright © 2018 The Authors. Published by Elsevier Inc. All rights reserved.

  1. ConGEMs: Condensed Gene Co-Expression Module Discovery Through Rule-Based Clustering and Its Application to Carcinogenesis.

    PubMed

    Mallik, Saurav; Zhao, Zhongming

    2017-12-28

    For transcriptomic analysis, there are numerous microarray-based genomic data, especially those generated for cancer research. The typical analysis measures the difference between a cancer sample-group and a matched control group for each transcript or gene. Association rule mining is used to discover interesting item sets through rule-based methodology. Thus, it has advantages to find causal effect relationships between the transcripts. In this work, we introduce two new rule-based similarity measures-weighted rank-based Jaccard and Cosine measures-and then propose a novel computational framework to detect condensed gene co-expression modules ( C o n G E M s) through the association rule-based learning system and the weighted similarity scores. In practice, the list of evolved condensed markers that consists of both singular and complex markers in nature depends on the corresponding condensed gene sets in either antecedent or consequent of the rules of the resultant modules. In our evaluation, these markers could be supported by literature evidence, KEGG (Kyoto Encyclopedia of Genes and Genomes) pathway and Gene Ontology annotations. Specifically, we preliminarily identified differentially expressed genes using an empirical Bayes test. A recently developed algorithm-RANWAR-was then utilized to determine the association rules from these genes. Based on that, we computed the integrated similarity scores of these rule-based similarity measures between each rule-pair, and the resultant scores were used for clustering to identify the co-expressed rule-modules. We applied our method to a gene expression dataset for lung squamous cell carcinoma and a genome methylation dataset for uterine cervical carcinogenesis. Our proposed module discovery method produced better results than the traditional gene-module discovery measures. In summary, our proposed rule-based method is useful for exploring biomarker modules from transcriptomic data.

  2. Genome characterization, antigenicity and pathogenicity of a novel infectious bronchitis virus type isolated from south China.

    PubMed

    Jiang, Lei; Zhao, Wenjun; Han, Zongxi; Chen, Yuqiu; Zhao, Yan; Sun, Junfeng; Li, Huixin; Shao, Yuhao; Liu, Liangliang; Liu, Shengwang

    2017-10-01

    In 2014, three infectious bronchitis virus (IBV) strains, designated as γCoV/ck/China/I0111/14, γCoV/ck/China/I0114/14 and γCoV/ck/China/I0118/14, were isolated and identified from chickens suspected to be infected with IBV in Guangxi province, China. Based upon data arising from S1 sequence and phylogenetic analyses, the three IBV isolates were genetically different from other known IBV types, which represented a novel genotype (GI-29). Virus cross-neutralization tests, using γCoV/ck/China/I0111/14 as a representative, showed that genotype GI-29 was antigenically different from all other known IBV types, thus representing a novel serotype. Complete genomic analysis showed that GI-29 type viruses were closely related to and might originate from a GX-YL5-like virus by accumulation of substitutions in multiple genes. These GI-29 viral genomes are still evolving and diverging, particularly in the 3' region, although we cannot rule out the possibility of recombination events occurring. For isolate γCoV/ck/China/I0114/14, we found that recombination events had occurred between nsps 2 and 3 in gene 1 which led to the introduction of a 4/91 gene fragment into the γCoV/ck/China/I0114/14 viral genome. In addition, we found that the GI-29 type γCoV/ck/China/I0111/14 isolate was a nephropathogenic strain and high pathogenic to 1-day-old specific pathogen-free (SPF) chickens although cystic oviducts were not observed in the surviving layer chickens challenged with γCoV/ck/China/I0111/14 isolate. Copyright © 2017 Elsevier B.V. All rights reserved.

  3. Whole-genome phylogenies of the family Bacillaceae and expansion of the sigma factor gene family in the Bacillus cereus species-group

    PubMed Central

    2011-01-01

    Background The Bacillus cereus sensu lato group consists of six species (B. anthracis, B. cereus, B. mycoides, B. pseudomycoides, B. thuringiensis, and B. weihenstephanensis). While classical microbial taxonomy proposed these organisms as distinct species, newer molecular phylogenies and comparative genome sequencing suggests that these organisms should be classified as a single species (thus, we will refer to these organisms collectively as the Bc species-group). How do we account for the underlying similarity of these phenotypically diverse microbes? It has been established for some time that the most rapidly evolving and evolutionarily flexible portions of the bacterial genome are regulatory sequences and transcriptional networks. Other studies have suggested that the sigma factor gene family of these organisms has diverged and expanded significantly relative to their ancestors; sigma factors are those portions of the bacterial transcriptional apparatus that control RNA polymerase recognition for promoter selection. Thus, examining sigma factor divergence in these organisms would concurrently examine both regulatory sequences and transcriptional networks important for divergence. We began this examination by comparison to the sigma factor gene set of B. subtilis. Results Phylogenetic analysis of the Bc species-group utilizing 157 single-copy genes of the family Bacillaceae suggests that several taxonomic revisions of the genus Bacillus should be considered. Within the Bc species-group there is little indication that the currently recognized species form related sub-groupings, suggesting that they are members of the same species. The sigma factor gene family encoded by the Bc species-group appears to be the result of a dynamic gene-duplication and gene-loss process that in previous analyses underestimated the true heterogeneity of the sigma factor content in the Bc species-group. Conclusions Expansion of the sigma factor gene family appears to have preferentially

  4. Genomics for paediatricians: promises and pitfalls.

    PubMed

    Hammond, Carrie Louise; Willoughby, Josh Matthew; Parker, Michael James

    2018-03-24

    In recent years, there have been significant advances in genetic technologies, evolving the field of genomics from genetics. This has huge diagnostic potential, as genomic testing increasingly becomes part of mainstream medicine. However, there are numerous potential pitfalls in the interpretation of genomic data. It is therefore essential that we educate clinicians more widely about the appropriate interpretation and utilisation of genomic testing. © Article author(s) (or their employer(s) unless otherwise stated in the text of the article) 2018. All rights reserved. No commercial use is permitted unless otherwise expressly granted.

  5. Complete Genome Sequences of Four Novel Escherichia coli Bacteriophages Belonging to New Phage Groups

    PubMed Central

    Kot, Witold

    2015-01-01

    Here, we describe the sequencing and genome annotations of a set of four Escherichia coli bacteriophages (phages) belonging to newly discovered groups previously consisting of only a single phage and thus expand our knowledge of these phage groups. PMID:26184932

  6. The Transcriptome of the Reference Potato Genome Solanum tuberosum Group Phureja Clone DM1-3 516R44

    PubMed Central

    Massa, Alicia N.; Childs, Kevin L.; Lin, Haining; Bryan, Glenn J.; Giuliano, Giovanni; Buell, C. Robin

    2011-01-01

    Advances in molecular breeding in potato have been limited by its complex biological system, which includes vegetative propagation, autotetraploidy, and extreme heterozygosity. The availability of the potato genome and accompanying gene complement with corresponding gene structure, location, and functional annotation are powerful resources for understanding this complex plant and advancing molecular breeding efforts. Here, we report a reference for the potato transcriptome using 32 tissues and growth conditions from the doubled monoploid Solanum tuberosum Group Phureja clone DM1-3 516R44 for which a genome sequence is available. Analysis of greater than 550 million RNA-Seq reads permitted the detection and quantification of expression levels of over 22,000 genes. Hierarchical clustering and principal component analyses captured the biological variability that accounts for gene expression differences among tissues suggesting tissue-specific gene expression, and genes with tissue or condition restricted expression. Using gene co-expression network analysis, we identified 18 gene modules that represent tissue-specific transcriptional networks of major potato organs and developmental stages. This information provides a powerful resource for potato research as well as studies on other members of the Solanaceae family. PMID:22046362

  7. Convergent evolution of the genomes of marine mammals

    USGS Publications Warehouse

    Foote, Andrew D.; Liu, Yue; Thomas, Gregg W.C.; Vinař, Tomáš; Alföldi, Jessica; Deng, Jixin; Dugan, Shannon; van Elk, Cornelis E.; Hunter, Margaret; Joshi, Vandita; Khan, Ziad; Kovar, Christie; Lee, Sandra L.; Lindblad-Toh, Kerstin; Mancia, Annalaura; Nielsen, Rasmus; Qin, Xiang; Qu, Jiaxin; Raney, Brian J.; Vijay, Nagarjun; Wolf, Jochen B. W.; Hahn, Matthew W.; Muzny, Donna M.; Worley, Kim C.; Gilbert, M. Thomas P.; Gibbs, Richard A.

    2015-01-01

    Marine mammals from different mammalian orders share several phenotypic traits adapted to the aquatic environment and therefore represent a classic example of convergent evolution. To investigate convergent evolution at the genomic level, we sequenced and performed de novo assembly of the genomes of three species of marine mammals (the killer whale, walrus and manatee) from three mammalian orders that share independently evolved phenotypic adaptations to a marine existence. Our comparative genomic analyses found that convergent amino acid substitutions were widespread throughout the genome and that a subset of these substitutions were in genes evolving under positive selection and putatively associated with a marine phenotype. However, we found higher levels of convergent amino acid substitutions in a control set of terrestrial sister taxa to the marine mammals. Our results suggest that, whereas convergent molecular evolution is relatively common, adaptive molecular convergence linked to phenotypic convergence is comparatively rare.

  8. Convergent evolution of the genomes of marine mammals

    PubMed Central

    Foote, Andrew D.; Liu, Yue; Thomas, Gregg W.C.; Vinař, Tomáš; Alföldi, Jessica; Deng, Jixin; Dugan, Shannon; van Elk, Cornelis E.; Hunter, Margaret E.; Joshi, Vandita; Khan, Ziad; Kovar, Christie; Lee, Sandra L.; Lindblad-Toh, Kerstin; Mancia, Annalaura; Nielsen, Rasmus; Qin, Xiang; Qu, Jiaxin; Raney, Brian J.; Vijay, Nagarjun; Wolf, Jochen B. W.; Hahn, Matthew W.; Muzny, Donna M.; Worley, Kim C.; Gilbert, M. Thomas P.; Gibbs, Richard A.

    2015-01-01

    Marine mammals from different mammalian orders share several phenotypic traits adapted to the aquatic environment and are therefore a classic example of convergent evolution. To investigate convergent evolution at the genomic level, we sequenced and de novo assembled the genomes of three species of marine mammals (the killer whale, walrus and manatee) from three mammalian orders that share independently evolved phenotypic adaptations to a marine existence. Our comparative genomic analyses found that convergent amino acid substitutions were widespread throughout the genome, and that a subset were in genes evolving under positive selection and putatively associated with a marine phenotype. However, we found higher levels of convergent amino acid substitutions in a control set of terrestrial sister taxa to the marine mammals. Our results suggest that while convergent molecular evolution is relatively common, adaptive molecular convergence linked to phenotypic convergence is comparatively rare. PMID:25621460

  9. Whole genome co-expression analysis of soybean cytochrome P450 genes identifies nodulation-specific P450 monooxygenases

    PubMed Central

    2010-01-01

    Background Cytochrome P450 monooxygenases (P450s) catalyze oxidation of various substrates using oxygen and NAD(P)H. Plant P450s are involved in the biosynthesis of primary and secondary metabolites performing diverse biological functions. The recent availability of the soybean genome sequence allows us to identify and analyze soybean putative P450s at a genome scale. Co-expression analysis using an available soybean microarray and Illumina sequencing data provides clues for functional annotation of these enzymes. This approach is based on the assumption that genes that have similar expression patterns across a set of conditions may have a functional relationship. Results We have identified a total number of 332 full-length P450 genes and 378 pseudogenes from the soybean genome. From the full-length sequences, 195 genes belong to A-type, which could be further divided into 20 families. The remaining 137 genes belong to non-A type P450s and are classified into 28 families. A total of 178 probe sets were found to correspond to P450 genes on the Affymetrix soybean array. Out of these probe sets, 108 represented single genes. Using the 28 publicly available microarray libraries that contain organ-specific information, some tissue-specific P450s were identified. Similarly, stress responsive soybean P450s were retrieved from 99 microarray soybean libraries. We also utilized Illumina transcriptome sequencing technology to analyze the expressions of all 332 soybean P450 genes. This dataset contains total RNAs isolated from nodules, roots, root tips, leaves, flowers, green pods, apical meristem, mock-inoculated and Bradyrhizobium japonicum-infected root hair cells. The tissue-specific expression patterns of these P450 genes were analyzed and the expression of a representative set of genes were confirmed by qRT-PCR. We performed the co-expression analysis on many of the 108 P450 genes on the Affymetrix arrays. First we confirmed that CYP93C5 (an isoflavone synthase gene) is

  10. Genomic comparison of closely related Giant Viruses supports an accordion-like model of evolution.

    PubMed

    Filée, Jonathan

    2015-01-01

    Genome gigantism occurs so far in Phycodnaviridae and Mimiviridae (order Megavirales). Origin and evolution of these Giant Viruses (GVs) remain open questions. Interestingly, availability of a collection of closely related GV genomes enabling genomic comparisons offer the opportunity to better understand the different evolutionary forces acting on these genomes. Whole genome alignment for five groups of viruses belonging to the Mimiviridae and Phycodnaviridae families show that there is no trend of genome expansion or general tendency of genome contraction. Instead, GV genomes accumulated genomic mutations over the time with gene gains compensating the different losses. In addition, each lineage displays specific patterns of genome evolution. Mimiviridae (megaviruses and mimiviruses) and Chlorella Phycodnaviruses evolved mainly by duplications and losses of genes belonging to large paralogous families (including movements of diverse mobiles genetic elements), whereas Micromonas and Ostreococcus Phycodnaviruses derive most of their genetic novelties thought lateral gene transfers. Taken together, these data support an accordion-like model of evolution in which GV genomes have undergone successive steps of gene gain and gene loss, accrediting the hypothesis that genome gigantism appears early, before the diversification of the different GV lineages.

  11. Are there ergodic limits to evolution? Ergodic exploration of genome space and convergence

    PubMed Central

    McLeish, Tom C. B.

    2015-01-01

    We examine the analogy between evolutionary dynamics and statistical mechanics to include the fundamental question of ergodicity—the representative exploration of the space of possible states (in the case of evolution this is genome space). Several properties of evolutionary dynamics are identified that allow a generalization of the ergodic dynamics, familiar in dynamical systems theory, to evolution. Two classes of evolved biological structure then arise, differentiated by the qualitative duration of their evolutionary time scales. The first class has an ergodicity time scale (the time required for representative genome exploration) longer than available evolutionary time, and has incompletely explored the genotypic and phenotypic space of its possibilities. This case generates no expectation of convergence to an optimal phenotype or possibility of its prediction. The second, more interesting, class exhibits an evolutionary form of ergodicity—essentially all of the structural space within the constraints of slower evolutionary variables have been sampled; the ergodicity time scale for the system evolution is less than the evolutionary time. In this case, some convergence towards similar optima may be expected for equivalent systems in different species where both possess ergodic evolutionary dynamics. When the fitness maximum is set by physical, rather than co-evolved, constraints, it is additionally possible to make predictions of some properties of the evolved structures and systems. We propose four structures that emerge from evolution within genotypes whose fitness is induced from their phenotypes. Together, these result in an exponential speeding up of evolution, when compared with complete exploration of genomic space. We illustrate a possible case of application and a prediction of convergence together with attaining a physical fitness optimum in the case of invertebrate compound eye resolution. PMID:26640648

  12. Homing endonucleases from mobile group I introns: discovery to genome engineering

    PubMed Central

    2014-01-01

    Homing endonucleases are highly specific DNA cleaving enzymes that are encoded within genomes of all forms of microbial life including phage and eukaryotic organelles. These proteins drive the mobility and persistence of their own reading frames. The genes that encode homing endonucleases are often embedded within self-splicing elements such as group I introns, group II introns and inteins. This combination of molecular functions is mutually advantageous: the endonuclease activity allows surrounding introns and inteins to act as invasive DNA elements, while the splicing activity allows the endonuclease gene to invade a coding sequence without disrupting its product. Crystallographic analyses of representatives from all known homing endonuclease families have illustrated both their mechanisms of action and their evolutionary relationships to a wide range of host proteins. Several homing endonucleases have been completely redesigned and used for a variety of genome engineering applications. Recent efforts to augment homing endonucleases with auxiliary DNA recognition elements and/or nucleic acid processing factors has further accelerated their use for applications that demand exceptionally high specificity and activity. PMID:24589358

  13. Arthropod phylogenetics in light of three novel millipede (myriapoda: diplopoda) mitochondrial genomes with comments on the appropriateness of mitochondrial genome sequence data for inferring deep level relationships.

    PubMed

    Brewer, Michael S; Swafford, Lynn; Spruill, Chad L; Bond, Jason E

    2013-01-01

    Arthropods are the most diverse group of eukaryotic organisms, but their phylogenetic relationships are poorly understood. Herein, we describe three mitochondrial genomes representing orders of millipedes for which complete genomes had not been characterized. Newly sequenced genomes are combined with existing data to characterize the protein coding regions of myriapods and to attempt to reconstruct the evolutionary relationships within the Myriapoda and Arthropoda. The newly sequenced genomes are similar to previously characterized millipede sequences in terms of synteny and length. Unique translocations occurred within the newly sequenced taxa, including one half of the Appalachioria falcifera genome, which is inverted with respect to other millipede genomes. Across myriapods, amino acid conservation levels are highly dependent on the gene region. Additionally, individual loci varied in the level of amino acid conservation. Overall, most gene regions showed low levels of conservation at many sites. Attempts to reconstruct the evolutionary relationships suffered from questionable relationships and low support values. Analyses of phylogenetic informativeness show the lack of signal deep in the trees (i.e., genes evolve too quickly). As a result, the myriapod tree resembles previously published results but lacks convincing support, and, within the arthropod tree, well established groups were recovered as polyphyletic. The novel genome sequences described herein provide useful genomic information concerning millipede groups that had not been investigated. Taken together with existing sequences, the variety of compositions and evolution of myriapod mitochondrial genomes are shown to be more complex than previously thought. Unfortunately, the use of mitochondrial protein-coding regions in deep arthropod phylogenetics appears problematic, a result consistent with previously published studies. Lack of phylogenetic signal renders the resulting tree topologies as suspect

  14. Evolving mobile robots able to display collective behaviors.

    PubMed

    Baldassarre, Gianluca; Nolfi, Stefano; Parisi, Domenico

    2003-01-01

    We present a set of experiments in which simulated robots are evolved for the ability to aggregate and move together toward a light target. By developing and using quantitative indexes that capture the structural properties of the emerged formations, we show that evolved individuals display interesting behavioral patterns in which groups of robots act as a single unit. Moreover, evolved groups of robots with identical controllers display primitive forms of situated specialization and play different behavioral functions within the group according to the circumstances. Overall, the results presented in the article demonstrate that evolutionary techniques, by exploiting the self-organizing behavioral properties that emerge from the interactions between the robots and between the robots and the environment, are a powerful method for synthesizing collective behavior.

  15. Multiplex Degenerate Primer Design for Targeted Whole Genome Amplification of Many Viral Genomes

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Gardner, Shea N.; Jaing, Crystal J.; Elsheikh, Maher M.

    Background . Targeted enrichment improves coverage of highly mutable viruses at low concentration in complex samples. Degenerate primers that anneal to conserved regions can facilitate amplification of divergent, low concentration variants, even when the strain present is unknown. Results . A tool for designing multiplex sets of degenerate sequencing primers to tile overlapping amplicons across multiple whole genomes is described. The new script, run_tiled_primers, is part of the PriMux software. Primers were designed for each segment of South American hemorrhagic fever viruses, tick-borne encephalitis, Henipaviruses, Arenaviruses, Filoviruses, Crimean-Congo hemorrhagic fever virus, Rift Valley fever virus, and Japanese encephalitis virus. Eachmore » group is highly diverse with as little as 5% genome consensus. Primer sets were computationally checked for nontarget cross reactions against the NCBI nucleotide sequence database. Primers for murine hepatitis virus were demonstrated in the lab to specifically amplify selected genes from a laboratory cultured strain that had undergone extensive passage in vitro and in vivo. Conclusions . This software should help researchers design multiplex sets of primers for targeted whole genome enrichment prior to sequencing to obtain better coverage of low titer, divergent viruses. Applications include viral discovery from a complex background and improved sensitivity and coverage of rapidly evolving strains or variants in a gene family.« less

  16. Multiplex Degenerate Primer Design for Targeted Whole Genome Amplification of Many Viral Genomes

    DOE PAGES

    Gardner, Shea N.; Jaing, Crystal J.; Elsheikh, Maher M.; ...

    2014-01-01

    Background . Targeted enrichment improves coverage of highly mutable viruses at low concentration in complex samples. Degenerate primers that anneal to conserved regions can facilitate amplification of divergent, low concentration variants, even when the strain present is unknown. Results . A tool for designing multiplex sets of degenerate sequencing primers to tile overlapping amplicons across multiple whole genomes is described. The new script, run_tiled_primers, is part of the PriMux software. Primers were designed for each segment of South American hemorrhagic fever viruses, tick-borne encephalitis, Henipaviruses, Arenaviruses, Filoviruses, Crimean-Congo hemorrhagic fever virus, Rift Valley fever virus, and Japanese encephalitis virus. Eachmore » group is highly diverse with as little as 5% genome consensus. Primer sets were computationally checked for nontarget cross reactions against the NCBI nucleotide sequence database. Primers for murine hepatitis virus were demonstrated in the lab to specifically amplify selected genes from a laboratory cultured strain that had undergone extensive passage in vitro and in vivo. Conclusions . This software should help researchers design multiplex sets of primers for targeted whole genome enrichment prior to sequencing to obtain better coverage of low titer, divergent viruses. Applications include viral discovery from a complex background and improved sensitivity and coverage of rapidly evolving strains or variants in a gene family.« less

  17. The mitochondrial genome of booklouse, Liposcelis sculptilis (Psocoptera: Liposcelididae) and the evolutionary timescale of Liposcelis

    PubMed Central

    Shi, Yan; Chu, Qing; Wei, Dan-Dan; Qiu, Yuan-Jian; Shang, Feng; Dou, Wei; Wang, Jin-Jun

    2016-01-01

    Bilateral animals are featured by an extremely compact mitochondrial (mt) genome with 37 genes on a single circular chromosome. To date, the complete mt genome has only been determined for four species of Liposcelis, a genus with economic importance, including L. entomophila, L. decolor, L. bostrychophila, and L. paeta. They belong to A, B, or D group of Liposcelis, respectively. Unlike most bilateral animals, L. bostrychophila, L. entomophila and L. paeta have a bitipartite mt genome with genes on two chromosomes. However, the mt genome of L. decolor has the typical mt chromosome of bilateral animals. Here, we sequenced the mt genome of L. sculptilis, and identified 35 genes, which were on a single chromosome. The mt genome fragmentation is not shared by the D group of Liposcelis and the single chromosome of L. sculptilis differed from those of booklice known in gene content and gene arrangement. We inferred that different evolutionary patterns and rate existed in Liposcelis. Further, we reconstructed the evolutionary history of 21 psocodean taxa with phylogenetic analyses, which suggested that Liposcelididae and Phthiraptera have evolved 134 Ma and the sucking lice diversified in the Late Cretaceous. PMID:27470659

  18. Biodegradation of Poly(butylene succinate) Powder in a Controlled Compost at 58 °C Evaluated by Naturally-Occurring Carbon 14 Amounts in Evolved CO2 Based on the ISO 14855-2 Method

    PubMed Central

    Kunioka, Masao; Ninomiya, Fumi; Funabashi, Masahiro

    2009-01-01

    The biodegradabilities of poly(butylene succinate) (PBS) powders in a controlled compost at 58 °C have been studied using a Microbial Oxidative Degradation Analyzer (MODA) based on the ISO 14855-2 method, entitled “Determination of the ultimate aerobic biodegradability of plastic materials under controlled composting conditions—Method by analysis of evolved carbon dioxide—Part 2: Gravimetric measurement of carbon dioxide evolved in a laboratory-scale test”. The evolved CO2 was trapped by an additional aqueous Ba(OH)2 solution. The trapped BaCO3 was transformed into graphite via a serial vaporization and reduction reaction using a gas-tight tube and vacuum manifold system. This graphite was analyzed by accelerated mass spectrometry (AMS) to determine the percent modern carbon [pMC (sample)] based on the 14C radiocarbon concentration. By using the theory that pMC (sample) was the sum of the pMC (compost) (109.87%) and pMC (PBS) (0%) as the respective ratio in the determined period, the CO2 (respiration) was calculated from only one reaction vessel. It was found that the biodegradabilities determined by the CO2 amount from PBS in the sample vessel were about 30% lower than those based on the ISO method. These differences between the ISO and AMS methods are caused by the fact that part of the carbons from PBS are changed into metabolites by the microorganisms in the compost, and not changed into CO2. PMID:20057944

  19. Watching MOOCs Together: Investigating Co-Located MOOC Study Groups

    ERIC Educational Resources Information Center

    Li, Nan; Verma, Himanshu; Skevi, Afroditi; Zufferey, Guillaume; Blom, Jan; Dillenbourg, Pierre

    2014-01-01

    Research suggests that massive open online course (MOOC) students prefer to study in groups, and that social facilitation within the study groups may render the learning of difficult concepts a pleasing experience. We report on a longitudinal study that investigates how co-located study groups watch and study MOOC videos together. The study was…

  20. Complete genome sequence of Lactobacillus plantarum LZ227, a potential probiotic strain producing B-group vitamins.

    PubMed

    Li, Ping; Zhou, Qingqing; Gu, Qing

    2016-09-20

    B-group vitamins play an important role in human metabolism, whose deficiencies are associated with a variety of disorders and diseases. Certain microorganisms such as Lactic acid bacteria (LAB) have been shown to have capacities for B-group vitamin production and thus could potentially replace chemically synthesized vitamins for food fortification. A potential probiotic strain named Lactobacillus plantarum LZ227, which was isolated from raw cow milk in this study, exhibits the ability to produce B-group vitamins. Complete genome sequencing of LZ227 was performed to gain insights into the genetic elements involved in B-group vitamin production. The genome of LZ227 contains a circular 3,131,750-bp chromosome, three circular plasmids and two predicted linear plasmids. LZ227 also contains gene clusters for biosynthesis of both riboflavin and folate. This genome sequence provides a basis for further elucidation of its molecular genetics and probiotic functions, and will facilitate its applications as starter cultures in food industry. Copyright © 2016 Elsevier B.V. All rights reserved.

  1. Characterizing 3-D flow velocity in evolving pore networks driven by CaCO3 precipitation and dissolution

    NASA Astrophysics Data System (ADS)

    Chojnicki, K. N.; Yoon, H.; Martinez, M. J.

    2015-12-01

    Understanding reactive flow in geomaterials is important for optimizing geologic carbon storage practices, such as using pore space efficiently. Flow paths can be complex in large degrees of geologic heterogeneities across scales. In addition, local heterogeneity can evolve as reactive transport processes alter the pore-scale morphology. For example, dissolved carbon dioxide may react with minerals in fractured rocks, confined aquifers, or faults, resulting in heterogeneous cementation (and/or dissolution) and evolving flow conditions. Both path and flow complexities are important and poorly characterized, making it difficult to determine their evolution with traditional 2-D transport models. Here we characterize the development of 3-D pore-scale flow with an evolving pore configuration due to calcium carbonate (CaCO3) precipitation and dissolution. A simple pattern of a microfluidic pore network is used initially and pore structures will become more complex due to precipitation and dissolution processes. At several stages of precipitation and dissolution, we directly visualize 3-D velocity vectors using micro particle image velocimetry and a laser scanning confocal microscope. Measured 3-D velocity vectors are then compared to 3-D simulated flow fields which will be used to simulate reactive transport. Our findings will highlight the importance of the 3-D flow dynamics and its impact on estimating reactive surface area over time. Sandia National Laboratories is a multi-program laboratory managed and operated by Sandia Corporation, a wholly owned subsidiary of Lockheed Martin Corporation, for the U.S. Department of Energy's National Nuclear Security Administration under contract DE-AC04-94AL85000. This material is based upon work supported as part of the Center for Frontiers of Subsurface Energy Security, an Energy Frontier Research Center funded by the U.S. Department of Energy, Office of Science, Office of Basic Energy Sciences under Award Number DE-SC0001114.

  2. Evolutionary pathway to increased virulence and epidemic group A Streptococcus disease derived from 3,615 genome sequences.

    PubMed

    Nasser, Waleed; Beres, Stephen B; Olsen, Randall J; Dean, Melissa A; Rice, Kelsey A; Long, S Wesley; Kristinsson, Karl G; Gottfredsson, Magnus; Vuopio, Jaana; Raisanen, Kati; Caugant, Dominique A; Steinbakk, Martin; Low, Donald E; McGeer, Allison; Darenberg, Jessica; Henriques-Normark, Birgitta; Van Beneden, Chris A; Hoffmann, Steen; Musser, James M

    2014-04-29

    We sequenced the genomes of 3,615 strains of serotype Emm protein 1 (M1) group A Streptococcus to unravel the nature and timing of molecular events contributing to the emergence, dissemination, and genetic diversification of an unusually virulent clone that now causes epidemic human infections worldwide. We discovered that the contemporary epidemic clone emerged in stepwise fashion from a precursor cell that first contained the phage encoding an extracellular DNase virulence factor (streptococcal DNase D2, SdaD2) and subsequently acquired the phage encoding the SpeA1 variant of the streptococcal pyrogenic exotoxin A superantigen. The SpeA2 toxin variant evolved from SpeA1 by a single-nucleotide change in the M1 progenitor strain before acquisition by horizontal gene transfer of a large chromosomal region encoding secreted toxins NAD(+)-glycohydrolase and streptolysin O. Acquisition of this 36-kb region in the early 1980s into just one cell containing the phage-encoded sdaD2 and speA2 genes was the final major molecular event preceding the emergence and rapid intercontinental spread of the contemporary epidemic clone. Thus, we resolve a decades-old controversy about the type and sequence of genomic alterations that produced this explosive epidemic. Analysis of comprehensive, population-based contemporary invasive strains from seven countries identified strong patterns of temporal population structure. Compared with a preepidemic reference strain, the contemporary clone is significantly more virulent in nonhuman primate models of pharyngitis and necrotizing fasciitis. A key finding is that the molecular evolutionary events transpiring in just one bacterial cell ultimately have produced millions of human infections worldwide.

  3. Evolutionary pathway to increased virulence and epidemic group A Streptococcus disease derived from 3,615 genome sequences

    PubMed Central

    Nasser, Waleed; Beres, Stephen B.; Olsen, Randall J.; Dean, Melissa A.; Rice, Kelsey A.; Long, S. Wesley; Kristinsson, Karl G.; Gottfredsson, Magnus; Vuopio, Jaana; Raisanen, Kati; Caugant, Dominique A.; Steinbakk, Martin; Low, Donald E.; McGeer, Allison; Darenberg, Jessica; Henriques-Normark, Birgitta; Van Beneden, Chris A.; Hoffmann, Steen; Musser, James M.

    2014-01-01

    We sequenced the genomes of 3,615 strains of serotype Emm protein 1 (M1) group A Streptococcus to unravel the nature and timing of molecular events contributing to the emergence, dissemination, and genetic diversification of an unusually virulent clone that now causes epidemic human infections worldwide. We discovered that the contemporary epidemic clone emerged in stepwise fashion from a precursor cell that first contained the phage encoding an extracellular DNase virulence factor (streptococcal DNase D2, SdaD2) and subsequently acquired the phage encoding the SpeA1 variant of the streptococcal pyrogenic exotoxin A superantigen. The SpeA2 toxin variant evolved from SpeA1 by a single-nucleotide change in the M1 progenitor strain before acquisition by horizontal gene transfer of a large chromosomal region encoding secreted toxins NAD+-glycohydrolase and streptolysin O. Acquisition of this 36-kb region in the early 1980s into just one cell containing the phage-encoded sdaD2 and speA2 genes was the final major molecular event preceding the emergence and rapid intercontinental spread of the contemporary epidemic clone. Thus, we resolve a decades-old controversy about the type and sequence of genomic alterations that produced this explosive epidemic. Analysis of comprehensive, population-based contemporary invasive strains from seven countries identified strong patterns of temporal population structure. Compared with a preepidemic reference strain, the contemporary clone is significantly more virulent in nonhuman primate models of pharyngitis and necrotizing fasciitis. A key finding is that the molecular evolutionary events transpiring in just one bacterial cell ultimately have produced millions of human infections worldwide. PMID:24733896

  4. Public preferences for communicating personal genomic risk information: a focus group study.

    PubMed

    Smit, Amelia K; Keogh, Louise A; Hersch, Jolyn; Newson, Ainsley J; Butow, Phyllis; Williams, Gabrielle; Cust, Anne E

    2016-12-01

    Personalized genomic risk information has the potential to motivate behaviour change and promote population health, but the success of this will depend upon effective risk communication strategies. To determine preferences for different graphical and written risk communication formats, and the delivery of genomic risk information including the mode of communication and the role of health professionals. Focus groups, transcribed and analysed thematically. Thirty-four participants from the public. Participants were provided with, and invited to discuss, a hypothetical scenario giving an individual's personalized genomic risk of melanoma displayed in several graphical formats. Participants preferred risk formats that were familiar and easy to understand, such as a 'double pie chart' and '100 person diagram' (pictograph). The 100 person diagram was considered persuasive because it humanized and personalized the risk information. People described the pie chart format as resembling bank data and food (such as cake and pizza). Participants thought that email, web-based platforms and postal mail were viable options for communicating genomic risk information. However, they felt that it was important that a health professional (either a genetic counsellor or 'informed' general practitioner) be available for discussion at the time of receiving the risk information, to minimize potential negative emotional responses and misunderstanding. Face-to-face or telephone delivery was preferred for delivery of high-risk results. These public preferences for communication strategies for genomic risk information will help to guide translation of genome-based knowledge into improved population health. © 2015 The Authors. Health Expectations. Published by John Wiley & Sons Ltd.

  5. Co-production in practice: how people with assisted living needs can help design and evolve technologies and services.

    PubMed

    Wherton, Joseph; Sugarhood, Paul; Procter, Rob; Hinder, Sue; Greenhalgh, Trisha

    2015-05-26

    The low uptake of telecare and telehealth services by older people may be explained by the limited involvement of users in the design. If the ambition of 'care closer to home' is to be realised, then industry, health and social care providers must evolve ways to work with older people to co-produce useful and useable solutions. We conducted 10 co-design workshops with users of telehealth and telecare, their carers, service providers and technology suppliers. Using vignettes developed from in-depth ethnographic case studies, we explored participants' perspectives on the design features of technologies and services to enable and facilitate the co-production of new care solutions. Workshop discussions were audio recorded, transcribed and analysed thematically. Analysis revealed four main themes. First, there is a need to raise awareness and provide information to potential users of assisted living technologies (ALTs). Second, technologies must be highly customisable and adaptable to accommodate the multiple and changing needs of different users. Third, the service must align closely with the individual's wider social support network. Finally, the service must support a high degree of information sharing and coordination. The case vignettes within inclusive and democratic co-design workshops provided a powerful means for ALT users and their carers to contribute, along with other stakeholders, to technology and service design. The workshops identified a need to focus attention on supporting the social processes that facilitate the collective efforts of formal and informal care networks in ALT delivery and use.

  6. Accurate multiplex polony sequencing of an evolved bacterial genome.

    PubMed

    Shendure, Jay; Porreca, Gregory J; Reppas, Nikos B; Lin, Xiaoxia; McCutcheon, John P; Rosenbaum, Abraham M; Wang, Michael D; Zhang, Kun; Mitra, Robi D; Church, George M

    2005-09-09

    We describe a DNA sequencing technology in which a commonly available, inexpensive epifluorescence microscope is converted to rapid nonelectrophoretic DNA sequencing automation. We apply this technology to resequence an evolved strain of Escherichia coli at less than one error per million consensus bases. A cell-free, mate-paired library provided single DNA molecules that were amplified in parallel to 1-micrometer beads by emulsion polymerase chain reaction. Millions of beads were immobilized in a polyacrylamide gel and subjected to automated cycles of sequencing by ligation and four-color imaging. Cost per base was roughly one-ninth as much as that of conventional sequencing. Our protocols were implemented with off-the-shelf instrumentation and reagents.

  7. Evolution of herbivore-induced early defense signaling was shaped by genome-wide duplications in Nicotiana

    PubMed Central

    Zhou, Wenwu; Brockmöller, Thomas; Ling, Zhihao; Omdahl, Ashton; Baldwin, Ian T; Xu, Shuqing

    2016-01-01

    Herbivore-induced defenses are widespread, rapidly evolving and relevant for plant fitness. Such induced defenses are often mediated by early defense signaling (EDS) rapidly activated by the perception of herbivore associated elicitors (HAE) that includes transient accumulations of jasmonic acid (JA). Analyzing 60 HAE-induced leaf transcriptomes from closely-related Nicotiana species revealed a key gene co-expression network (M4 module) which is co-activated with the HAE-induced JA accumulations but is elicited independently of JA, as revealed in plants silenced in JA signaling. Functional annotations of the M4 module were consistent with roles in EDS and a newly identified hub gene of the M4 module (NaLRRK1) mediates a negative feedback loop with JA signaling. Phylogenomic analysis revealed preferential gene retention after genome-wide duplications shaped the evolution of HAE-induced EDS in Nicotiana. These results highlight the importance of genome-wide duplications in the evolution of adaptive traits in plants. DOI: http://dx.doi.org/10.7554/eLife.19531.001 PMID:27813478

  8. 2-Oxoacid Metabolism in Methanogenic CoM and CoB Biosynthesis

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Graham, David E

    Coenzyme M (CoM) and coenzyme B (CoB) are essential for methane production by the euryarchaea that employ this specialized anaerobic metabolism. Two pathways are known to produce CoM, 2-mercaptoethanesulfonate, and both converge on the 2-oxoacid sulfopyruvate. These cells have recruited the rich biochemistry of amino acid and 2-oxoacid metabolizing enzymes to produce a compound that resembles oxaloacetate, but with a more stable and acidic sulfonate group. 7-Mercaptoheptanoylthreonine phosphate, CoB, likewise owes its carbon backbone to a 2-oxoacid. Three enzymes recruited from leucine biosynthesis have evolved to catalyze the elongation of 2-oxoglutarate to 2-oxosuberate in CoB biosynthesis. This chapter describes themore » enzymology, synthesis and analytical techniques used to study 2-oxoacid metabolism in these pathways. Protein structure and mechanistic information from enzymes provides insight into the evolution of new enzymatic activity, and the evolution of substrate specificity from promiscuous enzyme scaffolds.« less

  9. Chance and necessity in the genome evolution of endosymbiotic bacteria of insects.

    PubMed

    Sabater-Muñoz, Beatriz; Toft, Christina; Alvarez-Ponce, David; Fares, Mario A

    2017-06-01

    An open question in evolutionary biology is how does the selection-drift balance determine the fates of biological interactions. We searched for signatures of selection and drift in genomes of five endosymbiotic bacterial groups known to evolve under strong genetic drift. Although most genes in endosymbiotic bacteria showed evidence of relaxed purifying selection, many genes in these bacteria exhibited stronger selective constraints than their orthologs in free-living bacterial relatives. Remarkably, most of these highly constrained genes had no role in the host-symbiont interactions but were involved in either buffering the deleterious consequences of drift or other host-unrelated functions, suggesting that they have either acquired new roles or their role became more central in endosymbiotic bacteria. Experimental evolution of Escherichia coli under strong genetic drift revealed remarkable similarities in the mutational spectrum, genome reduction patterns and gene losses to endosymbiotic bacteria of insects. Interestingly, the transcriptome of the experimentally evolved lines showed a generalized deregulation of the genome that affected genes encoding proteins involved in mutational buffering, regulation and amino acid biosynthesis, patterns identical to those found in endosymbiotic bacteria. Our results indicate that drift has shaped endosymbiotic associations through a change in the functional landscape of bacterial genes and that the host had only a small role in such a shift.

  10. Reconstruction of an Integrated Genome-Scale Co-Expression Network Reveals Key Modules Involved in Lung Adenocarcinoma

    PubMed Central

    Hosseini Ashtiani, Saman; Moeini, Ali; Nowzari-Dalini, Abbas; Masoudi-Nejad, Ali

    2013-01-01

    Our goal of this study was to reconstruct a “genome-scale co-expression network” and find important modules in lung adenocarcinoma so that we could identify the genes involved in lung adenocarcinoma. We integrated gene mutation, GWAS, CGH, array-CGH and SNP array data in order to identify important genes and loci in genome-scale. Afterwards, on the basis of the identified genes a co-expression network was reconstructed from the co-expression data. The reconstructed network was named “genome-scale co-expression network”. As the next step, 23 key modules were disclosed through clustering. In this study a number of genes have been identified for the first time to be implicated in lung adenocarcinoma by analyzing the modules. The genes EGFR, PIK3CA, TAF15, XIAP, VAPB, Appl1, Rab5a, ARF4, CLPTM1L, SP4, ZNF124, LPP, FOXP1, SOX18, MSX2, NFE2L2, SMARCC1, TRA2B, CBX3, PRPF6, ATP6V1C1, MYBBP1A, MACF1, GRM2, TBXA2R, PRKAR2A, PTK2, PGF and MYO10 are among the genes that belong to modules 1 and 22. All these genes, being implicated in at least one of the phenomena, namely cell survival, proliferation and metastasis, have an over-expression pattern similar to that of EGFR. In few modules, the genes such as CCNA2 (Cyclin A2), CCNB2 (Cyclin B2), CDK1, CDK5, CDC27, CDCA5, CDCA8, ASPM, BUB1, KIF15, KIF2C, NEK2, NUSAP1, PRC1, SMC4, SYCE2, TFDP1, CDC42 and ARHGEF9 are present that play a crucial role in cell cycle progression. In addition to the mentioned genes, there are some other genes (i.e. DLGAP5, BIRC5, PSMD2, Src, TTK, SENP2, PSMD2, DOK2, FUS and etc.) in the modules. PMID:23874428

  11. Reconstruction of an integrated genome-scale co-expression network reveals key modules involved in lung adenocarcinoma.

    PubMed

    Bidkhori, Gholamreza; Narimani, Zahra; Hosseini Ashtiani, Saman; Moeini, Ali; Nowzari-Dalini, Abbas; Masoudi-Nejad, Ali

    2013-01-01

    Our goal of this study was to reconstruct a "genome-scale co-expression network" and find important modules in lung adenocarcinoma so that we could identify the genes involved in lung adenocarcinoma. We integrated gene mutation, GWAS, CGH, array-CGH and SNP array data in order to identify important genes and loci in genome-scale. Afterwards, on the basis of the identified genes a co-expression network was reconstructed from the co-expression data. The reconstructed network was named "genome-scale co-expression network". As the next step, 23 key modules were disclosed through clustering. In this study a number of genes have been identified for the first time to be implicated in lung adenocarcinoma by analyzing the modules. The genes EGFR, PIK3CA, TAF15, XIAP, VAPB, Appl1, Rab5a, ARF4, CLPTM1L, SP4, ZNF124, LPP, FOXP1, SOX18, MSX2, NFE2L2, SMARCC1, TRA2B, CBX3, PRPF6, ATP6V1C1, MYBBP1A, MACF1, GRM2, TBXA2R, PRKAR2A, PTK2, PGF and MYO10 are among the genes that belong to modules 1 and 22. All these genes, being implicated in at least one of the phenomena, namely cell survival, proliferation and metastasis, have an over-expression pattern similar to that of EGFR. In few modules, the genes such as CCNA2 (Cyclin A2), CCNB2 (Cyclin B2), CDK1, CDK5, CDC27, CDCA5, CDCA8, ASPM, BUB1, KIF15, KIF2C, NEK2, NUSAP1, PRC1, SMC4, SYCE2, TFDP1, CDC42 and ARHGEF9 are present that play a crucial role in cell cycle progression. In addition to the mentioned genes, there are some other genes (i.e. DLGAP5, BIRC5, PSMD2, Src, TTK, SENP2, PSMD2, DOK2, FUS and etc.) in the modules.

  12. Mixed-Gender Co-Facilitation in Therapeutic Groups for Men Who Have Perpetrated Intimate Partner Violence: Group Members' Perspectives

    ERIC Educational Resources Information Center

    Roy, Valerie; Lindsay, Jocelyn; Dallaire, Louis-Francois

    2013-01-01

    This article describes a study that explored the use of mixed-gender co-facilitation in intimate partner violence groups, especially regarding its potential for gender role socialization. Using an interpretive approach, interviews with men from different mixed-gender co-facilitated groups in Canada were analyzed, with a focus on the men's…

  13. Comparative Genomics of the Dual-Obligate Symbionts from the Treehopper, Entylia carinata (Hemiptera: Membracidae), Provide Insight into the Origins and Evolution of an Ancient Symbiosis.

    PubMed

    Mao, Meng; Yang, Xiushuai; Poff, Kirsten; Bennett, Gordon

    2017-06-01

    Insect species in the Auchenorrhyncha suborder (Hemiptera) maintain ancient obligate symbioses with bacteria that provide essential amino acids (EAAs) deficient in their plant-sap diets. Molecular studies have revealed that two complementary symbiont lineages, "Candidatus Sulcia muelleri" and a betaproteobacterium ("Ca. Zinderia insecticola" in spittlebugs [Cercopoidea] and "Ca. Nasuia deltocephalinicola" in leafhoppers [Cicadellidae]) may have persisted in the suborder since its origin ∼300 Ma. However, investigation of how this pair has co-evolved on a genomic level is limited to only a few host lineages. We sequenced the complete genomes of Sulcia and a betaproteobacterium from the treehopper, Entylia carinata (Membracidae: ENCA), as the first representative from this species-rich group. It also offers the opportunity to compare symbiont evolution across a major insect group, the Membracoidea (leafhoppers + treehoppers). Genomic analyses show that the betaproteobacteria in ENCA is a member of the Nasuia lineage. Both symbionts have larger genomes (Sulcia = 218 kb and Nasuia = 144 kb) than related lineages in Deltocephalinae leafhoppers, retaining genes involved in basic cellular functions and information processing. Nasuia-ENCA further exhibits few unique gene losses, suggesting that its parent lineage in the common ancestor to the Membracoidea was already highly reduced. Sulcia-ENCA has lost the abilities to synthesize menaquinone cofactor and to complete the synthesis of the branched-chain EAAs. Both capabilities are conserved in other Sulcia lineages sequenced from across the Auchenorrhyncha. Finally, metagenomic sequencing recovered the partial genome of an Arsenophonus symbiont, although it infects only 20% of individuals indicating a facultative role. © The Author 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

  14. Comparative Genomics of the Dual-Obligate Symbionts from the Treehopper, Entylia carinata (Hemiptera: Membracidae), Provide Insight into the Origins and Evolution of an Ancient Symbiosis

    PubMed Central

    Yang, Xiushuai; Poff, Kirsten; Bennett, Gordon

    2017-01-01

    Abstract Insect species in the Auchenorrhyncha suborder (Hemiptera) maintain ancient obligate symbioses with bacteria that provide essential amino acids (EAAs) deficient in their plant-sap diets. Molecular studies have revealed that two complementary symbiont lineages, “Candidatus Sulcia muelleri” and a betaproteobacterium (“Ca. Zinderia insecticola” in spittlebugs [Cercopoidea] and “Ca. Nasuia deltocephalinicola” in leafhoppers [Cicadellidae]) may have persisted in the suborder since its origin ∼300 Ma. However, investigation of how this pair has co-evolved on a genomic level is limited to only a few host lineages. We sequenced the complete genomes of Sulcia and a betaproteobacterium from the treehopper, Entylia carinata (Membracidae: ENCA), as the first representative from this species-rich group. It also offers the opportunity to compare symbiont evolution across a major insect group, the Membracoidea (leafhoppers + treehoppers). Genomic analyses show that the betaproteobacteria in ENCA is a member of the Nasuia lineage. Both symbionts have larger genomes (Sulcia = 218 kb and Nasuia = 144 kb) than related lineages in Deltocephalinae leafhoppers, retaining genes involved in basic cellular functions and information processing. Nasuia-ENCA further exhibits few unique gene losses, suggesting that its parent lineage in the common ancestor to the Membracoidea was already highly reduced. Sulcia-ENCA has lost the abilities to synthesize menaquinone cofactor and to complete the synthesis of the branched-chain EAAs. Both capabilities are conserved in other Sulcia lineages sequenced from across the Auchenorrhyncha. Finally, metagenomic sequencing recovered the partial genome of an Arsenophonus symbiont, although it infects only 20% of individuals indicating a facultative role. PMID:28854637

  15. Pancreatic Cancer Genomics 2.0: Profiling Metastases.

    PubMed

    Collisson, Eric A; Maitra, Anirban

    2017-03-13

    Pancreatic ductal adenocarcinoma, even when diagnosed early, nearly always metastasizes. Recurrent mutations and genomic instability are early events in the disease. Two recent papers advance our understanding of how the cancer genome evolves as the primary tumor migrates from its origin in the pancreas to colonize distant metastatic sites. Copyright © 2017 Elsevier Inc. All rights reserved.

  16. Genomic clocks and evolutionary timescales

    NASA Technical Reports Server (NTRS)

    Blair Hedges, S.; Kumar, Sudhir

    2003-01-01

    For decades, molecular clocks have helped to illuminate the evolutionary timescale of life, but now genomic data pose a challenge for time estimation methods. It is unclear how to integrate data from many genes, each potentially evolving under a different model of substitution and at a different rate. Current methods can be grouped by the way the data are handled (genes considered separately or combined into a 'supergene') and the way gene-specific rate models are applied (global versus local clock). There are advantages and disadvantages to each of these approaches, and the optimal method has not yet emerged. Fortunately, time estimates inferred using many genes or proteins have greater precision and appear to be robust to different approaches.

  17. The Genomic Basis for Evolved Pollution Tolerance in Killifish (Fundulus heterclitus).

    EPA Science Inventory

    Uncovering the molecular mechanisms of adaptive variation is a leading challenge in evolutionary biology. Identifying genes that influence ecological traits can provide insight into the evolutionary processes behind genomic responses to environmental change. Here, we examine the...

  18. Genomic Evaluation of Thermoanaerobacter spp. for the Construction of Designer Co-Cultures to Improve Lignocellulosic Biofuel Production

    PubMed Central

    Verbeke, Tobin J.; Zhang, Xiangli; Henrissat, Bernard; Spicer, Vic; Rydzak, Thomas; Krokhin, Oleg V.; Fristensky, Brian; Levin, David B.; Sparling, Richard

    2013-01-01

    The microbial production of ethanol from lignocellulosic biomass is a multi-component process that involves biomass hydrolysis, carbohydrate transport and utilization, and finally, the production of ethanol. Strains of the genus Thermoanaerobacter have been studied for decades due to their innate abilities to produce comparatively high ethanol yields from hemicellulose constituent sugars. However, their inability to hydrolyze cellulose, limits their usefulness in lignocellulosic biofuel production. As such, co-culturing Thermoanaerobacter spp. with cellulolytic organisms is a plausible approach to improving lignocellulose conversion efficiencies and yields of biofuels. To evaluate native lignocellulosic ethanol production capacities relative to competing fermentative end-products, comparative genomic analysis of 11 sequenced Thermoanaerobacter strains, including a de novo genome, Thermoanaerobacter thermohydrosulfuricus WC1, was conducted. Analysis was specifically focused on the genomic potential for each strain to address all aspects of ethanol production mentioned through a consolidated bioprocessing approach. Whole genome functional annotation analysis identified three distinct clades within the genus. The genomes of Clade 1 strains encode the fewest extracellular carbohydrate active enzymes and also show the least diversity in terms of lignocellulose relevant carbohydrate utilization pathways. However, these same strains reportedly are capable of directing a higher proportion of their total carbon flux towards ethanol, rather than non-biofuel end-products, than other Thermoanaerobacter strains. Strains in Clade 2 show the greatest diversity in terms of lignocellulose hydrolysis and utilization, but proportionately produce more non-ethanol end-products than Clade 1 strains. Strains in Clade 3, in which T. thermohydrosulfuricus WC1 is included, show mid-range potential for lignocellulose hydrolysis and utilization, but also exhibit extensive divergence from both

  19. Clinical utilization of genomics data produced by the international Pseudomonas aeruginosa consortium

    PubMed Central

    Freschi, Luca; Jeukens, Julie; Kukavica-Ibrulj, Irena; Boyle, Brian; Dupont, Marie-Josée; Laroche, Jérôme; Larose, Stéphane; Maaroufi, Halim; Fothergill, Joanne L.; Moore, Matthew; Winsor, Geoffrey L.; Aaron, Shawn D.; Barbeau, Jean; Bell, Scott C.; Burns, Jane L.; Camara, Miguel; Cantin, André; Charette, Steve J.; Dewar, Ken; Déziel, Éric; Grimwood, Keith; Hancock, Robert E. W.; Harrison, Joe J.; Heeb, Stephan; Jelsbak, Lars; Jia, Baofeng; Kenna, Dervla T.; Kidd, Timothy J.; Klockgether, Jens; Lam, Joseph S.; Lamont, Iain L.; Lewenza, Shawn; Loman, Nick; Malouin, François; Manos, Jim; McArthur, Andrew G.; McKeown, Josie; Milot, Julie; Naghra, Hardeep; Nguyen, Dao; Pereira, Sheldon K.; Perron, Gabriel G.; Pirnay, Jean-Paul; Rainey, Paul B.; Rousseau, Simon; Santos, Pedro M.; Stephenson, Anne; Taylor, Véronique; Turton, Jane F.; Waglechner, Nicholas; Williams, Paul; Thrane, Sandra W.; Wright, Gerard D.; Brinkman, Fiona S. L.; Tucker, Nicholas P.; Tümmler, Burkhard; Winstanley, Craig; Levesque, Roger C.

    2015-01-01

    The International Pseudomonas aeruginosa Consortium is sequencing over 1000 genomes and building an analysis pipeline for the study of Pseudomonas genome evolution, antibiotic resistance and virulence genes. Metadata, including genomic and phenotypic data for each isolate of the collection, are available through the International Pseudomonas Consortium Database (http://ipcd.ibis.ulaval.ca/). Here, we present our strategy and the results that emerged from the analysis of the first 389 genomes. With as yet unmatched resolution, our results confirm that P. aeruginosa strains can be divided into three major groups that are further divided into subgroups, some not previously reported in the literature. We also provide the first snapshot of P. aeruginosa strain diversity with respect to antibiotic resistance. Our approach will allow us to draw potential links between environmental strains and those implicated in human and animal infections, understand how patients become infected and how the infection evolves over time as well as identify prognostic markers for better evidence-based decisions on patient care. PMID:26483767

  20. Flexible Unicast-Based Group Communication for CoAP-Enabled Devices †

    PubMed Central

    Ishaq, Isam; Hoebeke, Jeroen; Van den Abeele, Floris; Rossey, Jen; Moerman, Ingrid; Demeester, Piet

    2014-01-01

    Smart embedded objects will become an important part of what is called the Internet of Things. Applications often require concurrent interactions with several of these objects and their resources. Existing solutions have several limitations in terms of reliability, flexibility and manageability of such groups of objects. To overcome these limitations we propose an intermediately level of intelligence to easily manipulate a group of resources across multiple smart objects, building upon the Constrained Application Protocol (CoAP). We describe the design of our solution to create and manipulate a group of CoAP resources using a single client request. Furthermore we introduce the concept of profiles for the created groups. The use of profiles allows the client to specify in more detail how the group should behave. We have implemented our solution and demonstrate that it covers the complete group life-cycle, i.e., creation, validation, flexible usage and deletion. Finally, we quantitatively analyze the performance of our solution and compare it against multicast-based CoAP group communication. The results show that our solution improves reliability and flexibility with a trade-off in increased communication overhead. PMID:24901978

  1. Full-Genome Characterisation of Orungo, Lebombo and Changuinola Viruses Provides Evidence for Co-Evolution of Orbiviruses with Their Arthropod Vectors

    PubMed Central

    Mohd Jaafar, Fauziah; Belhouchet, Mourad; Belaganahalli, Manjunatha; Tesh, Robert B.; Mertens, Peter P. C.; Attoui, Houssam

    2014-01-01

    The complete genomes of Orungo virus (ORUV), Lebombo virus (LEBV) and Changuinola virus (CGLV) were sequenced, confirming that they each encode 11 distinct proteins (VP1-VP7 and NS1-NS4). Phylogenetic analyses of cell-attachment protein ‘outer-capsid protein 1′ (OC1), show that orbiviruses fall into three large groups, identified as: VP2(OC1), in which OC1 is the 2nd largest protein, including the Culicoides transmitted orbiviruses; VP3(OC1), which includes the mosquito transmitted orbiviruses; and VP4(OC1) which includes the tick transmitted viruses. Differences in the size of OC1 between these groups, places the T2 ‘subcore-shell protein’ as the third largest protein ‘VP3(T2)’ in the first of these groups, but the second largest protein ‘VP3(T2)’ in the other two groups. ORUV, LEBV and CGLV all group with the Culicoides-borne VP2(OC1)/VP3(T2) viruses. The G+C content of the ORUV, LEBV and CGLV genomes is also similar to that of the Culicoides-borne, rather than the mosquito-borne, or tick borne orbiviruses. These data suggest that ORUV and LEBV are Culicoides- rather than mosquito-borne. Multiple isolations of CGLV from sand flies suggest that they are its primary vector. OC1 of the insect-borne orbiviruses is approximately twice the size of the equivalent protein of the tick borne viruses. Together with internal sequence similarities, this suggests its origin by duplication (concatermerisation) of a smaller OC1 from an ancestral tick-borne orbivirus. Phylogenetic comparisons showing linear relationships between the dates of evolutionary-separation of their vector species, and genetic-distances between tick-, mosquito- or Culicoides-borne virus-groups, provide evidence for co-evolution of the orbiviruses with their arthropod vectors. PMID:24475112

  2. Group-based variant calling leveraging next-generation supercomputing for large-scale whole-genome sequencing studies.

    PubMed

    Standish, Kristopher A; Carland, Tristan M; Lockwood, Glenn K; Pfeiffer, Wayne; Tatineni, Mahidhar; Huang, C Chris; Lamberth, Sarah; Cherkas, Yauheniya; Brodmerkel, Carrie; Jaeger, Ed; Smith, Lance; Rajagopal, Gunaretnam; Curran, Mark E; Schork, Nicholas J

    2015-09-22

    Next-generation sequencing (NGS) technologies have become much more efficient, allowing whole human genomes to be sequenced faster and cheaper than ever before. However, processing the raw sequence reads associated with NGS technologies requires care and sophistication in order to draw compelling inferences about phenotypic consequences of variation in human genomes. It has been shown that different approaches to variant calling from NGS data can lead to different conclusions. Ensuring appropriate accuracy and quality in variant calling can come at a computational cost. We describe our experience implementing and evaluating a group-based approach to calling variants on large numbers of whole human genomes. We explore the influence of many factors that may impact the accuracy and efficiency of group-based variant calling, including group size, the biogeographical backgrounds of the individuals who have been sequenced, and the computing environment used. We make efficient use of the Gordon supercomputer cluster at the San Diego Supercomputer Center by incorporating job-packing and parallelization considerations into our workflow while calling variants on 437 whole human genomes generated as part of large association study. We ultimately find that our workflow resulted in high-quality variant calls in a computationally efficient manner. We argue that studies like ours should motivate further investigations combining hardware-oriented advances in computing systems with algorithmic developments to tackle emerging 'big data' problems in biomedical research brought on by the expansion of NGS technologies.

  3. Differential Scanning Calorimetry and Evolved Gas Analysis of Hydromagnesite

    NASA Technical Reports Server (NTRS)

    Lauer, H. V., Jr.; Golden, D. C.; Ming, Douglas W.; Boynton, W. V.

    1999-01-01

    Volatile-bearing minerals (e.g., Fe-oxyhydroxides, phyllosilicates, carbonates and sulfates) may be important phases on the surface of Mars. In order to characterize these phases the Thermal and Evolved Gas Analyzer (TEGA) flying on the Mars'98 lander will perform analyses on surface samples from Mars. Hydromagnesite [Mg5(CO3)4(OH)2.4H2O] is considered a good standard mineral to examine as a Mars soil analog component because it evolves both H2O and CO2 at temperatures between 0 and 600 C. Our aim here is to interpret the DSC signature of hydromagnesite under ambient pressure and 20 sccm N2 flow in the range 25 to 600 C. The DSC curve for hydromagnesite under the above conditions consists of three endothermic peaks at temperatures 296, 426, and 548 and one sharp exotherm at 511 C. X-ray analysis of the sample at different stop temperatures suggested that the exotherm corresponded with the formation of crystalline magnesite. The first endotherm was due to dehydration of hydromagnesite, and then the second one was due to the decomposition of carbonate, immediately followed by the formation of magnesite (exotherm) and its decomposition to periclase (last endotherm). Evolution of water and CO2 were consistent with the observed enthalpy changes. A library of such DSC-evolved gas curves for putative Martian minerals are currently being acquired in order to facilitate the interpretation of results obtained by a robotic lander.

  4. Complete Genome Sequence of the RmInt1 Group II Intronless Sinorhizobium meliloti Strain RMO17

    PubMed Central

    Martínez-Abarca, Francisco; Nisa-Martínez, Rafael

    2014-01-01

    We report the complete genome sequence of the RmInt1 group II intronless Sinorhizobium meliloti strain RMO17 isolated from Medicago orbicularis nodules from Spanish soil. The genome consists of 6.73 Mb distributed between a single chromosome and two megaplasmids (the chromid pSymB and pSymA). PMID:25301650

  5. Genomics of Parallel Ecological Speciation in Lake Victoria Cichlids.

    PubMed

    Meier, Joana Isabel; Marques, David Alexander; Wagner, Catherine Elise; Excoffier, Laurent; Seehausen, Ole

    2018-06-01

    The genetic basis of parallel evolution of similar species is of great interest in evolutionary biology. In the adaptive radiation of Lake Victoria cichlid fishes, sister species with either blue or red-back male nuptial coloration have evolved repeatedly, often associated with shallower and deeper water, respectively. One such case is blue and red-backed Pundamilia species, for which we recently showed that a young species pair may have evolved through "hybrid parallel speciation". Coalescent simulations suggested that the older species P. pundamilia (blue) and P. nyererei (red-back) admixed in the Mwanza Gulf and that new "nyererei-like" and "pundamilia-like" species evolved from the admixed population. Here, we use genome scans to study the genomic architecture of differentiation, and assess the influence of hybridization on the evolution of the younger species pair. For each of the two species pairs, we find over 300 genomic regions, widespread across the genome, which are highly differentiated. A subset of the most strongly differentiated regions of the older pair are also differentiated in the younger pair. These shared differentiated regions often show parallel allele frequency differences, consistent with the hypothesis that admixture-derived alleles were targeted by divergent selection in the hybrid population. However, two-thirds of the genomic regions that are highly differentiated between the younger species are not highly differentiated between the older species, suggesting independent evolutionary responses to selection pressures. Our analyses reveal how divergent selection on admixture-derived genetic variation can facilitate new speciation events.

  6. Evolving doublesex expression correlates with the origin and diversification of male sexual ornaments in the Drosophila immigrans species group.

    PubMed

    Rice, Gavin; Barmina, Olga; Hu, Kevin; Kopp, Artyom

    2018-03-01

    Male ornaments and other sex-specific traits present some of the most dramatic examples of evolutionary innovations. Comparative studies of similar but independently evolved traits are particularly important for identifying repeated patterns in the evolution of these traits. Male-specific modifications of the front legs have evolved repeatedly in Drosophilidae and other Diptera. The best understood of these novel structures is the sex comb of Drosophila melanogaster and its close relatives. Here, we examine the evolution of another male foreleg modification, the sex brush, found in the distantly related Drosophila immigrans species group. Similar to the sex comb, we find that the origin of the sex brush correlates with novel, spatially restricted expression of the doublesex (dsx) transcription factor, the primary effector of the Drosophila sex determination pathway. The diversity of Dsx expression patterns in the immigrans species group closely reflects the differences in the presence, position, and size of the sex brush. Together with previous work on sex comb evolution, these observations suggest that tissue-specific activation of dsx expression may be a common mechanism responsible for the evolution of sexual dimorphism and particularly for the origin of novel male-specific ornaments. © 2018 Wiley Periodicals, Inc.

  7. Complete Genome Sequence of the RmInt1 Group II Intronless Sinorhizobium meliloti Strain RMO17.

    PubMed

    Toro, Nicolás; Martínez-Abarca, Francisco; Nisa-Martínez, Rafael

    2014-10-09

    We report the complete genome sequence of the RmInt1 group II intronless Sinorhizobium meliloti strain RMO17 isolated from Medicago orbicularis nodules from Spanish soil. The genome consists of 6.73 Mb distributed between a single chromosome and two megaplasmids (the chromid pSymB and pSymA). Copyright © 2014 Toro et al.

  8. The mitochondrial genome of the arbuscular mycorrhizal fungus Gigaspora margarita reveals two unsuspected trans-splicing events of group I introns.

    PubMed

    Pelin, Adrian; Pombert, Jean-François; Salvioli, Alessandra; Bonen, Linda; Bonfante, Paola; Corradi, Nicolas

    2012-05-01

    • Arbuscular mycorrhizal fungi (AMF) are ubiquitous organisms that benefit ecosystems through the establishment of an association with the roots of most plants: the mycorrhizal symbiosis. Despite their ecological importance, however, these fungi have been poorly studied at the genome level. • In this study, total DNA from the AMF Gigaspora margarita was subjected to a combination of 454 and Illumina sequencing, and the resulting reads were used to assemble its mitochondrial genome de novo. This genome was annotated and compared with those of other relatives to better comprehend the evolution of the AMF lineage. • The mitochondrial genome of G. margarita is unique in many ways, exhibiting a large size (97 kbp) and elevated GC content (45%). This genome also harbors molecular events that were previously unknown to occur in fungal mitochondrial genomes, including trans-splicing of group I introns from two different genes coding for the first subunit of the cytochrome oxidase and for the small subunit of the rRNA. • This study reports the second published genome from an AMF organelle, resulting in relevant DNA sequence information from this poorly studied fungal group, and providing new insights into the frequency, origin and evolution of trans-spliced group I introns found across the mitochondrial genomes of distantly related organisms. © 2012 The Authors. New Phytologist © 2012 New Phytologist Trust.

  9. Comparative genomics of Lactobacillus

    PubMed Central

    Kant, Ravi; Blom, Jochen; Palva, Airi; Siezen, Roland J.; de Vos, Willem M.

    2011-01-01

    Summary The genus Lactobacillus includes a diverse group of bacteria consisting of many species that are associated with fermentations of plants, meat or milk. In addition, various lactobacilli are natural inhabitants of the intestinal tract of humans and other animals. Finally, several Lactobacillus strains are marketed as probiotics as their consumption can confer a health benefit to host. Presently, 154 Lactobacillus species are known and a growing fraction of these are subject to draft genome sequencing. However, complete genome sequences are needed to provide a platform for detailed genomic comparisons. Therefore, we selected a total of 20 genomes of various Lactobacillus strains for which complete genomic sequences have been reported. These genomes had sizes varying from 1.8 to 3.3 Mb and other characteristic features, such as G+C content that ranged from 33% to 51%. The Lactobacillus pan genome was found to consist of approximately 14 000 protein‐encoding genes while all 20 genomes shared a total of 383 sets of orthologous genes that defined the Lactobacillus core genome (LCG). Based on advanced phylogeny of the proteins encoded by this LCG, we grouped the 20 strains into three main groups and defined core group genes present in all genomes of a single group, signature group genes shared in all genomes of one group but absent in all other Lactobacillus genomes, and Group‐specific ORFans present in core group genes of one group and absent in all other complete genomes. The latter are of specific value in defining the different groups of genomes. The study provides a platform for present individual comparisons as well as future analysis of new Lactobacillus genomes. PMID:21375712

  10. Bacteriophage Taxonomy: An Evolving Discipline.

    PubMed

    Tolstoy, Igor; Kropinski, Andrew M; Brister, J Rodney

    2018-01-01

    While taxonomy is an often-unappreciated branch of science it serves very important roles. Bacteriophage taxonomy has evolved from a mainly morphology-based discipline, characterized by the work of David Bradley and Hans-Wolfgang Ackermann, to the holistic approach that is taken today. The Bacterial and Archaeal Viruses Subcommittee of the International Committee on Taxonomy of Viruses (ICTV) takes a comprehensive approach to classifying prokaryote viruses measuring overall DNA and protein identity and phylogeny before making decisions about the taxonomic position of a new virus. The huge number of complete genomes being deposited with NCBI and other public databases has resulted in a reassessment of the taxonomy of many viruses, and the future will see the introduction of new viral families and higher orders.

  11. Yeast "make-accumulate-consume" life strategy evolved as a multi-step process that predates the whole genome duplication.

    PubMed

    Hagman, Arne; Säll, Torbjörn; Compagno, Concetta; Piskur, Jure

    2013-01-01

    When fruits ripen, microbial communities start a fierce competition for the freely available fruit sugars. Three yeast lineages, including baker's yeast Saccharomyces cerevisiae, have independently developed the metabolic activity to convert simple sugars into ethanol even under fully aerobic conditions. This fermentation capacity, named Crabtree effect, reduces the cell-biomass production but provides in nature a tool to out-compete other microorganisms. Here, we analyzed over forty Saccharomycetaceae yeasts, covering over 200 million years of the evolutionary history, for their carbon metabolism. The experiments were done under strictly controlled and uniform conditions, which has not been done before. We show that the origin of Crabtree effect in Saccharomycetaceae predates the whole genome duplication and became a settled metabolic trait after the split of the S. cerevisiae and Kluyveromyces lineages, and coincided with the origin of modern fruit bearing plants. Our results suggest that ethanol fermentation evolved progressively, involving several successive molecular events that have gradually remodeled the yeast carbon metabolism. While some of the final evolutionary events, like gene duplications of glucose transporters and glycolytic enzymes, have been deduced, the earliest molecular events initiating Crabtree effect are still to be determined.

  12. The first myriapod genome sequence reveals conservative arthropod gene content and genome organisation in the centipede Strigamia maritima.

    PubMed

    Chipman, Ariel D; Ferrier, David E K; Brena, Carlo; Qu, Jiaxin; Hughes, Daniel S T; Schröder, Reinhard; Torres-Oliva, Montserrat; Znassi, Nadia; Jiang, Huaiyang; Almeida, Francisca C; Alonso, Claudio R; Apostolou, Zivkos; Aqrawi, Peshtewani; Arthur, Wallace; Barna, Jennifer C J; Blankenburg, Kerstin P; Brites, Daniela; Capella-Gutiérrez, Salvador; Coyle, Marcus; Dearden, Peter K; Du Pasquier, Louis; Duncan, Elizabeth J; Ebert, Dieter; Eibner, Cornelius; Erikson, Galina; Evans, Peter D; Extavour, Cassandra G; Francisco, Liezl; Gabaldón, Toni; Gillis, William J; Goodwin-Horn, Elizabeth A; Green, Jack E; Griffiths-Jones, Sam; Grimmelikhuijzen, Cornelis J P; Gubbala, Sai; Guigó, Roderic; Han, Yi; Hauser, Frank; Havlak, Paul; Hayden, Luke; Helbing, Sophie; Holder, Michael; Hui, Jerome H L; Hunn, Julia P; Hunnekuhl, Vera S; Jackson, LaRonda; Javaid, Mehwish; Jhangiani, Shalini N; Jiggins, Francis M; Jones, Tamsin E; Kaiser, Tobias S; Kalra, Divya; Kenny, Nathan J; Korchina, Viktoriya; Kovar, Christie L; Kraus, F Bernhard; Lapraz, François; Lee, Sandra L; Lv, Jie; Mandapat, Christigale; Manning, Gerard; Mariotti, Marco; Mata, Robert; Mathew, Tittu; Neumann, Tobias; Newsham, Irene; Ngo, Dinh N; Ninova, Maria; Okwuonu, Geoffrey; Ongeri, Fiona; Palmer, William J; Patil, Shobha; Patraquim, Pedro; Pham, Christopher; Pu, Ling-Ling; Putman, Nicholas H; Rabouille, Catherine; Ramos, Olivia Mendivil; Rhodes, Adelaide C; Robertson, Helen E; Robertson, Hugh M; Ronshaugen, Matthew; Rozas, Julio; Saada, Nehad; Sánchez-Gracia, Alejandro; Scherer, Steven E; Schurko, Andrew M; Siggens, Kenneth W; Simmons, DeNard; Stief, Anna; Stolle, Eckart; Telford, Maximilian J; Tessmar-Raible, Kristin; Thornton, Rebecca; van der Zee, Maurijn; von Haeseler, Arndt; Williams, James M; Willis, Judith H; Wu, Yuanqing; Zou, Xiaoyan; Lawson, Daniel; Muzny, Donna M; Worley, Kim C; Gibbs, Richard A; Akam, Michael; Richards, Stephen

    2014-11-01

    Myriapods (e.g., centipedes and millipedes) display a simple homonomous body plan relative to other arthropods. All members of the class are terrestrial, but they attained terrestriality independently of insects. Myriapoda is the only arthropod class not represented by a sequenced genome. We present an analysis of the genome of the centipede Strigamia maritima. It retains a compact genome that has undergone less gene loss and shuffling than previously sequenced arthropods, and many orthologues of genes conserved from the bilaterian ancestor that have been lost in insects. Our analysis locates many genes in conserved macro-synteny contexts, and many small-scale examples of gene clustering. We describe several examples where S. maritima shows different solutions from insects to similar problems. The insect olfactory receptor gene family is absent from S. maritima, and olfaction in air is likely effected by expansion of other receptor gene families. For some genes S. maritima has evolved paralogues to generate coding sequence diversity, where insects use alternate splicing. This is most striking for the Dscam gene, which in Drosophila generates more than 100,000 alternate splice forms, but in S. maritima is encoded by over 100 paralogues. We see an intriguing linkage between the absence of any known photosensory proteins in a blind organism and the additional absence of canonical circadian clock genes. The phylogenetic position of myriapods allows us to identify where in arthropod phylogeny several particular molecular mechanisms and traits emerged. For example, we conclude that juvenile hormone signalling evolved with the emergence of the exoskeleton in the arthropods and that RR-1 containing cuticle proteins evolved in the lineage leading to Mandibulata. We also identify when various gene expansions and losses occurred. The genome of S. maritima offers us a unique glimpse into the ancestral arthropod genome, while also displaying many adaptations to its specific

  13. The First Myriapod Genome Sequence Reveals Conservative Arthropod Gene Content and Genome Organisation in the Centipede Strigamia maritima

    PubMed Central

    Chipman, Ariel D.; Ferrier, David E. K.; Brena, Carlo; Qu, Jiaxin; Hughes, Daniel S. T.; Schröder, Reinhard; Torres-Oliva, Montserrat; Znassi, Nadia; Jiang, Huaiyang; Almeida, Francisca C.; Alonso, Claudio R.; Apostolou, Zivkos; Aqrawi, Peshtewani; Arthur, Wallace; Barna, Jennifer C. J.; Blankenburg, Kerstin P.; Brites, Daniela; Capella-Gutiérrez, Salvador; Coyle, Marcus; Dearden, Peter K.; Du Pasquier, Louis; Duncan, Elizabeth J.; Ebert, Dieter; Eibner, Cornelius; Erikson, Galina; Evans, Peter D.; Extavour, Cassandra G.; Francisco, Liezl; Gabaldón, Toni; Gillis, William J.; Goodwin-Horn, Elizabeth A.; Green, Jack E.; Griffiths-Jones, Sam; Grimmelikhuijzen, Cornelis J. P.; Gubbala, Sai; Guigó, Roderic; Han, Yi; Hauser, Frank; Havlak, Paul; Hayden, Luke; Helbing, Sophie; Holder, Michael; Hui, Jerome H. L.; Hunn, Julia P.; Hunnekuhl, Vera S.; Jackson, LaRonda; Javaid, Mehwish; Jhangiani, Shalini N.; Jiggins, Francis M.; Jones, Tamsin E.; Kaiser, Tobias S.; Kalra, Divya; Kenny, Nathan J.; Korchina, Viktoriya; Kovar, Christie L.; Kraus, F. Bernhard; Lapraz, François; Lee, Sandra L.; Lv, Jie; Mandapat, Christigale; Manning, Gerard; Mariotti, Marco; Mata, Robert; Mathew, Tittu; Neumann, Tobias; Newsham, Irene; Ngo, Dinh N.; Ninova, Maria; Okwuonu, Geoffrey; Ongeri, Fiona; Palmer, William J.; Patil, Shobha; Patraquim, Pedro; Pham, Christopher; Pu, Ling-Ling; Putman, Nicholas H.; Rabouille, Catherine; Ramos, Olivia Mendivil; Rhodes, Adelaide C.; Robertson, Helen E.; Robertson, Hugh M.; Ronshaugen, Matthew; Rozas, Julio; Saada, Nehad; Sánchez-Gracia, Alejandro; Scherer, Steven E.; Schurko, Andrew M.; Siggens, Kenneth W.; Simmons, DeNard; Stief, Anna; Stolle, Eckart; Telford, Maximilian J.; Tessmar-Raible, Kristin; Thornton, Rebecca; van der Zee, Maurijn; von Haeseler, Arndt; Williams, James M.; Willis, Judith H.; Wu, Yuanqing; Zou, Xiaoyan; Lawson, Daniel; Muzny, Donna M.; Worley, Kim C.; Gibbs, Richard A.; Akam, Michael; Richards, Stephen

    2014-01-01

    Myriapods (e.g., centipedes and millipedes) display a simple homonomous body plan relative to other arthropods. All members of the class are terrestrial, but they attained terrestriality independently of insects. Myriapoda is the only arthropod class not represented by a sequenced genome. We present an analysis of the genome of the centipede Strigamia maritima. It retains a compact genome that has undergone less gene loss and shuffling than previously sequenced arthropods, and many orthologues of genes conserved from the bilaterian ancestor that have been lost in insects. Our analysis locates many genes in conserved macro-synteny contexts, and many small-scale examples of gene clustering. We describe several examples where S. maritima shows different solutions from insects to similar problems. The insect olfactory receptor gene family is absent from S. maritima, and olfaction in air is likely effected by expansion of other receptor gene families. For some genes S. maritima has evolved paralogues to generate coding sequence diversity, where insects use alternate splicing. This is most striking for the Dscam gene, which in Drosophila generates more than 100,000 alternate splice forms, but in S. maritima is encoded by over 100 paralogues. We see an intriguing linkage between the absence of any known photosensory proteins in a blind organism and the additional absence of canonical circadian clock genes. The phylogenetic position of myriapods allows us to identify where in arthropod phylogeny several particular molecular mechanisms and traits emerged. For example, we conclude that juvenile hormone signalling evolved with the emergence of the exoskeleton in the arthropods and that RR-1 containing cuticle proteins evolved in the lineage leading to Mandibulata. We also identify when various gene expansions and losses occurred. The genome of S. maritima offers us a unique glimpse into the ancestral arthropod genome, while also displaying many adaptations to its specific

  14. A 400,000-year-old mitochondrial genome questions phylogenetic relationships amongst archaic hominins: using the latest advances in ancient genomics, the mitochondrial genome sequence of a 400,000-year-old hominin has been deciphered.

    PubMed

    Orlando, Ludovic

    2014-06-01

    By combining state-of-the-art approaches in ancient genomics, Meyer and co-workers have reconstructed the mitochondrial sequence of an archaic hominin that lived at Sierra de Atapuerca, Spain about 400,000 years ago. This achievement follows recent advances in molecular anthropology that delivered the genome sequence of younger archaic hominins, such as Neanderthals and Denisovans. Molecular phylogenetic reconstructions placed the Atapuercan as a sister group to Denisovans, although its morphology suggested closer affinities with Neanderthals. In addition to possibly challenging our interpretation of the fossil record, this study confirms that genomic information can be recovered from extremely damaged DNA molecules, even in the presence of significant levels of human contamination. Together with the recent characterization of a 700,000-year-old horse genome, this study opens the Middle Pleistocene to genomics, thereby extending the scope of ancient DNA to the last million years. © 2014 WILEY Periodicals, Inc.

  15. Convergent evolution of adenosine aptamers spanning bacterial, human, and random sequences revealed by structure-based bioinformatics and genomic SELEX

    PubMed Central

    Vu, Michael M. K.; Jameson, Nora E.; Masuda, Stuart J.; Lin, Dana; Larralde-Ridaura, Rosa; Lupták, Andrej

    2012-01-01

    SUMMARY Aptamers are structured macromolecules in vitro evolved to bind molecular targets, whereas in nature they form the ligand-binding domains of riboswitches. Adenosine aptamers of a single structural family were isolated several times from random pools but they have not been identified in genomic sequences. We used two unbiased methods, structure-based bioinformatics and human genome-based in vitro selection, to identify aptamers that form the same adenosine-binding structure in a bacterium, and several vertebrates, including humans. Two of the human aptamers map to introns of RAB3C and FGD3 genes. The RAB3C aptamer binds ATP with dissociation constants about ten times lower than physiological ATP concentration, while the minimal FGD3 aptamer binds ATP only co-transcriptionally. PMID:23102219

  16. Evolving targeted therapies for right ventricular failure.

    PubMed

    Di Salvo, Thomas G

    2015-01-01

    Although right and left ventricular embryological origins, morphology and cardiodynamics differ, the notion of selectively targeted right ventricular therapies remains controversial. This review focuses on both the currently evolving pharmacologic agents targeting right ventricular failure (metabolic modulators, phosphodiesterase type V inhibitors) and future therapeutic approaches including epigenetic modulation by miRNAs, chromatin binding complexes, long non-coding RNAs, genomic editing, adoptive gene transfer and gene therapy, cell regeneration via cell transplantation and cell reprogramming and cardiac tissue engineering. Strategies for adult right ventricular regeneration will require a more holistic approach than strategies for adult left ventricular failure. Instances of right ventricular failure requiring global reconstitution of right ventricular myocardium, attractive approaches include: i) myocardial patches seeded with cardiac fibroblasts reprogrammed into cardiomyocytes in vivo by small molecules, miRNAs or other epigenetic modifiers; and ii) administration of miRNAs, lncRNAs or small molecules by non-viral vector delivery systems targeted to fibroblasts (e.g., episomes) to stimulate in vivo reprogramming of fibroblasts into cardiomyocytes. For selected heritable genetic myocardial diseases, genomic editing affords exciting opportunities for allele-specific silencing by site-specific directed silencing, mutagenesis or gene excision. Genomic editing by adoptive gene transfer affords similarly exciting opportunities for restoration of myocardial gene expression.

  17. Insect glycerol transporters evolved by functional co-option and gene replacement

    PubMed Central

    Finn, Roderick Nigel; Chauvigné, François; Stavang, Jon Anders; Belles, Xavier; Cerdà, Joan

    2015-01-01

    Transmembrane glycerol transport is typically facilitated by aquaglyceroporins in Prokaryota and Eukaryota. In holometabolan insects however, aquaglyceroporins are absent, yet several species possess polyol permeable aquaporins. It thus remains unknown how glycerol transport evolved in the Holometabola. By combining phylogenetic and functional studies, here we show that a more efficient form of glycerol transporter related to the water-selective channel AQP4 specifically evolved and multiplied in the insect lineage, resulting in the replacement of the ancestral branch of aquaglyceroporins in holometabolan insects. To recapitulate this evolutionary process, we generate specific mutants in distantly related insect aquaporins and human AQP4 and show that a single mutation in the selectivity filter converted a water-selective channel into a glycerol transporter at the root of the crown clade of hexapod insects. Integration of phanerozoic climate models suggests that these events were associated with the emergence of complete metamorphosis and the unparalleled radiation of insects. PMID:26183829

  18. Many gene and domain families have convergent fates following independent whole-genome duplication events in Arabidopsis, Oryza, Saccharomyces and Tetraodon.

    PubMed

    Paterson, Andrew H; Chapman, Brad A; Kissinger, Jessica C; Bowers, John E; Feltus, Frank A; Estill, James C

    2006-11-01

    Genome duplication is potentially a good source of new genes, but such genes take time to evolve. We have found a group of "duplication-resistant" genes, which have undergone convergent restoration to singleton status following several independent genome duplications. Restoration of duplication-resistant genes to singleton status could be important to long-term survival of a polyploid lineage. Angiosperms show more frequent polyploidization and a higher degree of duplicate gene preservation than other paleopolyploids, making them well-suited to further study of duplication-resistant genes.

  19. Draft genome sequence of Bacillus azotoformans MEV2011, a (Co-) denitrifying strain unable to grow with oxygen.

    PubMed

    Nielsen, Maja; Schreiber, Lars; Finster, Kai; Schramm, Andreas

    2015-01-01

    Bacillus azotoformans MEV2011, isolated from soil, is a microaerotolerant obligate denitrifier, which can also produce N2 by co-denitrification. Oxygen is consumed but not growth-supportive. The draft genome has a size of 4.7 Mb and contains key genes for both denitrification and dissimilatory nitrate reduction to ammonium.

  20. Draft genome sequence of Bacillus azotoformans MEV2011, a (Co-) denitrifying strain unable to grow with oxygen.

    PubMed

    Nielsen, Maja; Schreiber, Lars; Finster, Kai; Schramm, Andreas

    2014-01-01

    Bacillus azotoformans MEV2011, isolated from soil, is a microaerotolerant obligate denitrifier, which can also produce N2 by co-denitrification. Oxygen is consumed but not growth-supportive. The draft genome has a size of 4.7 Mb and contains key genes for both denitrification and dissimilatory nitrate reduction to ammonium.

  1. The CesA Gene Family of Barley. Quantitative Analysis of Transcripts Reveals Two Groups of Co-Expressed Genes1

    PubMed Central

    Burton, Rachel A.; Shirley, Neil J.; King, Brendon J.; Harvey, Andrew J.; Fincher, Geoffrey B.

    2004-01-01

    Sequence data from cDNA and genomic clones, coupled with analyses of expressed sequence tag databases, indicate that the CesA (cellulose synthase) gene family from barley (Hordeum vulgare) has at least eight members, which are distributed across the genome. Quantitative polymerase chain reaction has been used to determine the relative abundance of mRNA transcripts for individual HvCesA genes in vegetative and floral tissues, at different stages of development. To ensure accurate expression profiling, geometric averaging of multiple internal control gene transcripts has been applied for the normalization of transcript abundance. Total HvCesA mRNA levels are highest in coleoptiles, roots, and stems and much lower in floral tissues, early developing grain, and in the elongation zone of leaves. In most tissues, HvCesA1, HvCesA2, and HvCesA6 predominate, and their relative abundance is very similar; these genes appear to be coordinately transcribed. A second group, comprising HvCesA4, HvCesA7, and HvCesA8, also appears to be coordinately transcribed, most obviously in maturing stem and root tissues. The HvCesA3 expression pattern does not fall into either of these two groups, and HvCesA5 transcript levels are extremely low in all tissues. Thus, the HvCesA genes fall into two general groups of three genes with respect to mRNA abundance, and the co-expression of the groups identifies their products as candidates for the rosettes that are involved in cellulose biosynthesis at the plasma membrane. Phylogenetic analysis allows the two groups of genes to be linked with orthologous Arabidopsis CesA genes that have been implicated in primary and secondary wall synthesis. PMID:14701917

  2. Co-Option and De Novo Gene Evolution Underlie Molluscan Shell Diversity

    PubMed Central

    Aguilera, Felipe; McDougall, Carmel

    2017-01-01

    Abstract Molluscs fabricate shells of incredible diversity and complexity by localized secretions from the dorsal epithelium of the mantle. Although distantly related molluscs express remarkably different secreted gene products, it remains unclear if the evolution of shell structure and pattern is underpinned by the differential co-option of conserved genes or the integration of lineage-specific genes into the mantle regulatory program. To address this, we compare the mantle transcriptomes of 11 bivalves and gastropods of varying relatedness. We find that each species, including four Pinctada (pearl oyster) species that diverged within the last 20 Ma, expresses a unique mantle secretome. Lineage- or species-specific genes comprise a large proportion of each species’ mantle secretome. A majority of these secreted proteins have unique domain architectures that include repetitive, low complexity domains (RLCDs), which evolve rapidly, and have a proclivity to expand, contract and rearrange in the genome. There are also a large number of secretome genes expressed in the mantle that arose before the origin of gastropods and bivalves. Each species expresses a unique set of these more ancient genes consistent with their independent co-option into these mantle gene regulatory networks. From this analysis, we infer lineage-specific secretomes underlie shell diversity, and include both rapidly evolving RLCD-containing proteins, and the continual recruitment and loss of both ancient and recently evolved genes into the periphery of the regulatory network controlling gene expression in the mantle epithelium. PMID:28053006

  3. The Evolution of Genome Structure by Natural and Sexual Selection.

    PubMed

    Kirkpatrick, Mark

    2017-01-01

    Progress on understanding how genome structure evolves is accelerating with the arrival of new genomic, comparative, and theoretical approaches. This article reviews progress in understanding how chromosome inversions and sex chromosomes evolve, and how their evolution affects species' ecology. Analyses of clines in inversion frequencies in flies and mosquitoes imply strong local adaptation, and roles for both over- and under dominant selection. Those results are consistent with the hypothesis that inversions become established when they capture locally adapted alleles. Inversions can carry alleles that are beneficial to closely related species, causing them to introgress following hybridization. Models show that this "adaptive cassette" scenario can trigger large range expansions, as recently happened in malaria mosquitoes. Sex chromosomes are the most rapidly evolving genome regions of some taxa. Sexually antagonistic selection may be the key force driving transitions of sex determination between different pairs of chromosomes and between XY and ZW systems. Fusions between sex-chromosomes and autosomes most often involve the Y chromosome, a pattern that can be explained if fusions are mildly deleterious and fix by drift. Sexually antagonistic selection is one of several hypotheses to explain the recent discovery that the sex determination system has strong effects on the adult sex ratios of tetrapods. The emerging view of how genome structure evolves invokes a much richer constellation of forces than was envisioned during the Golden Age of research on Drosophila karyotypes. © The American Genetic Association 2016. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

  4. Genome-scale reconstruction of the metabolic network in Yersinia pestis CO92

    NASA Astrophysics Data System (ADS)

    Navid, Ali; Almaas, Eivind

    2007-03-01

    The gram-negative bacterium Yersinia pestis is the causative agent of bubonic plague. Using publicly available genomic, biochemical and physiological data, we have developed a constraint-based flux balance model of metabolism in the CO92 strain (biovar Orientalis) of this organism. The metabolic reactions were appropriately compartmentalized, and the model accounts for the exchange of metabolites, as well as the import of nutrients and export of waste products. We have characterized the metabolic capabilities and phenotypes of this organism, after comparing the model predictions with available experimental observations to evaluate accuracy and completeness. We have also begun preliminary studies into how cellular metabolism affects virulence.

  5. A complex ligase ribozyme evolved in vitro from a group I ribozyme domain

    NASA Technical Reports Server (NTRS)

    Jaeger, L.; Wright, M. C.; Joyce, G. F.; Bada, J. L. (Principal Investigator)

    1999-01-01

    Like most proteins, complex RNA molecules often are modular objects made up of distinct structural and functional domains. The component domains of a protein can associate in alternative combinations to form molecules with different functions. These observations raise the possibility that complex RNAs also can be assembled from preexisting structural and functional domains. To test this hypothesis, an in vitro evolution procedure was used to isolate a previously undescribed class of complex ligase ribozymes, starting from a pool of 10(16) different RNA molecules that contained a constant region derived from a large structural domain that occurs within self-splicing group I ribozymes. Attached to this constant region were three hypervariable regions, totaling 85 nucleotides, that gave rise to the catalytic motif within the evolved catalysts. The ligase ribozymes catalyze formation of a 3',5'-phosphodiester linkage between adjacent template-bound oligonucleotides, one bearing a 3' hydroxyl and the other a 5' triphosphate. Ligation occurs in the context of a Watson-Crick duplex, with a catalytic rate of 0.26 min(-1) under optimal conditions. The constant region is essential for catalytic activity and appears to retain the tertiary structure of the group I ribozyme. This work demonstrates that complex RNA molecules, like their protein counterparts, can share common structural domains while exhibiting distinct catalytic functions.

  6. Positive Selection Driving Cytoplasmic Genome Evolution of the Medicinally Important Ginseng Plant Genus Panax.

    PubMed

    Jiang, Peng; Shi, Feng-Xue; Li, Ming-Rui; Liu, Bao; Wen, Jun; Xiao, Hong-Xing; Li, Lin-Feng

    2018-01-01

    Panax L. (the ginseng genus) is a shade-demanding group within the family Araliaceae and all of its species are of crucial significance in traditional Chinese medicine. Phylogenetic and biogeographic analyses demonstrated that two rounds of whole genome duplications accompanying with geographic and ecological isolations promoted the diversification of Panax species. However, contributions of the cytoplasmic genomes to the adaptive evolution of Panax species remained largely uninvestigated. In this study, we sequenced the chloroplast and mitochondrial genomes of 11 accessions belonging to seven Panax species. Our results show that heterogeneity in nucleotide substitution rate is abundant in both of the two cytoplasmic genomes, with the mitochondrial genome possessing more variants at the total level but the chloroplast showing higher sequence polymorphisms at the genic regions. Genome-wide scanning of positive selection identified five and 12 genes from the chloroplast and mitochondrial genomes, respectively. Functional analyses further revealed that these selected genes play important roles in plant development, cellular metabolism and adaptation. We therefore conclude that positive selection might be one of the potential evolutionary forces that shaped nucleotide variation pattern of these Panax species. In particular, the mitochondrial genes evolved under stronger selective pressure compared to the chloroplast genes.

  7. Draft genome sequence of Bacillus azotoformans MEV2011, a (Co-) denitrifying strain unable to grow with oxygen

    PubMed Central

    2014-01-01

    Bacillus azotoformans MEV2011, isolated from soil, is a microaerotolerant obligate denitrifier, which can also produce N2 by co-denitrification. Oxygen is consumed but not growth-supportive. The draft genome has a size of 4.7 Mb and contains key genes for both denitrification and dissimilatory nitrate reduction to ammonium. PMID:25685261

  8. Organisation of the plant genome in chromosomes.

    PubMed

    Heslop-Harrison, J S Pat; Schwarzacher, Trude

    2011-04-01

    The plant genome is organized into chromosomes that provide the structure for the genetic linkage groups and allow faithful replication, transcription and transmission of the hereditary information. Genome sizes in plants are remarkably diverse, with a 2350-fold range from 63 to 149,000 Mb, divided into n=2 to n= approximately 600 chromosomes. Despite this huge range, structural features of chromosomes like centromeres, telomeres and chromatin packaging are well-conserved. The smallest genomes consist of mostly coding and regulatory DNA sequences present in low copy, along with highly repeated rDNA (rRNA genes and intergenic spacers), centromeric and telomeric repetitive DNA and some transposable elements. The larger genomes have similar numbers of genes, with abundant tandemly repeated sequence motifs, and transposable elements alone represent more than half the DNA present. Chromosomes evolve by fission, fusion, duplication and insertion events, allowing evolution of chromosome size and chromosome number. A combination of sequence analysis, genetic mapping and molecular cytogenetic methods with comparative analysis, all only becoming widely available in the 21st century, is elucidating the exact nature of the chromosome evolution events at all timescales, from the base of the plant kingdom, to intraspecific or hybridization events associated with recent plant breeding. As well as being of fundamental interest, understanding and exploiting evolutionary mechanisms in plant genomes is likely to be a key to crop development for food production. © 2011 The Authors. The Plant Journal © 2011 Blackwell Publishing Ltd.

  9. The western painted turtle genome, a model for the evolution of extreme physiological adaptations in a slowly evolving lineage

    PubMed Central

    2013-01-01

    Background We describe the genome of the western painted turtle, Chrysemys picta bellii, one of the most widespread, abundant, and well-studied turtles. We place the genome into a comparative evolutionary context, and focus on genomic features associated with tooth loss, immune function, longevity, sex differentiation and determination, and the species' physiological capacities to withstand extreme anoxia and tissue freezing. Results Our phylogenetic analyses confirm that turtles are the sister group to living archosaurs, and demonstrate an extraordinarily slow rate of sequence evolution in the painted turtle. The ability of the painted turtle to withstand complete anoxia and partial freezing appears to be associated with common vertebrate gene networks, and we identify candidate genes for future functional analyses. Tooth loss shares a common pattern of pseudogenization and degradation of tooth-specific genes with birds, although the rate of accumulation of mutations is much slower in the painted turtle. Genes associated with sex differentiation generally reflect phylogeny rather than convergence in sex determination functionality. Among gene families that demonstrate exceptional expansions or show signatures of strong natural selection, immune function and musculoskeletal patterning genes are consistently over-represented. Conclusions Our comparative genomic analyses indicate that common vertebrate regulatory networks, some of which have analogs in human diseases, are often involved in the western painted turtle's extraordinary physiological capacities. As these regulatory pathways are analyzed at the functional level, the painted turtle may offer important insights into the management of a number of human health disorders. PMID:23537068

  10. Equine Clinical Genomics: A Clinician’s Primer

    PubMed Central

    Brosnahan, Margaret Mary; Brooks, Samantha A.; Antczak, Douglas F.

    2012-01-01

    Summary The objective of this review is to introduce equine clinicians to the rapidly evolving field of clinical genomics with a vision of improving the health and welfare of the domestic horse. For fifteen years a consortium of veterinary geneticists and clinicians has worked together under the umbrella of The Horse Genome Project. This group, encompassing 22 laboratories in 12 countries, has made rapid progress, developing several iterations of linkage, physical and comparative gene maps of the horse with increasing levels of detail. In early 2006, the research was greatly facilitated when the U.S. National Human Genome Research Institute of the National Institutes of Health added the horse to the list of mammalian species scheduled for whole genome sequencing. The genome of the domestic horse has now been sequenced and is available to researchers worldwide in publicly accessible databases. This achievement creates the potential for transformative change within the horse industry, particularly in the fields of internal medicine, sports medicine and reproduction. The genome sequence has enabled the development of new genome-wide tools and resources for studying inherited diseases of the horse. To date, researchers have identified eleven mutations causing ten clinical syndromes in the horse. Testing is commercially available for all but one of these diseases. Future research will probably identify the genetic bases for other equine diseases, produce new diagnostic tests and generate novel therapeutics for some of these conditions. This will enable equine clinicians to play a critical role in ensuring the thoughtful and appropriate application of this knowledge as they assist clients with breeding and clinical decision-making. PMID:20840582

  11. High-Throughput Sequencing of Six Bamboo Chloroplast Genomes: Phylogenetic Implications for Temperate Woody Bamboos (Poaceae: Bambusoideae)

    PubMed Central

    Li, De-Zhu

    2011-01-01

    Background Bambusoideae is the only subfamily that contains woody members in the grass family, Poaceae. In phylogenetic analyses, Bambusoideae, Pooideae and Ehrhartoideae formed the BEP clade, yet the internal relationships of this clade are controversial. The distinctive life history (infrequent flowering and predominance of asexual reproduction) of woody bamboos makes them an interesting but taxonomically difficult group. Phylogenetic analyses based on large DNA fragments could only provide a moderate resolution of woody bamboo relationships, although a robust phylogenetic tree is needed to elucidate their evolutionary history. Phylogenomics is an alternative choice for resolving difficult phylogenies. Methodology/Principal Findings Here we present the complete nucleotide sequences of six woody bamboo chloroplast (cp) genomes using Illumina sequencing. These genomes are similar to those of other grasses and rather conservative in evolution. We constructed a phylogeny of Poaceae from 24 complete cp genomes including 21 grass species. Within the BEP clade, we found strong support for a sister relationship between Bambusoideae and Pooideae. In a substantial improvement over prior studies, all six nodes within Bambusoideae were supported with ≥0.95 posterior probability from Bayesian inference and 5/6 nodes resolved with 100% bootstrap support in maximum parsimony and maximum likelihood analyses. We found that repeats in the cp genome could provide phylogenetic information, while caution is needed when using indels in phylogenetic analyses based on few selected genes. We also identified relatively rapidly evolving cp genome regions that have the potential to be used for further phylogenetic study in Bambusoideae. Conclusions/Significance The cp genome of Bambusoideae evolved slowly, and phylogenomics based on whole cp genome could be used to resolve major relationships within the subfamily. The difficulty in resolving the diversification among three clades of

  12. Experimental Evaluation of Unicast and Multicast CoAP Group Communication

    PubMed Central

    Ishaq, Isam; Hoebeke, Jeroen; Moerman, Ingrid; Demeester, Piet

    2016-01-01

    The Internet of Things (IoT) is expanding rapidly to new domains in which embedded devices play a key role and gradually outnumber traditionally-connected devices. These devices are often constrained in their resources and are thus unable to run standard Internet protocols. The Constrained Application Protocol (CoAP) is a new alternative standard protocol that implements the same principals as the Hypertext Transfer Protocol (HTTP), but is tailored towards constrained devices. In many IoT application domains, devices need to be addressed in groups in addition to being addressable individually. Two main approaches are currently being proposed in the IoT community for CoAP-based group communication. The main difference between the two approaches lies in the underlying communication type: multicast versus unicast. In this article, we experimentally evaluate those two approaches using two wireless sensor testbeds and under different test conditions. We highlight the pros and cons of each of them and propose combining these approaches in a hybrid solution to better suit certain use case requirements. Additionally, we provide a solution for multicast-based group membership management using CoAP. PMID:27455262

  13. The genomes and comparative genomics of Lactobacillus delbrueckii phages.

    PubMed

    Riipinen, Katja-Anneli; Forsman, Päivi; Alatossava, Tapani

    2011-07-01

    Lactobacillus delbrueckii phages are a great source of genetic diversity. Here, the genome sequences of Lb. delbrueckii phages LL-Ku, c5 and JCL1032 were analyzed in detail, and the genetic diversity of Lb. delbrueckii phages belonging to different taxonomic groups was explored. The lytic isometric group b phages LL-Ku (31,080 bp) and c5 (31,841 bp) showed a minimum nucleotide sequence identity of 90% over about three-fourths of their genomes. The genomic locations of their lysis modules were unique, and the genomes featured several putative overlapping transcription units of genes. LL-Ku and c5 virions displayed peptidoglycan hydrolytic activity associated with a ~36-kDa protein similar in size to the endolysin. Unexpectedly, the 49,433-bp genome of the prolate phage JCL1032 (temperate, group c) revealed a conserved gene order within its structural genes. Lb. delbrueckii phages representing groups a (a phage LL-H), b and c possessed only limited protein sequence homology. Genomic comparison of LL-Ku and c5 suggested that diversification of Lb. delbrueckii phages is mainly due to insertions, deletions and recombination. For the first time, the complete genome sequences of group b and c Lb. delbrueckii phages are reported.

  14. A model species for agricultural pest genomics: the genome of the Colorado potato beetle, Leptinotarsa decemlineata (Coleoptera: Chrysomelidae)

    DOE PAGES

    Schoville, Sean D.; Chen, Yolanda H.; Andersson, Martin N.; ...

    2018-01-31

    The Colorado potato beetle is one of the most challenging agricultural pests to manage. It has shown a spectacular ability to adapt to a variety of solanaceaeous plants and variable climates during its global invasion, and, notably, to rapidly evolve insecticide resistance. To examine evidence of rapid evolutionary change, and to understand the genetic basis of herbivory and insecticide resistance, we tested for structural and functional genomic changes relative to other arthropod species using genome sequencing, transcriptomics, and community annotation. Two factors that might facilitate rapid evolutionary change include transposable elements, which comprise at least 17% of the genome andmore » are rapidly evolving compared to other Coleoptera, and high levels of nucleotide diversity in rapidly growing pest populations. Adaptations to plant feeding are evident in gene expansions and differential expression of digestive enzymes in gut tissues, as well as expansions of gustatory receptors for bitter tasting. Surprisingly, the suite of genes involved in insecticide resistance is similar to other beetles. Finally, duplications in the RNAi pathway might explain why Leptinotarsa decemlineata has high sensitivity to dsRNA. In conclusion, the L. decemlineata genome provides opportunities to investigate a broad range of phenotypes and to develop sustainable methods to control this widely successful pest.« less

  15. A model species for agricultural pest genomics: the genome of the Colorado potato beetle, Leptinotarsa decemlineata (Coleoptera: Chrysomelidae)

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Schoville, Sean D.; Chen, Yolanda H.; Andersson, Martin N.

    The Colorado potato beetle is one of the most challenging agricultural pests to manage. It has shown a spectacular ability to adapt to a variety of solanaceaeous plants and variable climates during its global invasion, and, notably, to rapidly evolve insecticide resistance. To examine evidence of rapid evolutionary change, and to understand the genetic basis of herbivory and insecticide resistance, we tested for structural and functional genomic changes relative to other arthropod species using genome sequencing, transcriptomics, and community annotation. Two factors that might facilitate rapid evolutionary change include transposable elements, which comprise at least 17% of the genome andmore » are rapidly evolving compared to other Coleoptera, and high levels of nucleotide diversity in rapidly growing pest populations. Adaptations to plant feeding are evident in gene expansions and differential expression of digestive enzymes in gut tissues, as well as expansions of gustatory receptors for bitter tasting. Surprisingly, the suite of genes involved in insecticide resistance is similar to other beetles. Finally, duplications in the RNAi pathway might explain why Leptinotarsa decemlineata has high sensitivity to dsRNA. In conclusion, the L. decemlineata genome provides opportunities to investigate a broad range of phenotypes and to develop sustainable methods to control this widely successful pest.« less

  16. A model species for agricultural pest genomics: the genome of the Colorado potato beetle, Leptinotarsa decemlineata (Coleoptera: Chrysomelidae).

    PubMed

    Schoville, Sean D; Chen, Yolanda H; Andersson, Martin N; Benoit, Joshua B; Bhandari, Anita; Bowsher, Julia H; Brevik, Kristian; Cappelle, Kaat; Chen, Mei-Ju M; Childers, Anna K; Childers, Christopher; Christiaens, Olivier; Clements, Justin; Didion, Elise M; Elpidina, Elena N; Engsontia, Patamarerk; Friedrich, Markus; García-Robles, Inmaculada; Gibbs, Richard A; Goswami, Chandan; Grapputo, Alessandro; Gruden, Kristina; Grynberg, Marcin; Henrissat, Bernard; Jennings, Emily C; Jones, Jeffery W; Kalsi, Megha; Khan, Sher A; Kumar, Abhishek; Li, Fei; Lombard, Vincent; Ma, Xingzhou; Martynov, Alexander; Miller, Nicholas J; Mitchell, Robert F; Munoz-Torres, Monica; Muszewska, Anna; Oppert, Brenda; Palli, Subba Reddy; Panfilio, Kristen A; Pauchet, Yannick; Perkin, Lindsey C; Petek, Marko; Poelchau, Monica F; Record, Éric; Rinehart, Joseph P; Robertson, Hugh M; Rosendale, Andrew J; Ruiz-Arroyo, Victor M; Smagghe, Guy; Szendrei, Zsofia; Thomas, Gregg W C; Torson, Alex S; Vargas Jentzsch, Iris M; Weirauch, Matthew T; Yates, Ashley D; Yocum, George D; Yoon, June-Sun; Richards, Stephen

    2018-01-31

    The Colorado potato beetle is one of the most challenging agricultural pests to manage. It has shown a spectacular ability to adapt to a variety of solanaceaeous plants and variable climates during its global invasion, and, notably, to rapidly evolve insecticide resistance. To examine evidence of rapid evolutionary change, and to understand the genetic basis of herbivory and insecticide resistance, we tested for structural and functional genomic changes relative to other arthropod species using genome sequencing, transcriptomics, and community annotation. Two factors that might facilitate rapid evolutionary change include transposable elements, which comprise at least 17% of the genome and are rapidly evolving compared to other Coleoptera, and high levels of nucleotide diversity in rapidly growing pest populations. Adaptations to plant feeding are evident in gene expansions and differential expression of digestive enzymes in gut tissues, as well as expansions of gustatory receptors for bitter tasting. Surprisingly, the suite of genes involved in insecticide resistance is similar to other beetles. Finally, duplications in the RNAi pathway might explain why Leptinotarsa decemlineata has high sensitivity to dsRNA. The L. decemlineata genome provides opportunities to investigate a broad range of phenotypes and to develop sustainable methods to control this widely successful pest.

  17. Insights from the Genome Sequence of Acidovorax citrulli M6, a Group I Strain of the Causal Agent of Bacterial Fruit Blotch of Cucurbits.

    PubMed

    Eckshtain-Levi, Noam; Shkedy, Dafna; Gershovits, Michael; Da Silva, Gustavo M; Tamir-Ariel, Dafna; Walcott, Ron; Pupko, Tal; Burdman, Saul

    2016-01-01

    Acidovorax citrulli is a seedborne bacterium that causes bacterial fruit blotch of cucurbit plants including watermelon and melon. A. citrulli strains can be divided into two major groups based on DNA fingerprint analyses and biochemical properties. Group I strains have been generally isolated from non-watermelon cucurbits, while group II strains are closely associated with watermelon. In the present study, we report the genome sequence of M6, a group I model A. citrulli strain, isolated from melon. We used comparative genome analysis to investigate differences between the genome of strain M6 and the genome of the group II model strain AAC00-1. The draft genome sequence of A. citrulli M6 harbors 139 contigs, with an overall approximate size of 4.85 Mb. The genome of M6 is ∼500 Kb shorter than that of strain AAC00-1. Comparative analysis revealed that this size difference is mainly explained by eight fragments, ranging from ∼35-120 Kb and distributed throughout the AAC00-1 genome, which are absent in the M6 genome. In agreement with this finding, while AAC00-1 was found to possess 532 open reading frames (ORFs) that are absent in strain M6, only 123 ORFs in M6 were absent in AAC00-1. Most of these M6 ORFs are hypothetical proteins and most of them were also detected in two group I strains that were recently sequenced, tw6 and pslb65. Further analyses by PCR assays and coverage analyses with other A. citrulli strains support the notion that some of these fragments or significant portions of them are discriminative between groups I and II strains of A. citrulli. Moreover, GC content, effective number of codon values and cluster of orthologs' analyses indicate that these fragments were introduced into group II strains by horizontal gene transfer events. Our study reports the genome sequence of a model group I strain of A. citrulli, one of the most important pathogens of cucurbits. It also provides the first comprehensive comparison at the genomic level between the

  18. Insights from the Genome Sequence of Acidovorax citrulli M6, a Group I Strain of the Causal Agent of Bacterial Fruit Blotch of Cucurbits

    PubMed Central

    Eckshtain-Levi, Noam; Shkedy, Dafna; Gershovits, Michael; Da Silva, Gustavo M.; Tamir-Ariel, Dafna; Walcott, Ron; Pupko, Tal; Burdman, Saul

    2016-01-01

    Acidovorax citrulli is a seedborne bacterium that causes bacterial fruit blotch of cucurbit plants including watermelon and melon. A. citrulli strains can be divided into two major groups based on DNA fingerprint analyses and biochemical properties. Group I strains have been generally isolated from non-watermelon cucurbits, while group II strains are closely associated with watermelon. In the present study, we report the genome sequence of M6, a group I model A. citrulli strain, isolated from melon. We used comparative genome analysis to investigate differences between the genome of strain M6 and the genome of the group II model strain AAC00-1. The draft genome sequence of A. citrulli M6 harbors 139 contigs, with an overall approximate size of 4.85 Mb. The genome of M6 is ∼500 Kb shorter than that of strain AAC00-1. Comparative analysis revealed that this size difference is mainly explained by eight fragments, ranging from ∼35–120 Kb and distributed throughout the AAC00-1 genome, which are absent in the M6 genome. In agreement with this finding, while AAC00-1 was found to possess 532 open reading frames (ORFs) that are absent in strain M6, only 123 ORFs in M6 were absent in AAC00-1. Most of these M6 ORFs are hypothetical proteins and most of them were also detected in two group I strains that were recently sequenced, tw6 and pslb65. Further analyses by PCR assays and coverage analyses with other A. citrulli strains support the notion that some of these fragments or significant portions of them are discriminative between groups I and II strains of A. citrulli. Moreover, GC content, effective number of codon values and cluster of orthologs’ analyses indicate that these fragments were introduced into group II strains by horizontal gene transfer events. Our study reports the genome sequence of a model group I strain of A. citrulli, one of the most important pathogens of cucurbits. It also provides the first comprehensive comparison at the genomic level between

  19. Towards Evolving Electronic Circuits for Autonomous Space Applications

    NASA Technical Reports Server (NTRS)

    Lohn, Jason D.; Haith, Gary L.; Colombano, Silvano P.; Stassinopoulos, Dimitris

    2000-01-01

    The relatively new field of Evolvable Hardware studies how simulated evolution can reconfigure, adapt, and design hardware structures in an automated manner. Space applications, especially those requiring autonomy, are potential beneficiaries of evolvable hardware. For example, robotic drilling from a mobile platform requires high-bandwidth controller circuits that are difficult to design. In this paper, we present automated design techniques based on evolutionary search that could potentially be used in such applications. First, we present a method of automatically generating analog circuit designs using evolutionary search and a circuit construction language. Our system allows circuit size (number of devices), circuit topology, and device values to be evolved. Using a parallel genetic algorithm, we present experimental results for five design tasks. Second, we investigate the use of coevolution in automated circuit design. We examine fitness evaluation by comparing the effectiveness of four fitness schedules. The results indicate that solution quality is highest with static and co-evolving fitness schedules as compared to the other two dynamic schedules. We discuss these results and offer two possible explanations for the observed behavior: retention of useful information, and alignment of problem difficulty with circuit proficiency.

  20. Co-Inheritance Analysis within the Domains of Life Substantially Improves Network Inference by Phylogenetic Profiling

    PubMed Central

    Shin, Junha; Lee, Insuk

    2015-01-01

    Phylogenetic profiling, a network inference method based on gene inheritance profiles, has been widely used to construct functional gene networks in microbes. However, its utility for network inference in higher eukaryotes has been limited. An improved algorithm with an in-depth understanding of pathway evolution may overcome this limitation. In this study, we investigated the effects of taxonomic structures on co-inheritance analysis using 2,144 reference species in four query species: Escherichia coli, Saccharomyces cerevisiae, Arabidopsis thaliana, and Homo sapiens. We observed three clusters of reference species based on a principal component analysis of the phylogenetic profiles, which correspond to the three domains of life—Archaea, Bacteria, and Eukaryota—suggesting that pathways inherit primarily within specific domains or lower-ranked taxonomic groups during speciation. Hence, the co-inheritance pattern within a taxonomic group may be eroded by confounding inheritance patterns from irrelevant taxonomic groups. We demonstrated that co-inheritance analysis within domains substantially improved network inference not only in microbe species but also in the higher eukaryotes, including humans. Although we observed two sub-domain clusters of reference species within Eukaryota, co-inheritance analysis within these sub-domain taxonomic groups only marginally improved network inference. Therefore, we conclude that co-inheritance analysis within domains is the optimal approach to network inference with the given reference species. The construction of a series of human gene networks with increasing sample sizes of the reference species for each domain revealed that the size of the high-accuracy networks increased as additional reference species genomes were included, suggesting that within-domain co-inheritance analysis will continue to expand human gene networks as genomes of additional species are sequenced. Taken together, we propose that co

  1. The resurrection genome of Boea hygrometrica: A blueprint for survival of dehydration.

    PubMed

    Xiao, Lihong; Yang, Ge; Zhang, Liechi; Yang, Xinhua; Zhao, Shuang; Ji, Zhongzhong; Zhou, Qing; Hu, Min; Wang, Yu; Chen, Ming; Xu, Yu; Jin, Haijing; Xiao, Xuan; Hu, Guipeng; Bao, Fang; Hu, Yong; Wan, Ping; Li, Legong; Deng, Xin; Kuang, Tingyun; Xiang, Chengbin; Zhu, Jian-Kang; Oliver, Melvin J; He, Yikun

    2015-05-05

    "Drying without dying" is an essential trait in land plant evolution. Unraveling how a unique group of angiosperms, the Resurrection Plants, survive desiccation of their leaves and roots has been hampered by the lack of a foundational genome perspective. Here we report the ∼1,691-Mb sequenced genome of Boea hygrometrica, an important resurrection plant model. The sequence revealed evidence for two historical genome-wide duplication events, a compliment of 49,374 protein-coding genes, 29.15% of which are unique (orphan) to Boea and 20% of which (9,888) significantly respond to desiccation at the transcript level. Expansion of early light-inducible protein (ELIP) and 5S rRNA genes highlights the importance of the protection of the photosynthetic apparatus during drying and the rapid resumption of protein synthesis in the resurrection capability of Boea. Transcriptome analysis reveals extensive alternative splicing of transcripts and a focus on cellular protection strategies. The lack of desiccation tolerance-specific genome organizational features suggests the resurrection phenotype evolved mainly by an alteration in the control of dehydration response genes.

  2. Genomic standards consortium projects.

    PubMed

    Field, Dawn; Sterk, Peter; Kottmann, Renzo; De Smet, J Wim; Amaral-Zettler, Linda; Cochrane, Guy; Cole, James R; Davies, Neil; Dawyndt, Peter; Garrity, George M; Gilbert, Jack A; Glöckner, Frank Oliver; Hirschman, Lynette; Klenk, Hans-Peter; Knight, Rob; Kyrpides, Nikos; Meyer, Folker; Karsch-Mizrachi, Ilene; Morrison, Norman; Robbins, Robert; San Gil, Inigo; Sansone, Susanna; Schriml, Lynn; Tatusova, Tatiana; Ussery, Dave; Yilmaz, Pelin; White, Owen; Wooley, John; Caporaso, Gregory

    2014-06-15

    The Genomic Standards Consortium (GSC) is an open-membership community that was founded in 2005 to work towards the development, implementation and harmonization of standards in the field of genomics. Starting with the defined task of establishing a minimal set of descriptions the GSC has evolved into an active standards-setting body that currently has 18 ongoing projects, with additional projects regularly proposed from within and outside the GSC. Here we describe our recently enacted policy for proposing new activities that are intended to be taken on by the GSC, along with the template for proposing such new activities.

  3. Yeast “Make-Accumulate-Consume” Life Strategy Evolved as a Multi-Step Process That Predates the Whole Genome Duplication

    PubMed Central

    Hagman, Arne; Säll, Torbjörn; Compagno, Concetta; Piskur, Jure

    2013-01-01

    When fruits ripen, microbial communities start a fierce competition for the freely available fruit sugars. Three yeast lineages, including baker’s yeast Saccharomyces cerevisiae, have independently developed the metabolic activity to convert simple sugars into ethanol even under fully aerobic conditions. This fermentation capacity, named Crabtree effect, reduces the cell-biomass production but provides in nature a tool to out-compete other microorganisms. Here, we analyzed over forty Saccharomycetaceae yeasts, covering over 200 million years of the evolutionary history, for their carbon metabolism. The experiments were done under strictly controlled and uniform conditions, which has not been done before. We show that the origin of Crabtree effect in Saccharomycetaceae predates the whole genome duplication and became a settled metabolic trait after the split of the S. cerevisiae and Kluyveromyces lineages, and coincided with the origin of modern fruit bearing plants. Our results suggest that ethanol fermentation evolved progressively, involving several successive molecular events that have gradually remodeled the yeast carbon metabolism. While some of the final evolutionary events, like gene duplications of glucose transporters and glycolytic enzymes, have been deduced, the earliest molecular events initiating Crabtree effect are still to be determined. PMID:23869229

  4. Microbial genomic taxonomy

    PubMed Central

    2013-01-01

    A need for a genomic species definition is emerging from several independent studies worldwide. In this commentary paper, we discuss recent studies on the genomic taxonomy of diverse microbial groups and a unified species definition based on genomics. Accordingly, strains from the same microbial species share >95% Average Amino Acid Identity (AAI) and Average Nucleotide Identity (ANI), >95% identity based on multiple alignment genes, <10 in Karlin genomic signature, and > 70% in silico Genome-to-Genome Hybridization similarity (GGDH). Species of the same genus will form monophyletic groups on the basis of 16S rRNA gene sequences, Multilocus Sequence Analysis (MLSA) and supertree analysis. In addition to the established requirements for species descriptions, we propose that new taxa descriptions should also include at least a draft genome sequence of the type strain in order to obtain a clear outlook on the genomic landscape of the novel microbe. The application of the new genomic species definition put forward here will allow researchers to use genome sequences to define simultaneously coherent phenotypic and genomic groups. PMID:24365132

  5. Whole-Genome Duplication and the Functional Diversification of Teleost Fish Hemoglobins

    PubMed Central

    Opazo, Juan C.; Butts, G. Tyler; Nery, Mariana F.; Storz, Jay F.; Hoffmann, Federico G.

    2013-01-01

    Subsequent to the two rounds of whole-genome duplication that occurred in the common ancestor of vertebrates, a third genome duplication occurred in the stem lineage of teleost fishes. This teleost-specific genome duplication (TGD) is thought to have provided genetic raw materials for the physiological, morphological, and behavioral diversification of this highly speciose group. The extreme physiological versatility of teleost fish is manifest in their diversity of blood–gas transport traits, which reflects the myriad solutions that have evolved to maintain tissue O2 delivery in the face of changing metabolic demands and environmental O2 availability during different ontogenetic stages. During the course of development, regulatory changes in blood–O2 transport are mediated by the expression of multiple, functionally distinct hemoglobin (Hb) isoforms that meet the particular O2-transport challenges encountered by the developing embryo or fetus (in viviparous or oviparous species) and in free-swimming larvae and adults. The main objective of the present study was to assess the relative contributions of whole-genome duplication, large-scale segmental duplication, and small-scale gene duplication in producing the extraordinary functional diversity of teleost Hbs. To accomplish this, we integrated phylogenetic reconstructions with analyses of conserved synteny to characterize the genomic organization and evolutionary history of the globin gene clusters of teleosts. These results were then integrated with available experimental data on functional properties and developmental patterns of stage-specific gene expression. Our results indicate that multiple α- and β-globin genes were present in the common ancestor of gars (order Lepisoteiformes) and teleosts. The comparative genomic analysis revealed that teleosts possess a dual set of TGD-derived globin gene clusters, each of which has undergone lineage-specific changes in gene content via repeated duplication and

  6. Comparative genomics and evolution of eukaryotic phospholipidbiosynthesis

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Lykidis, Athanasios

    2006-12-01

    Phospholipid biosynthetic enzymes produce diverse molecular structures and are often present in multiple forms encoded by different genes. This work utilizes comparative genomics and phylogenetics for exploring the distribution, structure and evolution of phospholipid biosynthetic genes and pathways in 26 eukaryotic genomes. Although the basic structure of the pathways was formed early in eukaryotic evolution, the emerging picture indicates that individual enzyme families followed unique evolutionary courses. For example, choline and ethanolamine kinases and cytidylyltransferases emerged in ancestral eukaryotes, whereas, multiple forms of the corresponding phosphatidyltransferases evolved mainly in a lineage specific manner. Furthermore, several unicellular eukaryotes maintain bacterial-type enzymesmore » and reactions for the synthesis of phosphatidylglycerol and cardiolipin. Also, base-exchange phosphatidylserine synthases are widespread and ancestral enzymes. The multiplicity of phospholipid biosynthetic enzymes has been largely generated by gene expansion in a lineage specific manner. Thus, these observations suggest that phospholipid biosynthesis has been an actively evolving system. Finally, comparative genomic analysis indicates the existence of novel phosphatidyltransferases and provides a candidate for the uncharacterized eukaryotic phosphatidylglycerol phosphate phosphatase.« less

  7. Standards of Practice: Applying Genetics and Genomics Resources to Oncology
.

    PubMed

    Kerber, Alice S; Ledbetter, Nancy J

    2017-04-01

    Knowledge about genetics and genomics and its application to oncology care is rapidly expanding and evolving. As a result, oncology nurses at all levels must develop and maintain their knowledge of genetics and genomics, as well as be aware of resources to guide practice. This article focuses on implementation of the standards described in the updated Genetics/Genomics Nursing: Scope and Standards of Practice by the basic practitioner.
.

  8. Mutation as a Stress Response and the Regulation of Evolvability

    PubMed Central

    Galhardo, Rodrigo S.; Hastings, P. J.; Rosenberg, Susan M.

    2010-01-01

    Our concept of a stable genome is evolving to one in which genomes are plastic and responsive to environmental changes. Growing evidence shows that a variety of environmental stresses induce genomic instability in bacteria, yeast, and human cancer cells, generating occasional fitter mutants and potentially accelerating adaptive evolution. The emerging molecular mechanisms of stress-induced mutagenesis vary but share telling common components that underscore two common themes. The first is the regulation of mutagenesis in time by cellular stress responses, which promote random mutations specifically when cells are poorly adapted to their environments, i.e., when they are stressed. A second theme is the possible restriction of random mutagenesis in genomic space, achieved via coupling of mutation-generating machinery to local events such as DNA-break repair or transcription. Such localization may minimize accumulation of deleterious mutations in the genomes of rare fitter mutants, and promote local concerted evolution. Although mutagenesis induced by stresses other than direct damage to DNA was previously controversial, evidence for the existence of various stress-induced mutagenesis programs is now overwhelming and widespread. Such mechanisms probably fuel evolution of microbial pathogenesis and antibiotic-resistance, and tumor progression and chemotherapy resistance, all of which occur under stress, driven by mutations. The emerging commonalities in stress-induced-mutation mechanisms provide hope for new therapeutic interventions for all of these processes. PMID:17917874

  9. The pineapple genome and the evolution of CAM photosynthesis

    PubMed Central

    Ming, Ray; VanBuren, Robert; Wai, Ching Man; Tang, Haibao; Schatz, Michael C.; Bowers, John E.; Lyons, Eric; Wang, Ming-Li; Chen, Jung; Biggers, Eric; Zhang, Jisen; Huang, Lixian; Zhang, Lingmao; Miao, Wenjing; Zhang, Jian; Ye, Zhangyao; Miao, Chenyong; Lin, Zhicong; Wang, Hao; Zhou, Hongye; Yim, Won C.; Priest, Henry D.; Zheng, Chunfang; Woodhouse, Margaret; Edger, Patrick P.; Guyot, Romain; Guo, Hao-Bo; Guo, Hong; Zheng, Guangyong; Singh, Ratnesh; Sharma, Anupma; Min, Xiangjia; Zheng, Yun; Lee, Hayan; Gurtowski, James; Sedlazeck, Fritz J.; Harkess, Alex; McKain, Michael R.; Liao, Zhenyang; Fang, Jingping; Liu, Juan; Zhang, Xiaodan; Zhang, Qing; Hu, Weichang; Qin, Yuan; Wang, Kai; Chen, Li-Yu; Shirley, Neil; Lin, Yann-Rong; Liu, Li-Yu; Hernandez, Alvaro G.; Wright, Chris L.; Bulone, Vincent; Tuskan, Gerald A.; Heath, Katy; Zee, Francis; Moore, Paul H.; Sunkar, Ramanjulu; Leebens-Mack, James H.; Mockler, Todd; Bennetzen, Jeffrey L.; Freeling, Michael; Sankoff, David; Paterson, Andrew H.; Zhu, Xinguang; Yang, Xiaohan; Smith, J. Andrew C.; Cushman, John C.; Paull, Robert E.; Yu, Qingyi

    2016-01-01

    Pineapple (Ananas comosus (L.) Merr.) is the most economically valuable crop possessing crassulacean acid metabolism (CAM), a photosynthetic carbon assimilation pathway with high water use efficiency, and the second most important tropical fruit after banana in terms of international trade. We sequenced the genomes of pineapple varieties ‘F153’ and ‘MD2’, and a wild pineapple relative A. bracteatus accession CB5. The pineapple genome has one fewer ancient whole genome duplications than sequenced grass genomes and, therefore, provides an important reference for elucidating gene content and structure in the last common ancestor of extant members of the grass family (Poaceae). Pineapple has a conserved karyotype with seven pre rho duplication chromosomes that are ancestral to extant grass karyotypes. The pineapple lineage has transitioned from C3 photosynthesis to CAM with CAM-related genes exhibiting a diel expression pattern in photosynthetic tissues using beta-carbonic anhydrase (βCA) for initial capture of CO2. Promoter regions of all three βCA genes contain a CCA1 binding site that can bind circadian core oscillators. CAM pathway genes were enriched with cis-regulatory elements including the morning (CCACAC) and evening (AAAATATC) elements associated with regulation of circadian-clock genes, providing the first link between CAM and the circadian clock regulation. Gene-interaction network analysis revealed both activation and repression of regulatory elements that control key enzymes in CAM photosynthesis, indicating that CAM evolved by reconfiguration of pathways preexisting in C3 plants. Pineapple CAM photosynthesis is the result of regulatory neofunctionalization of preexisting gene copies and not acquisition of neofunctionalized genes via whole genome or tandem gene duplication. PMID:26523774

  10. Population genetics of chronic kidney disease: the evolving story of APOL1.

    PubMed

    Wasser, Walter G; Tzur, Shay; Wolday, Dawit; Adu, Dwomoa; Baumstein, Donald; Rosset, Saharon; Skorecki, Karl

    2012-01-01

    Advances in human genome sequencing and generation of public databases of genomic diversity enable nephrologists to re-examine the genetics of common, complex kidney diseases. Non-diabetic kidney diseases prevalent in African ancestry populations and the allelic variation described in chromosome 22q12.3 is one such illustrative example. Newly available genomic database information enabled research groups to discover common functional DNA sequence risk variants in the APOL1 gene. These variants (termed G1 and G2) evolved to confer protection from a species of trypanosomal infection and thus achieved high prominence in many geographic regions of Africa and have been carried over to African diaspora communities worldwide. Since these discoveries two years ago, new insights have been gained: localization of APOL1 in normal and disease kidney tissues; influence of the APOL1 variants on the histopathology of HIV kidney disease; possible association with kidney transplant durability; onset of kidney failure at a younger age; association with blood lipid concentrations; more precise geographic localization of individuals with these variants to western and southern African ancestry; and the absence of the variants and kidney disease predisposition in Ethiopians. The definition of APOL1 nephropathy also confirms the long-held assumption by many clinicians that kidney disease attributed to hypertension in African populations represents an underlying glomerulopathy. Still awaited is the delineation of the biologic mechanisms of cellular injury related to these variants, to provide biologic proof of the APOL1 association and to provide potential targets for preventive and therapeutic intervention.

  11. Detecting non-orthology in the COGs database and other approaches grouping orthologs using genome-specific best hits.

    PubMed

    Dessimoz, Christophe; Boeckmann, Brigitte; Roth, Alexander C J; Gonnet, Gaston H

    2006-01-01

    Correct orthology assignment is a critical prerequisite of numerous comparative genomics procedures, such as function prediction, construction of phylogenetic species trees and genome rearrangement analysis. We present an algorithm for the detection of non-orthologs that arise by mistake in current orthology classification methods based on genome-specific best hits, such as the COGs database. The algorithm works with pairwise distance estimates, rather than computationally expensive and error-prone tree-building methods. The accuracy of the algorithm is evaluated through verification of the distribution of predicted cases, case-by-case phylogenetic analysis and comparisons with predictions from other projects using independent methods. Our results show that a very significant fraction of the COG groups include non-orthologs: using conservative parameters, the algorithm detects non-orthology in a third of all COG groups. Consequently, sequence analysis sensitive to correct orthology assignments will greatly benefit from these findings.

  12. Functional genomics of physiological plasticity and local adaptation in killifish.

    PubMed

    Whitehead, Andrew; Galvez, Fernando; Zhang, Shujun; Williams, Larissa M; Oleksiak, Marjorie F

    2011-01-01

    Evolutionary solutions to the physiological challenges of life in highly variable habitats can span the continuum from evolution of a cosmopolitan plastic phenotype to the evolution of locally adapted phenotypes. Killifish (Fundulus sp.) have evolved both highly plastic and locally adapted phenotypes within different selective contexts, providing a comparative system in which to explore the genomic underpinnings of physiological plasticity and adaptive variation. Importantly, extensive variation exists among populations and species for tolerance to a variety of stressors, and we exploit this variation in comparative studies to yield insights into the genomic basis of evolved phenotypic variation. Notably, species of Fundulus occupy the continuum of osmotic habitats from freshwater to marine and populations within Fundulus heteroclitus span far greater variation in pollution tolerance than across all species of fish. Here, we explore how transcriptome regulation underpins extreme physiological plasticity on osmotic shock and how genomic and transcriptomic variation is associated with locally evolved pollution tolerance. We show that F. heteroclitus quickly acclimate to extreme osmotic shock by mounting a dramatic rapid transcriptomic response including an early crisis control phase followed by a tissue remodeling phase involving many regulatory pathways. We also show that convergent evolution of locally adapted pollution tolerance involves complex patterns of gene expression and genome sequence variation, which is confounded with body-weight dependence for some genes. Similarly, exploiting the natural phenotypic variation associated with other established and emerging model organisms is likely to greatly accelerate the pace of discovery of the genomic basis of phenotypic variation.

  13. Functional Genomics of Physiological Plasticity and Local Adaptation in Killifish

    PubMed Central

    Galvez, Fernando; Zhang, Shujun; Williams, Larissa M.; Oleksiak, Marjorie F.

    2011-01-01

    Evolutionary solutions to the physiological challenges of life in highly variable habitats can span the continuum from evolution of a cosmopolitan plastic phenotype to the evolution of locally adapted phenotypes. Killifish (Fundulus sp.) have evolved both highly plastic and locally adapted phenotypes within different selective contexts, providing a comparative system in which to explore the genomic underpinnings of physiological plasticity and adaptive variation. Importantly, extensive variation exists among populations and species for tolerance to a variety of stressors, and we exploit this variation in comparative studies to yield insights into the genomic basis of evolved phenotypic variation. Notably, species of Fundulus occupy the continuum of osmotic habitats from freshwater to marine and populations within Fundulus heteroclitus span far greater variation in pollution tolerance than across all species of fish. Here, we explore how transcriptome regulation underpins extreme physiological plasticity on osmotic shock and how genomic and transcriptomic variation is associated with locally evolved pollution tolerance. We show that F. heteroclitus quickly acclimate to extreme osmotic shock by mounting a dramatic rapid transcriptomic response including an early crisis control phase followed by a tissue remodeling phase involving many regulatory pathways. We also show that convergent evolution of locally adapted pollution tolerance involves complex patterns of gene expression and genome sequence variation, which is confounded with body-weight dependence for some genes. Similarly, exploiting the natural phenotypic variation associated with other established and emerging model organisms is likely to greatly accelerate the pace of discovery of the genomic basis of phenotypic variation. PMID:20581107

  14. Genome-based approaches to develop vaccines against bacterial pathogens.

    PubMed

    Serruto, Davide; Serino, Laura; Masignani, Vega; Pizza, Mariagrazia

    2009-05-26

    Bacterial infectious diseases remain the single most important threat to health worldwide. Although conventional vaccinology approaches were successful in conferring protection against several diseases, they failed to provide efficacious solutions against many others. The advent of whole-genome sequencing changed the way to think about vaccine development, enabling the targeting of possible vaccine candidates starting from the genomic information of a single bacterial isolate, with a process named reverse vaccinology. As the genomic era progressed, reverse vaccinology has evolved with a pan-genome approach and multi-strain genome analysis became fundamental for the design of universal vaccines. This review describes the applications of genome-based approaches in the development of new vaccines against bacterial pathogens.

  15. Evidence for a high mutation rate at rapidly evolving yeast centromeres.

    PubMed

    Bensasson, Douda

    2011-07-18

    Although their role in cell division is essential, centromeres evolve rapidly in animals, plants and yeasts. Unlike the complex centromeres of plants and aminals, the point centromeres of Saccharomcyes yeasts can be readily sequenced to distinguish amongst the possible explanations for fast centromere evolution. Using DNA sequences of all 16 centromeres from 34 strains of Saccharomyces cerevisiae and population genomic data from Saccharomyces paradoxus, I show that centromeres in both species evolve 3 times more rapidly even than selectively unconstrained DNA. Exceptionally high levels of polymorphism seen in multiple yeast populations suggest that rapid centromere evolution does not result from the repeated selective sweeps expected under meiotic drive. I further show that there is little evidence for crossing-over or gene conversion within centromeres, although there is clear evidence for recombination in their immediate vicinity. Finally I show that the mutation spectrum at centromeres is consistent with the pattern of spontaneous mutation elsewhere in the genome. These results indicate that rapid centromere evolution is a common phenomenon in yeast species. Furthermore, these results suggest that rapid centromere evolution does not result from the mutagenic effect of gene conversion, but from a generalised increase in the mutation rate, perhaps arising from the unusual chromatin structure at centromeres in yeast and other eukaryotes.

  16. Evolving spiking neural networks: a novel growth algorithm exhibits unintelligent design

    NASA Astrophysics Data System (ADS)

    Schaffer, J. David

    2015-06-01

    Spiking neural networks (SNNs) have drawn considerable excitement because of their computational properties, believed to be superior to conventional von Neumann machines, and sharing properties with living brains. Yet progress building these systems has been limited because we lack a design methodology. We present a gene-driven network growth algorithm that enables a genetic algorithm (evolutionary computation) to generate and test SNNs. The genome for this algorithm grows O(n) where n is the number of neurons; n is also evolved. The genome not only specifies the network topology, but all its parameters as well. Experiments show the algorithm producing SNNs that effectively produce a robust spike bursting behavior given tonic inputs, an application suitable for central pattern generators. Even though evolution did not include perturbations of the input spike trains, the evolved networks showed remarkable robustness to such perturbations. In addition, the output spike patterns retain evidence of the specific perturbation of the inputs, a feature that could be exploited by network additions that could use this information for refined decision making if required. On a second task, a sequence detector, a discriminating design was found that might be considered an example of "unintelligent design"; extra non-functional neurons were included that, while inefficient, did not hamper its proper functioning.

  17. Regulating genomics in the 21st century: from logos to pathos?

    PubMed

    Gottweis, Herbert

    2005-03-01

    There is currently an important change in the governance of genomics. In the past, much of the regulatory discussion about genomics has focused on issues of risk. Today, a new discussion is evolving that emphasizes the uncertainties involved in the development and diffusion of genomics into society. The increasing importance of emotional language and the focus on trust in the discussion about genomics reflects the attempt to substitute for the shortcomings of logos with ethos and pathos.

  18. Evolvable Hardware for Space Applications

    NASA Technical Reports Server (NTRS)

    Lohn, Jason; Globus, Al; Hornby, Gregory; Larchev, Gregory; Kraus, William

    2004-01-01

    This article surveys the research of the Evolvable Systems Group at NASA Ames Research Center. Over the past few years, our group has developed the ability to use evolutionary algorithms in a variety of NASA applications ranging from spacecraft antenna design, fault tolerance for programmable logic chips, atomic force field parameter fitting, analog circuit design, and earth observing satellite scheduling. In some of these applications, evolutionary algorithms match or improve on human performance.

  19. Sperm should evolve to make female meiosis fair.

    PubMed

    Brandvain, Yaniv; Coop, Graham

    2015-04-01

    Genomic conflicts arise when an allele gains an evolutionary advantage at a cost to organismal fitness. Oögenesis is inherently susceptible to such conflicts because alleles compete for inclusion into the egg. Alleles that distort meiosis in their favor (i.e., meiotic drivers) often decrease organismal fitness, and therefore indirectly favor the evolution of mechanisms to suppress meiotic drive. In this light, many facets of oögenesis and gametogenesis have been interpreted as mechanisms of protection against genomic outlaws. That females of many animal species do not complete meiosis until after fertilization, appears to run counter to this interpretation, because this delay provides an opportunity for sperm-acting alleles to meddle with the outcome of female meiosis and help like alleles drive in heterozygous females. Contrary to this perceived danger, the population genetic theory presented herein suggests that, in fact, sperm nearly always evolve to increase the fairness of female meiosis in the face of genomic conflicts. These results are consistent with the apparent sperm dependence of the best characterized female meiotic driversin animals. Rather than providing an opportunity for sperm collaboration in female meiotic drive, the "fertilization requirement" indirectly protects females from meiotic drivers by providing sperm an opportunity to suppress drive. © 2015 The Author(s).

  20. Systematic genome assessment of B-vitamin biosynthesis suggests co-operation among gut microbes

    PubMed Central

    Magnúsdóttir, Stefanía; Ravcheev, Dmitry; de Crécy-Lagard, Valérie; Thiele, Ines

    2015-01-01

    The human gut microbiota supplies its host with essential nutrients, including B-vitamins. Using the PubSEED platform, we systematically assessed the genomes of 256 common human gut bacteria for the presence of biosynthesis pathways for eight B-vitamins: biotin, cobalamin, folate, niacin, pantothenate, pyridoxine, riboflavin, and thiamin. On the basis of the presence and absence of genome annotations, we predicted that each of the eight vitamins was produced by 40–65% of the 256 human gut microbes. The distribution of synthesis pathways was diverse; some genomes had all eight biosynthesis pathways, whereas others contained no de novo synthesis pathways. We compared our predictions to experimental data from 16 organisms and found 88% of our predictions to be in agreement with published data. In addition, we identified several pairs of organisms whose vitamin synthesis pathway pattern complemented those of other organisms. This analysis suggests that human gut bacteria actively exchange B-vitamins among each other, thereby enabling the survival of organisms that do not synthesize any of these essential cofactors. This result indicates the co-evolution of the gut microbes in the human gut environment. Our work presents the first comprehensive assessment of the B-vitamin synthesis capabilities of the human gut microbiota. We propose that in addition to diet, the gut microbiota is an important source of B-vitamins, and that changes in the gut microbiota composition can severely affect our dietary B-vitamin requirements. PMID:25941533

  1. Translating genomic discoveries to the clinic in pediatric oncology.

    PubMed

    Glade Bender, Julia; Verma, Anupam; Schiffman, Joshua D

    2015-02-01

    The present study describes the recent advances in the identification of targetable genomic alterations in pediatric cancers, along with the progress and associated challenges in translating these findings into therapeutic benefit. Each field within pediatric cancer has rapidly and comprehensively begun to define genomic targets in tumors that potentially can improve the clinical outcome of patients, including hematologic malignancies (leukemia and lymphoma), solid malignancies (neuroblastoma, rhabdomyosarcoma, Ewing sarcoma, and osteosarcoma), and brain tumors (gliomas, ependymomas, and medulloblastomas). Although each tumor has specific and sometimes overlapping genomic targets, the translation to the clinic of new targeted trials and precision medicine protocols is still in its infancy. The first clinical tumor profiling studies in pediatric oncology have demonstrated feasibility and patient enthusiasm for the personalized medicine paradigm, but have yet to demonstrate clinical utility. Complexities influencing implementation include rapidly evolving sequencing technologies, tumor heterogeneity, and lack of access to targeted therapies. The return of incidental findings from the germline also remains a challenge, with evolving policy statements and accepted standards. The translation of genomic discoveries to the clinic in pediatric oncology continues to move forward at a brisk pace. Early adoption of genomics for tumor classification, risk stratification, and initial trials of targeted therapeutic agents has led to powerful results. As our experience grows in the integration of genomic and clinical medicine, the outcome for children with cancer should continue to improve.

  2. Case Series Description and Genomic Characterization of Invasive Group A Streptococcal Infections in Pediatric Patients.

    PubMed

    Dien Bard, Jennifer; Mongkolrattanothai, Kanokporn; Kachroo, Priyanka; Beres, Stephen; Olsen, Randall J

    2017-06-01

    We report an unusual cluster of invasive group A Streptococcus infections in 6 pediatric patients and demonstrate that the strains were derived from diverse genetic backgrounds, confirming the occurrence of a community cluster rather than a clonal outbreak. Whole genome sequencing provided a rapid and comprehensive view of group A Streptococcus genotypes and helped guide our institutional response and public health maneuvers.

  3. Genome-wide analysis of the WRKY gene family in cotton.

    PubMed

    Dou, Lingling; Zhang, Xiaohong; Pang, Chaoyou; Song, Meizhen; Wei, Hengling; Fan, Shuli; Yu, Shuxun

    2014-12-01

    WRKY proteins are major transcription factors involved in regulating plant growth and development. Although many studies have focused on the functional identification of WRKY genes, our knowledge concerning many areas of WRKY gene biology is limited. For example, in cotton, the phylogenetic characteristics, global expression patterns, molecular mechanisms regulating expression, and target genes/pathways of WRKY genes are poorly characterized. Therefore, in this study, we present a genome-wide analysis of the WRKY gene family in cotton (Gossypium raimondii and Gossypium hirsutum). We identified 116 WRKY genes in G. raimondii from the completed genome sequence, and we cloned 102 WRKY genes in G. hirsutum. Chromosomal location analysis indicated that WRKY genes in G. raimondii evolved mainly from segmental duplication followed by tandem amplifications. Phylogenetic analysis of alga, bryophyte, lycophyta, monocot and eudicot WRKY domains revealed family member expansion with increasing complexity of the plant body. Microarray, expression profiling and qRT-PCR data revealed that WRKY genes in G. hirsutum may regulate the development of fibers, anthers, tissues (roots, stems, leaves and embryos), and are involved in the response to stresses. Expression analysis showed that most group II and III GhWRKY genes are highly expressed under diverse stresses. Group I members, representing the ancestral form, seem to be insensitive to abiotic stress, with low expression divergence. Our results indicate that cotton WRKY genes might have evolved by adaptive duplication, leading to sensitivity to diverse stresses. This study provides fundamental information to inform further analysis and understanding of WRKY gene functions in cotton species.

  4. Potential pitfalls of CRISPR/Cas9-mediated genome editing.

    PubMed

    Peng, Rongxue; Lin, Guigao; Li, Jinming

    2016-04-01

    Recently, a novel technique named the clustered regularly interspaced short palindromic repeat (CRISPR)/CRISPR-associated protein (Cas)9 system has been rapidly developed. This genome editing tool has improved our ability tremendously with respect to exploring the pathogenesis of diseases and correcting disease mutations, as well as phenotypes. With a short guide RNA, Cas9 can be precisely directed to target sites, and functions as an endonuclease to efficiently produce breaks in DNA double strands. Over the past 30 years, CRISPR has evolved from the 'curious sequences of unknown biological function' into a promising genome editing tool. As a result of the incessant development in the CRISPR/Cas9 system, Cas9 co-expressed with custom guide RNAs has been successfully used in a variety of cells and organisms. This genome editing technology can also be applied to synthetic biology, functional genomic screening, transcriptional modulation and gene therapy. However, although CRISPR/Cas9 has a broad range of action in science, there are several aspects that affect its efficiency and specificity, including Cas9 activity, target site selection and short guide RNA design, delivery methods, off-target effects and the incidence of homology-directed repair. In the present review, we highlight the factors that affect the utilization of CRISPR/Cas9, as well as possible strategies for handling any problems. Addressing these issues will allow us to take better advantage of this technique. In addition, we also review the history and rapid development of the CRISPR/Cas system from the time of its initial discovery in 2012. © 2015 FEBS.

  5. Optimists' Creed: Brave New Cyberlearning, Evolving Utopias (Circa 2041)

    ERIC Educational Resources Information Center

    Burleson, Winslow; Lewis, Armanda

    2016-01-01

    This essay imagines the role that artificial intelligence innovations play in the integrated living, learning and research environments of 2041. Here, in 2041, in the context of increasingly complex wicked challenges, whose solutions by their very nature continue to evade even the most capable experts, society and technology have co-evolved to…

  6. Novel Phage Group Infecting Lactobacillus delbrueckii subsp. lactis, as Revealed by Genomic and Proteomic Analysis of Bacteriophage Ldl1

    PubMed Central

    Casey, Eoghan; Mahony, Jennifer; Neve, Horst; Noben, Jean-Paul; Dal Bello, Fabio

    2014-01-01

    Ldl1 is a virulent phage infecting the dairy starter Lactobacillus delbrueckii subsp. lactis LdlS. Electron microscopy analysis revealed that this phage exhibits a large head and a long tail and bears little resemblance to other characterized phages infecting Lactobacillus delbrueckii. In vitro propagation of this phage revealed a latent period of 30 to 40 min and a burst size of 59.9 ± 1.9 phage particles. Comparative genomic and proteomic analyses showed remarkable similarity between the genome of Ldl1 and that of Lactobacillus plantarum phage ATCC 8014-B2. The genomic and proteomic characteristics of Ldl1 demonstrate that this phage does not belong to any of the four previously recognized L. delbrueckii phage groups, necessitating the creation of a new group, called group e, thus adding to the knowledge on the diversity of phages targeting strains of this industrially important lactic acid bacterial species. PMID:25501478

  7. Novel phage group infecting Lactobacillus delbrueckii subsp. lactis, as revealed by genomic and proteomic analysis of bacteriophage Ldl1.

    PubMed

    Casey, Eoghan; Mahony, Jennifer; Neve, Horst; Noben, Jean-Paul; Dal Bello, Fabio; van Sinderen, Douwe

    2015-02-01

    Ldl1 is a virulent phage infecting the dairy starter Lactobacillus delbrueckii subsp. lactis LdlS. Electron microscopy analysis revealed that this phage exhibits a large head and a long tail and bears little resemblance to other characterized phages infecting Lactobacillus delbrueckii. In vitro propagation of this phage revealed a latent period of 30 to 40 min and a burst size of 59.9 +/- 1.9 phage particles. Comparative genomic and proteomic analyses showed remarkable similarity between the genome of Ldl1 and that of Lactobacillus plantarum phage ATCC 8014-B2. The genomic and proteomic characteristics of Ldl1 demonstrate that this phage does not belong to any of the four previously recognized L. delbrueckii phage groups, necessitating the creation of a new group, called group e, thus adding to the knowledge on the diversity of phages targeting strains of this industrially important lactic acid bacterial species.

  8. DiRE: identifying distant regulatory elements of co-expressed genes

    PubMed Central

    Gotea, Valer; Ovcharenko, Ivan

    2008-01-01

    Regulation of gene expression in eukaryotic genomes is established through a complex cooperative activity of proximal promoters and distant regulatory elements (REs) such as enhancers, repressors and silencers. We have developed a web server named DiRE, based on the Enhancer Identification (EI) method, for predicting distant regulatory elements in higher eukaryotic genomes, namely for determining their chromosomal location and functional characteristics. The server uses gene co-expression data, comparative genomics and profiles of transcription factor binding sites (TFBSs) to determine TFBS-association signatures that can be used for discriminating specific regulatory functions. DiRE's unique feature is its ability to detect REs outside of proximal promoter regions, as it takes advantage of the full gene locus to conduct the search. DiRE can predict common REs for any set of input genes for which the user has prior knowledge of co-expression, co-function or other biologically meaningful grouping. The server predicts function-specific REs consisting of clusters of specifically-associated TFBSs and it also scores the association of individual transcription factors (TFs) with the biological function shared by the group of input genes. Its integration with the Array2BIO server allows users to start their analysis with raw microarray expression data. The DiRE web server is freely available at http://dire.dcode.org. PMID:18487623

  9. The DNA binding parvulin Par17 is targeted to the mitochondrial matrix by a recently evolved prepeptide uniquely present in Hominidae

    PubMed Central

    Kessler, Daniel; Papatheodorou, Panagiotis; Stratmann, Tina; Dian, Elke Andrea; Hartmann-Fatu, Cristina; Rassow, Joachim; Bayer, Peter; Mueller, Jonathan Wolf

    2007-01-01

    Background The parvulin-type peptidyl prolyl cis/trans isomerase Par14 is highly conserved in all metazoans. The recently identified parvulin Par17 contains an additional N-terminal domain whose occurrence and function was the focus of the present study. Results Based on the observation that the human genome encodes Par17, but bovine and rodent genomes do not, Par17 exon sequences from 10 different primate species were cloned and sequenced. Par17 is encoded in the genomes of Hominidae species including humans, but is absent from other mammalian species. In contrast to Par14, endogenous Par17 was found in mitochondrial and membrane fractions of human cell lysates. Fluorescence of EGFP fusions of Par17, but not Par14, co-localized with mitochondrial staining. Par14 and Par17 associated with isolated human, rat and yeast mitochondria at low salt concentrations, but only the Par17 mitochondrial association was resistant to higher salt concentrations. Par17 was imported into mitochondria in a time and membrane potential-dependent manner, where it reached the mitochondrial matrix. Moreover, Par17 was shown to bind to double-stranded DNA under physiological salt conditions. Conclusion Taken together, the DNA binding parvulin Par17 is targeted to the mitochondrial matrix by the most recently evolved mitochondrial prepeptide known to date, thus adding a novel protein constituent to the mitochondrial proteome of Hominidae. PMID:17875217

  10. The complete genome sequence of Lactobacillus bulgaricus reveals extensive and ongoing reductive evolution.

    PubMed

    van de Guchte, M; Penaud, S; Grimaldi, C; Barbe, V; Bryson, K; Nicolas, P; Robert, C; Oztas, S; Mangenot, S; Couloux, A; Loux, V; Dervyn, R; Bossy, R; Bolotin, A; Batto, J-M; Walunas, T; Gibrat, J-F; Bessières, P; Weissenbach, J; Ehrlich, S D; Maguin, E

    2006-06-13

    Lactobacillus delbrueckii ssp. bulgaricus (L. bulgaricus) is a representative of the group of lactic acid-producing bacteria, mainly known for its worldwide application in yogurt production. The genome sequence of this bacterium has been determined and shows the signs of ongoing specialization, with a substantial number of pseudogenes and incomplete metabolic pathways and relatively few regulatory functions. Several unique features of the L. bulgaricus genome support the hypothesis that the genome is in a phase of rapid evolution. (i) Exceptionally high numbers of rRNA and tRNA genes with regard to genome size may indicate that the L. bulgaricus genome has known a recent phase of important size reduction, in agreement with the observed high frequency of gene inactivation and elimination; (ii) a much higher GC content at codon position 3 than expected on the basis of the overall GC content suggests that the composition of the genome is evolving toward a higher GC content; and (iii) the presence of a 47.5-kbp inverted repeat in the replication termination region, an extremely rare feature in bacterial genomes, may be interpreted as a transient stage in genome evolution. The results indicate the adaptation of L. bulgaricus from a plant-associated habitat to the stable protein and lactose-rich milk environment through the loss of superfluous functions and protocooperation with Streptococcus thermophilus.

  11. Genomic diversity and evolution of the head crest in the rock pigeon.

    PubMed

    Shapiro, Michael D; Kronenberg, Zev; Li, Cai; Domyan, Eric T; Pan, Hailin; Campbell, Michael; Tan, Hao; Huff, Chad D; Hu, Haofu; Vickrey, Anna I; Nielsen, Sandra C A; Stringham, Sydney A; Hu, Hao; Willerslev, Eske; Gilbert, M Thomas P; Yandell, Mark; Zhang, Guojie; Wang, Jun

    2013-03-01

    The geographic origins of breeds and the genetic basis of variation within the widely distributed and phenotypically diverse domestic rock pigeon (Columba livia) remain largely unknown. We generated a rock pigeon reference genome and additional genome sequences representing domestic and feral populations. We found evidence for the origins of major breed groups in the Middle East and contributions from a racing breed to North American feral populations. We identified the gene EphB2 as a strong candidate for the derived head crest phenotype shared by numerous breeds, an important trait in mate selection in many avian species. We also found evidence that this trait evolved just once and spread throughout the species, and that the crest originates early in development by the localized molecular reversal of feather bud polarity.

  12. Whole Genome Sequence and Phylogenetic Analysis Show Helicobacter pylori Strains from Latin America Have Followed a Unique Evolution Pathway

    PubMed Central

    Muñoz-Ramírez, Zilia Y.; Mendez-Tenorio, Alfonso; Kato, Ikuko; Bravo, Maria M.; Rizzato, Cosmeri; Thorell, Kaisa; Torres, Roberto; Aviles-Jimenez, Francisco; Camorlinga, Margarita; Canzian, Federico; Torres, Javier

    2017-01-01

    Helicobacter pylori (HP) genetics may determine its clinical outcomes. Despite high prevalence of HP infection in Latin America (LA), there have been no phylogenetic studies in the region. We aimed to understand the structure of HP populations in LA mestizo individuals, where gastric cancer incidence remains high. The genome of 107 HP strains from Mexico, Nicaragua and Colombia were analyzed with 59 publicly available worldwide genomes. To study bacterial relationship on whole genome level we propose a virtual hybridization technique using thousands of high-entropy 13 bp DNA probes to generate fingerprints. Phylogenetic virtual genome fingerprint (VGF) was compared with Multi Locus Sequence Analysis (MLST) and with phylogenetic analyses of cagPAI virulence island sequences. With MLST some Nicaraguan and Mexican strains clustered close to Africa isolates, whereas European isolates were spread without clustering and intermingled with LA isolates. VGF analysis resulted in increased resolution of populations, separating European from LA strains. Furthermore, clusters with exclusively Colombian, Mexican, or Nicaraguan strains were observed, where the Colombian cluster separated from Europe, Asia, and Africa, while Nicaraguan and Mexican clades grouped close to Africa. In addition, a mixed large LA cluster including Mexican, Colombian, Nicaraguan, Peruvian, and Salvadorian strains was observed; all LA clusters separated from the Amerind clade. With cagPAI sequence analyses LA clades clearly separated from Europe, Asia and Amerind, and Colombian strains formed a single cluster. A NeighborNet analyses suggested frequent and recent recombination events particularly among LA strains. Results suggests that in the new world, H. pylori has evolved to fit mestizo LA populations, already 500 years after the Spanish colonization. This co-adaption may account for regional variability in gastric cancer risk. PMID:28293542

  13. Culture independent genomic comparisons reveal environmental adaptations for Altiarchaeales

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Bird, Jordan T.; Baker, Brett J.; Probst, Alexander J.

    The recently proposed candidatus order Altiarchaeales remains an uncultured archaeal lineage composed of genetically diverse, globally widespread organisms frequently observed in anoxic subsurface environments. In spite of 15 years of studies on the psychrophilic biofilm-producing Candidatus Altiarchaeum hamiconexum and its close relatives, very little is known about the phylogenetic and functional diversity of the widespread free-living marine members of this taxon. From methanogenic sediments in the White Oak River Estuary, NC, USA, we sequenced a single cell amplified genome (SAG), WOR_SM1_SCG, and used it to identify and refine two high-quality genomes from metagenomes, WOR_SM1_79 and WOR_SM1_86-2, from the same site.more » These three genomic reconstructions form a monophyletic group, which also includes three previously published genomes from metagenomes from terrestrial springs and a SAG from Sakinaw Lake in a group previously designated as pMC2A384. A synapomorphic mutation in the Altiarchaeales tRNA synthetase β subunit, pheT, caused the protein to be encoded as two subunits at non-adjacent loci. Consistent with the terrestrial spring clades, our estuarine genomes contained a near-complete autotrophic metabolism, H2 or CO as potential electron donors, a reductive acetyl-CoA pathway for carbon fixation, and methylotroph-like NADP(H)-dependent dehydrogenase. Phylogenies based on 16S rRNA genes and concatenated conserved proteins identified two distinct sub-clades of Altiarchaeales, Alti-1 populated by organisms from actively flowing springs, and Alti-2 which was more widespread, diverse, and not associated with visible mats. The core Alti-1 genome suggested Alti-1 is adapted for the stream environment with lipopolysaccharide production capacity and extracellular hami structures. The core Alti-2 genome suggested members of this clade are free-living with distinct mechanisms for energy maintenance, motility, osmoregulation, and sulfur redox reactions. These

  14. Culture Independent Genomic Comparisons Reveal Environmental Adaptations for Altiarchaeales

    PubMed Central

    Baker, Brett J.; Probst, Alexander J.; Podar, Mircea; Lloyd, Karen G.

    2016-01-01

    The recently proposed candidatus order Altiarchaeales remains an uncultured archaeal lineage composed of genetically diverse, globally widespread organisms frequently observed in anoxic subsurface environments. In spite of 15 years of studies on the psychrophilic biofilm-producing Candidatus Altiarchaeum hamiconexum and its close relatives, very little is known about the phylogenetic and functional diversity of the widespread free-living marine members of this taxon. From methanogenic sediments in the White Oak River Estuary, NC, USA, we sequenced a single cell amplified genome (SAG), WOR_SM1_SCG, and used it to identify and refine two high-quality genomes from metagenomes, WOR_SM1_79 and WOR_SM1_86-2, from the same site. These three genomic reconstructions form a monophyletic group, which also includes three previously published genomes from metagenomes from terrestrial springs and a SAG from Sakinaw Lake in a group previously designated as pMC2A384. A synapomorphic mutation in the Altiarchaeales tRNA synthetase β subunit, pheT, caused the protein to be encoded as two subunits at non-adjacent loci. Consistent with the terrestrial spring clades, our estuarine genomes contained a near-complete autotrophic metabolism, H2 or CO as potential electron donors, a reductive acetyl-CoA pathway for carbon fixation, and methylotroph-like NADP(H)-dependent dehydrogenase. Phylogenies based on 16S rRNA genes and concatenated conserved proteins identified two distinct sub-clades of Altiarchaeales, Alti-1 populated by organisms from actively flowing springs, and Alti-2 which was more widespread, diverse, and not associated with visible mats. The core Alti-1 genome suggested Alti-1 is adapted for the stream environment with lipopolysaccharide production capacity and extracellular hami structures. The core Alti-2 genome suggested members of this clade are free-living with distinct mechanisms for energy maintenance, motility, osmoregulation, and sulfur redox reactions. These data

  15. Culture independent genomic comparisons reveal environmental adaptations for Altiarchaeales

    DOE PAGES

    Bird, Jordan T.; Baker, Brett J.; Probst, Alexander J.; ...

    2016-08-05

    The recently proposed candidatus order Altiarchaeales remains an uncultured archaeal lineage composed of genetically diverse, globally widespread organisms frequently observed in anoxic subsurface environments. In spite of 15 years of studies on the psychrophilic biofilm-producing Candidatus Altiarchaeum hamiconexum and its close relatives, very little is known about the phylogenetic and functional diversity of the widespread free-living marine members of this taxon. From methanogenic sediments in the White Oak River Estuary, NC, USA, we sequenced a single cell amplified genome (SAG), WOR_SM1_SCG, and used it to identify and refine two high-quality genomes from metagenomes, WOR_SM1_79 and WOR_SM1_86-2, from the same site.more » These three genomic reconstructions form a monophyletic group, which also includes three previously published genomes from metagenomes from terrestrial springs and a SAG from Sakinaw Lake in a group previously designated as pMC2A384. A synapomorphic mutation in the Altiarchaeales tRNA synthetase β subunit, pheT, caused the protein to be encoded as two subunits at non-adjacent loci. Consistent with the terrestrial spring clades, our estuarine genomes contained a near-complete autotrophic metabolism, H2 or CO as potential electron donors, a reductive acetyl-CoA pathway for carbon fixation, and methylotroph-like NADP(H)-dependent dehydrogenase. Phylogenies based on 16S rRNA genes and concatenated conserved proteins identified two distinct sub-clades of Altiarchaeales, Alti-1 populated by organisms from actively flowing springs, and Alti-2 which was more widespread, diverse, and not associated with visible mats. The core Alti-1 genome suggested Alti-1 is adapted for the stream environment with lipopolysaccharide production capacity and extracellular hami structures. The core Alti-2 genome suggested members of this clade are free-living with distinct mechanisms for energy maintenance, motility, osmoregulation, and sulfur redox reactions. These

  16. How good is our genome?

    PubMed Central

    Weill, Jean-Claude; Radman, Miroslav

    2004-01-01

    Our genome has evolved to perpetuate itself through the maintenance of the species via an uninterrupted chain of reproductive somas. Accordingly, evolution is not concerned with diseases occurring after the soma's reproductive stage. Following Richard Dawkins, we would like to reassert that we indeed live as disposable somas, slaves of our germline genome, but could soon start rebelling against such slavery. Cancer and its relation to the TP53 gene may offer a paradigmatic example. The observation that the latency period in cancer can be prolonged in mice by increasing the number of TP53 genes in their genome, suggests that sooner or later we will have to address the question of heritable disease avoidance via the manipulation of the human germline. PMID:15065661

  17. New Implications on Genomic Adaptation Derived from the Helicobacter pylori Genome Comparison

    PubMed Central

    Lara-Ramírez, Edgar Eduardo; Segura-Cabrera, Aldo; Guo, Xianwu; Yu, Gongxin; García-Pérez, Carlos Armando; Rodríguez-Pérez, Mario A.

    2011-01-01

    Background Helicobacter pylori has a reduced genome and lives in a tough environment for long-term persistence. It evolved with its particular characteristics for biological adaptation. Because several H. pylori genome sequences are available, comparative analysis could help to better understand genomic adaptation of this particular bacterium. Principal Findings We analyzed nine H. pylori genomes with emphasis on microevolution from a different perspective. Inversion was an important factor to shape the genome structure. Illegitimate recombination not only led to genomic inversion but also inverted fragment duplication, both of which contributed to the creation of new genes and gene family, and further, homological recombination contributed to events of inversion. Based on the information of genomic rearrangement, the first genome scaffold structure of H. pylori last common ancestor was produced. The core genome consists of 1186 genes, of which 22 genes could particularly adapt to human stomach niche. H. pylori contains high proportion of pseudogenes whose genesis was principally caused by homopolynucleotide (HPN) mutations. Such mutations are reversible and facilitate the control of gene expression through the change of DNA structure. The reversible mutations and a quasi-panmictic feature could allow such genes or gene fragments frequently transferred within or between populations. Hence, pseudogenes could be a reservoir of adaptation materials and the HPN mutations could be favorable to H. pylori adaptation, leading to HPN accumulation on the genomes, which corresponds to a special feature of Helicobacter species: extremely high HPN composition of genome. Conclusion Our research demonstrated that both genome content and structure of H. pylori have been highly adapted to its particular life style. PMID:21387011

  18. Positive Selection Driving Cytoplasmic Genome Evolution of the Medicinally Important Ginseng Plant Genus Panax

    PubMed Central

    Jiang, Peng; Shi, Feng-Xue; Li, Ming-Rui; Liu, Bao; Wen, Jun; Xiao, Hong-Xing; Li, Lin-Feng

    2018-01-01

    Panax L. (the ginseng genus) is a shade-demanding group within the family Araliaceae and all of its species are of crucial significance in traditional Chinese medicine. Phylogenetic and biogeographic analyses demonstrated that two rounds of whole genome duplications accompanying with geographic and ecological isolations promoted the diversification of Panax species. However, contributions of the cytoplasmic genomes to the adaptive evolution of Panax species remained largely uninvestigated. In this study, we sequenced the chloroplast and mitochondrial genomes of 11 accessions belonging to seven Panax species. Our results show that heterogeneity in nucleotide substitution rate is abundant in both of the two cytoplasmic genomes, with the mitochondrial genome possessing more variants at the total level but the chloroplast showing higher sequence polymorphisms at the genic regions. Genome-wide scanning of positive selection identified five and 12 genes from the chloroplast and mitochondrial genomes, respectively. Functional analyses further revealed that these selected genes play important roles in plant development, cellular metabolism and adaptation. We therefore conclude that positive selection might be one of the potential evolutionary forces that shaped nucleotide variation pattern of these Panax species. In particular, the mitochondrial genes evolved under stronger selective pressure compared to the chloroplast genes. PMID:29670636

  19. The king cobra genome reveals dynamic gene evolution and adaptation in the snake venom system.

    PubMed

    Vonk, Freek J; Casewell, Nicholas R; Henkel, Christiaan V; Heimberg, Alysha M; Jansen, Hans J; McCleary, Ryan J R; Kerkkamp, Harald M E; Vos, Rutger A; Guerreiro, Isabel; Calvete, Juan J; Wüster, Wolfgang; Woods, Anthony E; Logan, Jessica M; Harrison, Robert A; Castoe, Todd A; de Koning, A P Jason; Pollock, David D; Yandell, Mark; Calderon, Diego; Renjifo, Camila; Currier, Rachel B; Salgado, David; Pla, Davinia; Sanz, Libia; Hyder, Asad S; Ribeiro, José M C; Arntzen, Jan W; van den Thillart, Guido E E J M; Boetzer, Marten; Pirovano, Walter; Dirks, Ron P; Spaink, Herman P; Duboule, Denis; McGlinn, Edwina; Kini, R Manjunatha; Richardson, Michael K

    2013-12-17

    Snakes are limbless predators, and many species use venom to help overpower relatively large, agile prey. Snake venoms are complex protein mixtures encoded by several multilocus gene families that function synergistically to cause incapacitation. To examine venom evolution, we sequenced and interrogated the genome of a venomous snake, the king cobra (Ophiophagus hannah), and compared it, together with our unique transcriptome, microRNA, and proteome datasets from this species, with data from other vertebrates. In contrast to the platypus, the only other venomous vertebrate with a sequenced genome, we find that snake toxin genes evolve through several distinct co-option mechanisms and exhibit surprisingly variable levels of gene duplication and directional selection that correlate with their functional importance in prey capture. The enigmatic accessory venom gland shows a very different pattern of toxin gene expression from the main venom gland and seems to have recruited toxin-like lectin genes repeatedly for new nontoxic functions. In addition, tissue-specific microRNA analyses suggested the co-option of core genetic regulatory components of the venom secretory system from a pancreatic origin. Although the king cobra is limbless, we recovered coding sequences for all Hox genes involved in amniote limb development, with the exception of Hoxd12. Our results provide a unique view of the origin and evolution of snake venom and reveal multiple genome-level adaptive responses to natural selection in this complex biological weapon system. More generally, they provide insight into mechanisms of protein evolution under strong selection.

  20. Small Traditional Human Communities Sustain Genomic Diversity over Microgeographic Scales despite Linguistic Isolation

    PubMed Central

    Cox, Murray P.; Hudjashov, Georgi; Sim, Andre; Savina, Olga; Karafet, Tatiana M.; Sudoyo, Herawati; Lansing, J. Stephen

    2016-01-01

    At least since the Neolithic, humans have largely lived in networks of small, traditional communities. Often socially isolated, these groups evolved distinct languages and cultures over microgeographic scales of just tens of kilometers. Population genetic theory tells us that genetic drift should act quickly in such isolated groups, thus raising the question: do networks of small human communities maintain levels of genetic diversity over microgeographic scales? This question can no longer be asked in most parts of the world, which have been heavily impacted by historical events that make traditional society structures the exception. However, such studies remain possible in parts of Island Southeast Asia and Oceania, where traditional ways of life are still practiced. We captured genome-wide genetic data, together with linguistic records, for a case–study system—eight villages distributed across Sumba, a small, remote island in eastern Indonesia. More than 4,000 years after these communities were established during the Neolithic period, most speak different languages and can be distinguished genetically. Yet their nuclear diversity is not reduced, instead being comparable to other, even much larger, regional groups. Modeling reveals a separation of time scales: while languages and culture can evolve quickly, creating social barriers, sporadic migration averaged over many generations is sufficient to keep villages linked genetically. This loosely-connected network structure, once the global norm and still extant on Sumba today, provides a living proxy to explore fine-scale genome dynamics in the sort of small traditional communities within which the most recent episodes of human evolution occurred. PMID:27274003

  1. Functional role of a distal (3'-phosphate) group of CoA in the recombinant human liver medium-chain acyl-CoA dehydrogenase-catalysed reaction.

    PubMed Central

    Peterson, K L; Srivastava, D K

    1997-01-01

    The X-ray crystallographic structure of medium-chain acyl-CoA dehydrogenase (MCAD)-octenoyl-CoA complex reveals that the 3'-phosphate group of CoA is confined to the exterior of the protein structure [approx. 15 A (1.5 nm) away from the enzyme active site], and is fully exposed to the outside solvent environment. To ascertain whether such a distal (3'-phosphate) fragment of CoA plays any significant role in the enzyme catalysis, we investigated the recombinant human liver MCAD (HMCAD)-catalysed reaction by using normal (phospho) and 3'-phosphate-truncated (dephospho) forms of octanoyl-CoA and butyryl-CoA substrates. The steady-state kinetic data revealed that deletion of the 3'-phosphate group from octanoyl-CoA substrate increased the turnover rate of the enzyme to about one-quarter, whereas that from butyryl-CoA substrate decreased the turnover rate of the enzyme to about one-fifth; the Km values of both these substrates were increased by 5-10-fold on deletion of the 3'-phosphate group from the corresponding acyl-CoA substrates. The transient kinetics for the reductive half-reaction, oxidative half-reaction and the dissociation 'off-rate' (of the reaction product from the oxidized enzyme site) were all found to be affected by deletions of the 3'-phosphate group from octanoyl-CoA and butyryl-CoA substrates. A cumulative account of these results reveals that, although the 3'-phosphate group of acyl-CoA substrates might seem 'useless' on the basis of the structural data, it has an essential functional role during HMCAD catalysis. PMID:9271097

  2. Distinct developmental genetic mechanisms underlie convergently evolved tooth gain in sticklebacks

    PubMed Central

    Ellis, Nicholas A.; Glazer, Andrew M.; Donde, Nikunj N.; Cleves, Phillip A.; Agoglia, Rachel M.; Miller, Craig T.

    2015-01-01

    Teeth are a classic model system of organogenesis, as repeated and reciprocal epithelial and mesenchymal interactions pattern placode formation and outgrowth. Less is known about the developmental and genetic bases of tooth formation and replacement in polyphyodonts, which are vertebrates with continual tooth replacement. Here, we leverage natural variation in the threespine stickleback fish Gasterosteus aculeatus to investigate the genetic basis of tooth development and replacement. We find that two derived freshwater stickleback populations have both convergently evolved more ventral pharyngeal teeth through heritable genetic changes. In both populations, evolved tooth gain manifests late in development. Using pulse-chase vital dye labeling to mark newly forming teeth in adult fish, we find that both high-toothed freshwater populations have accelerated tooth replacement rates relative to low-toothed ancestral marine fish. Despite the similar evolved phenotype of more teeth and an accelerated adult replacement rate, the timing of tooth number divergence and the spatial patterns of newly formed adult teeth are different in the two populations, suggesting distinct developmental mechanisms. Using genome-wide linkage mapping in marine-freshwater F2 genetic crosses, we find that the genetic basis of evolved tooth gain in the two freshwater populations is largely distinct. Together, our results support a model whereby increased tooth number and an accelerated tooth replacement rate have evolved convergently in two independently derived freshwater stickleback populations using largely distinct developmental and genetic mechanisms. PMID:26062935

  3. Genome of the pitcher plant Cephalotus reveals genetic changes associated with carnivory.

    PubMed

    Fukushima, Kenji; Fang, Xiaodong; Alvarez-Ponce, David; Cai, Huimin; Carretero-Paulet, Lorenzo; Chen, Cui; Chang, Tien-Hao; Farr, Kimberly M; Fujita, Tomomichi; Hiwatashi, Yuji; Hoshi, Yoshikazu; Imai, Takamasa; Kasahara, Masahiro; Librado, Pablo; Mao, Likai; Mori, Hitoshi; Nishiyama, Tomoaki; Nozawa, Masafumi; Pálfalvi, Gergő; Pollard, Stephen T; Rozas, Julio; Sánchez-Gracia, Alejandro; Sankoff, David; Shibata, Tomoko F; Shigenobu, Shuji; Sumikawa, Naomi; Uzawa, Taketoshi; Xie, Meiying; Zheng, Chunfang; Pollock, David D; Albert, Victor A; Li, Shuaicheng; Hasebe, Mitsuyasu

    2017-02-06

    Carnivorous plants exploit animals as a nutritional source and have inspired long-standing questions about the origin and evolution of carnivory-related traits. To investigate the molecular bases of carnivory, we sequenced the genome of the heterophyllous pitcher plant Cephalotus follicularis, in which we succeeded in regulating the developmental switch between carnivorous and non-carnivorous leaves. Transcriptome comparison of the two leaf types and gene repertoire analysis identified genetic changes associated with prey attraction, capture, digestion and nutrient absorption. Analysis of digestive fluid proteins from C. follicularis and three other carnivorous plants with independent carnivorous origins revealed repeated co-options of stress-responsive protein lineages coupled with convergent amino acid substitutions to acquire digestive physiology. These results imply constraints on the available routes to evolve plant carnivory.

  4. Insights into the Sulfur Mineralogy of Martian Soil at Rocknest, Gale Crater, Enabled by Evolved Gas Analyses

    NASA Technical Reports Server (NTRS)

    McAdam, A.; Franz, H.; Archer, P., Jr.; Freissinet, C.; Sutter, B.; Glavin, D.; Eigenbrode, J.; Bower, H.; Stern, J.; Mchaffy, P.; hide

    2013-01-01

    The first solid samples analysed by the Chemistry and Mineralogy (CheMin) instrument and Sample Analysis at Mars (SAM) instrument suite on the Mars Science Laboratory (MSL) consisted of < 150 m fines sieved from aeolian bedform material at a site named Rocknest. All four samples of this material analyzed by SAM s evolved gas analysis mass spectrometry (EGA-MS) released H2O, CO2, O2, and SO2 (Fig. 1), as well as H2S and possibly NO. This is the first time evolved SO2 (and evolved H2S) has been detected from thermal analysis of martian materials. The identity of these evolved gases and temperature (T) of evolution can support mineral detection by CheMin and place constraints on trace volatile-bearing phases present below the CheMin detection limit or difficult to characterize with XRD (e.g., X-ray amorphous phases). Constraints on phases responsible for evolved CO2 and O2 are detailed elsewhere [1,2,3]. Here, we focus on potential constraints on phases that evolved SO2, H2S, and H2O during thermal analysis.

  5. Evolutionary Genomics of Genes Involved in Olfactory Behavior in the Drosophila melanogaster Species Group

    PubMed Central

    Lavagnino, Nicolás; Serra, François; Arbiza, Leonardo; Dopazo, Hernán; Hasson, Esteban

    2012-01-01

    Previous comparative genomic studies of genes involved in olfactory behavior in Drosophila focused only on particular gene families such as odorant receptor and/or odorant binding proteins. However, olfactory behavior has a complex genetic architecture that is orchestrated by many interacting genes. In this paper, we present a comparative genomic study of olfactory behavior in Drosophila including an extended set of genes known to affect olfactory behavior. We took advantage of the recent burst of whole genome sequences and the development of powerful statistical tools to analyze genomic data and test evolutionary and functional hypotheses of olfactory genes in the six species of the Drosophila melanogaster species group for which whole genome sequences are available. Our study reveals widespread purifying selection and limited incidence of positive selection on olfactory genes. We show that the pace of evolution of olfactory genes is mostly independent of the life cycle stage, and of the number of life cycle stages, in which they participate in olfaction. However, we detected a relationship between evolutionary rates and the position that the gene products occupy in the olfactory system, genes occupying central positions tend to be more constrained than peripheral genes. Finally, we demonstrate that specialization to one host does not seem to be associated with bursts of adaptive evolution in olfactory genes in D. sechellia and D. erecta, the two specialists species analyzed, but rather different lineages have idiosyncratic evolutionary histories in which both historical and ecological factors have been involved. PMID:22346339

  6. Microeconomic principles explain an optimal genome size in bacteria.

    PubMed

    Ranea, Juan A G; Grant, Alastair; Thornton, Janet M; Orengo, Christine A

    2005-01-01

    Bacteria can clearly enhance their survival by expanding their genetic repertoire. However, the tight packing of the bacterial genome and the fact that the most evolved species do not necessarily have the biggest genomes suggest there are other evolutionary factors limiting their genome expansion. To clarify these restrictions on size, we studied those protein families contributing most significantly to bacterial-genome complexity. We found that all bacteria apply the same basic and ancestral 'molecular technology' to optimize their reproductive efficiency. The same microeconomics principles that define the optimum size in a factory can also explain the existence of a statistical optimum in bacterial genome size. This optimum is reached when the bacterial genome obtains the maximum metabolic complexity (revenue) for minimal regulatory genes (logistic cost).

  7. Complete genome sequences of two novel autographiviruses infecting a bacterium from the Pseudomonas fluorescens group.

    PubMed

    Nowicki, Grzegorz; Walkowiak-Nowicka, Karolina; Zemleduch-Barylska, Agata; Mleczko, Anna; Frąckowiak, Patryk; Nowaczyk, Natalia; Kozdrowska, Emilia; Barylski, Jakub

    2017-09-01

    In this paper, we describe two independent isolates of a new member of the subfamily Autographivirinae, Pseudomonas phage KNP. The type strain (KNP) has a linear, 40,491-bp-long genome with GC content of 57.3%, and 50 coding DNA sequences (CDSs). The genome of the second strain (WRT) contains one CDS less, encodes a significantly different tail fiber protein and is shorter (40,214 bp; GC content, 57.4%). Phylogenetic analysis indicates that both KNP and WRT belong to the genus T7virus. Together with genetically similar Pseudomonas phages (gh-1, phiPSA2, phiPsa17, PPPL-1, shl2, phi15, PPpW-4, UNO-SLW4, phiIBB-PF7A, Pf-10, and Phi-S1), they form a divergent yet coherent group that stands apart from the T7-like viruses (sensu lato). Analysis of the diversity of this group and its relatedness to other members of the subfamily Autographivirinae led us to the conclusion that this group might be considered as a candidate for a new genus.

  8. Origin and Possible Genetic Recombination of the Middle East Respiratory Syndrome Coronavirus from the First Imported Case in China: Phylogenetics and Coalescence Analysis.

    PubMed

    Wang, Yanqun; Liu, Di; Shi, Weifeng; Lu, Roujian; Wang, Wenling; Zhao, Yanjie; Deng, Yao; Zhou, Weimin; Ren, Hongguang; Wu, Jun; Wang, Yu; Wu, Guizhen; Gao, George F; Tan, Wenjie

    2015-09-08

    The Middle East respiratory syndrome coronavirus (MERS-CoV) causes a severe acute respiratory tract infection with a high fatality rate in humans. Coronaviruses are capable of infecting multiple species and can evolve rapidly through recombination events. Here, we report the complete genomic sequence analysis of a MERS-CoV strain imported to China from South Korea. The imported virus, provisionally named ChinaGD01, belongs to group 3 in clade B in the whole-genome phylogenetic tree and also has a similar tree topology structure in the open reading frame 1a and -b (ORF1ab) gene segment but clusters with group 5 of clade B in the tree constructed using the S gene. Genetic recombination analysis and lineage-specific single-nucleotide polymorphism (SNP) comparison suggest that the imported virus is a recombinant comprising group 3 and group 5 elements. The time-resolved phylogenetic estimation indicates that the recombination event likely occurred in the second half of 2014. Genetic recombination events between group 3 and group 5 of clade B may have implications for the transmissibility of the virus. The recent outbreak of MERS-CoV in South Korea has attracted global media attention due to the speed of spread and onward transmission. Here, we present the complete genome of the first imported MERS-CoV case in China and demonstrate genetic recombination events between group 3 and group 5 of clade B that may have implications for the transmissibility of MERS-CoV. Copyright © 2015 Wang et al.

  9. Localization of a bacterial group II intron-encoded protein in eukaryotic nuclear splicing-related cell compartments.

    PubMed

    Nisa-Martínez, Rafael; Laporte, Philippe; Jiménez-Zurdo, José Ignacio; Frugier, Florian; Crespi, Martin; Toro, Nicolás

    2013-01-01

    Some bacterial group II introns are widely used for genetic engineering in bacteria, because they can be reprogrammed to insert into the desired DNA target sites. There is considerable interest in developing this group II intron gene targeting technology for use in eukaryotes, but nuclear genomes present several obstacles to the use of this approach. The nuclear genomes of eukaryotes do not contain group II introns, but these introns are thought to have been the progenitors of nuclear spliceosomal introns. We investigated the expression and subcellular localization of the bacterial RmInt1 group II intron-encoded protein (IEP) in Arabidopsis thaliana protoplasts. Following the expression of translational fusions of the wild-type protein and several mutant variants with EGFP, the full-length IEP was found exclusively in the nucleolus, whereas the maturase domain alone targeted EGFP to nuclear speckles. The distribution of the bacterial RmInt1 IEP in plant cell protoplasts suggests that the compartmentalization of eukaryotic cells into nucleus and cytoplasm does not prevent group II introns from invading the host genome. Furthermore, the trafficking of the IEP between the nucleolus and the speckles upon maturase inactivation is consistent with the hypothesis that the spliceosomal machinery evolved from group II introns.

  10. Localization of a Bacterial Group II Intron-Encoded Protein in Eukaryotic Nuclear Splicing-Related Cell Compartments

    PubMed Central

    Nisa-Martínez, Rafael; Laporte, Philippe; Jiménez-Zurdo, José Ignacio; Frugier, Florian; Crespi, Martin; Toro, Nicolás

    2013-01-01

    Some bacterial group II introns are widely used for genetic engineering in bacteria, because they can be reprogrammed to insert into the desired DNA target sites. There is considerable interest in developing this group II intron gene targeting technology for use in eukaryotes, but nuclear genomes present several obstacles to the use of this approach. The nuclear genomes of eukaryotes do not contain group II introns, but these introns are thought to have been the progenitors of nuclear spliceosomal introns. We investigated the expression and subcellular localization of the bacterial RmInt1 group II intron-encoded protein (IEP) in Arabidopsis thaliana protoplasts. Following the expression of translational fusions of the wild-type protein and several mutant variants with EGFP, the full-length IEP was found exclusively in the nucleolus, whereas the maturase domain alone targeted EGFP to nuclear speckles. The distribution of the bacterial RmInt1 IEP in plant cell protoplasts suggests that the compartmentalization of eukaryotic cells into nucleus and cytoplasm does not prevent group II introns from invading the host genome. Furthermore, the trafficking of the IEP between the nucleolus and the speckles upon maturase inactivation is consistent with the hypothesis that the spliceosomal machinery evolved from group II introns. PMID:24391881

  11. Genome tailoring powered production of isobutanol in continuous CO2/H2 blend fermentation using engineered acetogen biocatalyst.

    PubMed

    Gak, Eugene; Tyurin, Michael; Kiriukhin, Michael

    2014-05-01

    The cell energy fraction that powered maintenance and expression of genes encoding pro-phage elements, pta-ack cluster, early sporulation, sugar ABC transporter periplasmic proteins, 6-phosphofructokinase, pyruvate kinase, and fructose-1,6-disphosphatase in acetogen Clostridium sp. MT871 was re-directed to power synthetic operon encoding isobutanol biosynthesis at the expense of these genes achieved via their elimination. Genome tailoring decreased cell duplication time by 7.0 ± 0.1 min (p < 0.05) compared to the parental strain, with intact genome and cell duplication time of 68 ± 1 min (p < 0.05). Clostridium sp. MT871 with tailored genome was UVC-mutated to withstand 6.1 % isobutanol in fermentation broth to prevent product inhibition in an engineered commercial biocatalyst producing 5 % (674.5 mM) isobutanol during two-step continuous fermentation of CO2/H2 gas blend. Biocatalyst Clostridium sp. MT871RG- 11IBR6 was engineered to express six copies of synthetic operon comprising optimized synthetic format dehydrogenase, pyruvate formate lyase, acetolactate synthase, acetohydroxyacid reductoisomerase, 2,3-dihydroxy-isovalerate dehydratase, branched-chain alpha-ketoacid decarboxylase gene, aldehyde dehydrogenase, and alcohol dehydrogenase, regaining cell duplication time of 68 ± 1 min (p < 0.05) for the parental strain. This is the first report on isobutanol production by an engineered acetogen biocatalyst suitable for commercial manufacturing of this chemical/fuel using continuous fermentation of CO2/H2 blend thus contributing to the reversal of global warming.

  12. Analysis of the full genome of human group C rotaviruses reveals lineage diversification and reassortment.

    PubMed

    Medici, Maria Cristina; Tummolo, Fabio; Martella, Vito; Arcangeletti, Maria Cristina; De Conto, Flora; Chezzi, Carlo; Fehér, Enikő; Marton, Szilvia; Calderaro, Adriana; Bányai, Krisztián

    2016-08-01

    Group C rotaviruses (RVC) are enteric pathogens of humans and animals. Whole-genome sequences are available only for few RVCs, leaving gaps in our knowledge about their genetic diversity. We determined the full-length genome sequence of two human RVCs (PR2593/2004 and PR713/2012), detected in Italy from hospital-based surveillance for rotavirus infection in 2004 and 2012. In the 11 RNA genomic segments, the two Italian RVCs segregated within separate intra-genotypic lineages showed variation ranging from 1.9 % (VP6) to 15.9 % (VP3) at the nucleotide level. Comprehensive analysis of human RVC sequences available in the databases allowed us to reveal the existence of at least two major genome configurations, defined as type I and type II. Human RVCs of type I were all associated with the M3 VP3 genotype, including the Italian strain PR2593/2004. Conversely, human RVCs of type II were all associated with the M2 VP3 genotype, including the Italian strain PR713/2012. Reassortant RVC strains between these major genome configurations were identified. Although only a few full-genome sequences of human RVCs, mostly of Asian origin, are available, the analysis of human RVC sequences retrieved from the databases indicates that at least two intra-genotypic RVC lineages circulate in European countries. Gathering more sequence data is necessary to develop a standardized genotype and intra-genotypic lineage classification system useful for epidemiological investigations and avoiding confusion in the literature.

  13. Recommendations for the classification of group A rotaviruses using all 11 genomic RNA segments.

    PubMed

    Matthijnssens, Jelle; Ciarlet, Max; Rahman, Mustafizur; Attoui, Houssam; Bányai, Krisztián; Estes, Mary K; Gentsch, Jon R; Iturriza-Gómara, Miren; Kirkwood, Carl D; Martella, Vito; Mertens, Peter P C; Nakagomi, Osamu; Patton, John T; Ruggeri, Franco M; Saif, Linda J; Santos, Norma; Steyer, Andrej; Taniguchi, Koki; Desselberger, Ulrich; Van Ranst, Marc

    2008-01-01

    Recently, a classification system was proposed for rotaviruses in which all the 11 genomic RNA segments are used (Matthijnssens et al. in J Virol 82:3204-3219, 2008). Based on nucleotide identity cut-off percentages, different genotypes were defined for each genome segment. A nomenclature for the comparison of complete rotavirus genomes was considered in which the notations Gx-P[x]-Ix-Rx-Cx-Mx-Ax-Nx-Tx-Ex-Hx are used for the VP7-VP4-VP6-VP1-VP2-VP3-NSP1-NSP2-NSP3-NSP4-NSP5/6 encoding genes, respectively. This classification system is an extension of the previously applied genotype-based system which made use of the rotavirus gene segments encoding VP4, VP7, VP6, and NSP4. In order to assign rotavirus strains to one of the established genotypes or a new genotype, a standard procedure is proposed in this report. As more human and animal rotavirus genomes will be completely sequenced, new genotypes for each of the 11 gene segments may be identified. A Rotavirus Classification Working Group (RCWG) including specialists in molecular virology, infectious diseases, epidemiology, and public health was formed, which can assist in the appropriate delineation of new genotypes, thus avoiding duplications and helping minimize errors. Scientists discovering a potentially new rotavirus genotype for any of the 11 gene segments are invited to send the novel sequence to the RCWG, where the sequence will be analyzed, and a new nomenclature will be advised as appropriate. The RCWG will update the list of classified strains regularly and make this accessible on a website. Close collaboration with the Study Group Reoviridae of the International Committee on the Taxonomy of Viruses will be maintained.

  14. Yeast Sub1 and human PC4 are G-quadruplex binding proteins that suppress genome instability at co-transcriptionally formed G4 DNA.

    PubMed

    Lopez, Christopher R; Singh, Shivani; Hambarde, Shashank; Griffin, Wezley C; Gao, Jun; Chib, Shubeena; Yu, Yang; Ira, Grzegorz; Raney, Kevin D; Kim, Nayun

    2017-06-02

    G-quadruplex or G4 DNA is a non-B secondary DNA structure consisting of a stacked array of guanine-quartets that can disrupt critical cellular functions such as replication and transcription. When sequences that can adopt Non-B structures including G4 DNA are located within actively transcribed genes, the reshaping of DNA topology necessary for transcription process stimulates secondary structure-formation thereby amplifying the potential for genome instability. Using a reporter assay designed to study G4-induced recombination in the context of an actively transcribed locus in Saccharomyces cerevisiae, we tested whether co-transcriptional activator Sub1, recently identified as a G4-binding factor, contributes to genome maintenance at G4-forming sequences. Our data indicate that, upon Sub1-disruption, genome instability linked to co-transcriptionally formed G4 DNA in Top1-deficient cells is significantly augmented and that its highly conserved DNA binding domain or the human homolog PC4 is sufficient to suppress G4-associated genome instability. We also show that Sub1 interacts specifically with co-transcriptionally formed G4 DNA in vivo and that yeast cells become highly sensitivity to G4-stabilizing chemical ligands by the loss of Sub1. Finally, we demonstrate the physical and genetic interaction of Sub1 with the G4-resolving helicase Pif1, suggesting a possible mechanism by which Sub1 suppresses instability at G4 DNA. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.

  15. An Experimentally-Supported Genome-Scale Metabolic Network Reconstruction for Yersinia pestis CO92

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Charusanti, Pep; Chauhan, Sadhana; Mcateer, Kathleen

    2011-10-13

    Yersinia pestis is a gram-negative bacterium that causes plague, a disease linked historically to the Black Death in Europe during the Middle Ages and to several outbreaks during the modern era. Metabolism in Y. pestis displays remarkable flexibility and robustness, allowing the bacterium to proliferate in both warm-blooded mammalian hosts and cold-blooded insect vectors such as fleas. Here we report a genome-scale reconstruction and mathematical model of metabolism for Y. pestis CO92 and supporting experimental growth and metabolite measurements. The model contains 815 genes, 678 proteins, 963 unique metabolites and 1678 reactions, accurately simulates growth on a range of carbonmore » sources both qualitatively and quantitatively, and identifies gaps in several key biosynthetic pathways and suggests how those gaps might be filled. Furthermore, our model presents hypotheses to explain certain known nutritional requirements characteristic of this strain. Y. pestis continues to be a dangerous threat to human health during modern times. The Y. pestis genome-scale metabolic reconstruction presented here, which has been benchmarked against experimental data and correctly reproduces known phenotypes, thus provides an in silico platform with which to investigate the metabolism of this important human pathogen.« less

  16. Genomic basis for the convergent evolution of electric organs

    PubMed Central

    Gallant, Jason R.; Traeger, Lindsay L.; Volkening, Jeremy D.; Moffett, Howell; Chen, Po-Hao; Novina, Carl D.; Phillips, George N.; Anand, Rene; Wells, Gregg B.; Pinch, Matthew; Güth, Robert; Unguez, Graciela A.; Albert, James S.; Zakon, Harold H.; Samanta, Manoj P.; Sussman, Michael R.

    2017-01-01

    Little is known about the genetic basis of convergent traits that originate repeatedly over broad taxonomic scales. The myogenic electric organ has evolved six times in fishes to produce electric fields used in communication, navigation, predation, or defense. We have examined the genomic basis of the convergent anatomical and physiological origins of these organs by assembling the genome of the electric eel (Electrophorus electricus) and sequencing electric organ and skeletal muscle transcriptomes from three lineages that have independently evolved electric organs. Our results indicate that, despite millions of years of evolution and large differences in the morphology of electric organ cells, independent lineages have leveraged similar transcription factors and developmental and cellular pathways in the evolution of electric organs. PMID:24970089

  17. Genome Content and Phylogenomics Reveal both Ancestral and Lateral Evolutionary Pathways in Plant-Pathogenic Streptomyces Species

    PubMed Central

    Huguet-Tapia, Jose C.; Lefebure, Tristan; Badger, Jonathan H.; Guan, Dongli; Stanhope, Michael J.

    2016-01-01

    Streptomyces spp. are highly differentiated actinomycetes with large, linear chromosomes that encode an arsenal of biologically active molecules and catabolic enzymes. Members of this genus are well equipped for life in nutrient-limited environments and are common soil saprophytes. Out of the hundreds of species in the genus Streptomyces, a small group has evolved the ability to infect plants. The recent availability of Streptomyces genome sequences, including four genomes of pathogenic species, provided an opportunity to characterize the gene content specific to these pathogens and to study phylogenetic relationships among them. Genome sequencing, comparative genomics, and phylogenetic analysis enabled us to discriminate pathogenic from saprophytic Streptomyces strains; moreover, we calculated that the pathogen-specific genome contains 4,662 orthologs. Phylogenetic reconstruction suggested that Streptomyces scabies and S. ipomoeae share an ancestor but that their biosynthetic clusters encoding the required virulence factor thaxtomin have diverged. In contrast, S. turgidiscabies and S. acidiscabies, two relatively unrelated pathogens, possess highly similar thaxtomin biosynthesis clusters, which suggests that the acquisition of these genes was through lateral gene transfer. PMID:26826232

  18. Eggs, embryos and the evolution of imprinting: insights from the platypus genome.

    PubMed

    Renfree, Marilyn B; Papenfuss, Anthony T; Shaw, Geoff; Pask, Andrew J

    2009-01-01

    Genomic imprinting is widespread in eutherian and marsupial mammals. Although there have been many hypotheses to explain why genomic imprinting evolved in mammals, few have examined how it arose. The host defence hypothesis suggests that imprinting evolved from existing mechanisms within the cell that act to silence foreign DNA elements that insert into the genome. However, the changes to the mammalian genome that accompanied the evolution of imprinting have been hard to define due to the absence of large-scale genomic resources from all extant classes. The recent release of the platypus genome sequence has provided the first opportunity to make comparisons between prototherian (monotreme, which show no signs of imprinting) and therian (marsupial and eutherian, which have imprinting) mammals. We compared the distribution of repeat elements known to attract epigenetic silencing across the genome from monotremes and therian mammals, particularly focusing on the orthologous imprinted regions. Our analyses show that the platypus has significantly fewer repeats of certain classes in the regions of the genome that have become imprinted in therian mammals. The accumulation of repeats, especially long-terminal repeats and DNA elements, in therian imprinted genes and gene clusters therefore appears to be coincident with, and may have been a potential driving force in, the development of mammalian genomic imprinting. Comparative platypus genome analyses of orthologous imprinted regions have provided strong support for the host defence hypothesis to explain the origin of imprinting.

  19. Inducible CRISPR genome-editing tool: classifications and future trends.

    PubMed

    Dai, Xiaofeng; Chen, Xiao; Fang, Qiuwu; Li, Jia; Bai, Zhonghu

    2018-06-01

    The discovery of CRISPR-Cas9/dCas9 system has reinforced our ability and revolutionized our history in genome engineering. While Cas9 and dCas9 are programed to modulate gene expression by introducing DNA breaks, blocking transcription factor recruitment or dragging functional groups towards the targeted sites, sgRNAs determine the genomic loci where the modulation occurs. The off-target problem, due to limited sgRNA specificity and genome complexity of many species, has posed concerns for the wide application of this revolutionary technique. To solve this problem and, more importantly, gain power over gene functionality and cell fate control, inducible strategies have been continuously evolved to offer tailored solutions to address specific biological questions. By reviewing recent advances in inducible CRISPR system design and critical elements potentially adding values to such systems, we classify current approaches in this domain into four mechanically distinct categories, namely, "split system", "allosteric system", "combinatorial system", and "transient delivery system", discuss the pros and cons of each system, and point out the under-explored areas and future directions, with the aim of enriching our toolbox of delicate life engineering.

  20. Mammalian-specific genomic functions: Newly acquired traits generated by genomic imprinting and LTR retrotransposon-derived genes in mammals.

    PubMed

    Kaneko-Ishino, Tomoko; Ishino, Fumitoshi

    2015-01-01

    Mammals, including human beings, have evolved a unique viviparous reproductive system and a highly developed central nervous system. How did these unique characteristics emerge in mammalian evolution, and what kinds of changes did occur in the mammalian genomes as evolution proceeded? A key conceptual term in approaching these issues is "mammalian-specific genomic functions", a concept covering both mammalian-specific epigenetics and genetics. Genomic imprinting and LTR retrotransposon-derived genes are reviewed as the representative, mammalian-specific genomic functions that are essential not only for the current mammalian developmental system, but also mammalian evolution itself. First, the essential roles of genomic imprinting in mammalian development, especially related to viviparous reproduction via placental function, as well as the emergence of genomic imprinting in mammalian evolution, are discussed. Second, we introduce the novel concept of "mammalian-specific traits generated by mammalian-specific genes from LTR retrotransposons", based on the finding that LTR retrotransposons served as a critical driving force in the mammalian evolution via generating mammalian-specific genes.

  1. The Physcomitrella patens chromosome-scale assembly reveals moss genome structure and evolution

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Lang, Daniel; Ullrich, Kristian K.; Murat, Florent

    Here, the draft genome of the moss model, Physcomitrella patens, comprised approximately 2000 unordered scaffolds. In order to enable analyses of genome structure and evolution we generated a chromosome–scale genome assembly using genetic linkage as well as (end) sequencing of long DNA fragments. We find that 57% of the genome comprises transposable elements (TEs), some of which may be actively transposing during the life cycle. Unlike in flowering plant genomes, gene– and TE–rich regions show an overall even distribution along the chromosomes. However, the chromosomes are mono–centric with peaks of a class of Copia elements potentially coinciding with centromeres. Genemore » body methylation is evident in 5.7% of the protein–coding genes, typically coinciding with low GC and low expression. Some giant virus insertions are transcriptionally active and might protect gametes from viral infection via siRNA mediated silencing. Structure–based detection methods show that the genome evolved via two rounds of whole genome duplications (WGDs), apparently common in mosses but not in liverworts and hornworts. Several hundred genes are present in colinear regions conserved since the last common ancestor of plants. These syntenic regions are enriched for functions related to plant–specific cell growth and tissue organization. The P. patens genome lacks the TE–rich pericentromeric and gene–rich distal regions typical for most flowering plant genomes. More non–seed plant genomes are needed to unravel how plant genomes evolve, and to understand whether the P. patens genome structure is typical for mosses or bryophytes.« less

  2. The Physcomitrella patens chromosome-scale assembly reveals moss genome structure and evolution

    DOE PAGES

    Lang, Daniel; Ullrich, Kristian K.; Murat, Florent; ...

    2017-12-13

    Here, the draft genome of the moss model, Physcomitrella patens, comprised approximately 2000 unordered scaffolds. In order to enable analyses of genome structure and evolution we generated a chromosome–scale genome assembly using genetic linkage as well as (end) sequencing of long DNA fragments. We find that 57% of the genome comprises transposable elements (TEs), some of which may be actively transposing during the life cycle. Unlike in flowering plant genomes, gene– and TE–rich regions show an overall even distribution along the chromosomes. However, the chromosomes are mono–centric with peaks of a class of Copia elements potentially coinciding with centromeres. Genemore » body methylation is evident in 5.7% of the protein–coding genes, typically coinciding with low GC and low expression. Some giant virus insertions are transcriptionally active and might protect gametes from viral infection via siRNA mediated silencing. Structure–based detection methods show that the genome evolved via two rounds of whole genome duplications (WGDs), apparently common in mosses but not in liverworts and hornworts. Several hundred genes are present in colinear regions conserved since the last common ancestor of plants. These syntenic regions are enriched for functions related to plant–specific cell growth and tissue organization. The P. patens genome lacks the TE–rich pericentromeric and gene–rich distal regions typical for most flowering plant genomes. More non–seed plant genomes are needed to unravel how plant genomes evolve, and to understand whether the P. patens genome structure is typical for mosses or bryophytes.« less

  3. Duplicated genes evolve independently in allopolyploid cotton.

    Treesearch

    Richard C. Cronn; Randall L. Small; Jonathan F. Wendel

    1999-01-01

    Of the many processes that generate gene duplications, polyploidy is unique in that entire genomes are duplicated. This process has been important in the evolution of many eukaryotic groups, and it occurs with high frequency in plants. Recent evidence suggests that polyploidization may be accompanied by rapid genomic changes, but the evolutionary fate of discrete loci...

  4. Analysis of co-evolving genes in campylobacter jejuni and C. coli

    USDA-ARS?s Scientific Manuscript database

    Background: The population structure of Campylobacter has been frequently studied by MLST, for which fragments of housekeeping genes are compared. We wished to determine if the used MLST genes are representative of the complete genome. Methods: A set of 1029 core gene families (CGF) was identifie...

  5. A rapidly evolving secretome builds and patterns a sea shell

    PubMed Central

    Jackson, Daniel J; McDougall, Carmel; Green, Kathryn; Simpson, Fiona; Wörheide, Gert; Degnan, Bernard M

    2006-01-01

    Background Instructions to fabricate mineralized structures with distinct nanoscale architectures, such as seashells and coral and vertebrate skeletons, are encoded in the genomes of a wide variety of animals. In mollusks, the mantle is responsible for the extracellular production of the shell, directing the ordered biomineralization of CaCO3 and the deposition of architectural and color patterns. The evolutionary origins of the ability to synthesize calcified structures across various metazoan taxa remain obscure, with only a small number of protein families identified from molluskan shells. The recent sequencing of a wide range of metazoan genomes coupled with the analysis of gene expression in non-model animals has allowed us to investigate the evolution and process of biomineralization in gastropod mollusks. Results Here we show that over 25% of the genes expressed in the mantle of the vetigastropod Haliotis asinina encode secreted proteins, indicating that hundreds of proteins are likely to be contributing to shell fabrication and patterning. Almost 85% of the secretome encodes novel proteins; remarkably, only 19% of these have identifiable homologues in the full genome of the patellogastropod Lottia scutum. The spatial expression profiles of mantle genes that belong to the secretome is restricted to discrete mantle zones, with each zone responsible for the fabrication of one of the structural layers of the shell. Patterned expression of a subset of genes along the length of the mantle is indicative of roles in shell ornamentation. For example, Has-sometsuke maps precisely to pigmentation patterns in the shell, providing the first case of a gene product to be involved in molluskan shell pigmentation. We also describe the expression of two novel genes involved in nacre (mother of pearl) deposition. Conclusion The unexpected complexity and evolvability of this secretome and the modular design of the molluskan mantle enables diversification of shell strength and

  6. The Small Nuclear Genomes of Selaginella Are Associated with a Low Rate of Genome Size Evolution.

    PubMed

    Baniaga, Anthony E; Arrigo, Nils; Barker, Michael S

    2016-06-03

    The haploid nuclear genome size (1C DNA) of vascular land plants varies over several orders of magnitude. Much of this observed diversity in genome size is due to the proliferation and deletion of transposable elements. To date, all vascular land plant lineages with extremely small nuclear genomes represent recently derived states, having ancestors with much larger genome sizes. The Selaginellaceae represent an ancient lineage with extremely small genomes. It is unclear how small nuclear genomes evolved in Selaginella We compared the rates of nuclear genome size evolution in Selaginella and major vascular plant clades in a comparative phylogenetic framework. For the analyses, we collected 29 new flow cytometry estimates of haploid genome size in Selaginella to augment publicly available data. Selaginella possess some of the smallest known haploid nuclear genome sizes, as well as the lowest rate of genome size evolution observed across all vascular land plants included in our analyses. Additionally, our analyses provide strong support for a history of haploid nuclear genome size stasis in Selaginella Our results indicate that Selaginella, similar to other early diverging lineages of vascular land plants, has relatively low rates of genome size evolution. Further, our analyses highlight that a rapid transition to a small genome size is only one route to an extremely small genome. © The Author 2016. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

  7. Genomic evidence for plant-parasitic nematodes as the earliest Wolbachia hosts

    PubMed Central

    Brown, Amanda M. V.; Wasala, Sulochana K.; Howe, Dana K.; Peetz, Amy B.; Zasada, Inga A.; Denver, Dee R.

    2016-01-01

    Wolbachia, one of the most widespread endosymbionts, is a target for biological control of mosquito-borne diseases (malaria and dengue virus), and antibiotic elimination of infectious filarial nematodes. We sequenced and analyzed the genome of a new Wolbachia strain (wPpe) in the plant-parasitic nematode Pratylenchus penetrans. Phylogenomic analyses placed wPpe as the earliest diverging Wolbachia, suggesting two evolutionary invasions into nematodes. The next branches comprised strains in sap-feeding insects, suggesting Wolbachia may have first evolved as a nutritional mutualist. Genome size, protein content, %GC, and repetitive DNA allied wPpe with mutualistic Wolbachia, whereas gene repertoire analyses placed it between parasite (A, B) and mutualist (C, D, F) groups. Conservation of iron metabolism genes across Wolbachia suggests iron homeostasis as a potential factor in its success. This study enhances our understanding of this globally pandemic endosymbiont, highlighting genetic patterns associated with host changes. Combined with future work on this strain, these genomic data could help provide potential new targets for plant-parasitic nematode control. PMID:27734894

  8. The Genome Biology of Effector Gene Evolution in Filamentous Plant Pathogens.

    PubMed

    Sánchez-Vallet, Andrea; Fouché, Simone; Fudal, Isabelle; Hartmann, Fanny E; Soyer, Jessica L; Tellier, Aurélien; Croll, Daniel

    2018-05-16

    Filamentous pathogens, including fungi and oomycetes, pose major threats to global food security. Crop pathogens cause damage by secreting effectors that manipulate the host to the pathogen's advantage. Genes encoding such effectors are among the most rapidly evolving genes in pathogen genomes. Here, we review how the major characteristics of the emergence, function, and regulation of effector genes are tightly linked to the genomic compartments where these genes are located in pathogen genomes. The presence of repetitive elements in these compartments is associated with elevated rates of point mutations and sequence rearrangements with a major impact on effector diversification. The expression of many effectors converges on an epigenetic control mediated by the presence of repetitive elements. Population genomics analyses showed that rapidly evolving pathogens show high rates of turnover at effector loci and display a mosaic in effector presence-absence polymorphism among strains. We conclude that effective pathogen containment strategies require a thorough understanding of the effector genome biology and the pathogen's potential for rapid adaptation. Expected final online publication date for the Annual Review of Phytopathology Volume 56 is August 25, 2018. Please see http://www.annualreviews.org/page/journal/pubdates for revised estimates.

  9. Evolved Gas Analyses of the Murray Formation in Gale Crater, Mars: Results of the Curiosity Rover's Sample Analysis at Mars (SAM) Instrument

    NASA Technical Reports Server (NTRS)

    Sutter, B.; McAdam, A. C.; Rampe, E. B.; Thompson, L. M.; Ming, D. W.; Mahaffy, P. R.; Navarro-Gonzalez, R.; Stern, J. C.; Eigenbrode, J. L.; Archer, P. D.

    2017-01-01

    The Sample Analysis at Mars (SAM) instrument aboard the Mars Science Laboratory rover has analyzed 13 samples from Gale Crater. All SAM-evolved gas analyses have yielded a multitude of volatiles (e.g., H2O, SO2, H2S, CO2, CO, NO, O2, HCl) [1- 6]. The objectives of this work are to 1) Characterize recent evolved SO2, CO2, O2, and NO gas traces of the Murray formation mudstone, 2) Constrain sediment mineralogy/composition based on SAM evolved gas analysis (SAM-EGA), and 3) Discuss the implications of these results relative to understanding the geological history of Gale Crater.

  10. Genomic analysis reveals hidden biodiversity within colugos, the sister group to primates

    PubMed Central

    Mason, Victor C.; Li, Gang; Minx, Patrick; Schmitz, Jürgen; Churakov, Gennady; Doronina, Liliya; Melin, Amanda D.; Dominy, Nathaniel J.; Lim, Norman T-L.; Springer, Mark S.; Wilson, Richard K.; Warren, Wesley C.; Helgen, Kristofer M.; Murphy, William J.

    2016-01-01

    Colugos are among the most poorly studied mammals despite their centrality to resolving supraordinal primate relationships. Two described species of these gliding mammals are the sole living members of the order Dermoptera, distributed throughout Southeast Asia. We generated a draft genome sequence for a Sunda colugo and a Philippine colugo reference alignment, and used these to identify colugo-specific genetic changes that were enriched in sensory and musculoskeletal-related genes that likely underlie their nocturnal and gliding adaptations. Phylogenomic analysis and catalogs of rare genomic changes overwhelmingly support the contested hypothesis that colugos are the sister group to primates (Primatomorpha), to the exclusion of treeshrews. We captured ~140 kb of orthologous sequence data from colugo museum specimens sampled across their range and identified large genetic differences between many geographically isolated populations that may result in a >300% increase in the number of recognized colugo species. Our results identify conservation units to mitigate future losses of this enigmatic mammalian order. PMID:27532052

  11. Dynamics and control of state-dependent networks for probing genomic organization

    PubMed Central

    Rajapakse, Indika; Groudine, Mark; Mesbahi, Mehran

    2011-01-01

    A state-dependent dynamic network is a collection of elements that interact through a network, whose geometry evolves as the state of the elements changes over time. The genome is an intriguing example of a state-dependent network, where chromosomal geometry directly relates to genomic activity, which in turn strongly correlates with geometry. Here we examine various aspects of a genomic state-dependent dynamic network. In particular, we elaborate on one of the important ramifications of viewing genomic networks as being state-dependent, namely, their controllability during processes of genomic reorganization such as in cell differentiation. PMID:21911407

  12. Evolution and genome architecture in fungal plant pathogens.

    PubMed

    Möller, Mareike; Stukenbrock, Eva H

    2017-12-01

    The fungal kingdom comprises some of the most devastating plant pathogens. Sequencing the genomes of fungal pathogens has shown a remarkable variability in genome size and architecture. Population genomic data enable us to understand the mechanisms and the history of changes in genome size and adaptive evolution in plant pathogens. Although transposable elements predominantly have negative effects on their host, fungal pathogens provide prominent examples of advantageous associations between rapidly evolving transposable elements and virulence genes that cause variation in virulence phenotypes. By providing homogeneous environments at large regional scales, managed ecosystems, such as modern agriculture, can be conducive for the rapid evolution and dispersal of pathogens. In this Review, we summarize key examples from fungal plant pathogen genomics and discuss evolutionary processes in pathogenic fungi in the context of molecular evolution, population genomics and agriculture.

  13. Rate of novel host invasion affects adaptability of evolving RNA virus lineages.

    PubMed

    Morley, Valerie J; Mendiola, Sandra Y; Turner, Paul E

    2015-08-22

    Although differing rates of environmental turnover should be consequential for the dynamics of adaptive change, this idea has been rarely examined outside of theory. In particular, the importance of RNA viruses in disease emergence warrants experiments testing how differing rates of novel host invasion may impact the ability of viruses to adaptively shift onto a novel host. To test whether the rate of environmental turnover influences adaptation, we experimentally evolved 144 Sindbis virus lineages in replicated tissue-culture environments, which transitioned from being dominated by a permissive host cell type to a novel host cell type. The rate at which the novel host 'invaded' the environment varied by treatment. The fitness (growth rate) of evolved virus populations was measured on each host type, and molecular substitutions were mapped via whole genome consensus sequencing. Results showed that virus populations more consistently reached high fitness levels on the novel host when the novel host 'invaded' the environment more gradually, and gradual invasion resulted in less variable genomic outcomes. Moreover, virus populations that experienced a rapid shift onto the novel host converged upon different genotypes than populations that experienced a gradual shift onto the novel host, suggesting a strong effect of historical contingency. © 2015 The Author(s).

  14. Ethical considerations in genomic testing for hematologic disorders.

    PubMed

    Marron, Jonathan M; Joffe, Steven

    2017-07-27

    As our technological capacities improve, genomic testing is increasingly integrating into patient care. The field of clinical hematology is no exception. Genomic testing carries great promise, but several ethical issues must be considered whenever such testing is performed. This review addresses these ethical considerations, including issues surrounding informed consent and the uncertainty of the results of genomic testing; the challenge of incidental findings; and possible inequities in access to and benefit from such testing. Genomic testing is likely to transform the practice of both benign and malignant hematology, but clinicians must carefully consider these core ethical issues in order to make the most of this exciting and evolving technology. © 2017 by The American Society of Hematology.

  15. Reproductive Mode and the Evolution of Genome Size and Structure in Caenorhabditis Nematodes

    PubMed Central

    Fierst, Janna L.; Willis, John H.; Thomas, Cristel G.; Wang, Wei; Reynolds, Rose M.; Ahearne, Timothy E.; Cutter, Asher D.; Phillips, Patrick C.

    2015-01-01

    The self-fertile nematode worms Caenorhabditis elegans, C. briggsae, and C. tropicalis evolved independently from outcrossing male-female ancestors and have genomes 20-40% smaller than closely related outcrossing relatives. This pattern of smaller genomes for selfing species and larger genomes for closely related outcrossing species is also seen in plants. We use comparative genomics, including the first high quality genome assembly for an outcrossing member of the genus (C. remanei) to test several hypotheses for the evolution of genome reduction under a change in mating system. Unlike plants, it does not appear that reductions in the number of repetitive elements, such as transposable elements, are an important contributor to the change in genome size. Instead, all functional genomic categories are lost in approximately equal proportions. Theory predicts that self-fertilization should equalize the effective population size, as well as the resulting effects of genetic drift, between the X chromosome and autosomes. Contrary to this, we find that the self-fertile C. briggsae and C. elegans have larger intergenic spaces and larger protein-coding genes on the X chromosome when compared to autosomes, while C. remanei actually has smaller introns on the X chromosome than either self-reproducing species. Rather than being driven by mutational biases and/or genetic drift caused by a reduction in effective population size under self reproduction, changes in genome size in this group of nematodes appear to be caused by genome-wide patterns of gene loss, most likely generated by genomic adaptation to self reproduction per se. PMID:26114425

  16. multi-dice: r package for comparative population genomic inference under hierarchical co-demographic models of independent single-population size changes.

    PubMed

    Xue, Alexander T; Hickerson, Michael J

    2017-11-01

    Population genetic data from multiple taxa can address comparative phylogeographic questions about community-scale response to environmental shifts, and a useful strategy to this end is to employ hierarchical co-demographic models that directly test multi-taxa hypotheses within a single, unified analysis. This approach has been applied to classical phylogeographic data sets such as mitochondrial barcodes as well as reduced-genome polymorphism data sets that can yield 10,000s of SNPs, produced by emergent technologies such as RAD-seq and GBS. A strategy for the latter had been accomplished by adapting the site frequency spectrum to a novel summarization of population genomic data across multiple taxa called the aggregate site frequency spectrum (aSFS), which potentially can be deployed under various inferential frameworks including approximate Bayesian computation, random forest and composite likelihood optimization. Here, we introduce the r package multi-dice, a wrapper program that exploits existing simulation software for flexible execution of hierarchical model-based inference using the aSFS, which is derived from reduced genome data, as well as mitochondrial data. We validate several novel software features such as applying alternative inferential frameworks, enforcing a minimal threshold of time surrounding co-demographic pulses and specifying flexible hyperprior distributions. In sum, multi-dice provides comparative analysis within the familiar R environment while allowing a high degree of user customization, and will thus serve as a tool for comparative phylogeography and population genomics. © 2017 The Authors. Molecular Ecology Resources Published by John Wiley & Sons Ltd.

  17. Functional Analysis of All Salmonid Genomes (FAASG): an international initiative supporting future salmonid research, conservation and aquaculture

    USDA-ARS?s Scientific Manuscript database

    We describe an emerging initiative - the 'Functional Analysis of All Salmonid Genomes' (FAASG), which will leverage the extensive trait diversity that has evolved since a whole genome duplication event in the salmonid ancestor, to develop an integrative understanding of the functional genomic basis ...

  18. Genomic analysis of expressed sequence tags in American black bear Ursus americanus

    PubMed Central

    2010-01-01

    Background Species of the bear family (Ursidae) are important organisms for research in molecular evolution, comparative physiology and conservation biology, but relatively little genetic sequence information is available for this group. Here we report the development and analyses of the first large scale Expressed Sequence Tag (EST) resource for the American black bear (Ursus americanus). Results Comprehensive analyses of molecular functions, alternative splicing, and tissue-specific expression of 38,757 black bear EST sequences were conducted using the dog genome as a reference. We identified 18 genes, involved in functions such as lipid catabolism, cell cycle, and vesicle-mediated transport, that are showing rapid evolution in the bear lineage Three genes, Phospholamban (PLN), cysteine glycine-rich protein 3 (CSRP3) and Troponin I type 3 (TNNI3), are related to heart contraction, and defects in these genes in humans lead to heart disease. Two genes, biphenyl hydrolase-like (BPHL) and CSRP3, contain positively selected sites in bear. Global analysis of evolution rates of hibernation-related genes in bear showed that they are largely conserved and slowly evolving genes, rather than novel and fast-evolving genes. Conclusion We provide a genomic resource for an important mammalian organism and our study sheds new light on the possible functions and evolution of bear genes. PMID:20338065

  19. Genomic analysis of expressed sequence tags in American black bear Ursus americanus.

    PubMed

    Zhao, Sen; Shao, Chunxuan; Goropashnaya, Anna V; Stewart, Nathan C; Xu, Yichi; Tøien, Øivind; Barnes, Brian M; Fedorov, Vadim B; Yan, Jun

    2010-03-26

    Species of the bear family (Ursidae) are important organisms for research in molecular evolution, comparative physiology and conservation biology, but relatively little genetic sequence information is available for this group. Here we report the development and analyses of the first large scale Expressed Sequence Tag (EST) resource for the American black bear (Ursus americanus). Comprehensive analyses of molecular functions, alternative splicing, and tissue-specific expression of 38,757 black bear EST sequences were conducted using the dog genome as a reference. We identified 18 genes, involved in functions such as lipid catabolism, cell cycle, and vesicle-mediated transport, that are showing rapid evolution in the bear lineage Three genes, Phospholamban (PLN), cysteine glycine-rich protein 3 (CSRP3) and Troponin I type 3 (TNNI3), are related to heart contraction, and defects in these genes in humans lead to heart disease. Two genes, biphenyl hydrolase-like (BPHL) and CSRP3, contain positively selected sites in bear. Global analysis of evolution rates of hibernation-related genes in bear showed that they are largely conserved and slowly evolving genes, rather than novel and fast-evolving genes. We provide a genomic resource for an important mammalian organism and our study sheds new light on the possible functions and evolution of bear genes.

  20. Variability in sex-determining mechanisms influences genome complexity in reptilia.

    PubMed

    Janes, D E; Organ, C L; Edwards, S V

    2009-01-01

    In this review, we describe the history of amniote sex determination as a classic example of Darwinian evolution. We suggest that evolutionary changes in sex determination provide a foundation for understanding important aspects of chromosome and genome organization that otherwise appear haphazard in their origins and contents. Species with genotypic sex determination often possess heteromorphic sex chromosomes, whereas species with environmental sex determination lack them. Through a series of mutations followed by selection at key genes, sex-determining mechanisms have turned over many times throughout the amniote lineage. As a consequence, amniote genomes have undergone gains or losses of sex chromosomes. We review the genomic and ecological contexts in which either temperature-dependent or genotypic sex determination has evolved. Once genotypic sex determination emerges in a lineage, viviparity and heteromorphic sex chromosomes become more likely to evolve. For example, in extinct marine reptiles, genotypic sex determination apparently led to viviparity, which in turn facilitated their pelagic radiation. Sex chromosomes comprise genome regions that differ from autosomes in recombination rate, mutation rate, levels of polymorphism, and the presence of sex-determining and sexually antagonistic genes. In short, many aspects of amniote genome complexity, life history, and adaptive radiation appear contingent on evolutionary changes in sex-determining mechanisms. Copyright 2010 S. Karger AG, Basel.

  1. Variability in Sex-Determining Mechanisms Influences Genome Complexity in Reptilia

    PubMed Central

    Janes, D.E.; Organ, C.L.; Edwards, S.V.

    2010-01-01

    In this review, we describe the history of amniote sex determination as a classic example of Darwinian evolution. We suggest that evolutionary changes in sex determination provide a foundation for understanding important aspects of chromosome and genome organization that otherwise appear haphazard in their origins and contents. Species with genotypic sex determination often possess heteromorphic sex chromosomes, whereas species with environmental sex determination lack them. Through a series of mutations followed by selection at key genes, sex-determining mechanisms have turned over many times throughout the amniote lineage. As a consequence, amniote genomes have undergone gains or losses of sex chromosomes. We review the genomic and ecological contexts in which either temperature-dependent or genotypic sex determination has evolved. Once genotypic sex determination emerges in a lineage, viviparity and heteromorphic sex chromosomes become more likely to evolve. For example, in extinct marine reptiles, genotypic sex determination apparently led to viviparity, which in turn facilitated their pelagic radiation. Sex chromosomes comprise genome regions that differ from autosomes in recombination rate, mutation rate, levels of polymorphism, and the presence of sex-determining and sexually antagonistic genes. In short, many aspects of amniote genome complexity, life history, and adaptive radiation appear contingent on evolutionary changes in sex-determining mechanisms. PMID:20203474

  2. Evidence for a high mutation rate at rapidly evolving yeast centromeres

    PubMed Central

    2011-01-01

    Background Although their role in cell division is essential, centromeres evolve rapidly in animals, plants and yeasts. Unlike the complex centromeres of plants and aminals, the point centromeres of Saccharomcyes yeasts can be readily sequenced to distinguish amongst the possible explanations for fast centromere evolution. Results Using DNA sequences of all 16 centromeres from 34 strains of Saccharomyces cerevisiae and population genomic data from Saccharomyces paradoxus, I show that centromeres in both species evolve 3 times more rapidly even than selectively unconstrained DNA. Exceptionally high levels of polymorphism seen in multiple yeast populations suggest that rapid centromere evolution does not result from the repeated selective sweeps expected under meiotic drive. I further show that there is little evidence for crossing-over or gene conversion within centromeres, although there is clear evidence for recombination in their immediate vicinity. Finally I show that the mutation spectrum at centromeres is consistent with the pattern of spontaneous mutation elsewhere in the genome. Conclusions These results indicate that rapid centromere evolution is a common phenomenon in yeast species. Furthermore, these results suggest that rapid centromere evolution does not result from the mutagenic effect of gene conversion, but from a generalised increase in the mutation rate, perhaps arising from the unusual chromatin structure at centromeres in yeast and other eukaryotes. PMID:21767380

  3. Polyploidy: adaptation to the genomic environment.

    PubMed

    Hollister, Jesse D

    2015-02-01

    Genomic evidence of ancestral whole genome duplication (WGD) and polyploidy is widespread among eukaryotic species, and especially among plants. WGD is thought to provide the raw material for adaptation in the form of duplicated genes, and polyploids are thought to benefit from both physiological and genetic buffering. Comparatively little attention has focused on the genomic challenge of polyploidy, however, although much evidence exists that polyploidy severely perturbs important cellular functions. Here, I review recent progress in the study of the re-establishment of stable meiosis in recently evolved polyploids, focusing on four plant species. This work has yielded an insight into the mechanisms underlying stabilization of genome transmission in polyploids, and is revealing remarkable parallels among diverse taxa. Importantly, these studies also provide a road map for investigating how polyploids respond to the challenge of WGD.

  4. Complete genome sequence of the bioleaching bacterium Leptospirillum sp. group II strain CF-1.

    PubMed

    Ferrer, Alonso; Bunk, Boyke; Spröer, Cathrin; Biedendieck, Rebekka; Valdés, Natalia; Jahn, Martina; Jahn, Dieter; Orellana, Omar; Levicán, Gloria

    2016-03-20

    We describe the complete genome sequence of Leptospirillum sp. group II strain CF-1, an acidophilic bioleaching bacterium isolated from an acid mine drainage (AMD). This work provides data to gain insights about adaptive response of Leptospirillum spp. to the extreme conditions of bioleaching environments. Copyright © 2016 Elsevier B.V. All rights reserved.

  5. Genome comparison of three serovar 5 pathogenic strains of Haemophilus parasuis: insights into an evolving swine pathogen.

    PubMed

    Bello-Ortí, Bernardo; Aragon, Virginia; Pina-Pedrero, Sonia; Bensaid, Albert

    2014-09-01

    Haemophilus parasuis is the causative agent of Glässer's disease, a systemic disorder characterized by polyarthritis, polyserositis and meningitis in pigs. Although it is well known that H. parasuis serovar 5 is the most prevalent serovar associated with the disease, the genetic differences among strains are only now being discovered. Genomes from two serovar 5 strains, SH0165 and 29755, are already available. Here, we present the draft genome of a third H. parasuis serovar 5 strain, the formal serovar 5 reference strain Nagasaki. An in silico genome subtractive analysis with full-length predicted genes of the three H. parasuis serovar 5 strains detected 95, 127 and 95 strain-specific genes (SSGs) for Nagasaki, SH0165 and 29755, respectively. We found that the genomic diversity within these three strains was high, in part because of a high number of mobile elements. Furthermore, a detailed analysis of large sequence polymorphisms (LSPs), encompassing regions ranging from 2 to 16 kb, revealed LSPs in virulence-related elements, such as a Toll-IL receptor, the AcrA multidrug efflux protein, an ATP-binding cassette (ABC) transporter, lipopolysaccharide-synthetizing enzymes and a tripartite ATP-independent periplasmic (TRAP) transporter. The whole-genome codon adaptation index (CAI) was also calculated and revealed values similar to other well-known bacterial pathogens. In addition, whole-genome SNP analysis indicated that nucleotide changes tended to be increased in membrane-related genes. This analysis provides further evidence that the genome of H. parasuis has been subjected to multiple lateral gene transfers (LGTs) and to fine-tuning of virulence factors, and has the potential for accelerated genome evolution. © 2014 The Authors.

  6. Distinguishing friends, foes, and freeloaders in giant genomes.

    PubMed

    Bennetzen, Jeffrey L; Park, Minkyu

    2018-04-01

    Most annotations of large eukaryotic genomes initially find transposable elements (TEs) and other repeats, then mask them so that subsequent efforts can be concentrated on the annotation and study of non-TE genes. However, TEs often contribute to host biology, and their community biologies are of intrinsic interest. This review discusses the challenges, rationale and technologies for comprehensive TE annotation in the commonly giant genomes of animals and plants. Complete discovery of the TEs in a fully sequenced genome is laborious, but feasible, with current strategies in the hands of a careful researcher. These deep TE studies have begun to provide important perspectives on how genomes evolve and the degree to which genome changes do and do not affect eukaryotic biology. Copyright © 2018 The Authors. Published by Elsevier Ltd.. All rights reserved.

  7. Seeing chordate evolution through the Ciona genome sequence

    PubMed Central

    Cañestro, Cristian; Bassham, Susan; Postlethwait, John H

    2003-01-01

    A draft sequence of the compact genome of the sea squirt Ciona intestinalis, a non-vertebrate chordate that diverged very early from other chordates, including vertebrates, illuminates how chordates originated and how vertebrate developmental innovations evolved. PMID:12620098

  8. Toward a Genome-Wide Systems Biology Analysis of Host-Pathogen Interactions in Group A Streptococcus

    PubMed Central

    Musser, James M.; DeLeo, Frank R.

    2005-01-01

    Genome-wide analysis of microbial pathogens and molecular pathogenesis processes has become an area of considerable activity in the last 5 years. These studies have been made possible by several advances, including completion of the human genome sequence, publication of genome sequences for many human pathogens, development of microarray technology and high-throughput proteomics, and maturation of bioinformatics. Despite these advances, relatively little effort has been expended in the bacterial pathogenesis arena to develop and use integrated research platforms in a systems biology approach to enhance our understanding of disease processes. This review discusses progress made in exploiting an integrated genome-wide research platform to gain new knowledge about how the human bacterial pathogen group A Streptococcus causes disease. Results of these studies have provided many new avenues for basic pathogenesis research and translational research focused on development of an efficacious human vaccine and novel therapeutics. One goal in summarizing this line of study is to bring exciting new findings to the attention of the investigative pathology community. In addition, we hope the review will stimulate investigators to consider using analogous approaches for analysis of the molecular pathogenesis of other microbes. PMID:16314461

  9. Analysis of the platypus genome suggests a transposon origin for mammalian imprinting.

    PubMed

    Pask, Andrew J; Papenfuss, Anthony T; Ager, Eleanor I; McColl, Kaighin A; Speed, Terence P; Renfree, Marilyn B

    2009-01-01

    Genomic imprinting is an epigenetic phenomenon that results in monoallelic gene expression. Many hypotheses have been advanced to explain why genomic imprinting evolved in mammals, but few have examined how it arose. The host defence hypothesis suggests that imprinting evolved from existing mechanisms within the cell that act to silence foreign DNA elements that insert into the genome. However, the changes to the mammalian genome that accompanied the evolution of imprinting have been hard to define due to the absence of large scale genomic resources between all extant classes. The recent release of the platypus genome has provided the first opportunity to perform comparisons between prototherian (monotreme; which appear to lack imprinting) and therian (marsupial and eutherian; which have imprinting) mammals. We compared the distribution of repeat elements known to attract epigenetic silencing across the entire genome from monotremes and therian mammals, particularly focusing on the orthologous imprinted regions. There is a significant accumulation of certain repeat elements within imprinted regions of therian mammals compared to the platypus. Our analyses show that the platypus has significantly fewer repeats of certain classes in the regions of the genome that have become imprinted in therian mammals. The accumulation of repeats, especially long terminal repeats and DNA elements, in therian imprinted genes and gene clusters is coincident with, and may have been a potential driving force in, the development of mammalian genomic imprinting. These data provide strong support for the host defence hypothesis.

  10. Analysis of the platypus genome suggests a transposon origin for mammalian imprinting

    PubMed Central

    Pask, Andrew J; Papenfuss, Anthony T; Ager, Eleanor I; McColl, Kaighin A; Speed, Terence P; Renfree, Marilyn B

    2009-01-01

    Background Genomic imprinting is an epigenetic phenomenon that results in monoallelic gene expression. Many hypotheses have been advanced to explain why genomic imprinting evolved in mammals, but few have examined how it arose. The host defence hypothesis suggests that imprinting evolved from existing mechanisms within the cell that act to silence foreign DNA elements that insert into the genome. However, the changes to the mammalian genome that accompanied the evolution of imprinting have been hard to define due to the absence of large scale genomic resources between all extant classes. The recent release of the platypus genome has provided the first opportunity to perform comparisons between prototherian (monotreme; which appear to lack imprinting) and therian (marsupial and eutherian; which have imprinting) mammals. Results We compared the distribution of repeat elements known to attract epigenetic silencing across the entire genome from monotremes and therian mammals, particularly focusing on the orthologous imprinted regions. There is a significant accumulation of certain repeat elements within imprinted regions of therian mammals compared to the platypus. Conclusions Our analyses show that the platypus has significantly fewer repeats of certain classes in the regions of the genome that have become imprinted in therian mammals. The accumulation of repeats, especially long terminal repeats and DNA elements, in therian imprinted genes and gene clusters is coincident with, and may have been a potential driving force in, the development of mammalian genomic imprinting. These data provide strong support for the host defence hypothesis. PMID:19121219

  11. The Oxytricha trifallax Macronuclear Genome: A Complex Eukaryotic Genome with 16,000 Tiny Chromosomes

    PubMed Central

    Swart, Estienne C.; Bracht, John R.; Magrini, Vincent; Minx, Patrick; Chen, Xiao; Zhou, Yi; Khurana, Jaspreet S.; Goldman, Aaron D.; Nowacki, Mariusz; Schotanus, Klaas; Jung, Seolkyoung; Fulton, Robert S.; Ly, Amy; McGrath, Sean; Haub, Kevin; Wiggins, Jessica L.; Storton, Donna; Matese, John C.; Parsons, Lance; Chang, Wei-Jen; Bowen, Michael S.; Stover, Nicholas A.; Jones, Thomas A.; Eddy, Sean R.; Herrick, Glenn A.; Doak, Thomas G.; Wilson, Richard K.; Mardis, Elaine R.; Landweber, Laura F.

    2013-01-01

    The macronuclear genome of the ciliate Oxytricha trifallax displays an extreme and unique eukaryotic genome architecture with extensive genomic variation. During sexual genome development, the expressed, somatic macronuclear genome is whittled down to the genic portion of a small fraction (∼5%) of its precursor “silent” germline micronuclear genome by a process of “unscrambling” and fragmentation. The tiny macronuclear “nanochromosomes” typically encode single, protein-coding genes (a small portion, 10%, encode 2–8 genes), have minimal noncoding regions, and are differentially amplified to an average of ∼2,000 copies. We report the high-quality genome assembly of ∼16,000 complete nanochromosomes (∼50 Mb haploid genome size) that vary from 469 bp to 66 kb long (mean ∼3.2 kb) and encode ∼18,500 genes. Alternative DNA fragmentation processes ∼10% of the nanochromosomes into multiple isoforms that usually encode complete genes. Nucleotide diversity in the macronucleus is very high (SNP heterozygosity is ∼4.0%), suggesting that Oxytricha trifallax may have one of the largest known effective population sizes of eukaryotes. Comparison to other ciliates with nonscrambled genomes and long macronuclear chromosomes (on the order of 100 kb) suggests several candidate proteins that could be involved in genome rearrangement, including domesticated MULE and IS1595-like DDE transposases. The assembly of the highly fragmented Oxytricha macronuclear genome is the first completed genome with such an unusual architecture. This genome sequence provides tantalizing glimpses into novel molecular biology and evolution. For example, Oxytricha maintains tens of millions of telomeres per cell and has also evolved an intriguing expansion of telomere end-binding proteins. In conjunction with the micronuclear genome in progress, the O. trifallax macronuclear genome will provide an invaluable resource for investigating programmed genome rearrangements, complementing

  12. In situ structures of the genome and genome-delivery apparatus in a single-stranded RNA virus.

    PubMed

    Dai, Xinghong; Li, Zhihai; Lai, Mason; Shu, Sara; Du, Yushen; Zhou, Z Hong; Sun, Ren

    2017-01-05

    Packaging of the genome into a protein capsid and its subsequent delivery into a host cell are two fundamental processes in the life cycle of a virus. Unlike double-stranded DNA viruses, which pump their genome into a preformed capsid, single-stranded RNA (ssRNA) viruses, such as bacteriophage MS2, co-assemble their capsid with the genome; however, the structural basis of this co-assembly is poorly understood. MS2 infects Escherichia coli via the host 'sex pilus' (F-pilus); it was the first fully sequenced organism and is a model system for studies of translational gene regulation, RNA-protein interactions, and RNA virus assembly. Its positive-sense ssRNA genome of 3,569 bases is enclosed in a capsid with one maturation protein monomer and 89 coat protein dimers arranged in a T = 3 icosahedral lattice. The maturation protein is responsible for attaching the virus to an F-pilus and delivering the viral genome into the host during infection, but how the genome is organized and delivered is not known. Here we describe the MS2 structure at 3.6 Å resolution, determined by electron-counting cryo-electron microscopy (cryoEM) and asymmetric reconstruction. We traced approximately 80% of the backbone of the viral genome, built atomic models for 16 RNA stem-loops, and identified three conserved motifs of RNA-coat protein interactions among 15 of these stem-loops with diverse sequences. The stem-loop at the 3' end of the genome interacts extensively with the maturation protein, which, with just a six-helix bundle and a six-stranded β-sheet, forms a genome-delivery apparatus and joins 89 coat protein dimers to form a capsid. This atomic description of genome-capsid interactions in a spherical ssRNA virus provides insight into genome delivery via the host sex pilus and mechanisms underlying ssRNA-capsid co-assembly, and inspires speculation about the links between nucleoprotein complexes and the origins of viruses.

  13. Co-evolution of plant LTR-retrotransposons and their host genomes.

    PubMed

    Zhao, Meixia; Ma, Jianxin

    2013-07-01

    Transposable elements (TEs), particularly, long terminal repeat retrotransposons (LTR-RTs), are the most abundant DNA components in all plant species that have been investigated, and are largely responsible for plant genome size variation. Although plant genomes have experienced periodic proliferation and/or recent burst of LTR-retrotransposons, the majority of LTR-RTs are inactivated by DNA methylation and small RNA-mediated silencing mechanisms, and/or were deleted/truncated by unequal homologous recombination and illegitimate recombination, as suppression mechanisms that counteract genome expansion caused by LTR-RT amplification. LTR-RT DNA is generally enriched in pericentromeric regions of the host genomes, which appears to be the outcomes of preferential insertions of LTR-RTs in these regions and low effectiveness of selection that purges LTR-RT DNA from these regions relative to chromosomal arms. Potential functions of various TEs in their host genomes remain blurry; nevertheless, LTR-RTs have been recognized to play important roles in maintaining chromatin structures and centromere functions and regulation of gene expressions in their host genomes.

  14. Decarboxylation of Carbon Compounds as a Potential Source for CO2 and CO Observed by SAM at Yellowknife Bay, Gale Crater, Mars

    NASA Technical Reports Server (NTRS)

    Eigenbrode, J. L.; Bower, H.; Archer, P. Jr.

    2014-01-01

    Martian carbon was detected in the Sheepbed mudtsone at Yellowknife Bay, Gale Crater, Mars by the Sample Analysis at Mars (SAM) instrument onboard Curiosity, the rover of the Mars Science Laboratory missio]. The carbon was detected as CO2 thermally evolved from drilled and sieved rock powder that was delivered to SAM as a <150-micron-particle- size fraction. Most of the CO2 observed in the Cumberland (CB) drill hole evolved between 150deg and 350deg C. In the John Klein (JK) drill hole, the CO2 evolved up to 500deg C. Hypotheses for the source of the the CO2 include the breakdown of carbonate minerals reacting with HCl released from oxychlorine compounds, combustion of organic matter by O2 thermally evolved from the same oxychlorine minerals, and the decarboxylation of organic molecules indigenous to the martian rock sample. Here we explore the potential for the decarboxylation hypothesis.

  15. Expansion of CORE-SINEs in the genome of the Tasmanian devil

    PubMed Central

    2012-01-01

    Background The genome of the carnivorous marsupial, the Tasmanian devil (Sarcophilus harrisii, Order: Dasyuromorphia), was sequenced in the hopes of finding a cure for or gaining a better understanding of the contagious devil facial tumor disease that is threatening the species’ survival. To better understand the Tasmanian devil genome, we screened it for transposable elements and investigated the dynamics of short interspersed element (SINE) retroposons. Results The temporal history of Tasmanian devil SINEs, elucidated using a transposition in transposition analysis, indicates that WSINE1, a CORE-SINE present in around 200,000 copies, is the most recently active element. Moreover, we discovered a new subtype of WSINE1 (WSINE1b) that comprises at least 90% of all Tasmanian devil WSINE1s. The frequencies of WSINE1 subtypes differ in the genomes of two of the other Australian marsupial orders. A co-segregation analysis indicated that at least 66 subfamilies of WSINE1 evolved during the evolution of Dasyuromorphia. Using a substitution rate derived from WSINE1 insertions, the ages of the subfamilies were estimated and correlated with a newly established phylogeny of Dasyuromorphia. Phylogenetic analyses and divergence time estimates of mitochondrial genome data indicate a rapid radiation of the Tasmanian devil and the closest relative the quolls (Dasyurus) around 14 million years ago. Conclusions The radiation and abundance of CORE-SINEs in marsupial genomes indicates that they may be a major player in the evolution of marsupials. It is evident that the early phases of evolution of the carnivorous marsupial order Dasyuromorphia was characterized by a burst of SINE activity. A correlation between a speciation event and a major burst of retroposon activity is for the first time shown in a marsupial genome. PMID:22559330

  16. Expansion of CORE-SINEs in the genome of the Tasmanian devil.

    PubMed

    Nilsson, Maria A; Janke, Axel; Murchison, Elizabeth P; Ning, Zemin; Hallström, Björn M

    2012-05-06

    The genome of the carnivorous marsupial, the Tasmanian devil (Sarcophilus harrisii, Order: Dasyuromorphia), was sequenced in the hopes of finding a cure for or gaining a better understanding of the contagious devil facial tumor disease that is threatening the species' survival. To better understand the Tasmanian devil genome, we screened it for transposable elements and investigated the dynamics of short interspersed element (SINE) retroposons. The temporal history of Tasmanian devil SINEs, elucidated using a transposition in transposition analysis, indicates that WSINE1, a CORE-SINE present in around 200,000 copies, is the most recently active element. Moreover, we discovered a new subtype of WSINE1 (WSINE1b) that comprises at least 90% of all Tasmanian devil WSINE1s. The frequencies of WSINE1 subtypes differ in the genomes of two of the other Australian marsupial orders. A co-segregation analysis indicated that at least 66 subfamilies of WSINE1 evolved during the evolution of Dasyuromorphia. Using a substitution rate derived from WSINE1 insertions, the ages of the subfamilies were estimated and correlated with a newly established phylogeny of Dasyuromorphia. Phylogenetic analyses and divergence time estimates of mitochondrial genome data indicate a rapid radiation of the Tasmanian devil and the closest relative the quolls (Dasyurus) around 14 million years ago. The radiation and abundance of CORE-SINEs in marsupial genomes indicates that they may be a major player in the evolution of marsupials. It is evident that the early phases of evolution of the carnivorous marsupial order Dasyuromorphia was characterized by a burst of SINE activity. A correlation between a speciation event and a major burst of retroposon activity is for the first time shown in a marsupial genome.

  17. Whole-genome analyses resolve early branches in the tree of life of modern birds

    PubMed Central

    Jarvis, Erich D.; Mirarab, Siavash; Aberer, Andre J.; Li, Bo; Houde, Peter; Li, Cai; Ho, Simon Y. W.; Faircloth, Brant C.; Nabholz, Benoit; Howard, Jason T.; Suh, Alexander; Weber, Claudia C.; da Fonseca, Rute R.; Li, Jianwen; Zhang, Fang; Li, Hui; Zhou, Long; Narula, Nitish; Liu, Liang; Ganapathy, Ganesh; Boussau, Bastien; Bayzid, Md. Shamsuzzoha; Zavidovych, Volodymyr; Subramanian, Sankar; Gabaldón, Toni; Capella-Gutiérrez, Salvador; Huerta-Cepas, Jaime; Rekepalli, Bhanu; Munch, Kasper; Schierup, Mikkel; Lindow, Bent; Warren, Wesley C.; Ray, David; Green, Richard E.; Bruford, Michael W.; Zhan, Xiangjiang; Dixon, Andrew; Li, Shengbin; Li, Ning; Huang, Yinhua; Derryberry, Elizabeth P.; Bertelsen, Mads Frost; Sheldon, Frederick H.; Brumfield, Robb T.; Mello, Claudio V.; Lovell, Peter V.; Wirthlin, Morgan; Schneider, Maria Paula Cruz; Prosdocimi, Francisco; Samaniego, José Alfredo; Velazquez, Amhed Missael Vargas; Alfaro-Núñez, Alonzo; Campos, Paula F.; Petersen, Bent; Sicheritz-Ponten, Thomas; Pas, An; Bailey, Tom; Scofield, Paul; Bunce, Michael; Lambert, David M.; Zhou, Qi; Perelman, Polina; Driskell, Amy C.; Shapiro, Beth; Xiong, Zijun; Zeng, Yongli; Liu, Shiping; Li, Zhenyu; Liu, Binghang; Wu, Kui; Xiao, Jin; Yinqi, Xiong; Zheng, Qiuemei; Zhang, Yong; Yang, Huanming; Wang, Jian; Smeds, Linnea; Rheindt, Frank E.; Braun, Michael; Fjeldsa, Jon; Orlando, Ludovic; Barker, F. Keith; Jønsson, Knud Andreas; Johnson, Warren; Koepfli, Klaus-Peter; O’Brien, Stephen; Haussler, David; Ryder, Oliver A.; Rahbek, Carsten; Willerslev, Eske; Graves, Gary R.; Glenn, Travis C.; McCormack, John; Burt, Dave; Ellegren, Hans; Alström, Per; Edwards, Scott V.; Stamatakis, Alexandros; Mindell, David P.; Cracraft, Joel; Braun, Edward L.; Warnow, Tandy; Jun, Wang; Gilbert, M. Thomas P.; Zhang, Guojie

    2015-01-01

    To better determine the history of modern birds, we performed a genome-scale phylogenetic analysis of 48 species representing all orders of Neoaves using phylogenomic methods created to handle genome-scale data. We recovered a highly resolved tree that confirms previously controversial sister or close relationships. We identified the first divergence in Neoaves, two groups we named Passerea and Columbea, representing independent lineages of diverse and convergently evolved land and water bird species. Among Passerea, we infer the common ancestor of core landbirds to have been an apex predator and confirm independent gains of vocal learning. Among Columbea, we identify pigeons and flamingoes as belonging to sister clades. Even with whole genomes, some of the earliest branches in Neoaves proved challenging to resolve, which was best explained by massive protein-coding sequence convergence and high levels of incomplete lineage sorting that occurred during a rapid radiation after the Cretaceous-Paleogene mass extinction event about 66 million years ago. PMID:25504713

  18. Small groups and long memories promote cooperation.

    PubMed

    Stewart, Alexander J; Plotkin, Joshua B

    2016-06-01

    Complex social behaviors lie at the heart of many of the challenges facing evolutionary biology, sociology, economics, and beyond. For evolutionary biologists the question is often how group behaviors such as collective action, or decision making that accounts for memories of past experience, can emerge and persist in an evolving system. Evolutionary game theory provides a framework for formalizing these questions and admitting them to rigorous study. Here we develop such a framework to study the evolution of sustained collective action in multi-player public-goods games, in which players have arbitrarily long memories of prior rounds of play and can react to their experience in an arbitrary way. We construct a coordinate system for memory-m strategies in iterated n-player games that permits us to characterize all cooperative strategies that resist invasion by any mutant strategy, and stabilize cooperative behavior. We show that, especially when groups are small, longer-memory strategies make cooperation easier to evolve, by increasing the number of ways to stabilize cooperation. We also explore the co-evolution of behavior and memory. We find that even when memory has a cost, longer-memory strategies often evolve, which in turn drives the evolution of cooperation, even when the benefits for cooperation are low.

  19. Characterization of CoPK02, a Ca2+/calmodulin-dependent protein kinase in mushroom Coprinopsis cinerea.

    PubMed

    Yamashita, Masashi; Sueyoshi, Noriyuki; Yamada, Hiroki; Katayama, Syouichi; Senga, Yukako; Takenaka, Yasuhiro; Ishida, Atsuhiko; Kameshita, Isamu; Shigeri, Yasushi

    2018-04-20

    We surveyed genome sequences from the basidiomycetous mushroom Coprinopsis cinerea and isolated a cDNA homologous to CMKA, a calmodulin-dependent protein kinase (CaMK) in Aspergillus nidulans. We designated this sequence, encoding 580 amino acids with a molecular weight of 63,987, as CoPK02. CoPK02 possessed twelve subdomains specific to protein kinases and exhibited 43, 35, 40% identity with rat CaMKI, CaMKII, CaMKIV, respectively, and 40% identity with CoPK12, one of the CaMK orthologs in C. cinerea. CoPK02 showed significant autophosphorylation activity and phosphorylated exogenous proteins in the presence of Ca 2+ /CaM. By the CaM-overlay assay we confirmed that the C-terminal sequence (Trp346-Arg358) was the calmodulin-binding site, and that the binding of Ca 2+ /CaM to CoPK02 was reduced by the autophosphorylation of CoPK02. Since CoPK02 evolved in a different clade from CoPK12, and showed different gene expression compared to that of CoPK32, which is homologous to mitogen-activated protein kinase-activated protein kinase, CoPK02 and CoPK12 might cooperatively regulate Ca 2+ -signaling in C. cinerea.

  20. The chromosomal distributions of Ty1-copia group retrotransposable elements in higher plants and their implications for genome evolution

    Treesearch

    J.S. (Pat) Heslop-Harrison; Andrea Brandes; Shin Taketa; Thomas Schmidt; Alexander V. Vershinin; Elena G. Alkhimova; Anette Kamm; Robert L. Doudrick; [and others

    1997-01-01

    Retrotransposons make up a major fraction - sometimes more than 40% - of all plant genomes investigated so far. We have isolated the reverse transcriptase domains of theTyl-copia group elements from several species, ranging in genome size from some 100 Mbp to 23,000 Mbp, and determined the distribution patterns of these retrotransposons on metaphase chromosomes and...

  1. Draft Genome Sequence of Grammothele lineata SDL-CO-2015-1, a Jute Endophyte with a Potential for Paclitaxel Biosynthesis.

    PubMed

    Das, Avizit; Ahmed, Oly; Baten, A K M Abdul; Bushra, Samira; Islam, M Tariqul; Ferdous, Ahlan Sabah; Islam, Mohammad Riazul; Khan, Haseena

    2017-08-17

    Grammothele lineata strain SDL-CO-2015-1, a basidiomycete fungus, was identified as an endophyte from a jute species, Corchorus olitorius var. 2015, and found to produce paclitaxel, a diterpenic polyoxygenated pseudoalkaloid with antitumor activity. Here, we report the draft genome sequence (42.8 Mb with 9,395 genes) of this strain. Copyright © 2017 Das et al.

  2. Draft Genome Sequence of Grammothele lineata SDL-CO-2015-1, a Jute Endophyte with a Potential for Paclitaxel Biosynthesis

    PubMed Central

    Das, Avizit; Ahmed, Oly; Baten, A. K. M. Abdul; Bushra, Samira; Islam, M. Tariqul; Ferdous, Ahlan Sabah; Islam, Mohammad Riazul

    2017-01-01

    ABSTRACT Grammothele lineata strain SDL-CO-2015-1, a basidiomycete fungus, was identified as an endophyte from a jute species, Corchorus olitorius var. 2015, and found to produce paclitaxel, a diterpenic polyoxygenated pseudoalkaloid with antitumor activity. Here, we report the draft genome sequence (42.8 Mb with 9,395 genes) of this strain. PMID:28818909

  3. Spectra of English evolving word co-occurrence networks

    NASA Astrophysics Data System (ADS)

    Liang, Wei

    2017-02-01

    Spectral analysis is a powerful tool that provides global measures of the network properties. In this paper, 200 English articles are collected. A word co-occurrence network is constructed from each single article (denoted by single network). Furthermore, 5 large English word co-occurrence networks are constructed (denoted by large network). Spectra of their adjacency matrices are computed. The largest eigenvalue, λ1, depends on the network size N and the number of edges E as λ1 ∝N0.66 and λ1 ∝E0.54, respectively. The number of different eigenvalues, Nλ, increase in the manner of Nλ ∝N0.58 and Nλ ∝E0.47. The middle part of the spectral distribution can be fitted by a line with slope - 0.01 in each of the large networks, whereas two segments with the same slope - 0.03 for 0 ≪ N < 260 and - 0.02 for 260 < N < 2800 are needed for the single networks. An "M"-shape distribution appears in each of the spectral densities of the large networks. These and other results can provide useful insight into the structural properties of English linguistic networks.

  4. Genomic features of bacterial adaptation to plants

    PubMed Central

    Levy, Asaf; Gonzalez, Isai Salas; Mittelviefhaus, Maximilian; Clingenpeel, Scott; Paredes, Sur Herrera; Miao, Jiamin; Wang, Kunru; Devescovi, Giulia; Stillman, Kyra; Monteiro, Freddy; Alvarez, Bryan Rangel; Lundberg, Derek S.; Lu, Tse-Yuan; Lebeis, Sarah; Jin, Zhao; McDonald, Meredith; Klein, Andrew P.; Feltcher, Meghan E.; del Rio, Tijana Glavina; Grant, Sarah R.; Doty, Sharon L.; Ley, Ruth E.; Zhao, Bingyu; Venturi, Vittorio; Pelletier, Dale A.; Vorholt, Julia A.; Tringe, Susannah G.; Woyke, Tanja; Dangl, Jeffery L.

    2017-01-01

    Plants intimately associate with diverse bacteria. Plant-associated (PA) bacteria have ostensibly evolved genes enabling adaptation to the plant environment. However, the identities of such genes are mostly unknown and their functions are poorly characterized. We sequenced 484 genomes of bacterial isolates from roots of Brassicaceae, poplar, and maize. We then compared 3837 bacterial genomes to identify thousands of PA gene clusters. Genomes of PA bacteria encode more carbohydrate metabolism functions and fewer mobile elements than related non-plant associated genomes. We experimentally validated candidates from two sets of PA genes, one involved in plant colonization, the other serving in microbe-microbe competition between PA bacteria. We also identified 64 PA protein domains that potentially mimic plant domains; some are shared with PA fungi and oomycetes. This work expands the genome-based understanding of plant-microbe interactions and provides leads for efficient and sustainable agriculture through microbiome engineering. PMID:29255260

  5. Managing the genomic revolution in cancer diagnostics.

    PubMed

    Nguyen, Doreen; Gocke, Christopher D

    2017-08-01

    Molecular tumor profiling is now a routine part of patient care, revealing targetable genomic alterations and molecularly distinct tumor subtypes with therapeutic and prognostic implications. The widespread adoption of next-generation sequencing technologies has greatly facilitated clinical implementation of genomic data and opened the door for high-throughput multigene-targeted sequencing. Herein, we discuss the variability of cancer genetic profiling currently offered by clinical laboratories, the challenges of applying rapidly evolving medical knowledge to individual patients, and the need for more standardized population-based molecular profiling.

  6. Mutational Dynamics of Aroid Chloroplast Genomes

    PubMed Central

    Ahmed, Ibrar; Biggs, Patrick J.; Matthews, Peter J.; Collins, Lesley J.; Hendy, Michael D.; Lockhart, Peter J.

    2012-01-01

    A characteristic feature of eukaryote and prokaryote genomes is the co-occurrence of nucleotide substitution and insertion/deletion (indel) mutations. Although similar observations have also been made for chloroplast DNA, genome-wide associations have not been reported. We determined the chloroplast genome sequences for two morphotypes of taro (Colocasia esculenta; family Araceae) and compared these with four publicly available aroid chloroplast genomes. Here, we report the extent of genome-wide association between direct and inverted repeats, indels, and substitutions in these aroid chloroplast genomes. We suggest that alternative but not mutually exclusive hypotheses explain the mutational dynamics of chloroplast genome evolution. PMID:23204304

  7. Vibrationally excited water emission at 658 GHz from evolved stars

    NASA Astrophysics Data System (ADS)

    Baudry, A.; Humphreys, E. M. L.; Herpin, F.; Torstensson, K.; Vlemmings, W. H. T.; Richards, A. M. S.; Gray, M. D.; De Breuck, C.; Olberg, M.

    2018-01-01

    Context. Several rotational transitions of ortho- and para-water have been identified toward evolved stars in the ground vibrational state as well as in the first excited state of the bending mode (v2 = 1 in (0, 1, 0) state). In the latter vibrational state of water, the 658 GHz J = 11,0-10,1 rotational transition is often strong and seems to be widespread in late-type stars. Aims: Our main goals are to better characterize the nature of the 658 GHz emission, compare the velocity extent of the 658 GHz emission with SiO maser emission to help locate the water layers and, more generally, investigate the physical conditions prevailing in the excited water layers of evolved stars. Another goal is to identify new 658 GHz emission sources and contribute in showing that this emission is widespread in evolved stars. Methods: We have used the J = 11,0-10,1 rotational transition of water in the (0, 1, 0) vibrational state nearly 2400 K above the ground-state to trace some of the physical conditions of evolved stars. Eleven evolved stars were extracted from our mini-catalog of existing and potential 658 GHz sources for observations with the Atacama Pathfinder EXperiment (APEX) telescope equipped with the SEPIA Band 9 receiver. The 13CO J = 6-5 line at 661 GHz was placed in the same receiver sideband for simultaneous observation with the 658 GHz line of water. We have compared the ratio of these two lines to the same ratio derived from HIFI earlier observations to check for potential time variability in the 658 GHz line. We have compared the 658 GHz line properties with our H2O radiative transfer models in stars and we have compared the velocity ranges of the 658 GHz and SiO J = 2-1, v = 1 maser lines. Results: Eleven stars have been extracted from our catalog of known or potential 658 GHz evolved stars. All of them show 658 GHz emission with a peak flux density in the range ≈50-70 Jy (RU Hya and RT Eri) to ≈2000-3000 Jy (VY CMa and W Hya). Five Asymptotic Giant Branch (AGB

  8. Complete genome sequence of Geobacillus strain Y4.1MC1, a novel CO-utilizing Geobacillus thermoglucosidasius strain isolated from Bath Hot Spring in Yellowstone National Park

    DOE PAGES

    Brumm, Phillip; Land, Miriam L.; Hauser, Loren John; ...

    2015-02-10

    Geobacillus thermoglucosidasius Y4.1MC1 was isolated from a boiling spring in the lower geyser basin of Yellowstone National Park. We present this species is of interest because of its metabolic versatility. The genome consists of one circular chromosome of 3,840,330 bp and a circular plasmid of 71,617 bp with an average GC content of 44.01%. The genome is available in the GenBank database (NC_014650.1 and NC_014651.1). In addition to the expected metabolic pathways for sugars and amino acids, the Y4.1MC1 genome codes for two separate carbon monoxide utilization pathways, an aerobic oxidation pathway and an anaerobic reductive acetyl CoA (Wood-Ljungdahl) pathway.more » This is the first report of a nonanaerobic organism with the Wood-Ljungdahl pathway. Also, this anaerobic pathway permits the strain to utilize H 2 and fix CO 2 present in the hot spring environment. Y4.1MC1 and its related species may play a significant role in carbon capture and sequestration in thermophilic ecosystems and may open up new routes to produce biofuels and chemicals from CO, H 2, and CO 2.« less

  9. Complete genome sequence of Geobacillus strain Y4.1MC1, a novel CO-utilizing Geobacillus thermoglucosidasius strain isolated from Bath Hot Spring in Yellowstone National Park

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Brumm, Phillip; Land, Miriam L.; Hauser, Loren John

    Geobacillus thermoglucosidasius Y4.1MC1 was isolated from a boiling spring in the lower geyser basin of Yellowstone National Park. We present this species is of interest because of its metabolic versatility. The genome consists of one circular chromosome of 3,840,330 bp and a circular plasmid of 71,617 bp with an average GC content of 44.01%. The genome is available in the GenBank database (NC_014650.1 and NC_014651.1). In addition to the expected metabolic pathways for sugars and amino acids, the Y4.1MC1 genome codes for two separate carbon monoxide utilization pathways, an aerobic oxidation pathway and an anaerobic reductive acetyl CoA (Wood-Ljungdahl) pathway.more » This is the first report of a nonanaerobic organism with the Wood-Ljungdahl pathway. Also, this anaerobic pathway permits the strain to utilize H 2 and fix CO 2 present in the hot spring environment. Y4.1MC1 and its related species may play a significant role in carbon capture and sequestration in thermophilic ecosystems and may open up new routes to produce biofuels and chemicals from CO, H 2, and CO 2.« less

  10. The founding charter of the Genomic Observatories Network.

    PubMed

    Davies, Neil; Field, Dawn; Amaral-Zettler, Linda; Clark, Melody S; Deck, John; Drummond, Alexei; Faith, Daniel P; Geller, Jonathan; Gilbert, Jack; Glöckner, Frank Oliver; Hirsch, Penny R; Leong, Jo-Ann; Meyer, Chris; Obst, Matthias; Planes, Serge; Scholin, Chris; Vogler, Alfried P; Gates, Ruth D; Toonen, Rob; Berteaux-Lecellier, Véronique; Barbier, Michèle; Barker, Katherine; Bertilsson, Stefan; Bicak, Mesude; Bietz, Matthew J; Bobe, Jason; Bodrossy, Levente; Borja, Angel; Coddington, Jonathan; Fuhrman, Jed; Gerdts, Gunnar; Gillespie, Rosemary; Goodwin, Kelly; Hanson, Paul C; Hero, Jean-Marc; Hoekman, David; Jansson, Janet; Jeanthon, Christian; Kao, Rebecca; Klindworth, Anna; Knight, Rob; Kottmann, Renzo; Koo, Michelle S; Kotoulas, Georgios; Lowe, Andrew J; Marteinsson, Viggó Thór; Meyer, Folker; Morrison, Norman; Myrold, David D; Pafilis, Evangelos; Parker, Stephanie; Parnell, John Jacob; Polymenakou, Paraskevi N; Ratnasingham, Sujeevan; Roderick, George K; Rodriguez-Ezpeleta, Naiara; Schonrogge, Karsten; Simon, Nathalie; Valette-Silver, Nathalie J; Springer, Yuri P; Stone, Graham N; Stones-Havas, Steve; Sansone, Susanna-Assunta; Thibault, Kate M; Wecker, Patricia; Wichels, Antje; Wooley, John C; Yahara, Tetsukazu; Zingone, Adriana

    2014-03-07

    The co-authors of this paper hereby state their intention to work together to launch the Genomic Observatories Network (GOs Network) for which this document will serve as its Founding Charter. We define a Genomic Observatory as an ecosystem and/or site subject to long-term scientific research, including (but not limited to) the sustained study of genomic biodiversity from single-celled microbes to multicellular organisms.An international group of 64 scientists first published the call for a global network of Genomic Observatories in January 2012. The vision for such a network was expanded in a subsequent paper and developed over a series of meetings in Bremen (Germany), Shenzhen (China), Moorea (French Polynesia), Oxford (UK), Pacific Grove (California, USA), Washington (DC, USA), and London (UK). While this community-building process continues, here we express our mutual intent to establish the GOs Network formally, and to describe our shared vision for its future. The views expressed here are ours alone as individual scientists, and do not necessarily represent those of the institutions with which we are affiliated.

  11. The founding charter of the Genomic Observatories Network

    PubMed Central

    2014-01-01

    The co-authors of this paper hereby state their intention to work together to launch the Genomic Observatories Network (GOs Network) for which this document will serve as its Founding Charter. We define a Genomic Observatory as an ecosystem and/or site subject to long-term scientific research, including (but not limited to) the sustained study of genomic biodiversity from single-celled microbes to multicellular organisms. An international group of 64 scientists first published the call for a global network of Genomic Observatories in January 2012. The vision for such a network was expanded in a subsequent paper and developed over a series of meetings in Bremen (Germany), Shenzhen (China), Moorea (French Polynesia), Oxford (UK), Pacific Grove (California, USA), Washington (DC, USA), and London (UK). While this community-building process continues, here we express our mutual intent to establish the GOs Network formally, and to describe our shared vision for its future. The views expressed here are ours alone as individual scientists, and do not necessarily represent those of the institutions with which we are affiliated. PMID:24606731

  12. Thermodynamic Basis for the Emergence of Genomes during Prebiotic Evolution

    DTIC Science & Technology

    2012-05-01

    Thermodynamic Basis for the Emergence of Genomes during Prebiotic Evolution Hyung-June Woo, Ravi Vijaya Satya, Jaques Reifman* DoD Biotechnology High...polymerases are above, near, and below a critical point, respectively. The prebiotic evolution therefore must have crossed this critical region. Over...among many potential oligomers capable of templated replication, RNAs may have evolved to form prebiotic genomes due to the value of their nonenzymatic

  13. Genomic Hypomethylation in the Human Germline Associates with Selective Structural Mutability in the Human Genome

    PubMed Central

    Li, Jian; Harris, R. Alan; Cheung, Sau Wai; Coarfa, Cristian; Jeong, Mira; Goodell, Margaret A.; White, Lisa D.; Patel, Ankita; Kang, Sung-Hae; Shaw, Chad; Chinault, A. Craig; Gambin, Tomasz; Gambin, Anna; Lupski, James R.; Milosavljevic, Aleksandar

    2012-01-01

    The hotspots of structural polymorphisms and structural mutability in the human genome remain to be explained mechanistically. We examine associations of structural mutability with germline DNA methylation and with non-allelic homologous recombination (NAHR) mediated by low-copy repeats (LCRs). Combined evidence from four human sperm methylome maps, human genome evolution, structural polymorphisms in the human population, and previous genomic and disease studies consistently points to a strong association of germline hypomethylation and genomic instability. Specifically, methylation deserts, the ∼1% fraction of the human genome with the lowest methylation in the germline, show a tenfold enrichment for structural rearrangements that occurred in the human genome since the branching of chimpanzee and are highly enriched for fast-evolving loci that regulate tissue-specific gene expression. Analysis of copy number variants (CNVs) from 400 human samples identified using a custom-designed array comparative genomic hybridization (aCGH) chip, combined with publicly available structural variation data, indicates that association of structural mutability with germline hypomethylation is comparable in magnitude to the association of structural mutability with LCR–mediated NAHR. Moreover, rare CNVs occurring in the genomes of individuals diagnosed with schizophrenia, bipolar disorder, and developmental delay and de novo CNVs occurring in those diagnosed with autism are significantly more concentrated within hypomethylated regions. These findings suggest a new connection between the epigenome, selective mutability, evolution, and human disease. PMID:22615578

  14. Multiple groups of endogenous epsilon-like retroviruses conserved across primates.

    PubMed

    Brown, Katherine; Emes, Richard D; Tarlinton, Rachael E

    2014-11-01

    Several types of cancer in fish are caused by retroviruses, including those responsible for major outbreaks of disease, such as walleye dermal sarcoma virus and salmon swim bladder sarcoma virus. These viruses form a phylogenetic group often described as the epsilonretrovirus genus. Epsilon-like retroviruses have become endogenous retroviruses (ERVs) on several occasions, integrating into germ line cells to become part of the host genome, and sections of fish and amphibian genomes are derived from epsilon-like retroviruses. However, epsilon-like ERVs have been identified in very few mammals. We have developed a pipeline to screen full genomes for ERVs, and using this pipeline, we have located over 800 endogenous epsilon-like ERV fragments in primate genomes. Genomes from 32 species of mammals and birds were screened, and epsilon-like ERV fragments were found in all primate and tree shrew genomes but no others. These viruses appear to have entered the genome of a common ancestor of Old and New World monkeys between 42 million and 65 million years ago. Based on these results, there is an ancient evolutionary relationship between epsilon-like retroviruses and primates. Clearly, these viruses had the potential to infect the ancestors of primates and were at some point a common pathogen in these hosts. Therefore, this result raises questions about the potential of epsilonretroviruses to infect humans and other primates and about the evolutionary history of these retroviruses. Epsilonretroviruses are a group of retroviruses that cause several important diseases in fish. Retroviruses have the ability to become a permanent part of the DNA of their host by entering the germ line as endogenous retroviruses (ERVs), where they lose their infectivity over time but can be recognized as retroviruses for millions of years. Very few mammals are known to have epsilon-like ERVs; however, we have identified over 800 fragments of endogenous epsilon-like ERVs in the genomes of all major

  15. Contingent movement and cooperation evolve under generalized reciprocity

    PubMed Central

    Hamilton, Ian M; Taborsky, Michael

    2005-01-01

    How cooperation and altruism among non-relatives can persist in the face of cheating remains a key puzzle in evolutionary biology. Although mechanisms such as direct and indirect reciprocity and limited movement have been put forward to explain such cooperation, they cannot explain cooperation among unfamiliar, highly mobile individuals. Here we show that cooperation may be evolutionarily stable if decisions taken to cooperate and to change group membership are both dependent on anonymous social experience (generalized reciprocity). We find that a win–stay, lose–shift rule (where shifting is either moving away from the group or changing tactics within the group after receiving defection) evolves in evolutionary simulations when group leaving is moderately costly (i.e. the current payoff to being alone is low, but still higher than that in a mutually defecting group, and new groups are rarely encountered). This leads to the establishment of widespread cooperation in the population. If the costs of group leaving are reduced, a similar group-leaving rule evolves in association with cooperation in pairs and exploitation of larger anonymous groups. We emphasize that mechanisms of assortment within populations are often behavioural decisions and should not be considered independently of the evolution of cooperation. PMID:16191638

  16. The chimeric nature of the genomes of marine magnetotactic coccoid-ovoid bacteria defines a novel group of Proteobacteria.

    PubMed

    Ji, Boyang; Zhang, Sheng-Da; Zhang, Wei-Jia; Rouy, Zoe; Alberto, François; Santini, Claire-Lise; Mangenot, Sophie; Gagnot, Séverine; Philippe, Nadège; Pradel, Nathalie; Zhang, Lichen; Tempel, Sébastien; Li, Ying; Médigue, Claudine; Henrissat, Bernard; Coutinho, Pedro M; Barbe, Valérie; Talla, Emmanuel; Wu, Long-Fei

    2017-03-01

    Magnetotactic bacteria (MTB) are a group of phylogenetically and physiologically diverse Gram-negative bacteria that synthesize intracellular magnetic crystals named magnetosomes. MTB are affiliated with three classes of Proteobacteria phylum, Nitrospirae phylum, Omnitrophica phylum and probably with the candidate phylum Latescibacteria. The evolutionary origin and physiological diversity of MTB compared with other bacterial taxonomic groups remain to be illustrated. Here, we analysed the genome of the marine magneto-ovoid strain MO-1 and found that it is closely related to Magnetococcus marinus MC-1. Detailed analyses of the ribosomal proteins and whole proteomes of 390 genomes reveal that, among the Proteobacteria analysed, only MO-1 and MC-1 have coding sequences (CDSs) with a similarly high proportion of origins from Alphaproteobacteria, Betaproteobacteria, Deltaproteobacteria and Gammaproteobacteria. Interestingly, a comparative metabolic network analysis with anoxic network enzymes from sequenced MTB and non-MTB successfully allows the eventual prediction of an organism with a metabolic profile compatible for magnetosome production. Altogether, our genomic analysis reveals multiple origins of MO-1 and M. marinus MC-1 genomes and suggests a metabolism-restriction model for explaining whether a bacterium could become an MTB upon acquisition of magnetosome encoding genes. © 2016 Society for Applied Microbiology and John Wiley & Sons Ltd.

  17. InCoB2012 Conference: from biological data to knowledge to technological breakthroughs

    PubMed Central

    2012-01-01

    Ten years ago when Asia-Pacific Bioinformatics Network held the first International Conference on Bioinformatics (InCoB) in Bangkok its theme was North-South Networking. At that time InCoB aimed to provide biologists and bioinformatics researchers in the Asia-Pacific region a forum to meet, interact with, and disseminate knowledge about the burgeoning field of bioinformatics. Meanwhile InCoB has evolved into a major regional bioinformatics conference that attracts not only talented and established scientists from the region but increasingly also from East Asia, North America and Europe. Since 2006 InCoB yielded 114 articles in BMC Bioinformatics supplement issues that have been cited nearly 1,000 times to date. In part, these developments reflect the success of bioinformatics education and continuous efforts to integrate and utilize bioinformatics in biotechnology and biosciences in the Asia-Pacific region. A cross-section of research leading from biological data to knowledge and to technological applications, the InCoB2012 theme, is introduced in this editorial. Other highlights included sessions organized by the Pan-Asian Pacific Genome Initiative and a Machine Learning in Immunology competition. InCoB2013 is scheduled for September 18-21, 2013 at Suzhou, China. PMID:23281929

  18. Proliferation of group II introns in the chloroplast genome of the green alga Oedocladium carolinianum (Chlorophyceae).

    PubMed

    Brouard, Jean-Simon; Turmel, Monique; Otis, Christian; Lemieux, Claude

    2016-01-01

    The chloroplast genome sustained extensive changes in architecture during the evolution of the Chlorophyceae, a morphologically and ecologically diverse class of green algae belonging to the Chlorophyta; however, the forces driving these changes are poorly understood. The five orders recognized in the Chlorophyceae form two major clades: the CS clade consisting of the Chlamydomonadales and Sphaeropleales, and the OCC clade consisting of the Oedogoniales, Chaetophorales, and Chaetopeltidales. In the OCC clade, considerable variations in chloroplast DNA (cpDNA) structure, size, gene order, and intron content have been observed. The large inverted repeat (IR), an ancestral feature characteristic of most green plants, is present in Oedogonium cardiacum (Oedogoniales) but is lacking in the examined members of the Chaetophorales and Chaetopeltidales. Remarkably, the Oedogonium 35.5-kb IR houses genes that were putatively acquired through horizontal DNA transfer. To better understand the dynamics of chloroplast genome evolution in the Oedogoniales, we analyzed the cpDNA of a second representative of this order, Oedocladium carolinianum . The Oedocladium cpDNA was sequenced and annotated. The evolutionary distances separating Oedocladium and Oedogonium cpDNAs and two other pairs of chlorophycean cpDNAs were estimated using a 61-gene data set. Phylogenetic analysis of an alignment of group IIA introns from members of the OCC clade was performed. Secondary structures and insertion sites of oedogonialean group IIA introns were analyzed. The 204,438-bp Oedocladium genome is 7.9 kb larger than the Oedogonium genome, but its repertoire of conserved genes is remarkably similar and gene order differs by only one reversal. Although the 23.7-kb IR is missing the putative foreign genes found in Oedogonium , it contains sequences coding for a putative phage or bacterial DNA primase and a hypothetical protein. Intergenic sequences are 1.5-fold longer and dispersed repeats are more

  19. Identification, characterization, and comparative genomic distribution of the HERV-K (HML-2) group of human endogenous retroviruses

    PubMed Central

    2011-01-01

    Background Integration of retroviral DNA into a germ cell may lead to a provirus that is transmitted vertically to that host's offspring as an endogenous retrovirus (ERV). In humans, ERVs (HERVs) comprise about 8% of the genome, the vast majority of which are truncated and/or highly mutated and no longer encode functional genes. The most recently active retroviruses that integrated into the human germ line are members of the Betaretrovirus-like HERV-K (HML-2) group, many of which contain intact open reading frames (ORFs) in some or all genes, sometimes encoding functional proteins that are expressed in various tissues. Interestingly, this expression is upregulated in many tumors ranging from breast and ovarian tissues to lymphomas and melanomas, as well as schizophrenia, rheumatoid arthritis, and other disorders. Results No study to date has characterized all HML-2 elements in the genome, an essential step towards determining a possible functional role of HML-2 expression in disease. We present here the most comprehensive and accurate catalog of all full-length and partial HML-2 proviruses, as well as solo LTR elements, within the published human genome to date. Furthermore, we provide evidence for preferential maintenance of proviruses and solo LTR elements on gene-rich chromosomes of the human genome and in proximity to gene regions. Conclusions Our analysis has found and corrected several errors in the annotation of HML-2 elements in the human genome, including mislabeling of a newly identified group called HML-11. HML-elements have been implicated in a wide array of diseases, and characterization of these elements will play a fundamental role to understand the relationship between endogenous retrovirus expression and disease. PMID:22067224

  20. Evolvable synthetic neural system

    NASA Technical Reports Server (NTRS)

    Curtis, Steven A. (Inventor)

    2009-01-01

    An evolvable synthetic neural system includes an evolvable neural interface operably coupled to at least one neural basis function. Each neural basis function includes an evolvable neural interface operably coupled to a heuristic neural system to perform high-level functions and an autonomic neural system to perform low-level functions. In some embodiments, the evolvable synthetic neural system is operably coupled to one or more evolvable synthetic neural systems in a hierarchy.

  1. The king cobra genome reveals dynamic gene evolution and adaptation in the snake venom system

    PubMed Central

    Vonk, Freek J.; Casewell, Nicholas R.; Henkel, Christiaan V.; Heimberg, Alysha M.; Jansen, Hans J.; McCleary, Ryan J. R.; Kerkkamp, Harald M. E.; Vos, Rutger A.; Guerreiro, Isabel; Calvete, Juan J.; Wüster, Wolfgang; Woods, Anthony E.; Logan, Jessica M.; Harrison, Robert A.; Castoe, Todd A.; de Koning, A. P. Jason; Pollock, David D.; Yandell, Mark; Calderon, Diego; Renjifo, Camila; Currier, Rachel B.; Salgado, David; Pla, Davinia; Sanz, Libia; Hyder, Asad S.; Ribeiro, José M. C.; Arntzen, Jan W.; van den Thillart, Guido E. E. J. M.; Boetzer, Marten; Pirovano, Walter; Dirks, Ron P.; Spaink, Herman P.; Duboule, Denis; McGlinn, Edwina; Kini, R. Manjunatha; Richardson, Michael K.

    2013-01-01

    Snakes are limbless predators, and many species use venom to help overpower relatively large, agile prey. Snake venoms are complex protein mixtures encoded by several multilocus gene families that function synergistically to cause incapacitation. To examine venom evolution, we sequenced and interrogated the genome of a venomous snake, the king cobra (Ophiophagus hannah), and compared it, together with our unique transcriptome, microRNA, and proteome datasets from this species, with data from other vertebrates. In contrast to the platypus, the only other venomous vertebrate with a sequenced genome, we find that snake toxin genes evolve through several distinct co-option mechanisms and exhibit surprisingly variable levels of gene duplication and directional selection that correlate with their functional importance in prey capture. The enigmatic accessory venom gland shows a very different pattern of toxin gene expression from the main venom gland and seems to have recruited toxin-like lectin genes repeatedly for new nontoxic functions. In addition, tissue-specific microRNA analyses suggested the co-option of core genetic regulatory components of the venom secretory system from a pancreatic origin. Although the king cobra is limbless, we recovered coding sequences for all Hox genes involved in amniote limb development, with the exception of Hoxd12. Our results provide a unique view of the origin and evolution of snake venom and reveal multiple genome-level adaptive responses to natural selection in this complex biological weapon system. More generally, they provide insight into mechanisms of protein evolution under strong selection. PMID:24297900

  2. Co-localization of the oncogenic transcription factor MYCN and the DNA methyl binding protein MeCP2 at genomic sites in neuroblastoma.

    PubMed

    Murphy, Derek M; Buckley, Patrick G; Das, Sudipto; Watters, Karen M; Bryan, Kenneth; Stallings, Raymond L

    2011-01-01

    MYCN is a transcription factor that is expressed during the development of the neural crest and its dysregulation plays a major role in the pathogenesis of pediatric cancers such as neuroblastoma, medulloblastoma and rhabdomyosarcoma. MeCP2 is a CpG methyl binding protein which has been associated with a number of cancers and developmental disorders, particularly Rett syndrome. Using an integrative global genomics approach involving chromatin immunoprecipitation applied to microarrays, we have determined that MYCN and MeCP2 co-localize to gene promoter regions, as well as inter/intragenic sites, within the neuroblastoma genome (MYCN amplified Kelly cells) at high frequency (70.2% of MYCN sites were also positive for MeCP2). Intriguingly, the frequency of co-localization was significantly less at promoter regions exhibiting substantial hypermethylation (8.7%), as determined by methylated DNA immunoprecipitation (MeDIP) applied to the same microarrays. Co-immunoprecipitation of MYCN using an anti-MeCP2 antibody indicated that a MYCN/MeCP2 interaction occurs at protein level. mRNA expression profiling revealed that the median expression of genes with promoters bound by MYCN was significantly higher than for genes bound by MeCP2, and that genes bound by both proteins had intermediate expression. Pathway analysis was carried out for genes bound by MYCN, MeCP2 or MYCN/MeCP2, revealing higher order functions. Our results indicate that MYCN and MeCP2 protein interact and co-localize to similar genomic sites at very high frequency, and that the patterns of binding of these proteins can be associated with significant differences in transcriptional activity. Although it is not yet known if this interaction contributes to neuroblastoma disease pathogenesis, it is intriguing that the interaction occurs at the promoter regions of several genes important for the development of neuroblastoma, including ALK, AURKA and BDNF.

  3. Improved simulation of group averaged CO2 surface concentrations using GEOS-Chem and fluxes from VEGAS

    NASA Astrophysics Data System (ADS)

    Chen, Z. H.; Zhu, J.; Zeng, N.

    2013-01-01

    CO2 measurements have been combined with simulated CO2 distributions from a transport model in order to produce the optimal estimates of CO2 surface fluxes in inverse modeling. However one persistent problem in using model-observation comparisons for this goal relates to the issue of compatibility. Observations at a single site reflect all underlying processes of various scales that usually cannot be fully resolved by model simulations at the grid points nearest the site due to lack of spatial or temporal resolution or missing processes in models. In this article we group site observations of multiple stations according to atmospheric mixing regimes and surface characteristics. The group averaged values of CO2 concentration from model simulations and observations are used to evaluate the regional model results. Using the group averaged measurements of CO2 reduces the noise of individual stations. The difference of group averaged values between observation and modeled results reflects the uncertainties of the large scale flux in the region where the grouped stations are. We compared the group averaged values between model results with two biospheric fluxes from the model Carnegie-Ames-Stanford-Approach (CASA) and VEgetation-Global-Atmosphere-Soil (VEGAS) and observations to evaluate the regional model results. Results show that the modeling group averaged values of CO2 concentrations in all regions with fluxes from VEGAS have significant improvements for most regions. There is still large difference between two model results and observations for grouped average values in North Atlantic, Indian Ocean, and South Pacific Tropics. This implies possible large uncertainties in the fluxes there.

  4. Mammalian-specific genomic functions: Newly acquired traits generated by genomic imprinting and LTR retrotransposon-derived genes in mammals

    PubMed Central

    KANEKO-ISHINO, Tomoko; ISHINO, Fumitoshi

    2015-01-01

    Mammals, including human beings, have evolved a unique viviparous reproductive system and a highly developed central nervous system. How did these unique characteristics emerge in mammalian evolution, and what kinds of changes did occur in the mammalian genomes as evolution proceeded? A key conceptual term in approaching these issues is “mammalian-specific genomic functions”, a concept covering both mammalian-specific epigenetics and genetics. Genomic imprinting and LTR retrotransposon-derived genes are reviewed as the representative, mammalian-specific genomic functions that are essential not only for the current mammalian developmental system, but also mammalian evolution itself. First, the essential roles of genomic imprinting in mammalian development, especially related to viviparous reproduction via placental function, as well as the emergence of genomic imprinting in mammalian evolution, are discussed. Second, we introduce the novel concept of “mammalian-specific traits generated by mammalian-specific genes from LTR retrotransposons”, based on the finding that LTR retrotransposons served as a critical driving force in the mammalian evolution via generating mammalian-specific genes. PMID:26666304

  5. Evolving Diversity of Hepatitis C Viruses in Yunnan Honghe, China

    PubMed Central

    Yang, Lanhui; Jiang, Chenyan; Hu, Song; Diao, Qiongni; Li, Jia; Si, Wei; Chen, Mei; Zhao, Richard Y.

    2016-01-01

    The Chinese Honghe Autonomous Prefecture (Honghe) in Yunnan Province is a unique ethnic area because it is inhabited by more than ten different minority ethnic groups. Geographically, Honghe directly shares a border with Vietnam. The objective of this study was to investigate genetic diversity and distribution of the Hepatitis C virus (HCV) in Honghe. Ninety nine subjects who were infected with HCV or HCV/HIV (Human Immunodeficiency Virus Type 1) were recruited into this study. HCV genotypes and subtypes were determined based on the sequences of the core/envelope 1 (C/E1) and the nonstructural protein 5B (NS5B) genomic regions. The viral diversity and origins of dissemination were examined by phylogenetic analyses. Three HCV genotypes (1, 3 and 6) with six subtypes (1b, 3b, 3a, 6a, 6n and 6v) were identified. The most predominant form was genotype 3 (54.6%) followed by 6 (34.3%), and 1 (9.1%). The HCV subtype 3b appeared to be the most frequent form (38.4%) followed by 6n (20.2%) and 3a (16.2%). Statistical analyses suggested a possible rise of the genotype 6a in Honghe among intravenous drug users with HCV/HIV co-infections. Further phylogenetic analyses suggested that similar HCV-6a viruses might have been circulating in the Honghe area for more than a decade, which likely originated from Vietnam or vice versa. Two HCV samples with single HCV infection (SC34 and SC45) were isolated that could represent new recombinant variants. Although the genetic prevalence of HCV in Honghe is in general agreement with that of Southwest China and Yunnan Province, the diversity of HCV genotypes and subtypes in Honghe is somewhat unique and evolving. Information presented here should provide useful information for future health surveillance and prevention of HCV infection in this area. PMID:26999127

  6. Molecular and genomic characterization of pathogenic traits of group A Streptococcus pyogenes

    PubMed Central

    HAMADA, Shigeyuki; KAWABATA, Shigetada; NAKAGAWA, Ichiro

    2015-01-01

    Group A streptococcus (GAS) or Streptococcus pyogenes causes various diseases ranging from self-limiting sore throat to deadly invasive diseases. The genome size of GAS is 1.85–1.9 Mb, and genomic rearrangement has been demonstrated. GAS possesses various surface-associated substances such as hyaluronic capsule, M proteins, and fibronectin/laminin/immunoglobulin-binding proteins. These are related to the virulence and play multifaceted and mutually reflected roles in the pathogenesis of GAS infections. Invasion of GAS into epithelial cells and deeper tissues provokes immune and non-immune defense or inflammatory responses including the recruitment of neutrophils, macrophages, and dendritic cells in hosts. GAS frequently evades host defense mechanisms by using its virulence factors. Extracellular products of GAS may perturb cellular and subcellular functions and degrade tissues enzymatically, which leads to the aggravation of local and/or systemic disorders in the host. In this review, we summarize some important cellular and extracellular substances that may affect pathogenic processes during GAS infections, and the host responses to these. PMID:26666305

  7. fac-Re(CO)3L complexes containing tridentate monoanionic ligands (L-) with a seldom-studied sulfonamido group as one terminal ligating group.

    PubMed

    Christoforou, Anna Maria; Fronczek, Frank R; Marzilli, Patricia A; Marzilli, Luigi G

    2007-08-20

    To achieve a net-neutral coordination unit in radiopharmaceuticals with a fac-M(CO)3+ core (M = Tc, Re), facially coordinated monoanionic tridentate ligands are needed. New neutral fac-Re(CO)3L complexes were obtained by treating fac-[Re(CO)3(H2O)3]+ with unsymmetrical tridentate NNN donor ligands (LH) based primarily on a diethylenetriamine (dien) moiety with an aromatic group linked to a terminal nitrogen through a sulfonamide. LHs contain 2,4,6-trimethylbenzenesulfonyl (tmbSO2) and 5-(dimethylamino)naphthalene-1-sulfonyl (DNS) groups. X-ray crystallographic and NMR analyses confirm that in both the solid and the solution states all L- in fac-Re(CO)3L complexes are bound in a tridentate fashion with one donor being nitrogen from a deprotonated sulfonamido group. Another fundamental property that is important in radiopharmaceuticals is shape, which in turn depends on ring pucker. For L- = tmbSO2-dien-, tmbSO2-N'-Medien-, and tmbSO2-N,N-Me2dien-, the two chelate rings have a different pucker chirality, as is commonly found for a broad range of metal complexes. However, for fac-Re(CO)3(DNS-dien), both chelate rings have the same pucker chirality because the sulfonamido ring has an unusual pucker for the absolute configuration at Re; a finding that is attributable to intramolecular and intermolecular hydrogen bonds from the sulfonamido oxygens to the NH2 groups. Averaging of tmb NMR signals, even at -90 degrees C for Re(CO)3(tmbSO2-N,N-Me2dien), indicates rapid dynamic motion in the complexes with this group. However, examination of the structures suggests that free rotation about the S-C(tmb) bond is not possible but that concerted coupled rotations about the N-S and the S-C bonds can explain the NMR data.

  8. Insights into Land Plant Evolution Garnered from the Marchantia polymorpha Genome.

    PubMed

    Bowman, John L; Kohchi, Takayuki; Yamato, Katsuyuki T; Jenkins, Jerry; Shu, Shengqiang; Ishizaki, Kimitsune; Yamaoka, Shohei; Nishihama, Ryuichi; Nakamura, Yasukazu; Berger, Frédéric; Adam, Catherine; Aki, Shiori Sugamata; Althoff, Felix; Araki, Takashi; Arteaga-Vazquez, Mario A; Balasubrmanian, Sureshkumar; Barry, Kerrie; Bauer, Diane; Boehm, Christian R; Briginshaw, Liam; Caballero-Perez, Juan; Catarino, Bruno; Chen, Feng; Chiyoda, Shota; Chovatia, Mansi; Davies, Kevin M; Delmans, Mihails; Demura, Taku; Dierschke, Tom; Dolan, Liam; Dorantes-Acosta, Ana E; Eklund, D Magnus; Florent, Stevie N; Flores-Sandoval, Eduardo; Fujiyama, Asao; Fukuzawa, Hideya; Galik, Bence; Grimanelli, Daniel; Grimwood, Jane; Grossniklaus, Ueli; Hamada, Takahiro; Haseloff, Jim; Hetherington, Alexander J; Higo, Asuka; Hirakawa, Yuki; Hundley, Hope N; Ikeda, Yoko; Inoue, Keisuke; Inoue, Shin-Ichiro; Ishida, Sakiko; Jia, Qidong; Kakita, Mitsuru; Kanazawa, Takehiko; Kawai, Yosuke; Kawashima, Tomokazu; Kennedy, Megan; Kinose, Keita; Kinoshita, Toshinori; Kohara, Yuji; Koide, Eri; Komatsu, Kenji; Kopischke, Sarah; Kubo, Minoru; Kyozuka, Junko; Lagercrantz, Ulf; Lin, Shih-Shun; Lindquist, Erika; Lipzen, Anna M; Lu, Chia-Wei; De Luna, Efraín; Martienssen, Robert A; Minamino, Naoki; Mizutani, Masaharu; Mizutani, Miya; Mochizuki, Nobuyoshi; Monte, Isabel; Mosher, Rebecca; Nagasaki, Hideki; Nakagami, Hirofumi; Naramoto, Satoshi; Nishitani, Kazuhiko; Ohtani, Misato; Okamoto, Takashi; Okumura, Masaki; Phillips, Jeremy; Pollak, Bernardo; Reinders, Anke; Rövekamp, Moritz; Sano, Ryosuke; Sawa, Shinichiro; Schmid, Marc W; Shirakawa, Makoto; Solano, Roberto; Spunde, Alexander; Suetsugu, Noriyuki; Sugano, Sumio; Sugiyama, Akifumi; Sun, Rui; Suzuki, Yutaka; Takenaka, Mizuki; Takezawa, Daisuke; Tomogane, Hirokazu; Tsuzuki, Masayuki; Ueda, Takashi; Umeda, Masaaki; Ward, John M; Watanabe, Yuichiro; Yazaki, Kazufumi; Yokoyama, Ryusuke; Yoshitake, Yoshihiro; Yotsui, Izumi; Zachgo, Sabine; Schmutz, Jeremy

    2017-10-05

    The evolution of land flora transformed the terrestrial environment. Land plants evolved from an ancestral charophycean alga from which they inherited developmental, biochemical, and cell biological attributes. Additional biochemical and physiological adaptations to land, and a life cycle with an alternation between multicellular haploid and diploid generations that facilitated efficient dispersal of desiccation tolerant spores, evolved in the ancestral land plant. We analyzed the genome of the liverwort Marchantia polymorpha, a member of a basal land plant lineage. Relative to charophycean algae, land plant genomes are characterized by genes encoding novel biochemical pathways, new phytohormone signaling pathways (notably auxin), expanded repertoires of signaling pathways, and increased diversity in some transcription factor families. Compared with other sequenced land plants, M. polymorpha exhibits low genetic redundancy in most regulatory pathways, with this portion of its genome resembling that predicted for the ancestral land plant. PAPERCLIP. Copyright © 2017 The Author(s). Published by Elsevier Inc. All rights reserved.

  9. Dynamix: dynamic visualization by automatic selection of informative tracks from hundreds of genomic datasets.

    PubMed

    Monfort, Matthias; Furlong, Eileen E M; Girardot, Charles

    2017-07-15

    Visualization of genomic data is fundamental for gaining insights into genome function. Yet, co-visualization of a large number of datasets remains a challenge in all popular genome browsers and the development of new visualization methods is needed to improve the usability and user experience of genome browsers. We present Dynamix, a JBrowse plugin that enables the parallel inspection of hundreds of genomic datasets. Dynamix takes advantage of a priori knowledge to automatically display data tracks with signal within a genomic region of interest. As the user navigates through the genome, Dynamix automatically updates data tracks and limits all manual operations otherwise needed to adjust the data visible on screen. Dynamix also introduces a new carousel view that optimizes screen utilization by enabling users to independently scroll through groups of tracks. Dynamix is hosted at http://furlonglab.embl.de/Dynamix . charles.girardot@embl.de. Supplementary data are available at Bioinformatics online. © The Author(s) 2017. Published by Oxford University Press.

  10. Dispersion of the RmInt1 group II intron in the Sinorhizobium meliloti genome upon acquisition by conjugative transfer.

    PubMed

    Nisa-Martínez, Rafael; Jiménez-Zurdo, José I; Martínez-Abarca, Francisco; Muñoz-Adelantado, Estefanía; Toro, Nicolás

    2007-01-01

    RmInt1 is a self-splicing and mobile group II intron initially identified in the bacterium Sinorhizobium meliloti, which encodes a reverse transcriptase-maturase (Intron Encoded Protein, IEP) lacking the C-terminal DNA binding (D) and DNA endonuclease domains (En). RmInt1 invades cognate intronless homing sites (ISRm2011-2) by a mechanism known as retrohoming. This work describes how the RmInt1 intron spreads in the S.meliloti genome upon acquisition by conjugation. This process was revealed by using the wild-type intron RmInt1 and engineered intron-donor constructs based on ribozyme coding sequence (DeltaORF)-derivatives with higher homing efficiency than the wild-type intron. The data demonstrate that RmInt1 propagates into the S.meliloti genome primarily by retrohoming with a strand bias related to replication of the chromosome and symbiotic megaplasmids. Moreover, we show that when expressed in trans from a separate plasmid, the IEP is able to mobilize genomic DeltaORF ribozymes that afterward displayed wild-type levels of retrohoming. Our results contribute to get further understanding of how group II introns spread into bacterial genomes in nature.

  11. Dispersion of the RmInt1 group II intron in the Sinorhizobium meliloti genome upon acquisition by conjugative transfer

    PubMed Central

    Nisa-Martínez, Rafael; Jiménez-Zurdo, José I.; Martínez-Abarca, Francisco; Muñoz-Adelantado, Estefanía; Toro, Nicolás

    2007-01-01

    RmInt1 is a self-splicing and mobile group II intron initially identified in the bacterium Sinorhizobium meliloti, which encodes a reverse transcriptase–maturase (Intron Encoded Protein, IEP) lacking the C-terminal DNA binding (D) and DNA endonuclease domains (En). RmInt1 invades cognate intronless homing sites (ISRm2011-2) by a mechanism known as retrohoming. This work describes how the RmInt1 intron spreads in the S.meliloti genome upon acquisition by conjugation. This process was revealed by using the wild-type intron RmInt1 and engineered intron-donor constructs based on ribozyme coding sequence (ΔORF)-derivatives with higher homing efficiency than the wild-type intron. The data demonstrate that RmInt1 propagates into the S.meliloti genome primarily by retrohoming with a strand bias related to replication of the chromosome and symbiotic megaplasmids. Moreover, we show that when expressed in trans from a separate plasmid, the IEP is able to mobilize genomic ΔORF ribozymes that afterward displayed wild-type levels of retrohoming. Our results contribute to get further understanding of how group II introns spread into bacterial genomes in nature. PMID:17158161

  12. A non-classical phase diagram for virus-bacterial co-evolution mediated by CRISPR

    NASA Astrophysics Data System (ADS)

    Han, Pu; Deem, Michael

    CRISPR is a newly discovered prokaryotic immune system. Bacteria and archaea with this system incorporate genetic material from invading viruses into their genomes, providing protection against future infection by similar viruses. Due to the cost of CRISPR, bacteria can lose the acquired immunity. We will show an intriguing phase diagram of the virus extinction probability, which when the rate of losing the acquired immunity is small, is more complex than that of the classic predator-prey model. As the CRISPR incorporates genetic material, viruses are under pressure to evolve to escape the recognition by CRISPR, and this co-evolution leads to a non-trivial phase structure that cannot be explained by the classical predator-prey model.

  13. "Orphan" retrogenes in the human genome.

    PubMed

    Ciomborowska, Joanna; Rosikiewicz, Wojciech; Szklarczyk, Damian; Makałowski, Wojciech; Makałowska, Izabela

    2013-02-01

    Gene duplicates generated via retroposition were long thought to be pseudogenized and consequently decayed. However, a significant number of these genes escaped their evolutionary destiny and evolved into functional genes. Despite multiple studies, the number of functional retrogenes in human and other genomes remains unclear. We performed a comparative analysis of human, chicken, and worm genomes to identify "orphan" retrogenes, that is, retrogenes that have replaced their progenitors. We located 25 such candidates in the human genome. All of these genes were previously known, and the majority has been intensively studied. Despite this, they have never been recognized as retrogenes. Analysis revealed that the phenomenon of replacing parental genes with their retrocopies has been taking place over the entire span of animal evolution. This process was often species specific and contributed to interspecies differences. Surprisingly, these retrogenes, which should evolve in a more relaxed mode, are subject to a very strong purifying selection, which is, on average, two and a half times stronger than other human genes. Also, for retrogenes, they do not show a typical overall tendency for a testis-specific expression. Notably, seven of them are associated with human diseases. Recognizing them as "orphan" retrocopies, which have different regulatory machinery than their parents, is important for any disease studies in model organisms, especially when discoveries made in one species are transferred to humans.

  14. Genome-scale modeling of the evolutionary path to C4 photosynthesis

    NASA Astrophysics Data System (ADS)

    Myers, Christopher R.; Bogart, Eli

    In C4 photosynthesis, plants maintain a high carbon dioxide level in specialized bundle sheath cells surrounding leaf veins and restrict CO2 assimilation to those cells, favoring CO2 over O2 in competition for Rubisco active sites. In C3 plants, which do not possess such a carbon concentrating mechanism, CO2 fixation is reduced due to this competition. Despite the complexity of the C4 system, it has evolved convergently from more than 60 independent origins in diverse families of plants around the world over the last 30 million years. We study the evolution of the C4 system in a genome-scale model of plant metabolism that describes interacting mesophyll and bundle sheath cells and enforces key nonlinear kinetic relationships. Adapting the zero-temperature string method for simulating transition paths in physics and chemistry, we find the highest-fitness paths connecting C3 and C4 positions in the model's high-dimensional parameter space, and show that they reproduce known aspects of the C3-C4 transition while making additional predictions about metabolic changes along the path. We explore the relationship between evolutionary history and C4 biochemical subtype, and the effects of atmospheric carbon dioxide levels.

  15. Social and behavioral research in genomic sequencing: approaches from the Clinical Sequencing Exploratory Research Consortium Outcomes and Measures Working Group.

    PubMed

    Gray, Stacy W; Martins, Yolanda; Feuerman, Lindsay Z; Bernhardt, Barbara A; Biesecker, Barbara B; Christensen, Kurt D; Joffe, Steven; Rini, Christine; Veenstra, David; McGuire, Amy L

    2014-10-01

    The routine use of genomic sequencing in clinical medicine has the potential to dramatically alter patient care and medical outcomes. To fully understand the psychosocial and behavioral impact of sequencing integration into clinical practice, it is imperative that we identify the factors that influence sequencing-related decision making and patient outcomes. In an effort to develop a collaborative and conceptually grounded approach to studying sequencing adoption, members of the National Human Genome Research Institute's Clinical Sequencing Exploratory Research Consortium formed the Outcomes and Measures Working Group. Here we highlight the priority areas of investigation and psychosocial and behavioral outcomes identified by the Working Group. We also review some of the anticipated challenges to measurement in social and behavioral research related to genomic sequencing; opportunities for instrument development; and the importance of qualitative, quantitative, and mixed-method approaches. This work represents the early, shared efforts of multiple research teams as we strive to understand individuals' experiences with genomic sequencing. The resulting body of knowledge will guide recommendations for the optimal use of sequencing in clinical practice.

  16. A PLSPM-based test statistic for detecting gene-gene co-association in genome-wide association study with case-control design.

    PubMed

    Zhang, Xiaoshuai; Yang, Xiaowei; Yuan, Zhongshang; Liu, Yanxun; Li, Fangyu; Peng, Bin; Zhu, Dianwen; Zhao, Jinghua; Xue, Fuzhong

    2013-01-01

    For genome-wide association data analysis, two genes in any pathway, two SNPs in the two linked gene regions respectively or in the two linked exons respectively within one gene are often correlated with each other. We therefore proposed the concept of gene-gene co-association, which refers to the effects not only due to the traditional interaction under nearly independent condition but the correlation between two genes. Furthermore, we constructed a novel statistic for detecting gene-gene co-association based on Partial Least Squares Path Modeling (PLSPM). Through simulation, the relationship between traditional interaction and co-association was highlighted under three different types of co-association. Both simulation and real data analysis demonstrated that the proposed PLSPM-based statistic has better performance than single SNP-based logistic model, PCA-based logistic model, and other gene-based methods.

  17. A PLSPM-Based Test Statistic for Detecting Gene-Gene Co-Association in Genome-Wide Association Study with Case-Control Design

    PubMed Central

    Zhang, Xiaoshuai; Yang, Xiaowei; Yuan, Zhongshang; Liu, Yanxun; Li, Fangyu; Peng, Bin; Zhu, Dianwen; Zhao, Jinghua; Xue, Fuzhong

    2013-01-01

    For genome-wide association data analysis, two genes in any pathway, two SNPs in the two linked gene regions respectively or in the two linked exons respectively within one gene are often correlated with each other. We therefore proposed the concept of gene-gene co-association, which refers to the effects not only due to the traditional interaction under nearly independent condition but the correlation between two genes. Furthermore, we constructed a novel statistic for detecting gene-gene co-association based on Partial Least Squares Path Modeling (PLSPM). Through simulation, the relationship between traditional interaction and co-association was highlighted under three different types of co-association. Both simulation and real data analysis demonstrated that the proposed PLSPM-based statistic has better performance than single SNP-based logistic model, PCA-based logistic model, and other gene-based methods. PMID:23620809

  18. Vertebrate Genome Evolution in the Light of Fish Cytogenomics and rDNAomics

    PubMed Central

    Howell, W. Mike

    2018-01-01

    To understand the cytogenomic evolution of vertebrates, we must first unravel the complex genomes of fishes, which were the first vertebrates to evolve and were ancestors to all other vertebrates. We must not forget the immense time span during which the fish genomes had to evolve. Fish cytogenomics is endowed with unique features which offer irreplaceable insights into the evolution of the vertebrate genome. Due to the general DNA base compositional homogeneity of fish genomes, fish cytogenomics is largely based on mapping DNA repeats that still represent serious obstacles in genome sequencing and assembling, even in model species. Localization of repeats on chromosomes of hundreds of fish species and populations originating from diversified environments have revealed the biological importance of this genomic fraction. Ribosomal genes (rDNA) belong to the most informative repeats and in fish, they are subject to a more relaxed regulation than in higher vertebrates. This can result in formation of a literal ‘rDNAome’ consisting of more than 20,000 copies with their high proportion employed in extra-coding functions. Because rDNA has high rates of transcription and recombination, it contributes to genome diversification and can form reproductive barrier. Our overall knowledge of fish cytogenomics grows rapidly by a continuously increasing number of fish genomes sequenced and by use of novel sequencing methods improving genome assembly. The recently revealed exceptional compositional heterogeneity in an ancient fish lineage (gars) sheds new light on the compositional genome evolution in vertebrates generally. We highlight the power of synergy of cytogenetics and genomics in fish cytogenomics, its potential to understand the complexity of genome evolution in vertebrates, which is also linked to clinical applications and the chromosomal backgrounds of speciation. We also summarize the current knowledge on fish cytogenomics and outline its main future avenues. PMID

  19. PSAT: A web tool to compare genomic neighborhoods of multiple prokaryotic genomes

    PubMed Central

    Fong, Christine; Rohmer, Laurence; Radey, Matthew; Wasnick, Michael; Brittnacher, Mitchell J

    2008-01-01

    Background The conservation of gene order among prokaryotic genomes can provide valuable insight into gene function, protein interactions, or events by which genomes have evolved. Although some tools are available for visualizing and comparing the order of genes between genomes of study, few support an efficient and organized analysis between large numbers of genomes. The Prokaryotic Sequence homology Analysis Tool (PSAT) is a web tool for comparing gene neighborhoods among multiple prokaryotic genomes. Results PSAT utilizes a database that is preloaded with gene annotation, BLAST hit results, and gene-clustering scores designed to help identify regions of conserved gene order. Researchers use the PSAT web interface to find a gene of interest in a reference genome and efficiently retrieve the sequence homologs found in other bacterial genomes. The tool generates a graphic of the genomic neighborhood surrounding the selected gene and the corresponding regions for its homologs in each comparison genome. Homologs in each region are color coded to assist users with analyzing gene order among various genomes. In contrast to common comparative analysis methods that filter sequence homolog data based on alignment score cutoffs, PSAT leverages gene context information for homologs, including those with weak alignment scores, enabling a more sensitive analysis. Features for constraining or ordering results are designed to help researchers browse results from large numbers of comparison genomes in an organized manner. PSAT has been demonstrated to be useful for helping to identify gene orthologs and potential functional gene clusters, and detecting genome modifications that may result in loss of function. Conclusion PSAT allows researchers to investigate the order of genes within local genomic neighborhoods of multiple genomes. A PSAT web server for public use is available for performing analyses on a growing set of reference genomes through any web browser with no client

  20. Simple stochastic birth and death models of genome evolution: was there enough time for us to evolve?

    PubMed

    Karev, Georgy P; Wolf, Yuri I; Koonin, Eugene V

    2003-10-12

    The distributions of many genome-associated quantities, including the membership of paralogous gene families can be approximated with power laws. We are interested in developing mathematical models of genome evolution that adequately account for the shape of these distributions and describe the evolutionary dynamics of their formation. We show that simple stochastic models of genome evolution lead to power-law asymptotics of protein domain family size distribution. These models, called Birth, Death and Innovation Models (BDIM), represent a special class of balanced birth-and-death processes, in which domain duplication and deletion rates are asymptotically equal up to the second order. The simplest, linear BDIM shows an excellent fit to the observed distributions of domain family size in diverse prokaryotic and eukaryotic genomes. However, the stochastic version of the linear BDIM explored here predicts that the actual size of large paralogous families is reached on an unrealistically long timescale. We show that introduction of non-linearity, which might be interpreted as interaction of a particular order between individual family members, allows the model to achieve genome evolution rates that are much better compatible with the current estimates of the rates of individual duplication/loss events.

  1. A Feast of Malaria Parasite Genomes.

    PubMed

    Carlton, Jane M; Sullivan, Steven A

    2017-03-08

    The Plasmodium genus has evolved over time and across hosts, complexifying our understanding of malaria. In a recent Nature paper, Rutledge et al. (2017) describe the genome sequences of three major human malaria parasite species, providing insight into Plasmodium evolution and raising the question of how many species there are. Copyright © 2017 Elsevier Inc. All rights reserved.

  2. Genomics and metagenomics in medical microbiology.

    PubMed

    Padmanabhan, Roshan; Mishra, Ajay Kumar; Raoult, Didier; Fournier, Pierre-Edouard

    2013-12-01

    Over the last two decades, sequencing tools have evolved from laborious time-consuming methodologies to real-time detection and deciphering of genomic DNA. Genome sequencing, especially using next generation sequencing (NGS) has revolutionized the landscape of microbiology and infectious disease. This deluge of sequencing data has not only enabled advances in fundamental biology but also helped improve diagnosis, typing of pathogen, virulence and antibiotic resistance detection, and development of new vaccines and culture media. In addition, NGS also enabled efficient analysis of complex human micro-floras, both commensal, and pathological, through metagenomic methods, thus helping the comprehension and management of human diseases such as obesity. This review summarizes technological advances in genomics and metagenomics relevant to the field of medical microbiology. Copyright © 2013 Elsevier B.V. All rights reserved.

  3. Genome-wide selective sweeps and gene-specific sweeps in natural bacterial populations

    DOE PAGES

    Bendall, Matthew L.; Stevens, Sarah L.R.; Chan, Leong-Keat; ...

    2016-01-08

    Multiple models describe the formation and evolution of distinct microbial phylogenetic groups. These evolutionary models make different predictions regarding how adaptive alleles spread through populations and how genetic diversity is maintained. Processes predicted by competing evolutionary models, for example, genome-wide selective sweeps vs gene-specific sweeps, could be captured in natural populations using time-series metagenomics if the approach were applied over a sufficiently long time frame. Direct observations of either process would help resolve how distinct microbial groups evolve. Using a 9-year metagenomic study of a freshwater lake (2005–2013), we explore changes in single-nucleotide polymorphism (SNP) frequencies and patterns of genemore » gain and loss in 30 bacterial populations. SNP analyses revealed substantial genetic heterogeneity within these populations, although the degree of heterogeneity varied by >1000-fold among populations. SNP allele frequencies also changed dramatically over time within some populations. Interestingly, nearly all SNP variants were slowly purged over several years from one population of green sulfur bacteria, while at the same time multiple genes either swept through or were lost from this population. Furthermore, these patterns were consistent with a genome-wide selective sweep in progress, a process predicted by the ‘ecotype model’ of speciation but not previously observed in nature. In contrast, other populations contained large, SNP-free genomic regions that appear to have swept independently through the populations prior to the study without purging diversity elsewhere in the genome. Finally, evidence for both genome-wide and gene-specific sweeps suggests that different models of bacterial speciation may apply to different populations coexisting in the same environment.« less

  4. Ongoing Relative Performance Evaluation for a CO2 EOR Asset in a Worldwide Peer Group

    NASA Astrophysics Data System (ADS)

    Zhao, C. F.; Li, X. S.; Wang, G. H.; Li, L.

    2017-10-01

    Abstract. Operators of a CO2 EOR asset need to know the relative performance level of their asset against its peers. The ongoing relative performance evaluation method is appropriate for this purpose. We first choose 52 CO2 assets around the world as the peer group, and then define the four ranking levels in terms of CO2consumption ratio. Only the final values of CO2consumption ratio for the group are obtained, and therefore cannot be used for an ongoing evaluation during a CO2 EOR asset’s life circle. Consequently, numerical reservoir simulation is employed to quantify the process values corresponding to the four ranking levels. Type curve plots are generated on the basis of the process values and utilized for the ongoing relative performance evaluation of a CO2 EOR asset in China.

  5. Co-evolution of atmospheres, life, and climate.

    PubMed

    Grenfell, J Lee; Rauer, Heike; Selsis, Franck; Kaltenegger, Lisa; Beichman, Charles; Danchi, William; Eiroa, Carlos; Fridlund, Malcolm; Henning, Thomas; Herbst, Tom; Lammer, Helmut; Léger, Alain; Liseau, René; Lunine, Jonathan; Paresce, Francesco; Penny, Alan; Quirrenbach, Andreas; Röttgering, Huub; Schneider, Jean; Stam, Daphne; Tinetti, Giovanna; White, Glenn J

    2010-01-01

    After Earth's origin, our host star, the Sun, was shining 20-25% less brightly than today. Without greenhouse-like conditions to warm the atmosphere, our early planet would have been an ice ball, and life may never have evolved. But life did evolve, which indicates that greenhouse gases must have been present on early Earth to warm the planet. Evidence from the geological record indicates an abundance of the greenhouse gas CO(2). CH(4) was probably present as well; and, in this regard, methanogenic bacteria, which belong to a diverse group of anaerobic prokaryotes that ferment CO(2) plus H(2) to CH(4), may have contributed to modification of the early atmosphere. Molecular oxygen was not present, as is indicated by the study of rocks from that era, which contain iron carbonate rather than iron oxide. Multicellular organisms originated as cells within colonies that became increasingly specialized. The development of photosynthesis allowed the Sun's energy to be harvested directly by life-forms. The resultant oxygen accumulated in the atmosphere and formed the ozone layer in the upper atmosphere. Aided by the absorption of harmful UV radiation in the ozone layer, life colonized Earth's surface. Our own planet is a very good example of how life-forms modified the atmosphere over the planets' lifetime. We show that these facts have to be taken into account when we discover and characterize atmospheres of Earth-like exoplanets. If life has originated and evolved on a planet, then it should be expected that a strong co-evolution occurred between life and the atmosphere, the result of which is the planet's climate.

  6. Seed desiccation mechanisms co-opted for vegetative desiccation in the resurrection grass Oropetium thomaeum.

    PubMed

    VanBuren, Robert; Wai, Ching Man; Zhang, Qingwei; Song, Xiaomin; Edger, Patrick P; Bryant, Doug; Michael, Todd P; Mockler, Todd C; Bartels, Dorothea

    2017-10-01

    Resurrection plants desiccate during periods of prolonged drought stress, then resume normal cellular metabolism upon water availability. Desiccation tolerance has multiple origins in flowering plants, and it likely evolved through rewiring seed desiccation pathways. Oropetium thomaeum is an emerging model for extreme drought tolerance, and its genome, which is the smallest among surveyed grasses, was recently sequenced. Combining RNA-seq, targeted metabolite analysis and comparative genomics, we show evidence for co-option of seed-specific pathways during vegetative desiccation. Desiccation-related gene co-expression clusters are enriched in functions related to seed development including several seed-specific transcription factors. Across the metabolic network, pathways involved in programmed cell death inhibition, ABA signalling and others are activated during dehydration. Oleosins and oil bodies that typically function in seed storage are highly abundant in desiccated leaves and may function for membrane stability and storage. Orthologs to seed-specific LEA proteins from rice and maize have neofunctionalized in Oropetium with high expression during desiccation. Accumulation of sucrose, raffinose and stachyose in drying leaves mirrors sugar accumulation patterns in maturing seeds. Together, these results connect vegetative desiccation with existing seed desiccation and drought responsive pathways and provide some key candidate genes for engineering improved drought tolerance in crop plants. © 2017 John Wiley & Sons Ltd.

  7. Seed development and genomic imprinting in plants.

    PubMed

    Köhler, Claudia; Grossniklaus, Ueli

    2005-01-01

    Genomic imprinting refers to an epigenetic phenomenon where the activity of an allele depends on its parental origin. Imprinting at individual genes has only been described in mammals and seed plants. We will discuss the role imprinted genes play in seed development and compare the situation in plants with that in mammals. Interestingly, many imprinted genes appear to control cell proliferation and growth in both groups of organisms although imprinting in plants may also be involved in the cellular differentiation of the two pairs of gametes involved in double fertilization. DNA methylation plays some role in the control of parent-of-origin-specific expression in both mammals and plants. Thus, although imprinting evolved independently in mammals and plants, there are striking similarities at the phenotypic and possibly also mechanistic level.

  8. Genomic medicine: health care issues and the unresolved ethical and social dilemmas.

    PubMed

    Idemyor, Vincent

    2014-01-01

    Our perception of the mechanism by which single genes can cause disease is evolving. This has led to the understanding of the pathophysiological basis of common diseases. Genomic Medicine continues to contribute to the understanding of the molecular basis of disease. Medicine has strived to achieve the goal of tailoring interventions to individual variations in risk and treatment response and advances in medical genomics will facilitate this process. Relevant to present-day practice is the use of genomic information to classify individuals according to disease susceptibility or expected responsiveness to a pharmacologic treatment and to provide targeted interventions. By investigating the genetic profile of individuals, medical professionals are able to select patients and use the information obtained to plan out a course of treatment that is much more in step with the way their body works. However, society is concerned about the effect genetic knowledge will have on ethnic or racial groups. Currently, the Health Insurance Portability and Accountability Act prohibits discrimination based on genetics. There is a need to increase the understanding of the social and ethical challenges that genomics information may pose to clinicians and scientists. This review is not meant to be exhaustive; rather, clinically relevant examples are used to illustrate how genomic medicine can facilitate the provision of molecular diagnostic methods that improve drug therapy. Finally, the rapid pace of change in genomics may likely make my conclusions today obsolete tomorrow.

  9. An intronic open reading frame was released from one of group II introns in the mitochondrial genome of the haptophyte Chrysochromulina sp. NIES-1333

    PubMed Central

    Nishimura, Yuki; Kamikawa, Ryoma; Hashimoto, Tetsuo; Inagaki, Yuji

    2014-01-01

    Mitochondrial (mt) genome sequences, which often bear introns, have been sampled from phylogenetically diverse eukaryotes. Thus, we can anticipate novel insights into intron evolution from previously unstudied mt genomes. We here investigated the origins and evolution of three introns in the mt genome of the haptophyte Chrysochromulina sp. NIES-1333, which was sequenced completely in this study. All the three introns were characterized as group II, on the basis of predicted secondary structure, and the conserved sequence motifs at the 5′ and 3′ termini. Our comparative studies on diverse mt genomes prompt us to propose that the Chrysochromulina mt genome laterally acquired the introns from mt genomes in distantly related eukaryotes. Many group II introns harbor intronic open reading frames for the proteins (intron-encoded proteins or IEPs), which likely facilitate the splicing of their host introns. However, we propose that a “free-standing,” IEP-like protein, which is not encoded within any introns in the Chrysochromulina mt genome, is involved in the splicing of the first cox1 intron that lacks any open reading frames. PMID:25054084

  10. Identification of candidate genes in Populus cell wall biosynthesis using text-mining, co-expression network and comparative genomics

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Yang, Xiaohan; Ye, Chuyu; Bisaria, Anjali

    2011-01-01

    Populus is an important bioenergy crop for bioethanol production. A greater understanding of cell wall biosynthesis processes is critical in reducing biomass recalcitrance, a major hindrance in efficient generation of ethanol from lignocellulosic biomass. Here, we report the identification of candidate cell wall biosynthesis genes through the development and application of a novel bioinformatics pipeline. As a first step, via text-mining of PubMed publications, we obtained 121 Arabidopsis genes that had the experimental evidences supporting their involvement in cell wall biosynthesis or remodeling. The 121 genes were then used as bait genes to query an Arabidopsis co-expression database and additionalmore » genes were identified as neighbors of the bait genes in the network, increasing the number of genes to 548. The 548 Arabidopsis genes were then used to re-query the Arabidopsis co-expression database and re-construct a network that captured additional network neighbors, expanding to a total of 694 genes. The 694 Arabidopsis genes were computationally divided into 22 clusters. Queries of the Populus genome using the Arabidopsis genes revealed 817 Populus orthologs. Functional analysis of gene ontology and tissue-specific gene expression indicated that these Arabidopsis and Populus genes are high likelihood candidates for functional genomics in relation to cell wall biosynthesis.« less

  11. Evolution and dynamics of megaplasmids with genome sizes larger than 100 kb in the Bacillus cereus group.

    PubMed

    Zheng, Jinshui; Peng, Donghai; Ruan, Lifang; Sun, Ming

    2013-12-02

    Plasmids play a crucial role in the evolution of bacterial genomes by mediating horizontal gene transfer. However, the origin and evolution of most plasmids remains unclear, especially for megaplasmids. Strains of the Bacillus cereus group contain up to 13 plasmids with genome sizes ranging from 2 kb to 600 kb, and thus can be used to study plasmid dynamics and evolution. This work studied the origin and evolution of 31 B. cereus group megaplasmids (>100 kb) focusing on the most conserved regions on plasmids, minireplicons. Sixty-five putative minireplicons were identified and classified to six types on the basis of proteins that are essential for replication. Twenty-nine of the 31 megaplasmids contained two or more minireplicons. Phylogenetic analysis of the protein sequences showed that different minireplicons on the same megaplasmid have different evolutionary histories. Therefore, we speculated that these megaplasmids are the results of fusion of smaller plasmids. All plasmids of a bacterial strain must be compatible. In megaplasmids of the B. cereus group, individual minireplicons of different megaplasmids in the same strain belong to different types or subtypes. Thus, the subtypes of each minireplicon they contain may determine the incompatibilities of megaplasmids. A broader analysis of all 1285 bacterial plasmids with putative known minireplicons whose complete genome sequences were available from GenBank revealed that 34% (443 plasmids) of the plasmids have two or more minireplicons. This indicates that plasmid fusion events are general among bacterial plasmids. Megaplasmids of B. cereus group are fusion of smaller plasmids, and the fusion of plasmids likely occurs frequently in the B. cereus group and in other bacterial taxa. Plasmid fusion may be one of the major mechanisms for formation of novel megaplasmids in the evolution of bacteria.

  12. Recurrent emergence of structural variants of LTR retrotransposon CsRn1 evolving novel expression strategy and their selective expansion in a carcinogenic liver fluke, Clonorchis sinensis.

    PubMed

    Kim, Seon-Hee; Kong, Yoon; Bae, Young-An

    2017-06-01

    Autonomous retrotransposons, in which replication and transcription are coupled, encode the essential gag and pol genes as a fusion or separate overlapping form(s) that are expressed in single transcripts regulated by a common upstream promoter. The element-specific expression strategies have driven development of relevant translational recoding mechanisms including ribosomal frameshifting to satisfy the protein stoichiometry critical for the assembly of infectious virus-like particles. Retrotransposons with different recoding strategies exhibit a mosaic distribution pattern across the diverse families of reverse transcribing elements, even though their respective distributions are substantially skewed towards certain family groups. However, only a few investigations to date have focused on the emergence of retrotransposons evolving novel expression strategy and causal genetic drivers of the structural variants. In this study, the bulk of genomic and transcribed sequences of a Ty3/gypsy-like CsRn1 retrotransposon in Clonorchis sinensis were analyzed for the comprehensive examination of its expression strategy. Our results demonstrated that structural variants with single open reading frame (ORF) have recurrently emerged from precedential CsRn1 copies encoding overlapping gag-pol ORFs by a single-nucleotide insertion in an upstream region of gag stop codon. In the parasite genome, some of the newly evolved variants appeared to undergo proliferative burst as active master lineages together with their ancestral copies. The genetic event was similarly observed in Opisthorchis viverrini, the closest neighbor of C. sinensis, whereas the resulting structural variants might have failed to overcome purifying selection and comprised minor remnant copies in the Opisthorchis genome. Copyright © 2017 Elsevier B.V. All rights reserved.

  13. Phylogenetic relationship and virulence inference of Streptococcus Anginosus Group: curated annotation and whole-genome comparative analysis support distinct species designation

    PubMed Central

    2013-01-01

    Background The Streptococcus Anginosus Group (SAG) represents three closely related species of the viridans group streptococci recognized as commensal bacteria of the oral, gastrointestinal and urogenital tracts. The SAG also cause severe invasive infections, and are pathogens during cystic fibrosis (CF) pulmonary exacerbation. Little genomic information or description of virulence mechanisms is currently available for SAG. We conducted intra and inter species whole-genome comparative analyses with 59 publically available Streptococcus genomes and seven in-house closed high quality finished SAG genomes; S. constellatus (3), S. intermedius (2), and S. anginosus (2). For each SAG species, we sequenced at least one numerically dominant strain from CF airways recovered during acute exacerbation and an invasive, non-lung isolate. We also evaluated microevolution that occurred within two isolates that were cultured from one individual one year apart. Results The SAG genomes were most closely related to S. gordonii and S. sanguinis, based on shared orthologs and harbor a similar number of proteins within each COG category as other Streptococcus species. Numerous characterized streptococcus virulence factor homologs were identified within the SAG genomes including; adherence, invasion, spreading factors, LPxTG cell wall proteins, and two component histidine kinases known to be involved in virulence gene regulation. Mobile elements, primarily integrative conjugative elements and bacteriophage, account for greater than 10% of the SAG genomes. S. anginosus was the most variable species sequenced in this study, yielding both the smallest and the largest SAG genomes containing multiple genomic rearrangements, insertions and deletions. In contrast, within the S. constellatus and S. intermedius species, there was extensive continuous synteny, with only slight differences in genome size between strains. Within S. constellatus we were able to determine important SNPs and changes in

  14. Insights from genomic comparisons of genetically monomorphic bacterial pathogens

    PubMed Central

    Achtman, Mark

    2012-01-01

    Some of the most deadly bacterial diseases, including leprosy, anthrax and plague, are caused by bacterial lineages with extremely low levels of genetic diversity, the so-called ‘genetically monomorphic bacteria’. It has only become possible to analyse the population genetics of such bacteria since the recent advent of high-throughput comparative genomics. The genomes of genetically monomorphic lineages contain very few polymorphic sites, which often reflect unambiguous clonal genealogies. Some genetically monomorphic lineages have evolved in the last decades, e.g. antibiotic-resistant Staphylococcus aureus, whereas others have evolved over several millennia, e.g. the cause of plague, Yersinia pestis. Based on recent results, it is now possible to reconstruct the sources and the history of pandemic waves of plague by a combined analysis of phylogeographic signals in Y. pestis plus polymorphisms found in ancient DNA. Different from historical accounts based exclusively on human disease, Y. pestis evolved in China, or the vicinity, and has spread globally on multiple occasions. These routes of transmission can be reconstructed from the genealogy, most precisely for the most recent pandemic that was spread from Hong Kong in multiple independent waves in 1894. PMID:22312053

  15. Within-host whole genome analysis of an antibiotic resistant Pseudomonas aeruginosa strain sub-type in cystic fibrosis.

    PubMed

    Sherrard, Laura J; Tai, Anna S; Wee, Bryan A; Ramsay, Kay A; Kidd, Timothy J; Ben Zakour, Nouri L; Whiley, David M; Beatson, Scott A; Bell, Scott C

    2017-01-01

    A Pseudomonas aeruginosa AUST-02 strain sub-type (M3L7) has been identified in Australia, infects the lungs of some people with cystic fibrosis and is associated with antibiotic resistance. Multiple clonal lineages may emerge during treatment with mutations in chromosomally encoded antibiotic resistance genes commonly observed. Here we describe the within-host diversity and antibiotic resistance of M3L7 during and after antibiotic treatment of an acute pulmonary exacerbation using whole genome sequencing and show both variation and shared mutations in important genes. Eleven isolates from an M3L7 population (n = 134) isolated over 3 months from an individual with cystic fibrosis underwent whole genome sequencing. A phylogeny based on core genome SNPs identified three distinct phylogenetic groups comprising two groups with higher rates of mutation (hypermutators) and one non-hypermutator group. Genomes were screened for acquired antibiotic resistance genes with the result suggesting that M3L7 resistance is principally driven by chromosomal mutations as no acquired mechanisms were detected. Small genetic variations, shared by all 11 isolates, were found in 49 genes associated with antibiotic resistance including frame-shift mutations (mexA, mexT), premature stop codons (oprD, mexB) and mutations in quinolone-resistance determining regions (gyrA, parE). However, whole genome sequencing also revealed mutations in 21 genes that were acquired following divergence of groups, which may also impact the activity of antibiotics and multi-drug efflux pumps. Comparison of mutations with minimum inhibitory concentrations of anti-pseudomonal antibiotics could not easily explain all resistance profiles observed. These data further demonstrate the complexity of chronic and antibiotic resistant P. aeruginosa infection where a multitude of co-existing genotypically diverse sub-lineages might co-exist during and after intravenous antibiotic treatment.

  16. Ground-based photometric support for the CoRoT mission by the CoRoT-Hungarian Asteroseismology Group

    NASA Astrophysics Data System (ADS)

    Bognár, Zs.; Paparó, M.

    2012-12-01

    The CoRoT-Hungarian Asteroseismology Group was established in 2005 and joined the preparatory work of the CoRoT Mission via an ESA PECS project. After the successful launch of the telescope, we have continued our work of ground-based multi-colour photometric observations and contributed to the analyses of CoRoT data. Our observations were focused on δ Scuti, γ Doradus, and RR Lyrae stars. The follow-up of some selected targets' pulsations in different wavelengths has provided valuable information for mode identification. We provided additional support by the confirmation of relatively faint variables' spectral types. We proved that our ground-based observations can help in the interpretation of a target with a contaminated CoRoT light curve. In this paper, we summarize our most important results of the photometric support for the CoRoT Mission. The CoRoT space mission was developed and is operated by the French space agency CNES, with participation of ESA's RSSD and Science Programmes, Austria, Belgium, Brazil, Germany, and Spain.

  17. Genetic and environmental factors affecting early rooting of six Populus genomic groups: implications for tree improvement

    Treesearch

    Ronald S., Jr. Zalesny

    2006-01-01

    Genetic and environmental factors affect the early rooting of Populus planted as unrooted hardwood cuttings. Populus genotypes of six genomic groups were tested in numerous studies for the quantitative genetics of rooting, along with effects of preplanting treatments and soil temperature. Genetics data (e.g. heritabilities,...

  18. Complete plastid genome sequences suggest strong selection for retention of photosynthetic genes in the parasitic plant genus Cuscuta.

    PubMed

    McNeal, Joel R; Kuehl, Jennifer V; Boore, Jeffrey L; de Pamphilis, Claude W

    2007-10-24

    Plastid genome content and protein sequence are highly conserved across land plants and their closest algal relatives. Parasitic plants, which obtain some or all of their nutrition through an attachment to a host plant, are often a striking exception. Heterotrophy can lead to relaxed constraint on some plastid genes or even total gene loss. We sequenced plastid genomes of two species in the parasitic genus Cuscuta along with a non-parasitic relative, Ipomoea purpurea, to investigate changes in the plastid genome that may result from transition to the parasitic lifestyle. Aside from loss of all ndh genes, Cuscuta exaltata retains photosynthetic and photorespiratory genes that evolve under strong selective constraint. Cuscuta obtusiflora has incurred substantially more change to its plastid genome, including loss of all genes for the plastid-encoded RNA polymerase. Despite extensive change in gene content and greatly increased rate of overall nucleotide substitution, C. obtusiflora also retains all photosynthetic and photorespiratory genes with only one minor exception. Although Epifagus virginiana, the only other parasitic plant with its plastid genome sequenced to date, has lost a largely overlapping set of transfer-RNA and ribosomal genes as Cuscuta, it has lost all genes related to photosynthesis and maintains a set of genes which are among the most divergent in Cuscuta. Analyses demonstrate photosynthetic genes are under the highest constraint of any genes within the plastid genomes of Cuscuta, indicating a function involving RuBisCo and electron transport through photosystems is still the primary reason for retention of the plastid genome in these species.

  19. Complete plastid genome sequences suggest strong selection for retention of photosynthetic genes in the parasitic plant genus Cuscuta

    PubMed Central

    McNeal, Joel R; Kuehl, Jennifer V; Boore, Jeffrey L; de Pamphilis, Claude W

    2007-01-01

    Background Plastid genome content and protein sequence are highly conserved across land plants and their closest algal relatives. Parasitic plants, which obtain some or all of their nutrition through an attachment to a host plant, are often a striking exception. Heterotrophy can lead to relaxed constraint on some plastid genes or even total gene loss. We sequenced plastid genomes of two species in the parasitic genus Cuscuta along with a non-parasitic relative, Ipomoea purpurea, to investigate changes in the plastid genome that may result from transition to the parasitic lifestyle. Results Aside from loss of all ndh genes, Cuscuta exaltata retains photosynthetic and photorespiratory genes that evolve under strong selective constraint. Cuscuta obtusiflora has incurred substantially more change to its plastid genome, including loss of all genes for the plastid-encoded RNA polymerase. Despite extensive change in gene content and greatly increased rate of overall nucleotide substitution, C. obtusiflora also retains all photosynthetic and photorespiratory genes with only one minor exception. Conclusion Although Epifagus virginiana, the only other parasitic plant with its plastid genome sequenced to date, has lost a largely overlapping set of transfer-RNA and ribosomal genes as Cuscuta, it has lost all genes related to photosynthesis and maintains a set of genes which are among the most divergent in Cuscuta. Analyses demonstrate photosynthetic genes are under the highest constraint of any genes within the plastid genomes of Cuscuta, indicating a function involving RuBisCo and electron transport through photosystems is still the primary reason for retention of the plastid genome in these species. PMID:17956636

  20. Co-evolution of Mycobacterium tuberculosis and Homo sapiens

    PubMed Central

    Brites, Daniela; Gagneux, Sebastien

    2015-01-01

    The causative agent of human tuberculosis (TB), Mycobacterium tuberculosis, is an obligate pathogen that evolved to exclusively persist in human populations. For M. tuberculosis to transmit from person to person, it has to cause pulmonary disease. Therefore, M. tuberculosis virulence has likely been a significant determinant of the association between M. tuberculosis and humans. Indeed, the evolutionary success of some M. tuberculosis genotypes seems at least partially attributable to their increased virulence. The latter possibly evolved as a consequence of human demographic expansions. If co-evolution occurred, humans would have counteracted to minimize the deleterious effects of M. tuberculosis virulence. The fact that human resistance to infection has a strong genetic basis is a likely consequence of such a counter-response. The genetic architecture underlying human resistance to M. tuberculosis remains largely elusive. However, interactions between human genetic polymorphisms and M. tuberculosis genotypes have been reported. Such interactions are consistent with local adaptation and allow for a better understanding of protective immunity in TB. Future ‘genome-to-genome’ studies, in which locally associated human and M. tuberculosis genotypes are interrogated in conjunction, will help identify new protective antigens for the development of better TB vaccines. PMID:25703549

  1. Genetic basis for rapidly evolved tolerance in the wild ...

    EPA Pesticide Factsheets

    Atlantic killifish (Fundulus heteroclitus) residing in some urban and industrialized estuaries of the US eastern seaboard demonstrate recently evolved and extreme tolerance to toxic aryl hydrocarbon pollutants, characterized as dioxin-like compounds (DLCs). Here we provide an unusually comprehensive accounting (69%) through Quantitative Trait Locus (QTL) analysis of the genetic basis for DLC tolerance in killifish inhabiting an urban estuary contaminated with PCB congeners, the most toxic of which are DLCs. Consistent with mechanistic knowledge of DLC toxicity in fish and other vertebrates, the Aryl Hydrocarbon Receptor (ahr2) region accounts for 17% of trait variation; however, QTLs on independent linkage groups and their interactions have even greater explanatory power (44%). QTLs interpreted within the context of recently available Fundulus genomic resources and shared synteny among fish species suggest adaptation via inter-acting components of a complex stress response network. Some QTLs were also enriched in other killifish populations characterized as DLC tolerant and residing in distant urban estuaries contaminated with unique mixtures of pollutants. Together, our results suggest that DLC tolerance in killifish represents an emerging example of parallel contemporary evolution that has been driven by intense human-mediated selection on natural populations. This manuscript describes experimental studies that contribute to our understanding of the ecological

  2. Comparative Omics-Driven Genome Annotation Refinement: Application across Yersiniae

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Rutledge, Alexandra C.; Jones, Marcus B.; Chauhan, Sadhana

    2012-03-27

    Genome sequencing continues to be a rapidly evolving technology, yet most downstream aspects of genome annotation pipelines remain relatively stable or are even being abandoned. To date, the perceived value of manual curation for genome annotations is not offset by the real cost and time associated with the process. In order to balance the large number of sequences generated, the annotation process is now performed almost exclusively in an automated fashion for most genome sequencing projects. One possible way to reduce errors inherent to automated computational annotations is to apply data from 'omics' measurements (i.e. transcriptional and proteomic) to themore » un-annotated genome with a proteogenomic-based approach. This approach does require additional experimental and bioinformatics methods to include omics technologies; however, the approach is readily automatable and can benefit from rapid developments occurring in those research domains as well. The annotation process can be improved by experimental validation of transcription and translation and aid in the discovery of annotation errors. Here the concept of annotation refinement has been extended to include a comparative assessment of genomes across closely related species, as is becoming common in sequencing efforts. Transcriptomic and proteomic data derived from three highly similar pathogenic Yersiniae (Y. pestis CO92, Y. pestis pestoides F, and Y. pseudotuberculosis PB1/+) was used to demonstrate a comprehensive comparative omic-based annotation methodology. Peptide and oligo measurements experimentally validated the expression of nearly 40% of each strain's predicted proteome and revealed the identification of 28 novel and 68 previously incorrect protein-coding sequences (e.g., observed frameshifts, extended start sites, and translated pseudogenes) within the three current Yersinia genome annotations. Gene loss is presumed to play a major role in Y. pestis acquiring its niche as a virulent

  3. Genome-wide evolutionary dynamics of influenza B viruses on a global scale

    PubMed Central

    Langat, Pinky; Bowden, Thomas A.; Edwards, Stephanie; Gall, Astrid; Rambaut, Andrew; Daniels, Rodney S.; Russell, Colin A.; Pybus, Oliver G.; McCauley, John

    2017-01-01

    The global-scale epidemiology and genome-wide evolutionary dynamics of influenza B remain poorly understood compared with influenza A viruses. We compiled a spatio-temporally comprehensive dataset of influenza B viruses, comprising over 2,500 genomes sampled worldwide between 1987 and 2015, including 382 newly-sequenced genomes that fill substantial gaps in previous molecular surveillance studies. Our contributed data increase the number of available influenza B virus genomes in Europe, Africa and Central Asia, improving the global context to study influenza B viruses. We reveal Yamagata-lineage diversity results from co-circulation of two antigenically-distinct groups that also segregate genetically across the entire genome, without evidence of intra-lineage reassortment. In contrast, Victoria-lineage diversity stems from geographic segregation of different genetic clades, with variability in the degree of geographic spread among clades. Differences between the lineages are reflected in their antigenic dynamics, as Yamagata-lineage viruses show alternating dominance between antigenic groups, while Victoria-lineage viruses show antigenic drift of a single lineage. Structural mapping of amino acid substitutions on trunk branches of influenza B gene phylogenies further supports these antigenic differences and highlights two potential mechanisms of adaptation for polymerase activity. Our study provides new insights into the epidemiological and molecular processes shaping influenza B virus evolution globally. PMID:29284042

  4. Genomic Aspects of Research Involving Polyploid Plants

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Yang, Xiaohan; Ye, Chuyu; Tschaplinski, Timothy J

    2011-01-01

    Almost all extant plant species have spontaneously doubled their genomes at least once in their evolutionary histories, resulting in polyploidy which provided a rich genomic resource for evolutionary processes. Moreover, superior polyploid clones have been created during the process of crop domestication. Polyploid plants generated by evolutionary processes and/or crop domestication have been the intentional or serendipitous focus of research dealing with the dynamics and consequences of genome evolution. One of the new trends in genomics research is to create synthetic polyploid plants which provide materials for studying the initial genomic changes/responses immediately after polyploid formation. Polyploid plants are alsomore » used in functional genomics research to study gene expression in a complex genomic background. In this review, we summarize the recent progress in genomics research involving ancient, young, and synthetic polyploid plants, with a focus on genome size evolution, genomics diversity, genomic rearrangement, genetic and epigenetic changes in duplicated genes, gene discovery, and comparative genomics. Implications on plant sciences including evolution, functional genomics, and plant breeding are presented. It is anticipated that polyploids will be a regular subject of genomics research in the foreseeable future as the rapid advances in DNA sequencing technology create unprecedented opportunities for discovering and monitoring genomic and transcriptomic changes in polyploid plants. The fast accumulation of knowledge on polyploid formation, maintenance, and divergence at whole-genome and subgenome levels will not only help plant biologists understand how plants have evolved and diversified, but also assist plant breeders in designing new strategies for crop improvement.« less

  5. Evolving Ideas on the Origin and Evolution of Flowers: New Perspectives in the Genomic Era

    PubMed Central

    Chanderbali, Andre S.; Berger, Brent A.; Howarth, Dianella G.; Soltis, Pamela S.; Soltis, Douglas E.

    2016-01-01

    The origin of the flower was a key innovation in the history of complex organisms, dramatically altering Earth’s biota. Advances in phylogenetics, developmental genetics, and genomics during the past 25 years have substantially advanced our understanding of the evolution of flowers, yet crucial aspects of floral evolution remain, such as the series of genetic and morphological changes that gave rise to the first flowers; the factors enabling the origin of the pentamerous eudicot flower, which characterizes ∼70% of all extant angiosperm species; and the role of gene and genome duplications in facilitating floral innovations. A key early concept was the ABC model of floral organ specification, developed by Elliott Meyerowitz and Enrico Coen and based on two model systems, Arabidopsis thaliana and Antirrhinum majus. Yet it is now clear that these model systems are highly derived species, whose molecular genetic-developmental organization must be very different from that of ancestral, as well as early, angiosperms. In this article, we will discuss how new research approaches are illuminating the early events in floral evolution and the prospects for further progress. In particular, advancing the next generation of research in floral evolution will require the development of one or more functional model systems from among the basal angiosperms and basal eudicots. More broadly, we urge the development of “model clades” for genomic and evolutionary-developmental analyses, instead of the primary use of single “model organisms.” We predict that new evolutionary models will soon emerge as genetic/genomic models, providing unprecedented new insights into floral evolution. PMID:27053123

  6. Evolutionary genomics of LysM genes in land plants.

    PubMed

    Zhang, Xue-Cheng; Cannon, Steven B; Stacey, Gary

    2009-08-03

    The ubiquitous LysM motif recognizes peptidoglycan, chitooligosaccharides (chitin) and, presumably, other structurally-related oligosaccharides. LysM-containing proteins were first shown to be involved in bacterial cell wall degradation and, more recently, were implicated in perceiving chitin (one of the established pathogen-associated molecular patterns) and lipo-chitin (nodulation factors) in flowering plants. However, the majority of LysM genes in plants remain functionally uncharacterized and the evolutionary history of complex LysM genes remains elusive. We show that LysM-containing proteins display a wide range of complex domain architectures. However, only a simple core architecture is conserved across kingdoms. Each individual kingdom appears to have evolved a distinct array of domain architectures. We show that early plant lineages acquired four characteristic architectures and progressively lost several primitive architectures. We report plant LysM phylogenies and associated gene, protein and genomic features, and infer the relative timing of duplications of LYK genes. We report a domain architecture catalogue of LysM proteins across all kingdoms. The unique pattern of LysM protein domain architectures indicates the presence of distinctive evolutionary paths in individual kingdoms. We describe a comparative and evolutionary genomics study of LysM genes in plant kingdom. One of the two groups of tandemly arrayed plant LYK genes likely resulted from an ancient genome duplication followed by local genomic rearrangement, while the origin of the other groups of tandemly arrayed LYK genes remains obscure. Given the fact that no animal LysM motif-containing genes have been functionally characterized, this study provides clues to functional characterization of plant LysM genes and is also informative with regard to evolutionary and functional studies of animal LysM genes.

  7. Genome and evolution of the shade-requiring medicinal herb Panax ginseng.

    PubMed

    Kim, Nam-Hoon; Jayakodi, Murukarthick; Lee, Sang-Choon; Choi, Beom-Soon; Jang, Woojong; Lee, Junki; Kim, Hyun Hee; Waminal, Nomar E; Lakshmanan, Meiyappan; van Nguyen, Binh; Lee, Yun Sun; Park, Hyun-Seung; Koo, Hyun Jo; Park, Jee Young; Perumal, Sampath; Joh, Ho Jun; Lee, Hana; Kim, Jinkyung; Kim, In Seo; Kim, Kyunghee; Koduru, Lokanand; Kang, Kyo Bin; Sung, Sang Hyun; Yu, Yeisoo; Park, Daniel S; Choi, Doil; Seo, Eunyoung; Kim, Seungill; Kim, Young-Chang; Hyun, Dong Yun; Park, Youn-Il; Kim, Changsoo; Lee, Tae-Ho; Kim, Hyun Uk; Soh, Moon Soo; Lee, Yi; In, Jun Gyo; Kim, Heui-Soo; Kim, Yong-Min; Yang, Deok-Chun; Wing, Rod A; Lee, Dong-Yup; Paterson, Andrew H; Yang, Tae-Jin

    2018-03-31

    Panax ginseng C. A. Meyer, reputed as the king of medicinal herbs, has slow growth, long generation time, low seed production and complicated genome structure that hamper its study. Here, we unveil the genomic architecture of tetraploid P. ginseng by de novo genome assembly, representing 2.98 Gbp with 59 352 annotated genes. Resequencing data indicated that diploid Panax species diverged in association with global warming in Southern Asia, and two North American species evolved via two intercontinental migrations. Two whole genome duplications (WGD) occurred in the family Araliaceae (including Panax) after divergence with the Apiaceae, the more recent one contributing to the ability of P. ginseng to overwinter, enabling it to spread broadly through the Northern Hemisphere. Functional and evolutionary analyses suggest that production of pharmacologically important dammarane-type ginsenosides originated in Panax and are produced largely in shoot tissues and transported to roots; that newly evolved P. ginseng fatty acid desaturases increase freezing tolerance; and that unprecedented retention of chlorophyll a/b binding protein genes enables efficient photosynthesis under low light. A genome-scale metabolic network provides a holistic view of Panax ginsenoside biosynthesis. This study provides valuable resources for improving medicinal values of ginseng either through genomics-assisted breeding or metabolic engineering. © 2018 The Authors. Plant Biotechnology Journal published by Society for Experimental Biology and The Association of Applied Biologists and John Wiley & Sons Ltd.

  8. Non-viral delivery of genome-editing nucleases for gene therapy.

    PubMed

    Wang, M; Glass, Z A; Xu, Q

    2017-03-01

    Manipulating the genetic makeup of mammalian cells using programmable nuclease-based genome-editing technology has recently evolved into a powerful avenue that holds great potential for treating genetic disorders. There are four types of genome-editing nucleases, including meganucleases, zinc finger nucleases, transcription activator-like effector nucleases and clustered, regularly interspaced, short palindromic repeat-associated nucleases such as Cas9. These nucleases have been harnessed to introduce precise and specific changes of the genome sequence at virtually any genome locus of interest. The therapeutic relevance of these genome-editing technologies, however, is challenged by the safe and efficient delivery of nuclease into targeted cells. Herein, we summarize recent advances that have been made on non-viral delivery of genome-editing nucleases. In particular, we focus on non-viral delivery of Cas9/sgRNA ribonucleoproteins for genome editing. In addition, the future direction for developing non-viral delivery of programmable nucleases for genome editing is discussed.

  9. Evolving together: the biology of symbiosis, part 2

    PubMed Central

    2000-01-01

    Symbiotic trade-offs dominate the world of biology and medicine in colonist-host relationships and between separate, mutually dependent organisms of different species. Infectious and parasitic diseases can be better understood by exploring the dynamic continuum between pathogenicity and mutualism, between antagonism and cooperation—the sliding scale along which microorganisms can move in a moment's notice with a single nucleotide substitution. Organisms practicing piracy or pastoralism may be close genetic relatives. Mergers occur not only between cells but also between genomes; viruses co-opt host genes and in turn insert themselves into host genomes. Separate organisms, from ants to fungi to plants, establish symbiotic ties with each other that bind over deep time, generating much of the diversity we see in nature. PMID:16389348

  10. Mitochondrial genome sequencing helps show the evolutionary mechanism of mitochondrial genome formation in Brassica

    PubMed Central

    2011-01-01

    Background Angiosperm mitochondrial genomes are more complex than those of other organisms. Analyses of the mitochondrial genome sequences of at least 11 angiosperm species have showed several common properties; these cannot easily explain, however, how the diverse mitotypes evolved within each genus or species. We analyzed the evolutionary relationships of Brassica mitotypes by sequencing. Results We sequenced the mitotypes of cam (Brassica rapa), ole (B. oleracea), jun (B. juncea), and car (B. carinata) and analyzed them together with two previously sequenced mitotypes of B. napus (pol and nap). The sizes of whole single circular genomes of cam, jun, ole, and car are 219,747 bp, 219,766 bp, 360,271 bp, and 232,241 bp, respectively. The mitochondrial genome of ole is largest as a resulting of the duplication of a 141.8 kb segment. The jun mitotype is the result of an inherited cam mitotype, and pol is also derived from the cam mitotype with evolutionary modifications. Genes with known functions are conserved in all mitotypes, but clear variation in open reading frames (ORFs) with unknown functions among the six mitotypes was observed. Sequence relationship analysis showed that there has been genome compaction and inheritance in the course of Brassica mitotype evolution. Conclusions We have sequenced four Brassica mitotypes, compared six Brassica mitotypes and suggested a mechanism for mitochondrial genome formation in Brassica, including evolutionary events such as inheritance, duplication, rearrangement, genome compaction, and mutation. PMID:21988783

  11. Genome structure and primitive sex chromosome revealed in Populus

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Tuskan, Gerald A; Yin, Tongming; Gunter, Lee E

    We constructed a comprehensive genetic map for Populus and ordered 332 Mb of sequence scaffolds along the 19 haploid chromosomes in order to compare chromosomal regions among diverse members of the genus. These efforts lead us to conclude that chromosome XIX in Populus is evolving into a sex chromosome. Consistent segregation distortion in favor of the sub-genera Tacamahaca alleles provided evidence of divergent selection among species, particularly at the proximal end of chromosome XIX. A large microsatellite marker (SSR) cluster was detected in the distorted region even though the genome-wide distribute SSR sites was uniform across the physical map. Themore » differences between the genetic map and physical sequence data suggested recombination suppression was occurring in the distorted region. A gender-determination locus and an overabundance of NBS-LRR genes were also co-located to the distorted region and were put forth as the cause for divergent selection and recombination suppression. This hypothesis was verified by using fine-scale mapping of an integrated scaffold in the vicinity of the gender-determination locus. As such it appears that chromosome XIX in Populus is in the process of evolving from an autosome into a sex chromosome and that NBS-LRR genes may play important role in the chromosomal diversification process in Populus.« less

  12. The Brassica oleracea genome reveals the asymmetrical evolution of polyploid genomes

    PubMed Central

    Liu, Shengyi; Liu, Yumei; Yang, Xinhua; Tong, Chaobo; Edwards, David; Parkin, Isobel A. P.; Zhao, Meixia; Ma, Jianxin; Yu, Jingyin; Huang, Shunmou; Wang, Xiyin; Wang, Junyi; Lu, Kun; Fang, Zhiyuan; Bancroft, Ian; Yang, Tae-Jin; Hu, Qiong; Wang, Xinfa; Yue, Zhen; Li, Haojie; Yang, Linfeng; Wu, Jian; Zhou, Qing; Wang, Wanxin; King, Graham J; Pires, J. Chris; Lu, Changxin; Wu, Zhangyan; Sampath, Perumal; Wang, Zhuo; Guo, Hui; Pan, Shengkai; Yang, Limei; Min, Jiumeng; Zhang, Dong; Jin, Dianchuan; Li, Wanshun; Belcram, Harry; Tu, Jinxing; Guan, Mei; Qi, Cunkou; Du, Dezhi; Li, Jiana; Jiang, Liangcai; Batley, Jacqueline; Sharpe, Andrew G; Park, Beom-Seok; Ruperao, Pradeep; Cheng, Feng; Waminal, Nomar Espinosa; Huang, Yin; Dong, Caihua; Wang, Li; Li, Jingping; Hu, Zhiyong; Zhuang, Mu; Huang, Yi; Huang, Junyan; Shi, Jiaqin; Mei, Desheng; Liu, Jing; Lee, Tae-Ho; Wang, Jinpeng; Jin, Huizhe; Li, Zaiyun; Li, Xun; Zhang, Jiefu; Xiao, Lu; Zhou, Yongming; Liu, Zhongsong; Liu, Xuequn; Qin, Rui; Tang, Xu; Liu, Wenbin; Wang, Yupeng; Zhang, Yangyong; Lee, Jonghoon; Kim, Hyun Hee; Denoeud, France; Xu, Xun; Liang, Xinming; Hua, Wei; Wang, Xiaowu; Wang, Jun; Chalhoub, Boulos; Paterson, Andrew H

    2014-01-01

    Polyploidization has provided much genetic variation for plant adaptive evolution, but the mechanisms by which the molecular evolution of polyploid genomes establishes genetic architecture underlying species differentiation are unclear. Brassica is an ideal model to increase knowledge of polyploid evolution. Here we describe a draft genome sequence of Brassica oleracea, comparing it with that of its sister species B. rapa to reveal numerous chromosome rearrangements and asymmetrical gene loss in duplicated genomic blocks, asymmetrical amplification of transposable elements, differential gene co-retention for specific pathways and variation in gene expression, including alternative splicing, among a large number of paralogous and orthologous genes. Genes related to the production of anticancer phytochemicals and morphological variations illustrate consequences of genome duplication and gene divergence, imparting biochemical and morphological variation to B. oleracea. This study provides insights into Brassica genome evolution and will underpin research into the many important crops in this genus. PMID:24852848

  13. Conserved Non-Coding Regulatory Signatures in Arabidopsis Co-Expressed Gene Modules

    PubMed Central

    Spangler, Jacob B.; Ficklin, Stephen P.; Luo, Feng; Freeling, Michael; Feltus, F. Alex

    2012-01-01

    Complex traits and other polygenic processes require coordinated gene expression. Co-expression networks model mRNA co-expression: the product of gene regulatory networks. To identify regulatory mechanisms underlying coordinated gene expression in a tissue-enriched context, ten Arabidopsis thaliana co-expression networks were constructed after manually sorting 4,566 RNA profiling datasets into aerial, flower, leaf, root, rosette, seedling, seed, shoot, whole plant, and global (all samples combined) groups. Collectively, the ten networks contained 30% of the measurable genes of Arabidopsis and were circumscribed into 5,491 modules. Modules were scrutinized for cis regulatory mechanisms putatively encoded in conserved non-coding sequences (CNSs) previously identified as remnants of a whole genome duplication event. We determined the non-random association of 1,361 unique CNSs to 1,904 co-expression network gene modules. Furthermore, the CNS elements were placed in the context of known gene regulatory networks (GRNs) by connecting 250 CNS motifs with known GRN cis elements. Our results provide support for a regulatory role of some CNS elements and suggest the functional consequences of CNS activation of co-expression in specific gene sets dispersed throughout the genome. PMID:23024789

  14. Conserved non-coding regulatory signatures in Arabidopsis co-expressed gene modules.

    PubMed

    Spangler, Jacob B; Ficklin, Stephen P; Luo, Feng; Freeling, Michael; Feltus, F Alex

    2012-01-01

    Complex traits and other polygenic processes require coordinated gene expression. Co-expression networks model mRNA co-expression: the product of gene regulatory networks. To identify regulatory mechanisms underlying coordinated gene expression in a tissue-enriched context, ten Arabidopsis thaliana co-expression networks were constructed after manually sorting 4,566 RNA profiling datasets into aerial, flower, leaf, root, rosette, seedling, seed, shoot, whole plant, and global (all samples combined) groups. Collectively, the ten networks contained 30% of the measurable genes of Arabidopsis and were circumscribed into 5,491 modules. Modules were scrutinized for cis regulatory mechanisms putatively encoded in conserved non-coding sequences (CNSs) previously identified as remnants of a whole genome duplication event. We determined the non-random association of 1,361 unique CNSs to 1,904 co-expression network gene modules. Furthermore, the CNS elements were placed in the context of known gene regulatory networks (GRNs) by connecting 250 CNS motifs with known GRN cis elements. Our results provide support for a regulatory role of some CNS elements and suggest the functional consequences of CNS activation of co-expression in specific gene sets dispersed throughout the genome.

  15. Evolving fuzzy rules in a learning classifier system

    NASA Technical Reports Server (NTRS)

    Valenzuela-Rendon, Manuel

    1993-01-01

    The fuzzy classifier system (FCS) combines the ideas of fuzzy logic controllers (FLC's) and learning classifier systems (LCS's). It brings together the expressive powers of fuzzy logic as it has been applied in fuzzy controllers to express relations between continuous variables, and the ability of LCS's to evolve co-adapted sets of rules. The goal of the FCS is to develop a rule-based system capable of learning in a reinforcement regime, and that can potentially be used for process control.

  16. Population genomics reveals the origin and asexual evolution of human infective trypanosomes

    PubMed Central

    Weir, William; Capewell, Paul; Foth, Bernardo; Clucas, Caroline; Pountain, Andrew; Steketee, Pieter; Veitch, Nicola; Koffi, Mathurin; De Meeûs, Thierry; Kaboré, Jacques; Camara, Mamadou; Cooper, Anneli; Tait, Andy; Jamonneau, Vincent; Bucheton, Bruno; Berriman, Matt; MacLeod, Annette

    2016-01-01

    Evolutionary theory predicts that the lack of recombination and chromosomal re-assortment in strictly asexual organisms results in homologous chromosomes irreversibly accumulating mutations and thus evolving independently of each other, a phenomenon termed the Meselson effect. We apply a population genomics approach to examine this effect in an important human pathogen, Trypanosoma brucei gambiense. We determine that T.b. gambiense is evolving strictly asexually and is derived from a single progenitor, which emerged within the last 10,000 years. We demonstrate the Meselson effect for the first time at the genome-wide level in any organism and show large regions of loss of heterozygosity, which we hypothesise to be a short-term compensatory mechanism for counteracting deleterious mutations. Our study sheds new light on the genomic and evolutionary consequences of strict asexuality, which this pathogen uses as it exploits a new biological niche, the human population. DOI: http://dx.doi.org/10.7554/eLife.11473.001 PMID:26809473

  17. Genome fluctuations in cyanobacteria reflect evolutionary, developmental and adaptive traits.

    PubMed

    Larsson, John; Nylander, Johan Aa; Bergman, Birgitta

    2011-06-30

    Cyanobacteria belong to an ancient group of photosynthetic prokaryotes with pronounced variations in their cellular differentiation strategies, physiological capacities and choice of habitat. Sequencing efforts have shown that genomes within this phylum are equally diverse in terms of size and protein-coding capacity. To increase our understanding of genomic changes in the lineage, the genomes of 58 contemporary cyanobacteria were analysed for shared and unique orthologs. A total of 404 protein families, present in all cyanobacterial genomes, were identified. Two of these are unique to the phylum, corresponding to an AbrB family transcriptional regulator and a gene that escapes functional annotation although its genomic neighbourhood is conserved among the organisms examined. The evolution of cyanobacterial genome sizes involves a mix of gains and losses in the clade encompassing complex cyanobacteria, while a single event of reduction is evident in a clade dominated by unicellular cyanobacteria. Genome sizes and gene family copy numbers evolve at a higher rate in the former clade, and multi-copy genes were predominant in large genomes. Orthologs unique to cyanobacteria exhibiting specific characteristics, such as filament formation, heterocyst differentiation, diazotrophy and symbiotic competence, were also identified. An ancestral character reconstruction suggests that the most recent common ancestor of cyanobacteria had a genome size of approx. 4.5 Mbp and 1678 to 3291 protein-coding genes, 4%-6% of which are unique to cyanobacteria today. The different rates of genome-size evolution and multi-copy gene abundance suggest two routes of genome development in the history of cyanobacteria. The expansion strategy is driven by gene-family enlargment and generates a broad adaptive potential; while the genome streamlining strategy imposes adaptations to highly specific niches, also reflected in their different functional capacities. A few genomes display extreme

  18. Consumer co-evolution as an important component of the eco-evolutionary feedback.

    PubMed

    Hiltunen, Teppo; Becks, Lutz

    2014-10-22

    Rapid evolution in ecologically relevant traits has recently been recognized to significantly alter the interaction between consumers and their resources, a key interaction in all ecological communities. While these eco-evolutionary dynamics have been shown to occur when prey populations are evolving, little is known about the role of predator evolution and co-evolution between predator and prey in this context. Here, we investigate the role of consumer co-evolution for eco-evolutionary feedback in bacteria-ciliate microcosm experiments by manipulating the initial trait variation in the predator populations. With co-evolved predators, prey evolve anti-predatory defences faster, trait values are more variable, and predator and prey population sizes are larger at the end of the experiment compared with the non-co-evolved predators. Most importantly, differences in predator traits results in a shift from evolution driving ecology, to ecology driving evolution. Thus we demonstrate that predator co-evolution has important effects on eco-evolutionary dynamics.

  19. Advances in computer simulation of genome evolution: toward more realistic evolutionary genomics analysis by approximate bayesian computation.

    PubMed

    Arenas, Miguel

    2015-04-01

    NGS technologies present a fast and cheap generation of genomic data. Nevertheless, ancestral genome inference is not so straightforward due to complex evolutionary processes acting on this material such as inversions, translocations, and other genome rearrangements that, in addition to their implicit complexity, can co-occur and confound ancestral inferences. Recently, models of genome evolution that accommodate such complex genomic events are emerging. This letter explores these novel evolutionary models and proposes their incorporation into robust statistical approaches based on computer simulations, such as approximate Bayesian computation, that may produce a more realistic evolutionary analysis of genomic data. Advantages and pitfalls in using these analytical methods are discussed. Potential applications of these ancestral genomic inferences are also pointed out.

  20. Comparative genome analysis of Pseudomonas genomes including Populus-associated isolates

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Jun, Se Ran; Wassenaar, Trudy; Nookaew, Intawat

    The Pseudomonas genus contains a metabolically versatile group of organisms that are known to occupy numerous ecological niches including the rhizosphere and endosphere of many plants influencing phylogenetic diversity and heterogeneity. In this study, comparative genome analysis was performed on over one thousand Pseudomonas genomes, including 21 Pseudomonas strains isolated from the roots of native Populus deltoides. Based on average amino acid identity, genomic clusters were identified within the Pseudomonas genus, which showed agreements with clades by NCBI and cliques by IMG. The P. fluorescens group was organized into 20 distinct genomic clusters, representing enormous diversity and heterogeneity. The speciesmore » P. aeruginosa showed clear distinction in their genomic relatedness compared to other Pseudomonas species groups based on the pan and core genome analysis. The 19 isolates of our 21 Populus-associated isolates formed three distinct subgroups within the P. fluorescens major group, supported by pathway profiles analysis, while two isolates were more closely related to P. chlororaphis and P. putida. The specific genes to Populus-associated subgroups were identified where genes specific to subgroup 1 include several sensory systems such as proteins which act in two-component signal transduction, a TonB-dependent receptor, and a phosphorelay sensor; specific genes to subgroup 2 contain unique hypothetical genes; and genes specific to subgroup 3 organisms have a different hydrolase activity. IMPORTANCE The comparative genome analyses of the genus Pseudomonas that included Populus-associated isolates resulted in novel insights into high diversity of Pseudomonas. Consistent and robust genomic clusters with phylogenetic homogeneity were identified, which resolved species-clades that are not clearly defined by 16S rRNA gene sequence analysis alone. The genomic clusters may be reflective of distinct ecological niches to which the organisms have adapted, but

  1. Comparative genome analysis of Pseudomonas genomes including Populus-associated isolates

    DOE PAGES

    Jun, Se Ran; Wassenaar, Trudy; Nookaew, Intawat; ...

    2016-01-01

    The Pseudomonas genus contains a metabolically versatile group of organisms that are known to occupy numerous ecological niches including the rhizosphere and endosphere of many plants influencing phylogenetic diversity and heterogeneity. In this study, comparative genome analysis was performed on over one thousand Pseudomonas genomes, including 21 Pseudomonas strains isolated from the roots of native Populus deltoides. Based on average amino acid identity, genomic clusters were identified within the Pseudomonas genus, which showed agreements with clades by NCBI and cliques by IMG. The P. fluorescens group was organized into 20 distinct genomic clusters, representing enormous diversity and heterogeneity. The speciesmore » P. aeruginosa showed clear distinction in their genomic relatedness compared to other Pseudomonas species groups based on the pan and core genome analysis. The 19 isolates of our 21 Populus-associated isolates formed three distinct subgroups within the P. fluorescens major group, supported by pathway profiles analysis, while two isolates were more closely related to P. chlororaphis and P. putida. The specific genes to Populus-associated subgroups were identified where genes specific to subgroup 1 include several sensory systems such as proteins which act in two-component signal transduction, a TonB-dependent receptor, and a phosphorelay sensor; specific genes to subgroup 2 contain unique hypothetical genes; and genes specific to subgroup 3 organisms have a different hydrolase activity. IMPORTANCE The comparative genome analyses of the genus Pseudomonas that included Populus-associated isolates resulted in novel insights into high diversity of Pseudomonas. Consistent and robust genomic clusters with phylogenetic homogeneity were identified, which resolved species-clades that are not clearly defined by 16S rRNA gene sequence analysis alone. The genomic clusters may be reflective of distinct ecological niches to which the organisms have adapted, but

  2. Evolving Strategies for Cancer and Autoimmunity: Back to the Future

    PubMed Central

    Lane, Peter J. L.; McConnell, Fiona M.; Anderson, Graham; Nawaf, Maher G.; Gaspal, Fabrina M.; Withers, David R.

    2014-01-01

    Although current thinking has focused on genetic variation between individuals and environmental influences as underpinning susceptibility to both autoimmunity and cancer, an alternative view is that human susceptibility to these diseases is a consequence of the way the immune system evolved. It is important to remember that the immunological genes that we inherit and the systems that they control were shaped by the drive for reproductive success rather than for individual survival. It is our view that human susceptibility to autoimmunity and cancer is the evolutionarily acceptable side effect of the immune adaptations that evolved in early placental mammals to accommodate a fundamental change in reproductive strategy. Studies of immune function in mammals show that high affinity antibodies and CD4 memory, along with its regulation, co-evolved with placentation. By dissection of the immunologically active genes and proteins that evolved to regulate this step change in the mammalian immune system, clues have emerged that may reveal ways of de-tuning both effector and regulatory arms of the immune system to abrogate autoimmune responses whilst preserving protection against infection. Paradoxically, it appears that such a detuned and deregulated immune system is much better equipped to mount anti-tumor immune responses against cancers. PMID:24782861

  3. Admixture patterns and genetic differentiation in negrito groups from West Malaysia estimated from genome-wide SNP data.

    PubMed

    Jinam, Timothy A; Phipps, Maude E; Saitou, Naruya

    2013-01-01

    Southeast Asia houses various culturally and linguistically diverse ethnic groups. In Malaysia, where the Malay, Chinese, and Indian ethnic groups form the majority, there exist minority groups such as the "negritos" who are believed to be descendants of the earliest settlers of Southeast Asia. Here we report patterns of genetic substructure and admixture in two Malaysian negrito populations (Jehai and Kensiu), using ~50,000 genome-wide single-nucleotide polymorphism (SNP) data. We found traces of recent admixture in both the negrito populations, particularly in the Jehai, with the Malay through principal component analysis and STRUCTURE analysis software, which suggested that the admixture was as recent as one generation ago. We also identified significantly differentiated nonsynonymous SNPs and haplotype blocks related to intracellular transport, metabolic processes, and detection of stimulus. These results highlight the different levels of admixture experienced by the two Malaysian negritos. Delineating admixture and differentiated genomic regions should be of importance in designing and interpretation of molecular anthropology and disease association studies. Copyright © 2013 Wayne State University Press, Detroit, Michigan 48201-1309.

  4. IMG 4 version of the integrated microbial genomes comparative analysis system

    PubMed Central

    Markowitz, Victor M.; Chen, I-Min A.; Palaniappan, Krishna; Chu, Ken; Szeto, Ernest; Pillay, Manoj; Ratner, Anna; Huang, Jinghua; Woyke, Tanja; Huntemann, Marcel; Anderson, Iain; Billis, Konstantinos; Varghese, Neha; Mavromatis, Konstantinos; Pati, Amrita; Ivanova, Natalia N.; Kyrpides, Nikos C.

    2014-01-01

    The Integrated Microbial Genomes (IMG) data warehouse integrates genomes from all three domains of life, as well as plasmids, viruses and genome fragments. IMG provides tools for analyzing and reviewing the structural and functional annotations of genomes in a comparative context. IMG’s data content and analytical capabilities have increased continuously since its first version released in 2005. Since the last report published in the 2012 NAR Database Issue, IMG’s annotation and data integration pipelines have evolved while new tools have been added for recording and analyzing single cell genomes, RNA Seq and biosynthetic cluster data. Different IMG datamarts provide support for the analysis of publicly available genomes (IMG/W: http://img.jgi.doe.gov/w), expert review of genome annotations (IMG/ER: http://img.jgi.doe.gov/er) and teaching and training in the area of microbial genome analysis (IMG/EDU: http://img.jgi.doe.gov/edu). PMID:24165883

  5. IMG 4 version of the integrated microbial genomes comparative analysis system.

    PubMed

    Markowitz, Victor M; Chen, I-Min A; Palaniappan, Krishna; Chu, Ken; Szeto, Ernest; Pillay, Manoj; Ratner, Anna; Huang, Jinghua; Woyke, Tanja; Huntemann, Marcel; Anderson, Iain; Billis, Konstantinos; Varghese, Neha; Mavromatis, Konstantinos; Pati, Amrita; Ivanova, Natalia N; Kyrpides, Nikos C

    2014-01-01

    The Integrated Microbial Genomes (IMG) data warehouse integrates genomes from all three domains of life, as well as plasmids, viruses and genome fragments. IMG provides tools for analyzing and reviewing the structural and functional annotations of genomes in a comparative context. IMG's data content and analytical capabilities have increased continuously since its first version released in 2005. Since the last report published in the 2012 NAR Database Issue, IMG's annotation and data integration pipelines have evolved while new tools have been added for recording and analyzing single cell genomes, RNA Seq and biosynthetic cluster data. Different IMG datamarts provide support for the analysis of publicly available genomes (IMG/W: http://img.jgi.doe.gov/w), expert review of genome annotations (IMG/ER: http://img.jgi.doe.gov/er) and teaching and training in the area of microbial genome analysis (IMG/EDU: http://img.jgi.doe.gov/edu).

  6. Friends Drinking Together: Young Adults' Evolving Support Practices

    ERIC Educational Resources Information Center

    Dresler, Emma; Anderson, Margaret

    2018-01-01

    Purpose: Young adult's drinking is about pleasure, a communal practice of socialising together in a friendship group. The purpose of this paper is to investigate the evolving support practices of drinking groups for better targeting of health communications messages. Design/methodology/approach: This qualitative descriptive study examined the…

  7. Population Genomics of Daphnia pulex

    PubMed Central

    Lynch, Michael; Gutenkunst, Ryan; Ackerman, Matthew; Spitze, Ken; Ye, Zhiqiang; Maruki, Takahiro; Jia, Zhiyuan

    2017-01-01

    Using data from 83 isolates from a single population, the population genomics of the microcrustacean Daphnia pulex are described and compared to current knowledge for the only other well-studied invertebrate, Drosophila melanogaster. These two species are quite similar with respect to effective population sizes and mutation rates, although some features of recombination appear to be different, with linkage disequilibrium being elevated at short (<100 bp) distances in D. melanogaster and at long distances in D. pulex. The study population adheres closely to the expectations under Hardy–Weinberg equilibrium, and reflects a past population history of no more than a twofold range of variation in effective population size. Fourfold redundant silent sites and a restricted region of intronic sites appear to evolve in a nearly neutral fashion, providing a powerful tool for population genetic analyses. Amino acid replacement sites are predominantly under strong purifying selection, as are a large fraction of sites in UTRs and intergenic regions, but the majority of SNPs at such sites that rise to frequencies >0.05 appear to evolve in a nearly neutral fashion. All forms of genomic sites (including replacement sites within codons, and intergenic and UTR regions) appear to be experiencing an ∼2× higher level of selection scaled to the power of drift in D. melanogaster, but this may in part be a consequence of recent demographic changes. These results establish D. pulex as an excellent system for future work on the evolutionary genomics of natural populations. PMID:27932545

  8. Identification of cyanobacterial non-coding RNAs by comparative genome analysis.

    PubMed

    Axmann, Ilka M; Kensche, Philip; Vogel, Jörg; Kohl, Stefan; Herzel, Hanspeter; Hess, Wolfgang R

    2005-01-01

    Whole genome sequencing of marine cyanobacteria has revealed an unprecedented degree of genomic variation and streamlining. With a size of 1.66 megabase-pairs, Prochlorococcus sp. MED4 has the most compact of these genomes and it is enigmatic how the few identified regulatory proteins efficiently sustain the lifestyle of an ecologically successful marine microorganism. Small non-coding RNAs (ncRNAs) control a plethora of processes in eukaryotes as well as in bacteria; however, systematic searches for ncRNAs are still lacking for most eubacterial phyla outside the enterobacteria. Based on a computational prediction we show the presence of several ncRNAs (cyanobacterial functional RNA or Yfr) in several different cyanobacteria of the Prochlorococcus-Synechococcus lineage. Some ncRNA genes are present only in two or three of the four strains investigated, whereas the RNAs Yfr2 through Yfr5 are structurally highly related and are encoded by a rapidly evolving gene family as their genes exist in different copy numbers and at different sites in the four investigated genomes. One ncRNA, Yfr7, is present in at least seven other cyanobacteria. In addition, control elements for several ribosomal operons were predicted as well as riboswitches for thiamine pyrophosphate and cobalamin. This is the first genome-wide and systematic screen for ncRNAs in cyanobacteria. Several ncRNAs were both computationally predicted and their presence was biochemically verified. These RNAs may have regulatory functions and each shows a distinct phylogenetic distribution. Our approach can be applied to any group of microorganisms for which more than one total genome sequence is available for comparative analysis.

  9. Comparative genomics and evolution of the amylase-binding proteins of oral streptococci.

    PubMed

    Haase, Elaine M; Kou, Yurong; Sabharwal, Amarpreet; Liao, Yu-Chieh; Lan, Tianying; Lindqvist, Charlotte; Scannapieco, Frank A

    2017-04-20

    Successful commensal bacteria have evolved to maintain colonization in challenging environments. The oral viridans streptococci are pioneer colonizers of dental plaque biofilm. Some of these bacteria have adapted to life in the oral cavity by binding salivary α-amylase, which hydrolyzes dietary starch, thus providing a source of nutrition. Oral streptococcal species bind α-amylase by expressing a variety of amylase-binding proteins (ABPs). Here we determine the genotypic basis of amylase binding where proteins of diverse size and function share a common phenotype. ABPs were detected in culture supernatants of 27 of 59 strains representing 13 oral Streptococcus species screened using the amylase-ligand binding assay. N-terminal sequences from ABPs of diverse size were obtained from 18 strains representing six oral streptococcal species. Genome sequencing and BLAST searches using N-terminal sequences, protein size, and key words identified the gene associated with each ABP. Among the sequenced ABPs, 14 matched amylase-binding protein A (AbpA), 6 matched amylase-binding protein B (AbpB), and 11 unique ABPs were identified as peptidoglycan-binding, glutamine ABC-type transporter, hypothetical, or choline-binding proteins. Alignment and phylogenetic analyses performed to ascertain evolutionary relationships revealed that ABPs cluster into at least six distinct, unrelated families (AbpA, AbpB, and four novel ABPs) with no phylogenetic evidence that one group evolved from another, and no single ancestral gene found within each group. AbpA-like sequences can be divided into five subgroups based on the N-terminal sequences. Comparative genomics focusing on the abpA gene locus provides evidence of horizontal gene transfer. The acquisition of an ABP by oral streptococci provides an interesting example of adaptive evolution.

  10. Genomic selection accuracies within and between environments and small breeding groups in white spruce.

    PubMed

    Beaulieu, Jean; Doerksen, Trevor K; MacKay, John; Rainville, André; Bousquet, Jean

    2014-12-02

    Genomic selection (GS) may improve selection response over conventional pedigree-based selection if markers capture more detailed information than pedigrees in recently domesticated tree species and/or make it more cost effective. Genomic prediction accuracies using 1748 trees and 6932 SNPs representative of as many distinct gene loci were determined for growth and wood traits in white spruce, within and between environments and breeding groups (BG), each with an effective size of Ne ≈ 20. Marker subsets were also tested. Model fits and/or cross-validation (CV) prediction accuracies for ridge regression (RR) and the least absolute shrinkage and selection operator models approached those of pedigree-based models. With strong relatedness between CV sets, prediction accuracies for RR within environment and BG were high for wood (r = 0.71-0.79) and moderately high for growth (r = 0.52-0.69) traits, in line with trends in heritabilities. For both classes of traits, these accuracies achieved between 83% and 92% of those obtained with phenotypes and pedigree information. Prediction into untested environments remained moderately high for wood (r ≥ 0.61) but dropped significantly for growth (r ≥ 0.24) traits, emphasizing the need to phenotype in all test environments and model genotype-by-environment interactions for growth traits. Removing relatedness between CV sets sharply decreased prediction accuracies for all traits and subpopulations, falling near zero between BGs with no known shared ancestry. For marker subsets, similar patterns were observed but with lower prediction accuracies. Given the need for high relatedness between CV sets to obtain good prediction accuracies, we recommend to build GS models for prediction within the same breeding population only. Breeding groups could be merged to build genomic prediction models as long as the total effective population size does not exceed 50 individuals in order to obtain high prediction accuracy such as that

  11. Analysis tools for the interplay between genome layout and regulation.

    PubMed

    Bouyioukos, Costas; Elati, Mohamed; Képès, François

    2016-06-06

    Genome layout and gene regulation appear to be interdependent. Understanding this interdependence is key to exploring the dynamic nature of chromosome conformation and to engineering functional genomes. Evidence for non-random genome layout, defined as the relative positioning of either co-functional or co-regulated genes, stems from two main approaches. Firstly, the analysis of contiguous genome segments across species, has highlighted the conservation of gene arrangement (synteny) along chromosomal regions. Secondly, the study of long-range interactions along a chromosome has emphasised regularities in the positioning of microbial genes that are co-regulated, co-expressed or evolutionarily correlated. While one-dimensional pattern analysis is a mature field, it is often powerless on biological datasets which tend to be incomplete, and partly incorrect. Moreover, there is a lack of comprehensive, user-friendly tools to systematically analyse, visualise, integrate and exploit regularities along genomes. Here we present the Genome REgulatory and Architecture Tools SCAN (GREAT:SCAN) software for the systematic study of the interplay between genome layout and gene expression regulation. SCAN is a collection of related and interconnected applications currently able to perform systematic analyses of genome regularities as well as to improve transcription factor binding sites (TFBS) and gene regulatory network predictions based on gene positional information. We demonstrate the capabilities of these tools by studying on one hand the regular patterns of genome layout in the major regulons of the bacterium Escherichia coli. On the other hand, we demonstrate the capabilities to improve TFBS prediction in microbes. Finally, we highlight, by visualisation of multivariate techniques, the interplay between position and sequence information for effective transcription regulation.

  12. Peer-led and professional-led group interventions for people with co-occurring disorders: a qualitative study.

    PubMed

    Pallaveshi, Luljeta; Balachandra, Krishna; Subramanian, Priya; Rudnick, Abraham

    2014-05-01

    This pilot study evaluated the experience of people with co-occurring disorders (mental illness and addiction) in relation to peer-led and professional-led group interventions. The study used a qualitative (phenomenological) approach to evaluate the experience of a convenience sample of 6 individuals with co-occurring disorders who participated in up to 8 sessions each of both peer-led and professional-led group interventions (with a similar rate of attendance in both groups). The semi-structured interview data were coded and thematically analyzed. We found 5 themes within and across the 2 interventions. In both groups, participants experienced a positive environment and personal growth, and learned, albeit different things. They were more comfortable in the peer-led group and acquired more knowledge and skills in the professional-led group. Offering both peer-led and professional-led group interventions to people with co-occurring disorders may be better than offering either alone.

  13. Whole-genome analyses resolve early branches in the tree of life of modern birds.

    PubMed

    Jarvis, Erich D; Mirarab, Siavash; Aberer, Andre J; Li, Bo; Houde, Peter; Li, Cai; Ho, Simon Y W; Faircloth, Brant C; Nabholz, Benoit; Howard, Jason T; Suh, Alexander; Weber, Claudia C; da Fonseca, Rute R; Li, Jianwen; Zhang, Fang; Li, Hui; Zhou, Long; Narula, Nitish; Liu, Liang; Ganapathy, Ganesh; Boussau, Bastien; Bayzid, Md Shamsuzzoha; Zavidovych, Volodymyr; Subramanian, Sankar; Gabaldón, Toni; Capella-Gutiérrez, Salvador; Huerta-Cepas, Jaime; Rekepalli, Bhanu; Munch, Kasper; Schierup, Mikkel; Lindow, Bent; Warren, Wesley C; Ray, David; Green, Richard E; Bruford, Michael W; Zhan, Xiangjiang; Dixon, Andrew; Li, Shengbin; Li, Ning; Huang, Yinhua; Derryberry, Elizabeth P; Bertelsen, Mads Frost; Sheldon, Frederick H; Brumfield, Robb T; Mello, Claudio V; Lovell, Peter V; Wirthlin, Morgan; Schneider, Maria Paula Cruz; Prosdocimi, Francisco; Samaniego, José Alfredo; Vargas Velazquez, Amhed Missael; Alfaro-Núñez, Alonzo; Campos, Paula F; Petersen, Bent; Sicheritz-Ponten, Thomas; Pas, An; Bailey, Tom; Scofield, Paul; Bunce, Michael; Lambert, David M; Zhou, Qi; Perelman, Polina; Driskell, Amy C; Shapiro, Beth; Xiong, Zijun; Zeng, Yongli; Liu, Shiping; Li, Zhenyu; Liu, Binghang; Wu, Kui; Xiao, Jin; Yinqi, Xiong; Zheng, Qiuemei; Zhang, Yong; Yang, Huanming; Wang, Jian; Smeds, Linnea; Rheindt, Frank E; Braun, Michael; Fjeldsa, Jon; Orlando, Ludovic; Barker, F Keith; Jønsson, Knud Andreas; Johnson, Warren; Koepfli, Klaus-Peter; O'Brien, Stephen; Haussler, David; Ryder, Oliver A; Rahbek, Carsten; Willerslev, Eske; Graves, Gary R; Glenn, Travis C; McCormack, John; Burt, Dave; Ellegren, Hans; Alström, Per; Edwards, Scott V; Stamatakis, Alexandros; Mindell, David P; Cracraft, Joel; Braun, Edward L; Warnow, Tandy; Jun, Wang; Gilbert, M Thomas P; Zhang, Guojie

    2014-12-12

    To better determine the history of modern birds, we performed a genome-scale phylogenetic analysis of 48 species representing all orders of Neoaves using phylogenomic methods created to handle genome-scale data. We recovered a highly resolved tree that confirms previously controversial sister or close relationships. We identified the first divergence in Neoaves, two groups we named Passerea and Columbea, representing independent lineages of diverse and convergently evolved land and water bird species. Among Passerea, we infer the common ancestor of core landbirds to have been an apex predator and confirm independent gains of vocal learning. Among Columbea, we identify pigeons and flamingoes as belonging to sister clades. Even with whole genomes, some of the earliest branches in Neoaves proved challenging to resolve, which was best explained by massive protein-coding sequence convergence and high levels of incomplete lineage sorting that occurred during a rapid radiation after the Cretaceous-Paleogene mass extinction event about 66 million years ago. Copyright © 2014, American Association for the Advancement of Science.

  14. Cell size, genome size and the dominance of Angiosperms

    NASA Astrophysics Data System (ADS)

    Simonin, K. A.; Roddy, A. B.

    2016-12-01

    Angiosperms are capable of maintaining the highest rates of photosynthetic gas exchange of all land plants. High rates of photosynthesis depends mechanistically both on efficiently transporting water to the sites of evaporation in the leaf and on regulating the loss of that water to the atmosphere as CO2 diffuses into the leaf. Angiosperm leaves are unique in their ability to sustain high fluxes of liquid and vapor phase water transport due to high vein densities and numerous, small stomata. Despite the ubiquity of studies characterizing the anatomical and physiological adaptations that enable angiosperms to maintain high rates of photosynthesis, the underlying mechanism explaining why they have been able to develop such high leaf vein densities, and such small and abundant stomata, is still incomplete. Here we ask whether the scaling of genome size and cell size places a fundamental constraint on the photosynthetic metabolism of land plants, and whether genome downsizing among the angiosperms directly contributed to their greater potential and realized primary productivity relative to the other major groups of terrestrial plants. Using previously published data we show that a single relationship can predict guard cell size from genome size across the major groups of terrestrial land plants (e.g. angiosperms, conifers, cycads and ferns). Similarly, a strong positive correlation exists between genome size and both stomatal density and vein density that together ultimately constrains maximum potential (gs, max) and operational stomatal conductance (gs, op). Further the difference in the slopes describing the covariation between genome size and both gs, max and gs, op suggests that genome downsizing brings gs, op closer to gs, max. Taken together the data presented here suggests that the smaller genomes of angiosperms allow their final cell sizes to vary more widely and respond more directly to environmental conditions and in doing so bring operational photosynthetic

  15. Two-component signal transduction systems of Xanthomonas spp.: a lesson from genomics.

    PubMed

    Qian, Wei; Han, Zhong-Ji; He, Chaozu

    2008-02-01

    The two-component signal transduction systems (TCSTSs), consisting of a histidine kinase sensor (HK) and a response regulator (RR), are the dominant molecular mechanisms by which prokaryotes sense and respond to environmental stimuli. Genomes of Xanthomonas generally contain a large repertoire of TCSTS genes (approximately 92 to 121 for each genome), which encode diverse structural groups of HKs and RRs. Among them, although a core set of 70 TCSTS genes (about two-thirds in total) which accumulates point mutations with a slow rate are shared by these genomes, the other genes, especially hybrid HKs, experienced extensive genetic recombination, including genomic rearrangement, gene duplication, addition or deletion, and fusion or fission. The recombinations potentially promote the efficiency and complexity of TCSTSs in regulating gene expression. In addition, our analysis suggests that a co-evolutionary model, rather than a selfish operon model, is the major mechanism for the maintenance and microevolution of TCSTS genes in the genomes of Xanthomonas. Genomic annotation, secondary protein structure prediction, and comparative genomic analyses of TCSTS genes reviewed here provide insights into our understanding of signal networks in these important phytopathogenic bacteria.

  16. Evolving Communicative Complexity: Insight from Rodents and Beyond

    DTIC Science & Technology

    2012-01-01

    Group size in animal societies: the potential role of social and ecological limitations in the group-living fish , Paragobiodon xanthosomus. Ethology... Ecology and Evolutionary Biology, University of California, Los Angeles, CA 90095, USA 2Human Research and Engineering Directorate, Perceptual Sciences...evolve is an active question in behavioural ecology . Sciurid rodents (ground squirrels, prairie dogs and marmots) provide an excellent model system for

  17. The Ciona intestinalis genome: when the constraints are off

    NASA Technical Reports Server (NTRS)

    Holland, Linda Z.; Gibson-Brown, Jeremy J.

    2003-01-01

    The recent genome sequencing of a non-vertebrate deuterostome, the ascidian tunicate Ciona intestinalis, makes a substantial contribution to the fields of evolutionary and developmental biology.1 Tunicates have some of the smallest bilaterian genomes, embryos with relatively few cells, fixed lineages and early determination of cell fates. Initial analyses of the C. intestinalis genome indicate that it has been evolving rapidly. Comparisons with other bilaterians show that C. intestinalis has lost a number of genes, and that many genes linked together in most other bilaterians have become uncoupled. In addition, a number of independent, lineage-specific gene duplications have been detected. These new results, although interesting in themselves, will take on a deeper significance once the genomes of additional invertebrate deuterostomes (e.g. echinoderms, hemichordates and amphioxus) have been sequenced. With such a broadened database, comparative genomics can begin to ask pointed questions about the relationship between the evolution of genomes and the evolution of body plans. Copyright 2003 Wiley Periodicals, Inc.

  18. New Turaev braided group categories and weak (co)quasi-Turaev group coalgebras

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Zhang, Xiaohui, E-mail: zxhhhhh@gmail.com; Wang, Shuanhong, E-mail: shuanhwang2002@yahoo.com

    In order to construct a class of new braided crossed G-categories with nontrivial associativity and unit constraints, we study the G-graded monoidal category over a family of algebras (H{sub α}){sub α∈G} and introduce the notion of a weak (co)quasi-Turaev G-(co)algebra. Then we prove that the category of (co)representations of (co)quasitriangular weak (co)quasi-Turaev π-(co)algebras is exactly a braided crossed G-category. In fact, this (co)quasitriangular structure provides a solution to a generalized quantum Yang-Baxter type equation.

  19. Extending information retrieval methods to personalized genomic-based studies of disease.

    PubMed

    Ye, Shuyun; Dawson, John A; Kendziorski, Christina

    2014-01-01

    Genomic-based studies of disease now involve diverse types of data collected on large groups of patients. A major challenge facing statistical scientists is how best to combine the data, extract important features, and comprehensively characterize the ways in which they affect an individual's disease course and likelihood of response to treatment. We have developed a survival-supervised latent Dirichlet allocation (survLDA) modeling framework to address these challenges. Latent Dirichlet allocation (LDA) models have proven extremely effective at identifying themes common across large collections of text, but applications to genomics have been limited. Our framework extends LDA to the genome by considering each patient as a "document" with "text" detailing his/her clinical events and genomic state. We then further extend the framework to allow for supervision by a time-to-event response. The model enables the efficient identification of collections of clinical and genomic features that co-occur within patient subgroups, and then characterizes each patient by those features. An application of survLDA to The Cancer Genome Atlas ovarian project identifies informative patient subgroups showing differential response to treatment, and validation in an independent cohort demonstrates the potential for patient-specific inference.

  20. From genomes to metabolomes: Understanding mechanisms of symbiosis and cell-cell signaling using the archaeal system Ignicoccus-Nanoarchaeum

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Podar, Mircea; Hettich, Robert; Copie, Valerie

    The main objective of this project was to use symbiotic Nanoarchaeaota, a group of thermophilic Archaea that are obligate symbionts/parasites on other Archaea, to develop an integrated multi-omic approach to study inter-species interactions as well as to understand fundamental mechanism that enable such relationships. As part of this grant we have achieved a number of important milestone on both technical and scientific levels. On the technical side, we developed immunofluorescence labeling and tracking methods to follow Nanoarchaeota in cultures and in environmental samples, we applied such methods in conjunction with flow cytometry to quantify and isolate uncultured representatives from themore » environment and characterized them by single cell genomics. On the proteomics side, we developed a more efficient and sensitive method to recover and semi-quantitatively measure membrane proteins, while achieving high total cellular proteome coverage (70-80% of the predicted proteome). Metabolomic analyses used complementary NMR and LC/GC mass spectrometry and led to the identification of novel lipids in these organisms as well as quantification of some of the major metabolites. Importantly, using several informatics approaches we were also able to integrate the transcriptomic, proteomic and metabolomic datasets, revealing aspects of the interspecies interaction that were not evident in the single omic analyses (manuscript in review). On the science side we determined that N. equitans and I. hospitalis are metabolically coupled and that N. equitans is strictly dependent on its host both for metabolic precursors and energetic needs. The actual mechanism by which small molecules move across the cell membrane remains unknown. The Ignicoccus host responds to the metabolic and energetic burned by upregulating of key primary metabolism steps and ATP synthesis. The two species have co-evolved, aspect that we determined by comparative genomics with other species of Ignicoccus

  1. Nonhuman genetics. Genomic basis for the convergent evolution of electric organs.

    PubMed

    Gallant, Jason R; Traeger, Lindsay L; Volkening, Jeremy D; Moffett, Howell; Chen, Po-Hao; Novina, Carl D; Phillips, George N; Anand, Rene; Wells, Gregg B; Pinch, Matthew; Güth, Robert; Unguez, Graciela A; Albert, James S; Zakon, Harold H; Samanta, Manoj P; Sussman, Michael R

    2014-06-27

    Little is known about the genetic basis of convergent traits that originate repeatedly over broad taxonomic scales. The myogenic electric organ has evolved six times in fishes to produce electric fields used in communication, navigation, predation, or defense. We have examined the genomic basis of the convergent anatomical and physiological origins of these organs by assembling the genome of the electric eel (Electrophorus electricus) and sequencing electric organ and skeletal muscle transcriptomes from three lineages that have independently evolved electric organs. Our results indicate that, despite millions of years of evolution and large differences in the morphology of electric organ cells, independent lineages have leveraged similar transcription factors and developmental and cellular pathways in the evolution of electric organs. Copyright © 2014, American Association for the Advancement of Science.

  2. Analysis of the entire genomes of fifteen torque teno midi virus variants classifiable into a third group of genus Anellovirus.

    PubMed

    Ninomiya, M; Takahashi, M; Shimosegawa, T; Okamoto, H

    2007-01-01

    Recently, we identified a novel human virus with a circular DNA genome of 3.2 kb, tentatively designated as torque teno midi virus (TTMDV), with a genomic organization resembling those of torque teno virus (TTV) of 3.8-3.9 kb and torque teno mini virus (TTMV) of 2.8-2.9 kb. To investigate the extent of genomic variability of TTMDV genomes, the full-length sequence was determined for 15 TTMDV isolates obtained from viremic individuals in Japan. The 15 TTMDV isolates comprised 3175-3230 bases and shared 67.0-90.3% identities with each other, and were only 68.4-73.0% identical to the 3 reported TTMDV isolates over the entire genome. TTMDV possessed a genomic organization with four open reading frames (ORF1-ORF4) with characteristic sequence motifs and stem and loop structures with high GC content, similar to TTV and TTMV. The total of 18 TTMDV genomes differed by up to 60.7% from each other in the amino acid sequence of ORF1 (658-677 amino acids), but segregated phylogenetically into the same cluster, which was distantly related to the TTVs and TTMVs. These results indicate that TTMDV with a circular DNA genome of 3.2 kb, has an extremely high degree of genomic variability, and is classifiable into a third group in the genus Anellovirus.

  3. Co-occurring genomic alterations define major subsets of KRAS - mutant lung adenocarcinoma with distinct biology, immune profiles, and therapeutic vulnerabilities

    PubMed Central

    Skoulidis, Ferdinandos; Byers, Lauren A.; Diao, Lixia; Papadimitrakopoulou, Vassiliki A.; Tong, Pan; Izzo, Julie; Behrens, Carmen; Kadara, Humam; Parra, Edwin R.; Canales, Jaime Rodriguez; Zhang, Jianjun; Giri, Uma; Gudikote, Jayanthi; Cortez, Maria A.; Yang, Chao; Fan, You Hong; Peyton, Michael; Girard, Luc; Coombes, Kevin R.; Toniatti, Carlo; Heffernan, Timothy P.; Choi, Murim; Frampton, Garrett M.; Miller, Vincent; Weinstein, John N.; Herbst, Roy S.; Wong, Kwok-Kin; Zhang, Jianhua; Sharma, Padmanee; Mills, Gordon B.; Hong, Waun K.; Minna, John D.; Allison, James P.; Futreal, Andrew; Wang, Jing; Wistuba, Ignacio I.; Heymach, John V.

    2015-01-01

    The molecular underpinnings that drive the heterogeneity of KRAS-mutant lung adenocarcinoma (LUAC) are poorly characterized. We performed an integrative analysis of genomic, transcriptomic and proteomic data from early-stage and chemo-refractory LUAC and identified three robust subsets of KRAS-mutant LUAC dominated, respectively, by co-occurring genetic events in STK11/LKB1 (the KL subgroup), TP53 (KP) and CDKN2A/B inactivation coupled with low expression of the NKX2-1 (TTF1) transcription factor (KC). We further reveal biologically and therapeutically relevant differences between the subgroups. KC tumors frequently exhibited mucinous histology and suppressed mTORC1 signaling. KL tumors had high rates of KEAP1 mutational inactivation and expressed lower levels of immune markers, including PD-L1. KP tumors demonstrated higher levels of somatic mutations, inflammatory markers, immune checkpoint effector molecules and improved relapse-free survival. Differences in drug sensitivity patterns were also observed; notably, KL cells showed increased vulnerability to HSP90-inhibitor therapy. This work provides evidence that co-occurring genomic alterations identify subgroups of KRAS-mutant LUAC with distinct biology and therapeutic vulnerabilities. PMID:26069186

  4. Co-Localization of the Oncogenic Transcription Factor MYCN and the DNA Methyl Binding Protein MeCP2 at Genomic Sites in Neuroblastoma

    PubMed Central

    Murphy, Derek M.; Buckley, Patrick G.; Das, Sudipto; Watters, Karen M.; Bryan, Kenneth; Stallings, Raymond L.

    2011-01-01

    Background MYCN is a transcription factor that is expressed during the development of the neural crest and its dysregulation plays a major role in the pathogenesis of pediatric cancers such as neuroblastoma, medulloblastoma and rhabdomyosarcoma. MeCP2 is a CpG methyl binding protein which has been associated with a number of cancers and developmental disorders, particularly Rett syndrome. Methods and Findings Using an integrative global genomics approach involving chromatin immunoprecipitation applied to microarrays, we have determined that MYCN and MeCP2 co-localize to gene promoter regions, as well as inter/intragenic sites, within the neuroblastoma genome (MYCN amplified Kelly cells) at high frequency (70.2% of MYCN sites were also positive for MeCP2). Intriguingly, the frequency of co-localization was significantly less at promoter regions exhibiting substantial hypermethylation (8.7%), as determined by methylated DNA immunoprecipitation (MeDIP) applied to the same microarrays. Co-immunoprecipitation of MYCN using an anti-MeCP2 antibody indicated that a MYCN/MeCP2 interaction occurs at protein level. mRNA expression profiling revealed that the median expression of genes with promoters bound by MYCN was significantly higher than for genes bound by MeCP2, and that genes bound by both proteins had intermediate expression. Pathway analysis was carried out for genes bound by MYCN, MeCP2 or MYCN/MeCP2, revealing higher order functions. Conclusions Our results indicate that MYCN and MeCP2 protein interact and co-localize to similar genomic sites at very high frequency, and that the patterns of binding of these proteins can be associated with significant differences in transcriptional activity. Although it is not yet known if this interaction contributes to neuroblastoma disease pathogenesis, it is intriguing that the interaction occurs at the promoter regions of several genes important for the development of neuroblastoma, including ALK, AURKA and BDNF. PMID:21731748

  5. CoLIde

    PubMed Central

    Mohorianu, Irina; Stocks, Matthew Benedict; Wood, John; Dalmay, Tamas; Moulton, Vincent

    2013-01-01

    Small RNAs (sRNAs) are 20–25 nt non-coding RNAs that act as guides for the highly sequence-specific regulatory mechanism known as RNA silencing. Due to the recent increase in sequencing depth, a highly complex and diverse population of sRNAs in both plants and animals has been revealed. However, the exponential increase in sequencing data has also made the identification of individual sRNA transcripts corresponding to biological units (sRNA loci) more challenging when based exclusively on the genomic location of the constituent sRNAs, hindering existing approaches to identify sRNA loci.   To infer the location of significant biological units, we propose an approach for sRNA loci detection called CoLIde (Co-expression based sRNA Loci Identification) that combines genomic location with the analysis of other information such as variation in expression levels (expression pattern) and size class distribution. For CoLIde, we define a locus as a union of regions sharing the same pattern and located in close proximity on the genome. Biological relevance, detected through the analysis of size class distribution, is also calculated for each locus. CoLIde can be applied on ordered (e.g., time-dependent) or un-ordered (e.g., organ, mutant) series of samples both with or without biological/technical replicates. The method reliably identifies known types of loci and shows improved performance on sequencing data from both plants (e.g., A. thaliana, S. lycopersicum) and animals (e.g., D. melanogaster) when compared with existing locus detection techniques. CoLIde is available for use within the UEA Small RNA Workbench which can be downloaded from: http://srna-workbench.cmp.uea.ac.uk. PMID:23851377

  6. Genomic features of bacterial adaptation to plants

    DOE PAGES

    Levy, Asaf; Salas Gonzalez, Isai; Mittelviefhaus, Maximilian; ...

    2017-12-18

    Plants intimately associate with diverse bacteria. Plant-associated bacteria have ostensibly evolved genes that enable them to adapt to plant environments. However, the identities of such genes are mostly unknown, and their functions are poorly characterized. In this study, we sequenced 484 genomes of bacterial isolates from roots of Brassicaceae, poplar, and maize. We then compared 3,837 bacterial genomes to identify thousands of plant-associated gene clusters. Genomes of plant-associated bacteria encode more carbohydrate metabolism functions and fewer mobile elements than related non-plant-associated genomes do. We experimentally validated candidates from two sets of plant-associated genes: one involved in plant colonization, and themore » other serving in microbe–microbe competition between plant-associated bacteria. We also identified 64 plant-associated protein domains that potentially mimic plant domains; some are shared with plant-associated fungi and oomycetes. In conclusion, this work expands the genome-based understanding of plant–microbe interactions and provides potential leads for efficient and sustainable agriculture through microbiome engineering.« less

  7. Genomic features of bacterial adaptation to plants

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Levy, Asaf; Salas Gonzalez, Isai; Mittelviefhaus, Maximilian

    Plants intimately associate with diverse bacteria. Plant-associated bacteria have ostensibly evolved genes that enable them to adapt to plant environments. However, the identities of such genes are mostly unknown, and their functions are poorly characterized. In this study, we sequenced 484 genomes of bacterial isolates from roots of Brassicaceae, poplar, and maize. We then compared 3,837 bacterial genomes to identify thousands of plant-associated gene clusters. Genomes of plant-associated bacteria encode more carbohydrate metabolism functions and fewer mobile elements than related non-plant-associated genomes do. We experimentally validated candidates from two sets of plant-associated genes: one involved in plant colonization, and themore » other serving in microbe–microbe competition between plant-associated bacteria. We also identified 64 plant-associated protein domains that potentially mimic plant domains; some are shared with plant-associated fungi and oomycetes. In conclusion, this work expands the genome-based understanding of plant–microbe interactions and provides potential leads for efficient and sustainable agriculture through microbiome engineering.« less

  8. Direct-to-consumer personalized genomic testing

    PubMed Central

    Bloss, Cinnamon S.; Darst, Burcu F.; Topol, Eric J.; Schork, Nicholas J.

    2011-01-01

    Over the past 18 months, there have been notable developments in the direct-to-consumer (DTC) genomic testing arena, in particular with regard to issues surrounding governmental regulation in the USA. While commentaries continue to proliferate on this topic, actual empirical research remains relatively scant. In terms of DTC genomic testing for disease susceptibility, most of the research has centered on uptake, perceptions and attitudes toward testing among health care professionals and consumers. Only a few available studies have examined actual behavioral response among consumers, and we are not aware of any studies that have examined response to DTC genetic testing for ancestry or for drug response. We propose that further research in this area is desperately needed, despite challenges in designing appropriate studies given the rapid pace at which the field is evolving. Ultimately, DTC genomic testing for common markers and conditions is only a precursor to the eventual cost-effectiveness and wide availability of whole genome sequencing of individuals, although it remains unclear whether DTC genomic information will still be attainable. Either way, however, current knowledge needs to be extended and enhanced with respect to the delivery, impact and use of increasingly accurate and comprehensive individualized genomic data. PMID:21828075

  9. The genome of the fire ant Solenopsis invicta

    USDA-ARS?s Scientific Manuscript database

    Ants have evolved very complex societies and are key ecosystem members. Some of them are also major pests, as exemplified by the fire ant Solenopsis invicta. We present here the draft genome of S. invicta, assembled from 454 and Illumina reads obtained from a focal haploid male and his brothers. In ...

  10. Clustering analysis of proteins from microbial genomes at multiple levels of resolution.

    PubMed

    Zaslavsky, Leonid; Ciufo, Stacy; Fedorov, Boris; Tatusova, Tatiana

    2016-08-31

    Microbial genomes at the National Center for Biotechnology Information (NCBI) represent a large collection of more than 35,000 assemblies. There are several complexities associated with the data: a great variation in sampling density since human pathogens are densely sampled while other bacteria are less represented; different protein families occur in annotations with different frequencies; and the quality of genome annotation varies greatly. In order to extract useful information from these sophisticated data, the analysis needs to be performed at multiple levels of phylogenomic resolution and protein similarity, with an adequate sampling strategy. Protein clustering is used to construct meaningful and stable groups of similar proteins to be used for analysis and functional annotation. Our approach is to create protein clusters at three levels. First, tight clusters in groups of closely-related genomes (species-level clades) are constructed using a combined approach that takes into account both sequence similarity and genome context. Second, clustroids of conservative in-clade clusters are organized into seed global clusters. Finally, global protein clusters are built around the the seed clusters. We propose filtering strategies that allow limiting the protein set included in global clustering. The in-clade clustering procedure, subsequent selection of clustroids and organization into seed global clusters provides a robust representation and high rate of compression. Seed protein clusters are further extended by adding related proteins. Extended seed clusters include a significant part of the data and represent all major known cell machinery. The remaining part, coming from either non-conservative (unique) or rapidly evolving proteins, from rare genomes, or resulting from low-quality annotation, does not group together well. Processing these proteins requires significant computational resources and results in a large number of questionable clusters. The developed

  11. Co-occurring genomic alterations define major subsets of KRAS-mutant lung adenocarcinoma with distinct biology, immune profiles, and therapeutic vulnerabilities.

    PubMed

    Skoulidis, Ferdinandos; Byers, Lauren A; Diao, Lixia; Papadimitrakopoulou, Vassiliki A; Tong, Pan; Izzo, Julie; Behrens, Carmen; Kadara, Humam; Parra, Edwin R; Canales, Jaime Rodriguez; Zhang, Jianjun; Giri, Uma; Gudikote, Jayanthi; Cortez, Maria A; Yang, Chao; Fan, Youhong; Peyton, Michael; Girard, Luc; Coombes, Kevin R; Toniatti, Carlo; Heffernan, Timothy P; Choi, Murim; Frampton, Garrett M; Miller, Vincent; Weinstein, John N; Herbst, Roy S; Wong, Kwok-Kin; Zhang, Jianhua; Sharma, Padmanee; Mills, Gordon B; Hong, Waun K; Minna, John D; Allison, James P; Futreal, Andrew; Wang, Jing; Wistuba, Ignacio I; Heymach, John V

    2015-08-01

    The molecular underpinnings that drive the heterogeneity of KRAS-mutant lung adenocarcinoma are poorly characterized. We performed an integrative analysis of genomic, transcriptomic, and proteomic data from early-stage and chemorefractory lung adenocarcinoma and identified three robust subsets of KRAS-mutant lung adenocarcinoma dominated, respectively, by co-occurring genetic events in STK11/LKB1 (the KL subgroup), TP53 (KP), and CDKN2A/B inactivation coupled with low expression of the NKX2-1 (TTF1) transcription factor (KC). We further revealed biologically and therapeutically relevant differences between the subgroups. KC tumors frequently exhibited mucinous histology and suppressed mTORC1 signaling. KL tumors had high rates of KEAP1 mutational inactivation and expressed lower levels of immune markers, including PD-L1. KP tumors demonstrated higher levels of somatic mutations, inflammatory markers, immune checkpoint effector molecules, and improved relapse-free survival. Differences in drug sensitivity patterns were also observed; notably, KL cells showed increased vulnerability to HSP90-inhibitor therapy. This work provides evidence that co-occurring genomic alterations identify subgroups of KRAS-mutant lung adenocarcinoma with distinct biology and therapeutic vulnerabilities. Co-occurring genetic alterations in STK11/LKB1, TP53, and CDKN2A/B-the latter coupled with low TTF1 expression-define three major subgroups of KRAS-mutant lung adenocarcinoma with distinct biology, patterns of immune-system engagement, and therapeutic vulnerabilities. ©2015 American Association for Cancer Research.

  12. Haemonchus contortus: Genome Structure, Organization and Comparative Genomics.

    PubMed

    Laing, R; Martinelli, A; Tracey, A; Holroyd, N; Gilleard, J S; Cotton, J A

    2016-01-01

    One of the first genome sequencing projects for a parasitic nematode was that for Haemonchus contortus. The open access data from the Wellcome Trust Sanger Institute provided a valuable early resource for the research community, particularly for the identification of specific genes and genetic markers. Later, a second sequencing project was initiated by the University of Melbourne, and the two draft genome sequences for H. contortus were published back-to-back in 2013. There is a pressing need for long-range genomic information for genetic mapping, population genetics and functional genomic studies, so we are continuing to improve the Wellcome Trust Sanger Institute assembly to provide a finished reference genome for H. contortus. This review describes this process, compares the H. contortus genome assemblies with draft genomes from other members of the strongylid group and discusses future directions for parasite genomics using the H. contortus model. Copyright © 2016 Elsevier Ltd. All rights reserved.

  13. Metabolic Environments and Genomic Features Associated with Pathogenic and Mutualistic Interactions between Bacteria and Plants is accepted for publication in MPMI

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Karpinets, Tatiana V; Park, Byung H; Syed, Mustafa H

    Most bacterial symbionts of plants are phenotypically characterized by their parasitic or matualistic relationship with the host; however, the genomic characteristics that likely discriminate mutualistic symbionts from pathogens of plants are poorly understood. This study comparatively analyzed the genomes of 54 plant-symbiontic bacteria, 27 mutualists and 27 pathogens, to discover genomic determinants of their parasitic and mutualistic nature in terms of protein family domains, KEGG orthologous groups, metabolic pathways and families of carbohydrate-active enzymes (CAZymes). We further used all bacteria with sequenced genomesl, published microarrays and transcriptomics experimental datasets, and literature to validate and to explore results of the comparison.more » The analysis revealed that genomes of mutualists are larger in size and higher in GC content and encode greater molecular, functional and metabolic diversity than the investigated genomes of pathogens. This enriched molecular and functional enzyme diversity included constructive biosynthetic signatures of CAZymes and metabolic pathways in genomes of mutualists compared with catabolic signatures dominant in the genomes of pathogens. Another discriminative characteristic of mutualists is the co-occurence of gene clusters required for the expression and function of nitrogenase and RuBisCO. Analysis of previously published experimental data indicate that nitrogen-fixing mutualists may employ Rubisco to fix CO2 not in the canonical Calvin-Benson-Basham cycle but in a novel metabolic pathway, here called Rubisco-based glycolysis , to increase efficiency of sugar utilization during the symbiosis with plants. An important discriminative characteristic of plant pathogenic bacteria is two groups of genes likely encoding effector proteins involved in host invasion and a genomic locus encoding a putative secretion system that includes a DUF1525 domain protein conserved in pathogens of plants and of other

  14. Genome annotation provides insight into carbon monoxide and hydrogen metabolism in Rubrivivax gelatinosus

    DOE PAGES

    Wawrousek, Karen; Noble, Scott; Korlach, Jonas; ...

    2014-12-05

    In this article, we report here the sequencing and analysis of the genome of the purple non-sulfur photosynthetic bacterium Rubrivivax gelatinosus CBS. This microbe is a model for studies of its carboxydotrophic life style under anaerobic condition, based on its ability to utilize carbon monoxide (CO) as the sole carbon substrate and water as the electron acceptor, yielding CO 2 and H 2 as the end products. The CO-oxidation reaction is known to be catalyzed by two enzyme complexes, the CO dehydrogenase and hydrogenase. As expected, analysis of the genome of Rx. gelatinosus CBS reveals the presence of genes encodingmore » both enzyme complexes. The CO-oxidation reaction is CO-inducible, which is consistent with the presence of two putative CO-sensing transcription factors in its genome. Genome analysis also reveals the presence of two additional hydrogenases, an uptake hydrogenase that liberates the electrons in H 2 in support of cell growth, and a regulatory hydrogenase that senses H 2 and relays the signal to a two-component system that ultimately controls synthesis of the uptake hydrogenase. The genome also contains two sets of hydrogenase maturation genes which are known to assemble the catalytic metallocluster of the hydrogenase NiFe active site. Finally and collectively, the genome sequence and analysis information reveals the blueprint of an intricate network of signal transduction pathways and its underlying regulation that enables Rx. gelatinosus CBS to thrive on CO or H 2 in support of cell growth.« less

  15. IMG 4 version of the integrated microbial genomes comparative analysis system

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Markowitz, Victor M.; Chen, I-Min A.; Palaniappan, Krishna

    The Integrated Microbial Genomes (IMG) data warehouse integrates genomes from all three domains of life, as well as plasmids, viruses and genome fragments. IMG provides tools for analyzing and reviewing the structural and functional annotations of genomes in a comparative context. IMG’s data content and analytical capabilities have increased continuously since its first version released in 2005. Since the last report published in the 2012 NAR Database Issue, IMG’s annotation and data integration pipelines have evolved while new tools have been added for recording and analyzing single cell genomes, RNA Seq and biosynthetic cluster data. Finally, different IMG datamarts providemore » support for the analysis of publicly available genomes (IMG/W: http://img.jgi.doe.gov/w), expert review of genome annotations (IMG/ER: http://img.jgi.doe.gov/er) and teaching and training in the area of microbial genome analysis (IMG/EDU: http://img.jgi.doe.gov/edu).« less

  16. Opportunities and challenges associated with clinical diagnostic genome sequencing: a report of the Association for Molecular Pathology.

    PubMed

    Schrijver, Iris; Aziz, Nazneen; Farkas, Daniel H; Furtado, Manohar; Gonzalez, Andrea Ferreira; Greiner, Timothy C; Grody, Wayne W; Hambuch, Tina; Kalman, Lisa; Kant, Jeffrey A; Klein, Roger D; Leonard, Debra G B; Lubin, Ira M; Mao, Rong; Nagan, Narasimhan; Pratt, Victoria M; Sobel, Mark E; Voelkerding, Karl V; Gibson, Jane S

    2012-11-01

    This report of the Whole Genome Analysis group of the Association for Molecular Pathology illuminates the opportunities and challenges associated with clinical diagnostic genome sequencing. With the reality of clinical application of next-generation sequencing, technical aspects of molecular testing can be accomplished at greater speed and with higher volume, while much information is obtained. Although this testing is a next logical step for molecular pathology laboratories, the potential impact on the diagnostic process and clinical correlations is extraordinary and clinical interpretation will be challenging. We review the rapidly evolving technologies; provide application examples; discuss aspects of clinical utility, ethics, and consent; and address the analytic, postanalytic, and professional implications. Copyright © 2012 American Society for Investigative Pathology and the Association for Molecular Pathology. Published by Elsevier Inc. All rights reserved.

  17. Comparative genomic analysis of clinical and environmental Vibrio vulnificus isolates revealed biotype 3 evolutionary relationships.

    PubMed

    Koton, Yael; Gordon, Michal; Chalifa-Caspi, Vered; Bisharat, Naiel

    2014-01-01

    In 1996 a common-source outbreak of severe soft tissue and bloodstream infections erupted among Israeli fish farmers and fish consumers due to changes in fish marketing policies. The causative pathogen was a new strain of Vibrio vulnificus, named biotype 3, which displayed a unique biochemical and genotypic profile. Initial observations suggested that the pathogen erupted as a result of genetic recombination between two distinct populations. We applied a whole genome shotgun sequencing approach using several V. vulnificus strains from Israel in order to study the pan genome of V. vulnificus and determine the phylogenetic relationship of biotype 3 with existing populations. The core genome of V. vulnificus based on 16 draft and complete genomes consisted of 3068 genes, representing between 59 and 78% of the whole genome of 16 strains. The accessory genome varied in size from 781 to 2044 kbp. Phylogenetic analysis based on whole, core, and accessory genomes displayed similar clustering patterns with two main clusters, clinical (C) and environmental (E), all biotype 3 strains formed a distinct group within the E cluster. Annotation of accessory genomic regions found in biotype 3 strains and absent from the core genome yielded 1732 genes, of which the vast majority encoded hypothetical proteins, phage-related proteins, and mobile element proteins. A total of 1916 proteins (including 713 hypothetical proteins) were present in all human pathogenic strains (both biotype 3 and non-biotype 3) and absent from the environmental strains. Clustering analysis of the non-hypothetical proteins revealed 148 protein clusters shared by all human pathogenic strains; these included transcriptional regulators, arylsulfatases, methyl-accepting chemotaxis proteins, acetyltransferases, GGDEF family proteins, transposases, type IV secretory system (T4SS) proteins, and integrases. Our study showed that V. vulnificus biotype 3 evolved from environmental populations and formed a genetically

  18. Repeated polyploidization of Gossypium genomes and the evolution of spinnable cotton fibres.

    PubMed

    Paterson, Andrew H; Wendel, Jonathan F; Gundlach, Heidrun; Guo, Hui; Jenkins, Jerry; Jin, Dianchuan; Llewellyn, Danny; Showmaker, Kurtis C; Shu, Shengqiang; Udall, Joshua; Yoo, Mi-jeong; Byers, Robert; Chen, Wei; Doron-Faigenboim, Adi; Duke, Mary V; Gong, Lei; Grimwood, Jane; Grover, Corrinne; Grupp, Kara; Hu, Guanjing; Lee, Tae-ho; Li, Jingping; Lin, Lifeng; Liu, Tao; Marler, Barry S; Page, Justin T; Roberts, Alison W; Romanel, Elisson; Sanders, William S; Szadkowski, Emmanuel; Tan, Xu; Tang, Haibao; Xu, Chunming; Wang, Jinpeng; Wang, Zining; Zhang, Dong; Zhang, Lan; Ashrafi, Hamid; Bedon, Frank; Bowers, John E; Brubaker, Curt L; Chee, Peng W; Das, Sayan; Gingle, Alan R; Haigler, Candace H; Harker, David; Hoffmann, Lucia V; Hovav, Ran; Jones, Donald C; Lemke, Cornelia; Mansoor, Shahid; ur Rahman, Mehboob; Rainville, Lisa N; Rambani, Aditi; Reddy, Umesh K; Rong, Jun-kang; Saranga, Yehoshua; Scheffler, Brian E; Scheffler, Jodi A; Stelly, David M; Triplett, Barbara A; Van Deynze, Allen; Vaslin, Maite F S; Waghmare, Vijay N; Walford, Sally A; Wright, Robert J; Zaki, Essam A; Zhang, Tianzhen; Dennis, Elizabeth S; Mayer, Klaus F X; Peterson, Daniel G; Rokhsar, Daniel S; Wang, Xiyin; Schmutz, Jeremy

    2012-12-20

    Polyploidy often confers emergent properties, such as the higher fibre productivity and quality of tetraploid cottons than diploid cottons bred for the same environments. Here we show that an abrupt five- to sixfold ploidy increase approximately 60 million years (Myr) ago, and allopolyploidy reuniting divergent Gossypium genomes approximately 1-2 Myr ago, conferred about 30-36-fold duplication of ancestral angiosperm (flowering plant) genes in elite cottons (Gossypium hirsutum and Gossypium barbadense), genetic complexity equalled only by Brassica among sequenced angiosperms. Nascent fibre evolution, before allopolyploidy, is elucidated by comparison of spinnable-fibred Gossypium herbaceum A and non-spinnable Gossypium longicalyx F genomes to one another and the outgroup D genome of non-spinnable Gossypium raimondii. The sequence of a G. hirsutum A(t)D(t) (in which 't' indicates tetraploid) cultivar reveals many non-reciprocal DNA exchanges between subgenomes that may have contributed to phenotypic innovation and/or other emergent properties such as ecological adaptation by polyploids. Most DNA-level novelty in G. hirsutum recombines alleles from the D-genome progenitor native to its New World habitat and the Old World A-genome progenitor in which spinnable fibre evolved. Coordinated expression changes in proximal groups of functionally distinct genes, including a nuclear mitochondrial DNA block, may account for clusters of cotton-fibre quantitative trait loci affecting diverse traits. Opportunities abound for dissecting emergent properties of other polyploids, particularly angiosperms, by comparison to diploid progenitors and outgroups.

  19. Water Contact Angle Dependence with Hydroxyl Functional Groups on Silica Surfaces under CO2 Sequestration Conditions.

    PubMed

    Chen, Cong; Zhang, Ning; Li, Weizhong; Song, Yongchen

    2015-12-15

    Functional groups on silica surfaces under CO2 sequestration conditions are complex due to reactions among supercritical CO2, brine and silica. Molecular dynamics simulations have been performed to investigate the effects of hydroxyl functional groups on wettability. It has been found that wettability shows a strong dependence on functional groups on silica surfaces: silanol number density, space distribution, and deprotonation/protonation degree. For neutral silica surfaces with crystalline structure (Q(3), Q(3)/Q(4), Q(4)), as silanol number density decreases, contact angle increases from 33.5° to 146.7° at 10.5 MPa and 318 K. When Q(3) surface changes to an amorphous structure, water contact angle increases 20°. Water contact angle decreases about 12° when 9% of silanol groups on Q(3) surface are deprotonated. When the deprotonation degree increases to 50%, water contact angle decreases to 0. The dependence of wettability on silica surface functional groups was used to analyze contact angle measurement ambiguity in literature. The composition of silica surfaces is complicated under CO2 sequestration conditions, the results found in this study may help to better understand wettability of CO2/brine/silica system.

  20. Genomic architecture of adaptive color pattern divergence and convergence in Heliconius butterflies

    PubMed Central

    Supple, Megan A.; Hines, Heather M.; Dasmahapatra, Kanchon K.; Lewis, James J.; Nielsen, Dahlia M.; Lavoie, Christine; Ray, David A.; Salazar, Camilo; McMillan, W. Owen; Counterman, Brian A.

    2013-01-01

    Identifying the genetic changes driving adaptive variation in natural populations is key to understanding the origins of biodiversity. The mosaic of mimetic wing patterns in Heliconius butterflies makes an excellent system for exploring adaptive variation using next-generation sequencing. In this study, we use a combination of techniques to annotate the genomic interval modulating red color pattern variation, identify a narrow region responsible for adaptive divergence and convergence in Heliconius wing color patterns, and explore the evolutionary history of these adaptive alleles. We use whole genome resequencing from four hybrid zones between divergent color pattern races of Heliconius erato and two hybrid zones of the co-mimic Heliconius melpomene to examine genetic variation across 2.2 Mb of a partial reference sequence. In the intergenic region near optix, the gene previously shown to be responsible for the complex red pattern variation in Heliconius, population genetic analyses identify a shared 65-kb region of divergence that includes several sites perfectly associated with phenotype within each species. This region likely contains multiple cis-regulatory elements that control discrete expression domains of optix. The parallel signatures of genetic differentiation in H. erato and H. melpomene support a shared genetic architecture between the two distantly related co-mimics; however, phylogenetic analysis suggests mimetic patterns in each species evolved independently. Using a combination of next-generation sequencing analyses, we have refined our understanding of the genetic architecture of wing pattern variation in Heliconius and gained important insights into the evolution of novel adaptive phenotypes in natural populations. PMID:23674305

  1. Evolving gene regulation networks into cellular networks guiding adaptive behavior: an outline how single cells could have evolved into a centralized neurosensory system

    PubMed Central

    Fritzsch, Bernd; Jahan, Israt; Pan, Ning; Elliott, Karen L.

    2014-01-01

    Understanding the evolution of the neurosensory system of man, able to reflect on its own origin, is one of the major goals of comparative neurobiology. Details of the origin of neurosensory cells, their aggregation into central nervous systems and associated sensory organs, their localized patterning into remarkably different cell types aggregated into variably sized parts of the central nervous system begin to emerge. Insights at the cellular and molecular level begin to shed some light on the evolution of neurosensory cells, partially covered in this review. Molecular evidence suggests that high mobility group (HMG) proteins of pre-metazoans evolved into the definitive Sox [SRY (sex determining region Y)-box] genes used for neurosensory precursor specification in metazoans. Likewise, pre-metazoan basic helix-loop-helix (bHLH) genes evolved in metazoans into the group A bHLH genes dedicated to neurosensory differentiation in bilaterians. Available evidence suggests that the Sox and bHLH genes evolved a cross-regulatory network able to synchronize expansion of precursor populations and their subsequent differentiation into novel parts of the brain or sensory organs. Molecular evidence suggests metazoans evolved patterning gene networks early and not dedicated to neuronal development. Only later in evolution were these patterning gene networks tied into the increasing complexity of diffusible factors, many of which were already present in pre-metazoans, to drive local patterning events. It appears that the evolving molecular basis of neurosensory cell development may have led, in interaction with differentially expressed patterning genes, to local network modifications guiding unique specializations of neurosensory cells into sensory organs and various areas of the central nervous system. PMID:25416504

  2. Evolving gene regulatory networks into cellular networks guiding adaptive behavior: an outline how single cells could have evolved into a centralized neurosensory system.

    PubMed

    Fritzsch, Bernd; Jahan, Israt; Pan, Ning; Elliott, Karen L

    2015-01-01

    Understanding the evolution of the neurosensory system of man, able to reflect on its own origin, is one of the major goals of comparative neurobiology. Details of the origin of neurosensory cells, their aggregation into central nervous systems and associated sensory organs and their localized patterning leading to remarkably different cell types aggregated into variably sized parts of the central nervous system have begun to emerge. Insights at the cellular and molecular level have begun to shed some light on the evolution of neurosensory cells, partially covered in this review. Molecular evidence suggests that high mobility group (HMG) proteins of pre-metazoans evolved into the definitive Sox [SRY (sex determining region Y)-box] genes used for neurosensory precursor specification in metazoans. Likewise, pre-metazoan basic helix-loop-helix (bHLH) genes evolved in metazoans into the group A bHLH genes dedicated to neurosensory differentiation in bilaterians. Available evidence suggests that the Sox and bHLH genes evolved a cross-regulatory network able to synchronize expansion of precursor populations and their subsequent differentiation into novel parts of the brain or sensory organs. Molecular evidence suggests metazoans evolved patterning gene networks early, which were not dedicated to neuronal development. Only later in evolution were these patterning gene networks tied into the increasing complexity of diffusible factors, many of which were already present in pre-metazoans, to drive local patterning events. It appears that the evolving molecular basis of neurosensory cell development may have led, in interaction with differentially expressed patterning genes, to local network modifications guiding unique specializations of neurosensory cells into sensory organs and various areas of the central nervous system.

  3. End-Group Effects on the Properties of PEG-co-PGA Hydrogels

    PubMed Central

    Bencherif, Sidi A.; Srinivasan, Abiraman; Sheehan, Jeffrey A.; Walker, Lynn M.; Gayathri, Chakicherla; Gil, Roberto; Hollinger, Jeffrey O.; Matyjaszewski, Krzysztof; Washburn, Newell R.

    2009-01-01

    A series of resorbable poly(ethylene glycol)-co-poly(glycolic acid) macromonomers have been synthesized with the chemistries from three different photopolymerizable end-groups (acrylates, methacrylates, and urethane methacrylates). The aim of the study is to examine the effects of the chemistry of the cross-linker group on the properties of photocross-linkable hydrogels. PEG-co-PGA (4KG5) hydrogels were prepared by photopolymerization with high vinyl group conversion as confirmed by 1H NMR spectroscopy using DOSY 1D pulse sequence. Our study reveals that the nature of end-groups in a moderately amphiphilic polymer can adjust the distribution and size of the micellar configuration in water leading to changes in the macroscopic structure of hydrogels. By varying the chemistry of the cross-linker group (diacrylates; DA, dimethacrylates; DM, and urethane dimethacrylates; UDM), we determined that the hydrophobocity of a single core polymer consisting of poly(glycolic acid) could be fine-tuned leading to significant variations in the mechanical, swelling, and degradation properties of the gels. In addition, the effects of cross-linker chemistry on cytotoxicity and proliferation were examined. Cytotoxicity assays showed that all the three types of hydrogels (4KG5 DA, DM, and UDM) were biocompatible and the introduction of RGD ligand enhanced cell adhesion. However, differences in gel properties and stability differentially affected the spreading and proliferation of myoblast C2C12 cells. PMID:19328754

  4. Evolution and phylogeny of the mud shrimps (Crustacea: Decapoda) revealed from complete mitochondrial genomes.

    PubMed

    Lin, Feng-Jiau; Liu, Yuan; Sha, Zhongli; Tsang, Ling Ming; Chu, Ka Hou; Chan, Tin-Yam; Liu, Ruiyu; Cui, Zhaoxia

    2012-11-16

    The evolutionary history and relationships of the mud shrimps (Crustacea: Decapoda: Gebiidea and Axiidea) are contentious, with previous attempts revealing mixed results. The mud shrimps were once classified in the infraorder Thalassinidea. Recent molecular phylogenetic analyses, however, suggest separation of the group into two individual infraorders, Gebiidea and Axiidea. Mitochondrial (mt) genome sequence and structure can be especially powerful in resolving higher systematic relationships that may offer new insights into the phylogeny of the mud shrimps and the other decapod infraorders, and test the hypothesis of dividing the mud shrimps into two infraorders. We present the complete mitochondrial genome sequences of five mud shrimps, Austinogebia edulis, Upogebia major, Thalassina kelanang (Gebiidea), Nihonotrypaea thermophilus and Neaxius glyptocercus (Axiidea). All five genomes encode a standard set of 13 protein-coding genes, two ribosomal RNA genes, 22 transfer RNA genes and a putative control region. Except for T. kelanang, mud shrimp mitochondrial genomes exhibited rearrangements and novel patterns compared to the pancrustacean ground pattern. Each of the two Gebiidea species (A. edulis and U. major) and two Axiidea species (N. glyptocercus and N. thermophiles) share unique gene order specific to their infraorders and analyses further suggest these two derived gene orders have evolved independently. Phylogenetic analyses based on the concatenated nucleotide and amino acid sequences of 13 protein-coding genes indicate the possible polyphyly of mud shrimps, supporting the division of the group into two infraorders. However, the infraordinal relationships among the Gebiidea and Axiidea, and other reptants are poorly resolved. The inclusion of mt genome from more taxa, in particular the reptant infraorders Polychelida and Glypheidea is required in further analysis. Phylogenetic analyses on the mt genome sequences and the distinct gene orders provide further

  5. Holistic Nursing in the Genetic/Genomic Era.

    PubMed

    Sharoff, Leighsa

    2016-06-01

    Holistic nursing practice is an ever-evolving transformative process with core values that require continued growth, professional leadership, and advocacy. Holistic nurses are required to stay current with all new required competencies, such as the Core Competencies in Genetics for Health Professional, and, as such, be adept at translating scientific evidence relating to genetics/genomics in the clinical setting. Knowledge of genetics/genomics in relation to nursing practice, policy, utilization, and research influence nurses' responsibilities. In addition to holistic nursing competencies, the holistic nurse must have basic knowledge and skills to integrate genetics/genomics aspects. It is important for holistic nurses to enhance their overall knowledge foundation, skills, and attitudes about genetics to prepare for the transformation in health care that is already underway. Holistic nurses can provide an important perspective to the application of genetics and genomics, focusing on health promotion, caring, and understanding the relationship between caring and families, community, and society. Yet there may be a lack of genetic and genomic knowledge to fully participate in the current genomic era. This article will explore the required core competencies for all health care professionals, share linkage of holistic nurses in practice with genetic/genomic conditions, and provide resources to further one's knowledge base. © The Author(s) 2015.

  6. Dictyostelium mobile elements: strategies to amplify in a compact genome.

    PubMed

    Winckler, T; Dingermann, T; Glöckner, G

    2002-12-01

    Dictyostelium discoideum is a eukaryotic microorganism that is attractive for the study of fundamental biological phenomena such as cell-cell communication, formation of multicellularity, cell differentiation and morphogenesis. Large-scale sequencing of the D. discoideum genome has provided new insights into evolutionary strategies evolved by transposable elements (TEs) to settle in compact microbial genomes and to maintain active populations over evolutionary time. The high gene density (about 1 gene/2.6 kb) of the D. discoideum genome leaves limited space for selfish molecular invaders to move and amplify without causing deleterious mutations that eradicate their host. Targeting of transfer RNA (tRNA) gene loci appears to be a generally successful strategy for TEs residing in compact genomes to insert away from coding regions. In D. discoideum, tRNA gene-targeted retrotransposition has evolved independently at least three times by both non-long terminal repeat (LTR) retrotransposons and retrovirus-like LTR retrotransposons. Unlike the nonspecifically inserting D. discoideum TEs, which have a strong tendency to insert into preexisting TE copies and form large and complex clusters near the ends of chromosomes, the tRNA gene-targeted retrotransposons have managed to occupy 75% of the tRNA gene loci spread on chromosome 2 and represent 80% of the TEs recognized on the assembled central 6.5-Mb part of chromosome 2. In this review we update the available information about D. discoideum TEs which emerges both from previous work and current large-scale genome sequencing, with special emphasis on the fact that tRNA genes are principal determinants of retrotransposon insertions into the D. discoideum genome.

  7. Genomic characterization, phylogenetic comparison and differential expression of the cyclic nucleotide-gated channels gene family in pear (Pyrus bretchneideri Rehd.).

    PubMed

    Chen, Jianqing; Yin, Hao; Gu, Jinping; Li, Leiting; Liu, Zhe; Jiang, Xueting; Zhou, Hongsheng; Wei, Shuwei; Zhang, Shaoling; Wu, Juyou

    2015-01-01

    The cyclic nucleotide-gated channel (CNGC) family is involved in the uptake of various cations, such as Ca(2+), to regulate plant growth and respond to biotic and abiotic stresses. However, there is far less information about this family in woody plants such as pear. Here, we provided a genome-wide identification and analysis of the CNGC gene family in pear. Phylogenetic analysis showed that the 21 pear CNGC genes could be divided into five groups (I, II, III, IVA and IVB). The majority of gene duplications in pear appeared to have been caused by segmental duplication and occurred 32.94-39.14 million years ago. Evolutionary analysis showed that positive selection had driven the evolution of pear CNGCs. Motif analyses showed that Group I CNGCs generally contained 26 motifs, which was the greatest number of motifs in all CNGC groups. Among these, eight motifs were shared by each group, suggesting that these domains play a conservative role in CNGC activity. Tissue-specific expression analysis indicated that functional diversification of the duplicated CNGC genes was a major feature of long-term evolution. Our results also suggested that the P-S6 and PBC & hinge domains had co-evolved during the evolution. These results provide valuable information to increase our understanding of the function, evolution and expression analyses of the CNGC gene family in higher plants. Copyright © 2014 Elsevier Inc. All rights reserved.

  8. The evolution of genomic imprinting: theories, predictions and empirical tests

    PubMed Central

    Patten, M M; Ross, L; Curley, J P; Queller, D C; Bonduriansky, R; Wolf, J B

    2014-01-01

    The epigenetic phenomenon of genomic imprinting has motivated the development of numerous theories for its evolutionary origins and genomic distribution. In this review, we examine the three theories that have best withstood theoretical and empirical scrutiny. These are: Haig and colleagues' kinship theory; Day and Bonduriansky's sexual antagonism theory; and Wolf and Hager's maternal–offspring coadaptation theory. These theories have fundamentally different perspectives on the adaptive significance of imprinting. The kinship theory views imprinting as a mechanism to change gene dosage, with imprinting evolving because of the differential effect that gene dosage has on the fitness of matrilineal and patrilineal relatives. The sexual antagonism and maternal–offspring coadaptation theories view genomic imprinting as a mechanism to modify the resemblance of an individual to its two parents, with imprinting evolving to increase the probability of expressing the fitter of the two alleles at a locus. In an effort to stimulate further empirical work on the topic, we carefully detail the logic and assumptions of all three theories, clarify the specific predictions of each and suggest tests to discriminate between these alternative theories for why particular genes are imprinted. PMID:24755983

  9. Genetic addiction: selfish gene's strategy for symbiosis in the genome.

    PubMed

    Mochizuki, Atsushi; Yahara, Koji; Kobayashi, Ichizo; Iwasa, Yoh

    2006-02-01

    The evolution and maintenance of the phenomenon of postsegregational host killing or genetic addiction are paradoxical. In this phenomenon, a gene complex, once established in a genome, programs death of a host cell that has eliminated it. The intact form of the gene complex would survive in other members of the host population. It is controversial as to why these genetic elements are maintained, due to the lethal effects of host killing, or perhaps some other properties are beneficial to the host. We analyzed their population dynamics by analytical methods and computer simulations. Genetic addiction turned out to be advantageous to the gene complex in the presence of a competitor genetic element. The advantage is, however, limited in a population without spatial structure, such as that in a well-mixed liquid culture. In contrast, in a structured habitat, such as the surface of a solid medium, the addiction gene complex can increase in frequency, irrespective of its initial density. Our demonstration that genomes can evolve through acquisition of addiction genes has implications for the general question of how a genome can evolve as a community of potentially selfish genes.

  10. Signatures of adaptation in the weedy rice genome

    USDA-ARS?s Scientific Manuscript database

    Weedy rice is a common problem of by product of domestication that has evolved multiple times from cultivated and wild rice relatives. Here we use whole genome sequences to examine the origin and adaptation of the two major US weedy red rice strains, with a comparison to Chinese weedy red rice. We f...

  11. Evolving Digital Ecological Networks

    PubMed Central

    Wagner, Aaron P.; Ofria, Charles

    2013-01-01

    “It is hard to realize that the living world as we know it is just one among many possibilities” [1]. Evolving digital ecological networks are webs of interacting, self-replicating, and evolving computer programs (i.e., digital organisms) that experience the same major ecological interactions as biological organisms (e.g., competition, predation, parasitism, and mutualism). Despite being computational, these programs evolve quickly in an open-ended way, and starting from only one or two ancestral organisms, the formation of ecological networks can be observed in real-time by tracking interactions between the constantly evolving organism phenotypes. These phenotypes may be defined by combinations of logical computations (hereafter tasks) that digital organisms perform and by expressed behaviors that have evolved. The types and outcomes of interactions between phenotypes are determined by task overlap for logic-defined phenotypes and by responses to encounters in the case of behavioral phenotypes. Biologists use these evolving networks to study active and fundamental topics within evolutionary ecology (e.g., the extent to which the architecture of multispecies networks shape coevolutionary outcomes, and the processes involved). PMID:23533370

  12. Discordance between genomic divergence and phenotypic variation in a rapidly evolving avian genus (Motacilla).

    PubMed

    Harris, Rebecca B; Alström, Per; Ödeen, Anders; Leaché, Adam D

    2018-03-01

    Generally, genotypes and phenotypes are expected to be spatially congruent; however, in widespread species complexes with few barriers to dispersal, multiple contact zones, and limited reproductive isolation, discordance between phenotypes and phylogeographic groups is more probable. Wagtails (Motacilla) are a genus of birds with striking plumage pattern variation across the Old World. Up to 13 subspecies are recognized within a single species, yet previous studies using mitochondrial DNA have supported polyphyletic phylogeographic groups that are inconsistent with subspecies plumage characteristics. In this study, we investigate the link between phenotypes and genotype by taking a phylogenetic approach. We use genome-wide SNPs, nuclear introns, and mitochondrial DNA to estimate population structure, isolation by distance, and species relationships. Together, our genetic sampling includes complete species-level sampling and comprehensive coverage of the three most phenotypically diverse Palearctic species. Our study provides strong evidence for species-level patterns of differentiation, however population-level differentiation is less pronounced. SNPs provide a robust estimate of species-level relationships, which are mostly corroborated by a combined analysis of mtDNA and nuclear introns (the first time-calibrated species tree for the genus). However, the mtDNA tree is strongly incongruent and is considered to misrepresent the species phylogeny. The extant wagtail lineages originated during the Pliocene and the Eurasian lineage underwent rapid diversification during the Pleistocene. Three of four widespread Eurasian species exhibit an east-west divide that contradicts both subspecies taxonomy and phenotypic variation. Indeed, SNPs fail to distinguish between phenotypically distinct subspecies within the M. alba and M. flava complexes, and instead support geographical regions, each of which is home to two or more different looking subspecies. This is a major step

  13. The possible evolution and future of CO2-concentrating mechanisms.

    PubMed

    Raven, John A; Beardall, John; Sánchez-Baracaldo, Patricia

    2017-06-01

    CO2-concentrating mechanisms (CCMs), based either on active transport of inorganic carbon (biophysical CCMs) or on biochemistry involving supplementary carbon fixation into C4 acids (C4 and CAM), play a major role in global primary productivity. However, the ubiquitous CO2-fixing enzyme in autotrophs, Rubisco, evolved at a time when atmospheric CO2 levels were very much higher than today and O2 was very low and, as CO2 and O2 approached (by no means monotonically), today's levels, at some time subsequently many organisms evolved a CCM that increased the supply of CO2 and decreased Rubisco oxygenase activity. Given that CO2 levels and other environmental factors have altered considerably between when autotrophs evolved and the present day, and are predicted to continue to change into the future, we here examine the drivers for, and possible timing of, evolution of CCMs. CCMs probably evolved when CO2 fell to 2-16 times the present atmospheric level, depending on Rubisco kinetics. We also assess the effects of other key environmental factors such as temperature and nutrient levels on CCM activity and examine the evidence for evolutionary changes in CCM activity and related cellular processes as well as limitations on continuity of CCMs through environmental variations. © The Author 2017. Published by Oxford University Press on behalf of the Society for Experimental Biology. All rights reserved. For permissions, please email: journals.permissions@oup.com.

  14. Comparative genomic analysis of the genus Staphylococcus including Staphylococcus aureus and its newly described sister species Staphylococcus simiae

    PubMed Central

    2012-01-01

    Background Staphylococcus belongs to the Gram-positive low G + C content group of the Firmicutes division of bacteria. Staphylococcus aureus is an important human and veterinary pathogen that causes a broad spectrum of diseases, and has developed important multidrug resistant forms such as methicillin-resistant S. aureus (MRSA). Staphylococcus simiae was isolated from South American squirrel monkeys in 2000, and is a coagulase-negative bacterium, closely related, and possibly the sister group, to S. aureus. Comparative genomic analyses of closely related bacteria with different phenotypes can provide information relevant to understanding adaptation to host environment and mechanisms of pathogenicity. Results We determined a Roche/454 draft genome sequence for S. simiae and included it in comparative genomic analyses with 11 other Staphylococcus species including S. aureus. A genome based phylogeny of the genus confirms that S. simiae is the sister group to S. aureus and indicates that the most basal Staphylococcus lineage is Staphylococcus pseudintermedius, followed by Staphylococcus carnosus. Given the primary niche of these two latter taxa, compared to the other species in the genus, this phylogeny suggests that human adaptation evolved after the split of S. carnosus. The two coagulase-positive species (S. aureus and S. pseudintermedius) are not phylogenetically closest but share many virulence factors exclusively, suggesting that these genes were acquired by horizontal transfer. Enrichment in genes related to mobile elements such as prophage in S. aureus relative to S. simiae suggests that pathogenesis in the S. aureus group has developed by gene gain through horizontal transfer, after the split of S. aureus and S. simiae from their common ancestor. Conclusions Comparative genomic analyses across 12 Staphylococcus species provide hypotheses about lineages in which human adaptation has taken place and contributions of horizontal transfer in pathogenesis. PMID

  15. Genomic comparison of the closely-related Salmonella enterica serovars enteritidis, dublin and gallinarum

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Matthews, T. David; Schmieder, Robert; Silva, Genivaldo G. Z.

    The Salmonella enterica serovars Enteritidis, Dublin, and Gallinarum are closely related but differ in virulence and host range. To identify the genetic elements responsible for these differences and to better understand how these serovars are evolving, we sequenced the genomes of Enteritidis strain LK5 and Dublin strain SARB12 and compared these genomes to the publicly available Enteritidis P125109, Dublin CT 02021853 and Dublin SD3246 genome sequences. We also compared the publicly available Gallinarum genome sequences from biotype Gallinarum 287/91 and Pullorum RKS5078. Using bioinformatic approaches, we identified single nucleotide polymorphisms, insertions, deletions, and differences in prophage and pseudogene content betweenmore » strains belonging to the same serovar. Through our analysis we also identified several prophage cargo genes and pseudogenes that affect virulence and may contribute to a host-specific, systemic lifestyle. These results strongly argue that the Enteritidis, Dublin and Gallinarum serovars of Salmonella enterica evolve by acquiring new genes through horizontal gene transfer, followed by the formation of pseudogenes. As a result, the loss of genes necessary for a gastrointestinal lifestyle ultimately leads to a systemic lifestyle and niche exclusion in the host-specific serovars.« less

  16. Genomic comparison of the closely-related Salmonella enterica serovars enteritidis, dublin and gallinarum

    DOE PAGES

    Matthews, T. David; Schmieder, Robert; Silva, Genivaldo G. Z.; ...

    2015-06-03

    The Salmonella enterica serovars Enteritidis, Dublin, and Gallinarum are closely related but differ in virulence and host range. To identify the genetic elements responsible for these differences and to better understand how these serovars are evolving, we sequenced the genomes of Enteritidis strain LK5 and Dublin strain SARB12 and compared these genomes to the publicly available Enteritidis P125109, Dublin CT 02021853 and Dublin SD3246 genome sequences. We also compared the publicly available Gallinarum genome sequences from biotype Gallinarum 287/91 and Pullorum RKS5078. Using bioinformatic approaches, we identified single nucleotide polymorphisms, insertions, deletions, and differences in prophage and pseudogene content betweenmore » strains belonging to the same serovar. Through our analysis we also identified several prophage cargo genes and pseudogenes that affect virulence and may contribute to a host-specific, systemic lifestyle. These results strongly argue that the Enteritidis, Dublin and Gallinarum serovars of Salmonella enterica evolve by acquiring new genes through horizontal gene transfer, followed by the formation of pseudogenes. As a result, the loss of genes necessary for a gastrointestinal lifestyle ultimately leads to a systemic lifestyle and niche exclusion in the host-specific serovars.« less

  17. Genomic Comparison of the Closely-Related Salmonella enterica Serovars Enteritidis, Dublin and Gallinarum

    PubMed Central

    Matthews, T. David; Schmieder, Robert; Silva, Genivaldo G. Z.; Busch, Julia; Cassman, Noriko; Dutilh, Bas E.; Green, Dawn; Matlock, Brian; Heffernan, Brian; Olsen, Gary J.; Farris Hanna, Leigh; Schifferli, Dieter M.; Maloy, Stanley; Dinsdale, Elizabeth A.; Edwards, Robert A.

    2015-01-01

    The Salmonella enterica serovars Enteritidis, Dublin, and Gallinarum are closely related but differ in virulence and host range. To identify the genetic elements responsible for these differences and to better understand how these serovars are evolving, we sequenced the genomes of Enteritidis strain LK5 and Dublin strain SARB12 and compared these genomes to the publicly available Enteritidis P125109, Dublin CT 02021853 and Dublin SD3246 genome sequences. We also compared the publicly available Gallinarum genome sequences from biotype Gallinarum 287/91 and Pullorum RKS5078. Using bioinformatic approaches, we identified single nucleotide polymorphisms, insertions, deletions, and differences in prophage and pseudogene content between strains belonging to the same serovar. Through our analysis we also identified several prophage cargo genes and pseudogenes that affect virulence and may contribute to a host-specific, systemic lifestyle. These results strongly argue that the Enteritidis, Dublin and Gallinarum serovars of Salmonella enterica evolve by acquiring new genes through horizontal gene transfer, followed by the formation of pseudogenes. The loss of genes necessary for a gastrointestinal lifestyle ultimately leads to a systemic lifestyle and niche exclusion in the host-specific serovars. PMID:26039056

  18. Genome-scale rates of evolutionary change in bacteria

    PubMed Central

    Duchêne, Sebastian; Holt, Kathryn E.; Weill, François-Xavier; Le Hello, Simon; Hawkey, Jane; Edwards, David J.; Fourment, Mathieu

    2016-01-01

    Estimating the rates at which bacterial genomes evolve is critical to understanding major evolutionary and ecological processes such as disease emergence, long-term host–pathogen associations and short-term transmission patterns. The surge in bacterial genomic data sets provides a new opportunity to estimate these rates and reveal the factors that shape bacterial evolutionary dynamics. For many organisms estimates of evolutionary rate display an inverse association with the time-scale over which the data are sampled. However, this relationship remains unexplored in bacteria due to the difficulty in estimating genome-wide evolutionary rates, which are impacted by the extent of temporal structure in the data and the prevalence of recombination. We collected 36 whole genome sequence data sets from 16 species of bacterial pathogens to systematically estimate and compare their evolutionary rates and assess the extent of temporal structure in the absence of recombination. The majority (28/36) of data sets possessed sufficient clock-like structure to robustly estimate evolutionary rates. However, in some species reliable estimates were not possible even with ‘ancient DNA’ data sampled over many centuries, suggesting that they evolve very slowly or that they display extensive rate variation among lineages. The robustly estimated evolutionary rates spanned several orders of magnitude, from approximately 10−5 to 10−8 nucleotide substitutions per site year−1. This variation was negatively associated with sampling time, with this relationship best described by an exponential decay curve. To avoid potential estimation biases, such time-dependency should be considered when inferring evolutionary time-scales in bacteria. PMID:28348834

  19. Complete genome sequence of Syntrophobacter fumaroxidans strain (MPOBT)

    PubMed Central

    Plugge, Caroline M.; Henstra, Anne M.; Worm, Petra; Swarts, Daan C.; Paulitsch-Fuchs, Astrid H.; Scholten, Johannes C.M.; Lykidis, Athanasios; Lapidus, Alla L.; Goltsman, Eugene; Kim, Edwin; McDonald, Erin; Rohlin, Lars; Crable, Bryan R.; Gunsalus, Robert P.; Stams, Alfons J.M.; McInerney, Michael J.

    2012-01-01

    Syntrophobacter fumaroxidans strain MPOBT is the best-studied species of the genus Syntrophobacter. The species is of interest because of its anaerobic syntrophic lifestyle, its involvement in the conversion of propionate to acetate, H2 and CO2 during the overall degradation of organic matter, and its release of products that serve as substrates for other microorganisms. The strain is able to ferment fumarate in pure culture to CO2 and succinate, and is also able to grow as a sulfate reducer with propionate as an electron donor. This is the first complete genome sequence of a member of the genus Syntrophobacter and a member genus in the family Syntrophobacteraceae. Here we describe the features of this organism, together with the complete genome sequence and annotation. The 4,990,251 bp long genome with its 4,098 protein-coding and 81 RNA genes is a part of the Microbial Genome Program (MGP) and the Genomes to Life (GTL) Program project. PMID:23450070

  20. Predicting Protein Function by Genomic Context: Quantitative Evaluation and Qualitative Inferences

    PubMed Central

    Huynen, Martijn; Snel, Berend; Lathe, Warren; Bork, Peer

    2000-01-01

    Various new methods have been proposed to predict functional interactions between proteins based on the genomic context of their genes. The types of genomic context that they use are Type I: the fusion of genes; Type II: the conservation of gene-order or co-occurrence of genes in potential operons; and Type III: the co-occurrence of genes across genomes (phylogenetic profiles). Here we compare these types for their coverage, their correlations with various types of functional interaction, and their overlap with homology-based function assignment. We apply the methods to Mycoplasma genitalium, the standard benchmarking genome in computational and experimental genomics. Quantitatively, conservation of gene order is the technique with the highest coverage, applying to 37% of the genes. By combining gene order conservation with gene fusion (6%), the co-occurrence of genes in operons in absence of gene order conservation (8%), and the co-occurrence of genes across genomes (11%), significant context information can be obtained for 50% of the genes (the categories overlap). Qualitatively, we observe that the functional interactions between genes are stronger as the requirements for physical neighborhood on the genome are more stringent, while the fraction of potential false positives decreases. Moreover, only in cases in which gene order is conserved in a substantial fraction of the genomes, in this case six out of twenty-five, does a single type of functional interaction (physical interaction) clearly dominate (>80%). In other cases, complementary function information from homology searches, which is available for most of the genes with significant genomic context, is essential to predict the type of interaction. Using a combination of genomic context and homology searches, new functional features can be predicted for 10% of M. genitalium genes. PMID:10958638