Science.gov

Sample records for space reveals genome

  1. Genome-wide comparative analysis of the Brassica rapa gene space reveals genome shrinkage and differential loss of duplicated genes after whole genome triplication

    PubMed Central

    2009-01-01

    Background Brassica rapa is one of the most economically important vegetable crops worldwide. Owing to its agronomic importance and phylogenetic position, B. rapa provides a crucial reference to understand polyploidy-related crop genome evolution. The high degree of sequence identity and remarkably conserved genome structure between Arabidopsis and Brassica genomes enables comparative tiling sequencing using Arabidopsis sequences as references to select the counterpart regions in B. rapa, which is a strong challenge of structural and comparative crop genomics. Results We assembled 65.8 megabase-pairs of non-redundant euchromatic sequence of B. rapa and compared this sequence to the Arabidopsis genome to investigate chromosomal relationships, macrosynteny blocks, and microsynteny within blocks. The triplicated B. rapa genome contains only approximately twice the number of genes as in Arabidopsis because of genome shrinkage. Genome comparisons suggest that B. rapa has a distinct organization of ancestral genome blocks as a result of recent whole genome triplication followed by a unique diploidization process. A lack of the most recent whole genome duplication (3R) event in the B. rapa genome, atypical of other Brassica genomes, may account for the emergence of B. rapa from the Brassica progenitor around 8 million years ago. Conclusions This work demonstrates the potential of using comparative tiling sequencing for genome analysis of crop species. Based on a comparative analysis of the B. rapa sequences and the Arabidopsis genome, it appears that polyploidy and chromosomal diploidization are ongoing processes that collectively stabilize the B. rapa genome and facilitate its evolution. PMID:19821981

  2. The cattle genome reveals its secrets

    PubMed Central

    Burt, David W

    2009-01-01

    The domesticated cow is the latest farm animal to have its genome sequenced and deciphered. The members of the Bovine Genome Consortium have published a series of papers on the assembly and what the sequence reveals so far about the biology of this ruminant and the consequences of its domestication. PMID:19439025

  3. Open chromatin reveals the functional maize genome

    PubMed Central

    Rodgers-Melnick, Eli; Vera, Daniel L.; Bass, Hank W.

    2016-01-01

    Cellular processes mediated through nuclear DNA must contend with chromatin. Chromatin structural assays can efficiently integrate information across diverse regulatory elements, revealing the functional noncoding genome. In this study, we use a differential nuclease sensitivity assay based on micrococcal nuclease (MNase) digestion to discover open chromatin regions in the maize genome. We find that maize MNase-hypersensitive (MNase HS) regions localize around active genes and within recombination hotspots, focusing biased gene conversion at their flanks. Although MNase HS regions map to less than 1% of the genome, they consistently explain a remarkably large amount (∼40%) of heritable phenotypic variance in diverse complex traits. MNase HS regions are therefore on par with coding sequences as annotations that demarcate the functional parts of the maize genome. These results imply that less than 3% of the maize genome (coding and MNase HS regions) may give rise to the overwhelming majority of phenotypic variation, greatly narrowing the scope of the functional genome. PMID:27185945

  4. Open chromatin reveals the functional maize genome.

    PubMed

    Rodgers-Melnick, Eli; Vera, Daniel L; Bass, Hank W; Buckler, Edward S

    2016-05-31

    Cellular processes mediated through nuclear DNA must contend with chromatin. Chromatin structural assays can efficiently integrate information across diverse regulatory elements, revealing the functional noncoding genome. In this study, we use a differential nuclease sensitivity assay based on micrococcal nuclease (MNase) digestion to discover open chromatin regions in the maize genome. We find that maize MNase-hypersensitive (MNase HS) regions localize around active genes and within recombination hotspots, focusing biased gene conversion at their flanks. Although MNase HS regions map to less than 1% of the genome, they consistently explain a remarkably large amount (∼40%) of heritable phenotypic variance in diverse complex traits. MNase HS regions are therefore on par with coding sequences as annotations that demarcate the functional parts of the maize genome. These results imply that less than 3% of the maize genome (coding and MNase HS regions) may give rise to the overwhelming majority of phenotypic variation, greatly narrowing the scope of the functional genome. PMID:27185945

  5. Comparative genomics reveals insights into avian genome evolution and adaptation

    PubMed Central

    Zhang, Guojie; Li, Cai; Li, Qiye; Li, Bo; Larkin, Denis M.; Lee, Chul; Storz, Jay F.; Antunes, Agostinho; Greenwold, Matthew J.; Meredith, Robert W.; Ödeen, Anders; Cui, Jie; Zhou, Qi; Xu, Luohao; Pan, Hailin; Wang, Zongji; Jin, Lijun; Zhang, Pei; Hu, Haofu; Yang, Wei; Hu, Jiang; Xiao, Jin; Yang, Zhikai; Liu, Yang; Xie, Qiaolin; Yu, Hao; Lian, Jinmin; Wen, Ping; Zhang, Fang; Li, Hui; Zeng, Yongli; Xiong, Zijun; Liu, Shiping; Zhou, Long; Huang, Zhiyong; An, Na; Wang, Jie; Zheng, Qiumei; Xiong, Yingqi; Wang, Guangbiao; Wang, Bo; Wang, Jingjing; Fan, Yu; da Fonseca, Rute R.; Alfaro-Núñez, Alonzo; Schubert, Mikkel; Orlando, Ludovic; Mourier, Tobias; Howard, Jason T.; Ganapathy, Ganeshkumar; Pfenning, Andreas; Whitney, Osceola; Rivas, Miriam V.; Hara, Erina; Smith, Julia; Farré, Marta; Narayan, Jitendra; Slavov, Gancho; Romanov, Michael N; Borges, Rui; Machado, João Paulo; Khan, Imran; Springer, Mark S.; Gatesy, John; Hoffmann, Federico G.; Opazo, Juan C.; Håstad, Olle; Sawyer, Roger H.; Kim, Heebal; Kim, Kyu-Won; Kim, Hyeon Jeong; Cho, Seoae; Li, Ning; Huang, Yinhua; Bruford, Michael W.; Zhan, Xiangjiang; Dixon, Andrew; Bertelsen, Mads F.; Derryberry, Elizabeth; Warren, Wesley; Wilson, Richard K; Li, Shengbin; Ray, David A.; Green, Richard E.; O’Brien, Stephen J.; Griffin, Darren; Johnson, Warren E.; Haussler, David; Ryder, Oliver A.; Willerslev, Eske; Graves, Gary R.; Alström, Per; Fjeldså, Jon; Mindell, David P.; Edwards, Scott V.; Braun, Edward L.; Rahbek, Carsten; Burt, David W.; Houde, Peter; Zhang, Yong; Yang, Huanming; Wang, Jian; Jarvis, Erich D.; Gilbert, M. Thomas P.; Wang, Jun

    2015-01-01

    Birds are the most species-rich class of tetrapod vertebrates and have wide relevance across many research fields. We explored bird macroevolution using full genomes from 48 avian species representing all major extant clades. The avian genome is principally characterized by its constrained size, which predominantly arose because of lineage-specific erosion of repetitive elements, large segmental deletions, and gene loss. Avian genomes furthermore show a remarkably high degree of evolutionary stasis at the levels of nucleotide sequence, gene synteny, and chromosomal structure. Despite this pattern of conservation, we detected many non-neutral evolutionary changes in protein-coding genes and noncoding regions. These analyses reveal that pan-avian genomic diversity covaries with adaptations to different lifestyles and convergent evolution of traits. PMID:25504712

  6. Comparative genomics reveals insights into avian genome evolution and adaptation.

    PubMed

    Zhang, Guojie; Li, Cai; Li, Qiye; Li, Bo; Larkin, Denis M; Lee, Chul; Storz, Jay F; Antunes, Agostinho; Greenwold, Matthew J; Meredith, Robert W; Ödeen, Anders; Cui, Jie; Zhou, Qi; Xu, Luohao; Pan, Hailin; Wang, Zongji; Jin, Lijun; Zhang, Pei; Hu, Haofu; Yang, Wei; Hu, Jiang; Xiao, Jin; Yang, Zhikai; Liu, Yang; Xie, Qiaolin; Yu, Hao; Lian, Jinmin; Wen, Ping; Zhang, Fang; Li, Hui; Zeng, Yongli; Xiong, Zijun; Liu, Shiping; Zhou, Long; Huang, Zhiyong; An, Na; Wang, Jie; Zheng, Qiumei; Xiong, Yingqi; Wang, Guangbiao; Wang, Bo; Wang, Jingjing; Fan, Yu; da Fonseca, Rute R; Alfaro-Núñez, Alonzo; Schubert, Mikkel; Orlando, Ludovic; Mourier, Tobias; Howard, Jason T; Ganapathy, Ganeshkumar; Pfenning, Andreas; Whitney, Osceola; Rivas, Miriam V; Hara, Erina; Smith, Julia; Farré, Marta; Narayan, Jitendra; Slavov, Gancho; Romanov, Michael N; Borges, Rui; Machado, João Paulo; Khan, Imran; Springer, Mark S; Gatesy, John; Hoffmann, Federico G; Opazo, Juan C; Håstad, Olle; Sawyer, Roger H; Kim, Heebal; Kim, Kyu-Won; Kim, Hyeon Jeong; Cho, Seoae; Li, Ning; Huang, Yinhua; Bruford, Michael W; Zhan, Xiangjiang; Dixon, Andrew; Bertelsen, Mads F; Derryberry, Elizabeth; Warren, Wesley; Wilson, Richard K; Li, Shengbin; Ray, David A; Green, Richard E; O'Brien, Stephen J; Griffin, Darren; Johnson, Warren E; Haussler, David; Ryder, Oliver A; Willerslev, Eske; Graves, Gary R; Alström, Per; Fjeldså, Jon; Mindell, David P; Edwards, Scott V; Braun, Edward L; Rahbek, Carsten; Burt, David W; Houde, Peter; Zhang, Yong; Yang, Huanming; Wang, Jian; Jarvis, Erich D; Gilbert, M Thomas P; Wang, Jun

    2014-12-12

    Birds are the most species-rich class of tetrapod vertebrates and have wide relevance across many research fields. We explored bird macroevolution using full genomes from 48 avian species representing all major extant clades. The avian genome is principally characterized by its constrained size, which predominantly arose because of lineage-specific erosion of repetitive elements, large segmental deletions, and gene loss. Avian genomes furthermore show a remarkably high degree of evolutionary stasis at the levels of nucleotide sequence, gene synteny, and chromosomal structure. Despite this pattern of conservation, we detected many non-neutral evolutionary changes in protein-coding genes and noncoding regions. These analyses reveal that pan-avian genomic diversity covaries with adaptations to different lifestyles and convergent evolution of traits. PMID:25504712

  7. An Exploration into Fern Genome Space.

    PubMed

    Wolf, Paul G; Sessa, Emily B; Marchant, Daniel Blaine; Li, Fay-Wei; Rothfels, Carl J; Sigel, Erin M; Gitzendanner, Matthew A; Visger, Clayton J; Banks, Jo Ann; Soltis, Douglas E; Soltis, Pamela S; Pryer, Kathleen M; Der, Joshua P

    2015-09-01

    Ferns are one of the few remaining major clades of land plants for which a complete genome sequence is lacking. Knowledge of genome space in ferns will enable broad-scale comparative analyses of land plant genes and genomes, provide insights into genome evolution across green plants, and shed light on genetic and genomic features that characterize ferns, such as their high chromosome numbers and large genome sizes. As part of an initial exploration into fern genome space, we used a whole genome shotgun sequencing approach to obtain low-density coverage (∼0.4X to 2X) for six fern species from the Polypodiales (Ceratopteris, Pteridium, Polypodium, Cystopteris), Cyatheales (Plagiogyria), and Gleicheniales (Dipteris). We explore these data to characterize the proportion of the nuclear genome represented by repetitive sequences (including DNA transposons, retrotransposons, ribosomal DNA, and simple repeats) and protein-coding genes, and to extract chloroplast and mitochondrial genome sequences. Such initial sweeps of fern genomes can provide information useful for selecting a promising candidate fern species for whole genome sequencing. We also describe variation of genomic traits across our sample and highlight some differences and similarities in repeat structure between ferns and seed plants. PMID:26311176

  8. An Exploration into Fern Genome Space

    PubMed Central

    Wolf, Paul G.; Sessa, Emily B.; Marchant, Daniel Blaine; Li, Fay-Wei; Rothfels, Carl J.; Sigel, Erin M.; Gitzendanner, Matthew A.; Visger, Clayton J.; Banks, Jo Ann; Soltis, Douglas E.; Soltis, Pamela S.; Pryer, Kathleen M.; Der, Joshua P.

    2015-01-01

    Ferns are one of the few remaining major clades of land plants for which a complete genome sequence is lacking. Knowledge of genome space in ferns will enable broad-scale comparative analyses of land plant genes and genomes, provide insights into genome evolution across green plants, and shed light on genetic and genomic features that characterize ferns, such as their high chromosome numbers and large genome sizes. As part of an initial exploration into fern genome space, we used a whole genome shotgun sequencing approach to obtain low-density coverage (∼0.4X to 2X) for six fern species from the Polypodiales (Ceratopteris, Pteridium, Polypodium, Cystopteris), Cyatheales (Plagiogyria), and Gleicheniales (Dipteris). We explore these data to characterize the proportion of the nuclear genome represented by repetitive sequences (including DNA transposons, retrotransposons, ribosomal DNA, and simple repeats) and protein-coding genes, and to extract chloroplast and mitochondrial genome sequences. Such initial sweeps of fern genomes can provide information useful for selecting a promising candidate fern species for whole genome sequencing. We also describe variation of genomic traits across our sample and highlight some differences and similarities in repeat structure between ferns and seed plants. PMID:26311176

  9. Genome-Wide Scan Reveals Mutation Associated with Melanoma

    MedlinePlus

    ... 1999 Spotlight on Research 2012 July 2012 (historical) Genome-Wide Scan Reveals Mutation Associated with Melanoma A ... out to see if a technology called whole genome sequencing would help them find other genetic risk ...

  10. Genomics reveals new landscapes for crop improvement

    PubMed Central

    2013-01-01

    The sequencing of large and complex genomes of crop species, facilitated by new sequencing technologies and bioinformatic approaches, has provided new opportunities for crop improvement. Current challenges include understanding how genetic variation translates into phenotypic performance in the field. PMID:23796126

  11. Advancing Eucalyptus Genomics: Cytogenomics Reveals Conservation of Eucalyptus Genomes.

    PubMed

    Ribeiro, Teresa; Barrela, Ricardo M; Bergès, Hélène; Marques, Cristina; Loureiro, João; Morais-Cecílio, Leonor; Paiva, Jorge A P

    2016-01-01

    The genus Eucalyptus encloses several species with high ecological and economic value, being the subgenus Symphyomyrtus one of the most important. Species such as E. grandis and E. globulus are well characterized at the molecular level but knowledge regarding genome and chromosome organization is very scarce. Here we characterized and compared the karyotypes of three economically important species, E. grandis, E. globulus, and E. calmadulensis, and three with ecological relevance, E. pulverulenta, E. cornuta, and E. occidentalis, through an integrative approach including genome size estimation, fluorochrome banding, rDNA FISH, and BAC landing comprising genes involved in lignin biosynthesis. All karyotypes show a high degree of conservation with pericentromeric 35S and 5S rDNA loci in the first and third pairs, respectively. GC-rich heterochromatin was restricted to the 35S rDNA locus while the AT-rich heterochromatin pattern was species-specific. The slight differences in karyotype formulas and distribution of AT-rich heterochromatin, along with genome sizes estimations, support the idea of Eucalyptus genome evolution by local expansions of heterochromatin clusters. The unusual co-localization of both rDNA with AT-rich heterochromatin was attributed mainly to the presence of silent transposable elements in those loci. The cinnamoyl CoA reductase gene (CCR1) previously assessed to linkage group 10 (LG10) was clearly localized distally at the long arm of chromosome 9 establishing an unexpected correlation between the cytogenetic chromosome 9 and the LG10. Our work is novel and contributes to the understanding of Eucalyptus genome organization which is essential to develop successful advanced breeding strategies for this genus. PMID:27148332

  12. Advancing Eucalyptus Genomics: Cytogenomics Reveals Conservation of Eucalyptus Genomes

    PubMed Central

    Ribeiro, Teresa; Barrela, Ricardo M.; Bergès, Hélène; Marques, Cristina; Loureiro, João; Morais-Cecílio, Leonor; Paiva, Jorge A. P.

    2016-01-01

    The genus Eucalyptus encloses several species with high ecological and economic value, being the subgenus Symphyomyrtus one of the most important. Species such as E. grandis and E. globulus are well characterized at the molecular level but knowledge regarding genome and chromosome organization is very scarce. Here we characterized and compared the karyotypes of three economically important species, E. grandis, E. globulus, and E. calmadulensis, and three with ecological relevance, E. pulverulenta, E. cornuta, and E. occidentalis, through an integrative approach including genome size estimation, fluorochrome banding, rDNA FISH, and BAC landing comprising genes involved in lignin biosynthesis. All karyotypes show a high degree of conservation with pericentromeric 35S and 5S rDNA loci in the first and third pairs, respectively. GC-rich heterochromatin was restricted to the 35S rDNA locus while the AT-rich heterochromatin pattern was species-specific. The slight differences in karyotype formulas and distribution of AT-rich heterochromatin, along with genome sizes estimations, support the idea of Eucalyptus genome evolution by local expansions of heterochromatin clusters. The unusual co-localization of both rDNA with AT-rich heterochromatin was attributed mainly to the presence of silent transposable elements in those loci. The cinnamoyl CoA reductase gene (CCR1) previously assessed to linkage group 10 (LG10) was clearly localized distally at the long arm of chromosome 9 establishing an unexpected correlation between the cytogenetic chromosome 9 and the LG10. Our work is novel and contributes to the understanding of Eucalyptus genome organization which is essential to develop successful advanced breeding strategies for this genus. PMID:27148332

  13. Genes but Not Genomes Reveal Bacterial Domestication of Lactococcus Lactis

    PubMed Central

    Passerini, Delphine; Beltramo, Charlotte; Coddeville, Michele; Quentin, Yves; Ritzenthaler, Paul

    2010-01-01

    Background The population structure and diversity of Lactococcus lactis subsp. lactis, a major industrial bacterium involved in milk fermentation, was determined at both gene and genome level. Seventy-six lactococcal isolates of various origins were studied by different genotyping methods and thirty-six strains displaying unique macrorestriction fingerprints were analyzed by a new multilocus sequence typing (MLST) scheme. This gene-based analysis was compared to genomic characteristics determined by pulsed-field gel electrophoresis (PFGE). Methodology/Principal Findings The MLST analysis revealed that L. lactis subsp. lactis is essentially clonal with infrequent intra- and intergenic recombination; also, despite its taxonomical classification as a subspecies, it displays a genetic diversity as substantial as that within several other bacterial species. Genome-based analysis revealed a genome size variability of 20%, a value typical of bacteria inhabiting different ecological niches, and that suggests a large pan-genome for this subspecies. However, the genomic characteristics (macrorestriction pattern, genome or chromosome size, plasmid content) did not correlate to the MLST-based phylogeny, with strains from the same sequence type (ST) differing by up to 230 kb in genome size. Conclusion/Significance The gene-based phylogeny was not fully consistent with the traditional classification into dairy and non-dairy strains but supported a new classification based on ecological separation between “environmental” strains, the main contributors to the genetic diversity within the subspecies, and “domesticated” strains, subject to recent genetic bottlenecks. Comparison between gene- and genome-based analyses revealed little relationship between core and dispensable genome phylogenies, indicating that clonal diversification and phenotypic variability of the “domesticated” strains essentially arose through substantial genomic flux within the dispensable genome

  14. Hybridization Reveals the Evolving Genomic Architecture of Speciation

    PubMed Central

    Kronforst, Marcus R.; Hansen, Matthew E.B.; Crawford, Nicholas G.; Gallant, Jason R.; Zhang, Wei; Kulathinal, Rob J.; Kapan, Durrell D.; Mullen, Sean P.

    2014-01-01

    SUMMARY The rate at which genomes diverge during speciation is unknown, as are the physical dynamics of the process. Here, we compare full genome sequences of 32 butterflies, representing five species from a hybridizing Heliconius butterfly community, to examine genome-wide patterns of introgression and infer how divergence evolves during the speciation process. Our analyses reveal that initial divergence is restricted to a small fraction of the genome, largely clustered around known wing-patterning genes. Over time, divergence evolves rapidly, due primarily to the origin of new divergent regions. Furthermore, divergent genomic regions display signatures of both selection and adaptive introgression, demonstrating the link between microevolutionary processes acting within species and the origin of species across macroevolutionary timescales. Our results provide a uniquely comprehensive portrait of the evolving species boundary due to the role that hybridization plays in reducing the background accumulation of divergence at neutral sites. PMID:24183670

  15. The genome of Tetranychus urticae reveals herbivorous pest adaptations

    PubMed Central

    Grbić, Miodrag; Van Leeuwen, Thomas; Clark, Richard M.; Rombauts, Stephane; Rouzé, Pierre; Grbić, Vojislava; Osborne, Edward J.; Dermauw, Wannes; Ngoc, Phuong Cao Thi; Ortego, Félix; Hernández-Crespo, Pedro; Diaz, Isabel; Martinez, Manuel; Navajas, Maria; Sucena, Élio; Magalhães, Sara; Nagy, Lisa; Pace, Ryan M.; Djuranović, Sergej; Smagghe, Guy; Iga, Masatoshi; Christiaens, Olivier; Veenstra, Jan A.; Ewer, John; Villalobos, Rodrigo Mancilla; Hutter, Jeffrey L.; Hudson, Stephen D.; Velez, Marisela; Yi, Soojin V.; Zeng, Jia; Pires-daSilva, Andre; Roch, Fernando; Cazaux, Marc; Navarro, Marie; Zhurov, Vladimir; Acevedo, Gustavo; Bjelica, Anica; Fawcett, Jeffrey A.; Bonnet, Eric; Martens, Cindy; Baele, Guy; Wissler, Lothar; Sanchez-Rodriguez, Aminael; Tirry, Luc; Blais, Catherine; Demeestere, Kristof; Henz, Stefan R.; Gregory, T. Ryan; Mathieu, Johannes; Verdon, Lou; Farinelli, Laurent; Schmutz, Jeremy; Lindquist, Erika; Feyereisen, René; Van de Peer, Yves

    2016-01-01

    The spider mite Tetranychus urticae is a cosmopolitan agricultural pest with an extensive host plant range and an extreme record of pesticide resistance. Here we present the completely sequenced and annotated spider mite genome, representing the first complete chelicerate genome. At 90 megabases T. urticae has the smallest sequenced arthropod genome. Compared with other arthropods, the spider mite genome shows unique changes in the hormonal environment and organization of the Hox complex, and also reveals evolutionary innovation of silk production. We find strong signatures of polyphagy and detoxification in gene families associated with feeding on different hosts and in new gene families acquired by lateral gene transfer. Deep transcriptome analysis of mites feeding on different plants shows how this pest responds to a changing host environment. The T. urticae genome thus offers new insights into arthropod evolution and plant–herbivore interactions, and provides unique opportunities for developing novel plant protection strategies. PMID:22113690

  16. Genome Sequencing Reveals a Phage in Helicobacter pylori

    PubMed Central

    Lehours, Philippe; Vale, Filipa F.; Bjursell, Magnus K.; Melefors, Ojar; Advani, Reza; Glavas, Steve; Guegueniat, Julia; Gontier, Etienne; Lacomme, Sabrina; Alves Matos, António; Menard, Armelle; Mégraud, Francis; Engstrand, Lars; Andersson, Anders F.

    2011-01-01

    ABSTRACT Helicobacter pylori chronically infects the gastric mucosa in more than half of the human population; in a subset of this population, its presence is associated with development of severe disease, such as gastric cancer. Genomic analysis of several strains has revealed an extensive H. pylori pan-genome, likely to grow as more genomes are sampled. Here we describe the draft genome sequence (63 contigs; 26× mean coverage) of H. pylori strain B45, isolated from a patient with gastric mucosa-associated lymphoid tissue (MALT) lymphoma. The major finding was a 24.6-kb prophage integrated in the bacterial genome. The prophage shares most of its genes (22/27) with prophage region II of Helicobacter acinonychis strain Sheeba. After UV treatment of liquid cultures, circular DNA carrying the prophage integrase gene could be detected, and intracellular tailed phage-like particles were observed in H. pylori cells by transmission electron microscopy, indicating that phage production can be induced from the prophage. PCR amplification and sequencing of the integrase gene from 341 H. pylori strains from different geographic regions revealed a high prevalence of the prophage (21.4%). Phylogenetic reconstruction showed four distinct clusters in the integrase gene, three of which tended to be specific for geographic regions. Our study implies that phages may play important roles in the ecology and evolution of H. pylori. PMID:22086490

  17. Camelid genomes reveal evolution and adaptation to desert environments.

    PubMed

    Wu, Huiguang; Guang, Xuanmin; Al-Fageeh, Mohamed B; Cao, Junwei; Pan, Shengkai; Zhou, Huanmin; Zhang, Li; Abutarboush, Mohammed H; Xing, Yanping; Xie, Zhiyuan; Alshanqeeti, Ali S; Zhang, Yanru; Yao, Qiulin; Al-Shomrani, Badr M; Zhang, Dong; Li, Jiang; Manee, Manee M; Yang, Zili; Yang, Linfeng; Liu, Yiyi; Zhang, Jilin; Altammami, Musaad A; Wang, Shenyuan; Yu, Lili; Zhang, Wenbin; Liu, Sanyang; Ba, La; Liu, Chunxia; Yang, Xukui; Meng, Fanhua; Wang, Shaowei; Li, Lu; Li, Erli; Li, Xueqiong; Wu, Kaifeng; Zhang, Shu; Wang, Junyi; Yin, Ye; Yang, Huanming; Al-Swailem, Abdulaziz M; Wang, Jun

    2014-01-01

    Bactrian camel (Camelus bactrianus), dromedary (Camelus dromedarius) and alpaca (Vicugna pacos) are economically important livestock. Although the Bactrian camel and dromedary are large, typically arid-desert-adapted mammals, alpacas are adapted to plateaus. Here we present high-quality genome sequences of these three species. Our analysis reveals the demographic history of these species since the Tortonian Stage of the Miocene and uncovers a striking correlation between large fluctuations in population size and geological time boundaries. Comparative genomic analysis reveals complex features related to desert adaptations, including fat and water metabolism, stress responses to heat, aridity, intense ultraviolet radiation and choking dust. Transcriptomic analysis of Bactrian camels further reveals unique osmoregulation, osmoprotection and compensatory mechanisms for water reservation underpinned by high blood glucose levels. We hypothesize that these physiological mechanisms represent kidney evolutionary adaptations to the desert environment. This study advances our understanding of camelid evolution and the adaptation of camels to arid-desert environments. PMID:25333821

  18. DEFINING THE CHEMICAL SPACE OF PUBLIC GENOMIC DATA (S)

    EPA Science Inventory

    The current project aims to chemically index the genomics content of public genomic databases to make these data accessible in relation to other publicly available, chemically-indexed toxicological information. By defining the chemical space of public genomic data, it is possibl...

  19. Population genomic analysis reveals highly conserved mitochondrial genomes in the yeast species Lachancea thermotolerans.

    PubMed

    Freel, Kelle C; Friedrich, Anne; Hou, Jing; Schacherer, Joseph

    2014-10-01

    The increasing availability of mitochondrial (mt) sequence data from various yeasts provides a tool to study genomic evolution within and between different species. While the genomes from a range of lineages are available, there is a lack of information concerning intraspecific mtDNA diversity. Here, we analyzed the mt genomes of 50 strains from Lachancea thermotolerans, a protoploid yeast species that has been isolated from several locations (Europe, Asia, Australia, South Africa, and North / South America) and ecological sources (fruit, tree exudate, plant material, and grape and agave fermentations). Protein-coding genes from the mtDNA were used to construct a phylogeny, which reflected a similar, yet less resolved topology than the phylogenetic tree of 50 nuclear genes. In comparison to its sister species Lachancea kluyveri, L. thermotolerans has a smaller mt genome. This is due to shorter intergenic regions and fewer introns, of which the latter are only found in COX1. We revealed that L. kluyveri and L. thermotolerans share similar levels of intraspecific divergence concerning the nuclear genomes. However, L. thermotolerans has a more highly conserved mt genome with the coding regions characterized by low rates of nonsynonymous substitution. Thus, in the mt genomes of L. thermotolerans, stronger purifying selection and lower mutation rates potentially shape genome diversity in contract to what was found for L. kluyveri, demonstrating that the factors driving mt genome evolution are different even between closely related species. PMID:25212859

  20. Comparative genomics reveals mobile pathogenicity chromosomes in Fusarium

    PubMed Central

    Ma, Li-Jun; van der Does, H. Charlotte; Borkovich, Katherine A.; Coleman, Jeffrey J.; Daboussi, Marie-Josée; Di Pietro, Antonio; Dufresne, Marie; Freitag, Michael; Grabherr, Manfred; Henrissat, Bernard; Houterman, Petra M.; Kang, Seogchan; Shim, Won-Bo; Woloshuk, Charles; Xie, Xiaohui; Xu, Jin-Rong; Antoniw, John; Baker, Scott E.; Bluhm, Burton H.; Breakspear, Andrew; Brown, Daren W.; Butchko, Robert A. E.; Chapman, Sinead; Coulson, Richard; Coutinho, Pedro M.; Danchin, Etienne G. J.; Diener, Andrew; Gale, Liane R.; Gardiner, Donald M.; Goff, Stephen; Hammond-Kosack, Kim E.; Hilburn, Karen; Hua-Van, Aurélie; Jonkers, Wilfried; Kazan, Kemal; Kodira, Chinnappa D.; Koehrsen, Michael; Kumar, Lokesh; Lee, Yong-Hwan; Li, Liande; Manners, John M.; Miranda-Saavedra, Diego; Mukherjee, Mala; Park, Gyungsoon; Park, Jongsun; Park, Sook-Young; Proctor, Robert H.; Regev, Aviv; Ruiz-Roldan, M. Carmen; Sain, Divya; Sakthikumar, Sharadha; Sykes, Sean; Schwartz, David C.; Turgeon, B. Gillian; Wapinski, Ilan; Yoder, Olen; Young, Sarah; Zeng, Qiandong; Zhou, Shiguo; Galagan, James; Cuomo, Christina A.; Kistler, H. Corby; Rep, Martijn

    2011-01-01

    Fusarium species are among the most important phytopathogenic and toxigenic fungi. To understand the molecular underpinnings of pathogenicity in the genus Fusarium, we compared the genomes of three phenotypically diverse species: Fusarium graminearum, Fusarium verticillioides and Fusarium oxysporum f. sp. lycopersici. Our analysis revealed lineage-specific (LS) genomic regions in F. oxysporum that include four entire chromosomes and account for more than one-quarter of the genome. LS regions are rich in transposons and genes with distinct evolutionary profiles but related to pathogenicity, indicative of horizontal acquisition. Experimentally, we demonstrate the transfer of two LS chromosomes between strains of F. oxysporum, converting a non-pathogenic strain into a pathogen. Transfer of LS chromosomes between otherwise genetically isolated strains explains the polyphyletic origin of host specificity and the emergence of new pathogenic lineages in F. oxysporum. These findings put the evolution of fungal pathogenicity into a new perspective. PMID:20237561

  1. Cytoscape: the network visualization tool for GenomeSpace workflows

    PubMed Central

    Demchak, Barry; Hull, Tim; Reich, Michael; Liefeld, Ted; Smoot, Michael; Ideker, Trey; Mesirov, Jill P.

    2014-01-01

    Modern genomic analysis often requires workflows incorporating multiple best-of-breed tools. GenomeSpace is a web-based visual workbench that combines a selection of these tools with mechanisms that create data flows between them. One such tool is Cytoscape 3, a popular application that enables analysis and visualization of graph-oriented genomic networks. As Cytoscape runs on the desktop, and not in a web browser, integrating it into GenomeSpace required special care in creating a seamless user experience and enabling appropriate data flows. In this paper, we present the design and operation of the Cytoscape GenomeSpace app, which accomplishes this integration, thereby providing critical analysis and visualization functionality for GenomeSpace users. It has been downloaded over 850 times since the release of its first version in September, 2013. PMID:25165537

  2. African Relapsing Fever Borreliae Genomospecies Revealed by Comparative Genomics

    PubMed Central

    Elbir, Haitham; Abi-Rached, Laurent; Pontarotti, Pierre; Yoosuf, Niyaz; Drancourt, Michel

    2014-01-01

    Background: Relapsing fever borreliae are vector-borne bacteria responsible for febrile infection in humans in North America, Africa, Asia, and in the Iberian Peninsula in Europe. Relapsing fever borreliae are phylogenetically closely related, yet they differ in pathogenicity and vectors. Their long-term taxonomy, based on geography and vector grouping, needs to be re-apprised in a genomic context. We therefore embarked into genomic analyses of relapsing fever borreliae, focusing on species found in Africa. Results: Genome-wide phylogenetic analyses group Old World Borrelia crocidurae, Borrelia hispanica, B. duttonii, and B. recurrentis in one clade, and New World Borrelia turicatae and Borrelia hermsii in a second clade. Accordingly, average nucleotide identity is 99% among B. duttonii, B. recurrentis, and B. crocidurae and 96% between latter borreliae and B. hispanica while the similarity is 86% between Old World and New World borreliae. Comparative genomics indicates that the Old World relapsing fever B. duttonii, B. recurrentis, B. crocidurae, and B. hispanica have a 2,514-gene pan genome and a 933-gene core genome that includes 788 chromosomal and 145 plasmidic genes. Analyzing the role that natural selection has played in the evolution of Old World borreliae species revealed that 55 loci were under positive diversifying selection, including loci coding for membrane, flagellar, and chemotaxis proteins, three categories associated with adaption to specific niches. Conclusion: Genomic analyses led to a reappraisal of the taxonomy of relapsing fever borreliae in Africa. These analyses suggest that B. crocidurae, B. duttonii, and B. recurrentis are ecotypes of a unique genomospecies, while B. hispanica is a distinct species. PMID:25229054

  3. Distinctive Genome Reduction Rates Revealed by Genomic Analyses of Two Coxiella-Like Endosymbionts in Ticks

    PubMed Central

    Gottlieb, Yuval; Lalzar, Itai; Klasson, Lisa

    2015-01-01

    Genome reduction is a hallmark of symbiotic genomes, and the rate and patterns of gene loss associated with this process have been investigated in several different symbiotic systems. However, in long-term host-associated coevolving symbiont clades, the genome size differences between strains are normally quite small and hence patterns of large-scale genome reduction can only be inferred from distant relatives. Here we present the complete genome of a Coxiella-like symbiont from Rhipicephalus turanicus ticks (CRt), and compare it with other genomes from the genus Coxiella in order to investigate the process of genome reduction in a genus consisting of intracellular host-associated bacteria with variable genome sizes. The 1.7-Mb CRt genome is larger than the genomes of most obligate mutualists but has a very low protein-coding content (48.5%) and an extremely high number of identifiable pseudogenes, indicating that it is currently undergoing genome reduction. Analysis of encoded functions suggests that CRt is an obligate tick mutualist, as indicated by the possible provisioning of the tick with biotin (B7), riboflavin (B2) and other cofactors, and by the loss of most genes involved in host cell interactions, such as secretion systems. Comparative analyses between CRt and the 2.5 times smaller genome of Coxiella from the lone star tick Amblyomma americanum (CLEAA) show that many of the same gene functions are lost and suggest that the large size difference might be due to a higher rate of genome evolution in CLEAA generated by the loss of the mismatch repair genes mutSL. Finally, sequence polymorphisms in the CRt population sampled from field collected ticks reveal up to one distinct strain variant per tick, and analyses of mutational patterns within the population suggest that selection might be acting on synonymous sites. The CRt genome is an extreme example of a symbiont genome caught in the act of genome reduction, and the comparison between CLEAA and CRt

  4. Genome evolution in the eremothecium clade of the Saccharomyces complex revealed by comparative genomics.

    PubMed

    Wendland, Jürgen; Walther, Andrea

    2011-12-01

    We used comparative genomics to elucidate the genome evolution within the pre-whole-genome duplication genus Eremothecium. To this end, we sequenced and assembled the complete genome of Eremothecium cymbalariae, a filamentous ascomycete representing the Eremothecium type strain. Genome annotation indicated 4712 gene models and 143 tRNAs. We compared the E. cymbalariae genome with that of its relative, the riboflavin overproducer Ashbya (Eremothecium) gossypii, and the reconstructed yeast ancestor. Decisive changes in the Eremothecium lineage leading to the evolution of the A. gossypii genome include the reduction from eight to seven chromosomes, the downsizing of the genome by removal of 10% or 900 kb of DNA, mostly in intergenic regions, the loss of a TY3-Gypsy-type transposable element, the re-arrangement of mating-type loci, and a massive increase of its GC content. Key species-specific events are the loss of MNN1-family of mannosyltransferases required to add the terminal fourth and fifth α-1,3-linked mannose residue to O-linked glycans and genes of the Ehrlich pathway in E. cymbalariae and the loss of ZMM-family of meiosis-specific proteins and acquisition of riboflavin overproduction in A. gossypii. This reveals that within the Saccharomyces complex genome, evolution is not only based on genome duplication with subsequent gene deletions and chromosomal rearrangements but also on fungi associated with specific environments (e.g. involving fungal-insect interactions as in Eremothecium), which have encountered challenges that may be reflected both in genome streamlining and their biosynthetic potential. PMID:22384365

  5. Genome Evolution in the Eremothecium Clade of the Saccharomyces Complex Revealed by Comparative Genomics

    PubMed Central

    Wendland, Jürgen; Walther, Andrea

    2011-01-01

    We used comparative genomics to elucidate the genome evolution within the pre–whole-genome duplication genus Eremothecium. To this end, we sequenced and assembled the complete genome of Eremothecium cymbalariae, a filamentous ascomycete representing the Eremothecium type strain. Genome annotation indicated 4712 gene models and 143 tRNAs. We compared the E. cymbalariae genome with that of its relative, the riboflavin overproducer Ashbya (Eremothecium) gossypii, and the reconstructed yeast ancestor. Decisive changes in the Eremothecium lineage leading to the evolution of the A. gossypii genome include the reduction from eight to seven chromosomes, the downsizing of the genome by removal of 10% or 900 kb of DNA, mostly in intergenic regions, the loss of a TY3-Gypsy–type transposable element, the re-arrangement of mating-type loci, and a massive increase of its GC content. Key species-specific events are the loss of MNN1-family of mannosyltransferases required to add the terminal fourth and fifth α-1,3-linked mannose residue to O-linked glycans and genes of the Ehrlich pathway in E. cymbalariae and the loss of ZMM-family of meiosis-specific proteins and acquisition of riboflavin overproduction in A. gossypii. This reveals that within the Saccharomyces complex genome, evolution is not only based on genome duplication with subsequent gene deletions and chromosomal rearrangements but also on fungi associated with specific environments (e.g. involving fungal-insect interactions as in Eremothecium), which have encountered challenges that may be reflected both in genome streamlining and their biosynthetic potential. PMID:22384365

  6. Comparative Whole-Genome Hybridization Reveals Genomic Islands in Brucella Species†

    PubMed Central

    Rajashekara, Gireesh; Glasner, Jeremy D.; Glover, David A.; Splitter, Gary A.

    2004-01-01

    Brucella species are responsible for brucellosis, a worldwide zoonotic disease causing abortion in domestic animals and Malta fever in humans. Based on host preference, the genus is divided into six species. Brucella abortus, B. melitensis, and B. suis are pathogenic to humans, whereas B. ovis and B. neotomae are nonpathogenic to humans and B. canis human infections are rare. Limited genome diversity exists among Brucella species. Comparison of Brucella species whole genomes is, therefore, likely to identify factors responsible for differences in host preference and virulence restriction. To facilitate such studies, we used the complete genome sequence of B. melitensis 16M, the species highly pathogenic to humans, to construct a genomic microarray. Hybridization of labeled genomic DNA from Brucella species to this microarray revealed a total of 217 open reading frames (ORFs) altered in five Brucella species analyzed. These ORFs are often found in clusters (islands) in the 16M genome. Examination of the genomic context of these islands suggests that many are horizontally acquired. Deletions of genetic content identified in Brucella species are conserved in multiple strains of the same species, and genomic islands missing in a given species are often restricted to that particular species. These findings suggest that, whereas the loss or gain of genetic material may be related to the host range and virulence restriction of certain Brucella species for humans, independent mechanisms involving gene inactivation or altered expression of virulence determinants may also contribute to these differences. PMID:15262941

  7. Comparative genomics reveals diversity among xanthomonads infecting tomato and pepper

    PubMed Central

    2011-01-01

    Background Bacterial spot of tomato and pepper is caused by four Xanthomonas species and is a major plant disease in warm humid climates. The four species are distinct from each other based on physiological and molecular characteristics. The genome sequence of strain 85-10, a member of one of the species, Xanthomonas euvesicatoria (Xcv) has been previously reported. To determine the relationship of the four species at the genome level and to investigate the molecular basis of their virulence and differing host ranges, draft genomic sequences of members of the other three species were determined and compared to strain 85-10. Results We sequenced the genomes of X. vesicatoria (Xv) strain 1111 (ATCC 35937), X. perforans (Xp) strain 91-118 and X. gardneri (Xg) strain 101 (ATCC 19865). The genomes were compared with each other and with the previously sequenced Xcv strain 85-10. In addition, the molecular features were predicted that may be required for pathogenicity including the type III secretion apparatus, type III effectors, other secretion systems, quorum sensing systems, adhesins, extracellular polysaccharide, and lipopolysaccharide determinants. Several novel type III effectors from Xg strain 101 and Xv strain 1111 genomes were computationally identified and their translocation was validated using a reporter gene assay. A homolog to Ax21, the elicitor of XA21-mediated resistance in rice, and a functional Ax21 sulfation system were identified in Xcv. Genes encoding proteins with functions mediated by type II and type IV secretion systems have also been compared, including enzymes involved in cell wall deconstruction, as contributors to pathogenicity. Conclusions Comparative genomic analyses revealed considerable diversity among bacterial spot pathogens, providing new insights into differences and similarities that may explain the diverse nature of these strains. Genes specific to pepper pathogens, such as the O-antigen of the lipopolysaccharide cluster, and genes

  8. Genomic affinities revealed by GISH suggests intergenomic restructuring between parental genomes of the paleopolyploid genus Zea.

    PubMed

    González, Graciela Esther; Poggio, Lidia

    2015-10-01

    The present work compares the molecular affinities, revealed by GISH, with the analysis of meiotic pairing in intra- and interspecific hybrids between species of Zea obtained in previous works. The joint analysis of these data provided evidence about the evolutionary relationships among the species from the paleopolyploid genus Zea (maize and teosintes). GISH and meiotic pairing of intraspecific hybrids revealed high genomic affinity between maize (Zea mays subsp. mays) and both Zea mays subsp. parviglumis and Zea mays subsp. mexicana. On the other hand, when Zea mays subsp. huehuetenanguensis DNA was probed on maize chromosomes, a lower affinity was detected, and the pattern of hybridization suggested intergenomical restructuring between the parental genomes of maize. When DNA from Zea luxurians was used as probe, homogeneous hybridization signals were observed through all maize chromosomes. Lower genomic affinity was observed when DNA from Zea diploperennis was probed on maize chromosomes, especially at knob regions. Maize chromosomes hybridized with Zea perennis DNA showed hybridization signals on four chromosome pairs: two chromosome pairs presented hybridization signal in only one chromosomal arm, whereas four chromosome pairs did not show any hybridization. These results are in agreement with previous GISH studies, which have identified the genomic source of the chromosomes involved in the meiotic configurations of Z. perennis × maize hybrids. These findings allow postulating that maize has a parental genome not shared with Z. perennis, and the existence of intergenomic restructuring between the parental genomes of maize. Moreover, the absence of hybridization signals in all maize knobs indicate that these heterochromatic regions were lost during the Z. perennis genome evolution. PMID:26506040

  9. Comparative Genomic Indexing Reveals the Phylogenomics of Escherichia coli Pathogens

    PubMed Central

    Anjum, Muna F.; Lucchini, Sacha; Thompson, Arthur; Hinton, Jay C. D.; Woodward, Martin J.

    2003-01-01

    The Escherichia coli O26 serogroup includes important food-borne pathogens associated with human and animal diarrheal disease. Current typing methods have revealed great genetic heterogeneity within the O26 group; the data are often inconsistent and focus only on verotoxin (VT)-positive O26 isolates. To improve current understanding of diversity within this serogroup, the genomic relatedness of VT-positive and -negative O26 strains was assessed by comparative genomic indexing. Our results clearly demonstrate that irrespective of virulence characteristics and pathotype designation, the O26 strains show greater genomic similarity to each other than to any other strain included in this study. Our data suggest that enteropathogenic and VT-expressing E. coli O26 strains represent the same clonal lineage and that VT-expressing E. coli O26 strains have gained additional virulence characteristics. Using this approach, we established the core genes which are central to the E. coli species and identified regions of variation from the E. coli K-12 chromosomal backbone. PMID:12874348

  10. Mitochondrial Genome Sequences Effectively Reveal the Phylogeny of Hylobates Gibbons

    PubMed Central

    Chan, Yi-Chiao; Roos, Christian; Inoue-Murayama, Miho; Inoue, Eiji; Shih, Chih-Chin; Pei, Kurtis Jai-Chyi; Vigilant, Linda

    2010-01-01

    Background Uniquely among hominoids, gibbons exist as multiple geographically contiguous taxa exhibiting distinctive behavioral, morphological, and karyotypic characteristics. However, our understanding of the evolutionary relationships of the various gibbons, especially among Hylobates species, is still limited because previous studies used limited taxon sampling or short mitochondrial DNA (mtDNA) sequences. Here we use mtDNA genome sequences to reconstruct gibbon phylogenetic relationships and reveal the pattern and timing of divergence events in gibbon evolutionary history. Methodology/Principal Findings We sequenced the mitochondrial genomes of 51 individuals representing 11 species belonging to three genera (Hylobates, Nomascus and Symphalangus) using the high-throughput 454 sequencing system with the parallel tagged sequencing approach. Three phylogenetic analyses (maximum likelihood, Bayesian analysis and neighbor-joining) depicted the gibbon phylogenetic relationships congruently and with strong support values. Most notably, we recover a well-supported phylogeny of the Hylobates gibbons. The estimation of divergence times using Bayesian analysis with relaxed clock model suggests a much more rapid speciation process in Hylobates than in Nomascus. Conclusions/Significance Use of more than 15 kb sequences of the mitochondrial genome provided more informative and robust data than previous studies of short mitochondrial segments (e.g., control region or cytochrome b) as shown by the reliable reconstruction of divergence patterns among Hylobates gibbons. Moreover, molecular dating of the mitogenomic divergence times implied that biogeographic change during the last five million years may be a factor promoting the speciation of Sundaland animals, including Hylobates species. PMID:21203450

  11. Comparative genomics reveals mobile pathogenicity chromosomes in Fusarium

    SciTech Connect

    Ma, Li Jun; van der Does, H. C.; Borkovich, Katherine A.; Coleman, Jeffrey J.; Daboussi, Marie-Jose; Di Pietro, Antonio; Dufresne, Marie; Freitag, Michael; Grabherr, Manfred; Henrissat, Bernard; Houterman, Petra M.; Kang, Seogchan; Shim, Won-Bo; Wolochuk, Charles; Xie, Xiaohui; Xu, Jin Rong; Antoniw, John; Baker, Scott E.; Bluhm, Burton H.; Breakspear, Andrew; Brown, Daren W.; Butchko, Robert A.; Chapman, Sinead; Coulson, Richard; Coutinho, Pedro M.; Danchin, Etienne G.; Diener, Andrew; Gale, Liane R.; Gardiner, Donald; Goff, Steven; Hammond-Kossack, Kim; Hilburn, Karen; Hua-Van, Aurelie; Jonkers, Wilfried; Kazan, Kemal; Kodira, Chinnappa D.; Koehrsen, Michael; Kumar, Lokesh; Lee, Yong Hwan; Li, Liande; Manners, John M.; Miranda-Saavedra, Diego; Mukherjee, Mala; Park, Gyungsoon; Park, Jongsun; Park, Sook Young; Proctor, Robert H.; Regev, Aviv; Ruiz-Roldan, M. C.; Sain, Divya; Sakthikumar, Sharadha; Sykes, Sean; Schwartz, David C.; Turgeon, Barbara G.; Wapinski, Ilan; Yoder, Olen; Young, Sarah; Zeng, Qiandong; Zhou, Shiguo; Galagan, James; Cuomo, Christina A.; Kistler, H. Corby; Rep, Martijn

    2010-03-18

    Fusarium species are among the most important phytopathogenic and toxigenic fungi, having significant impact on crop production and animal health. Distinctively, members of the F. oxysporum species complex exhibit wide host range but discontinuously distributed host specificity, reflecting remarkable genetic adaptability. To understand the molecular underpinnings of diverse phenotypic traits and their evolution in Fusarium, we compared the genomes of three economically important and phylogenetically related, yet phenotypically diverse plant-pathogenic species, F. graminearum, F. verticillioides and F. oxysporum f. sp. lycopersici. Our analysis revealed greatly expanded lineage-specific (LS) genomic regions in F. oxysporum that include four entire chromosomes, accounting for more than one-quarter of the genome. LS regions are rich in transposons and genes with distinct evolutionary profiles but related to pathogenicity. Experimentally, we demonstrate for the first time the transfer of two LS chromosomes between strains of F. oxysporum, resulting in the conversion of a non-pathogenic strain into a pathogen. Transfer of LS chromosomes between otherwise genetically isolated strains explains the polyphyletic origin of host specificity and the emergence of new pathogenic lineages in the F. oxysporum species complex, putting the evolution of fungal pathogenicity into a new perspective.

  12. Microsporidian genome analysis reveals evolutionary strategies for obligate intracellular growth.

    PubMed

    Cuomo, Christina A; Desjardins, Christopher A; Bakowski, Malina A; Goldberg, Jonathan; Ma, Amy T; Becnel, James J; Didier, Elizabeth S; Fan, Lin; Heiman, David I; Levin, Joshua Z; Young, Sarah; Zeng, Qiandong; Troemel, Emily R

    2012-12-01

    Microsporidia comprise a large phylum of obligate intracellular eukaryotes that are fungal-related parasites responsible for widespread disease, and here we address questions about microsporidia biology and evolution. We sequenced three microsporidian genomes from two species, Nematocida parisii and Nematocida sp1, which are natural pathogens of Caenorhabditis nematodes and provide model systems for studying microsporidian pathogenesis. We performed deep sequencing of transcripts from a time course of N. parisii infection. Examination of pathogen gene expression revealed compact transcripts and a dramatic takeover of host cells by Nematocida. We also performed phylogenomic analyses of Nematocida and other microsporidian genomes to refine microsporidian phylogeny and identify evolutionary events of gene loss, acquisition, and modification. In particular, we found that all microsporidia lost the tumor-suppressor gene retinoblastoma, which we speculate could accelerate the parasite cell cycle and increase the mutation rate. We also found that microsporidia acquired transporters that could import nucleosides to fuel rapid growth. In addition, microsporidian hexokinases gained secretion signal sequences, and in a functional assay these were sufficient to export proteins out of the cell; thus hexokinase may be targeted into the host cell to reprogram it toward biosynthesis. Similar molecular changes appear during formation of cancer cells and may be evolutionary strategies adopted independently by microsporidia to proliferate rapidly within host cells. Finally, analysis of genome polymorphisms revealed evidence for a sexual cycle that may provide genetic diversity to alleviate problems caused by clonal growth. Together these events may explain the emergence and success of these diverse intracellular parasites. PMID:22813931

  13. Microsporidian genome analysis reveals evolutionary strategies for obligate intracellular growth

    PubMed Central

    Cuomo, Christina A.; Desjardins, Christopher A.; Bakowski, Malina A.; Goldberg, Jonathan; Ma, Amy T.; Becnel, James J.; Didier, Elizabeth S.; Fan, Lin; Heiman, David I.; Levin, Joshua Z.; Young, Sarah; Zeng, Qiandong; Troemel, Emily R.

    2012-01-01

    Microsporidia comprise a large phylum of obligate intracellular eukaryotes that are fungal-related parasites responsible for widespread disease, and here we address questions about microsporidia biology and evolution. We sequenced three microsporidian genomes from two species, Nematocida parisii and Nematocida sp1, which are natural pathogens of Caenorhabditis nematodes and provide model systems for studying microsporidian pathogenesis. We performed deep sequencing of transcripts from a time course of N. parisii infection. Examination of pathogen gene expression revealed compact transcripts and a dramatic takeover of host cells by Nematocida. We also performed phylogenomic analyses of Nematocida and other microsporidian genomes to refine microsporidian phylogeny and identify evolutionary events of gene loss, acquisition, and modification. In particular, we found that all microsporidia lost the tumor-suppressor gene retinoblastoma, which we speculate could accelerate the parasite cell cycle and increase the mutation rate. We also found that microsporidia acquired transporters that could import nucleosides to fuel rapid growth. In addition, microsporidian hexokinases gained secretion signal sequences, and in a functional assay these were sufficient to export proteins out of the cell; thus hexokinase may be targeted into the host cell to reprogram it toward biosynthesis. Similar molecular changes appear during formation of cancer cells and may be evolutionary strategies adopted independently by microsporidia to proliferate rapidly within host cells. Finally, analysis of genome polymorphisms revealed evidence for a sexual cycle that may provide genetic diversity to alleviate problems caused by clonal growth. Together these events may explain the emergence and success of these diverse intracellular parasites. PMID:22813931

  14. Population-based 3D genome structure analysis reveals driving forces in spatial genome organization

    PubMed Central

    Li, Wenyuan; Kalhor, Reza; Dai, Chao; Hao, Shengli; Gong, Ke; Zhou, Yonggang; Li, Haochen; Zhou, Xianghong Jasmine; Le Gros, Mark A.; Larabell, Carolyn A.; Chen, Lin; Alber, Frank

    2016-01-01

    Conformation capture technologies (e.g., Hi-C) chart physical interactions between chromatin regions on a genome-wide scale. However, the structural variability of the genome between cells poses a great challenge to interpreting ensemble-averaged Hi-C data, particularly for long-range and interchromosomal interactions. Here, we present a probabilistic approach for deconvoluting Hi-C data into a model population of distinct diploid 3D genome structures, which facilitates the detection of chromatin interactions likely to co-occur in individual cells. Our approach incorporates the stochastic nature of chromosome conformations and allows a detailed analysis of alternative chromatin structure states. For example, we predict and experimentally confirm the presence of large centromere clusters with distinct chromosome compositions varying between individual cells. The stability of these clusters varies greatly with their chromosome identities. We show that these chromosome-specific clusters can play a key role in the overall chromosome positioning in the nucleus and stabilizing specific chromatin interactions. By explicitly considering genome structural variability, our population-based method provides an important tool for revealing novel insights into the key factors shaping the spatial genome organization. PMID:26951677

  15. New study reveals relatively few mutations in AML genomes - TCGA

    Cancer.gov

    Investigators for The Cancer Genome Atlas (TCGA) Research Network have detailed and broadly classified the genomic alterations that frequently underlie the development of acute myeloid leukemia (AML).

  16. Plastid genome sequences of Gymnochlora stellata, Lotharella vacuolata, and Partenskyella glossopodia reveal remarkable structural conservation among chlorarachniophyte species.

    PubMed

    Suzuki, Shigekatsu; Hirakawa, Yoshihisa; Kofuji, Rumiko; Sugita, Mamoru; Ishida, Ken-Ichiro

    2016-07-01

    Chlorarachniophyte algae have complex plastids acquired by the uptake of a green algal endosymbiont, and this event is called secondary endosymbiosis. Interestingly, the plastids possess a relict endosymbiont nucleus, referred to as the nucleomorph, in the intermembrane space, and the nucleomorphs contain an extremely reduced and compacted genome in comparison with green algal nuclear genomes. Therefore, chlorarachniophyte plastids consist of two endosymbiotically derived genomes, i.e., the plastid and nucleomorph genomes. To date, complete nucleomorph genomes have been sequenced in four different species, whereas plastid genomes have been reported in only two species in chlorarachniophytes. To gain further insight into the evolution of endosymbiotic genomes in chlorarachniophytes, we newly sequenced the plastid genomes of three species, Gymnochlora stellata, Lotharella vacuolata, and Partenskyella glossopodia. Our findings reveal that chlorarachniophyte plastid genomes are highly conserved in size, gene content, and gene order among species, but their nucleomorph genomes are divergent in such features. Accordingly, the current architecture of the plastid genomes of chlorarachniophytes evolved in a common ancestor, and changed very little during their subsequent diversification. Furthermore, our phylogenetic analyses using multiple plastid genes suggest that chlorarachniophyte plastids are derived from a green algal lineage that is closely related to Bryopsidales in the Ulvophyceae group. PMID:26920842

  17. Single-Cell (Meta-)Genomics of a Dimorphic Candidatus Thiomargarita nelsonii Reveals Genomic Plasticity

    PubMed Central

    Flood, Beverly E.; Fliss, Palmer; Jones, Daniel S.; Dick, Gregory J.; Jain, Sunit; Kaster, Anne-Kristin; Winkel, Matthias; Mußmann, Marc; Bailey, Jake

    2016-01-01

    The genus Thiomargarita includes the world's largest bacteria. But as uncultured organisms, their physiology, metabolism, and basis for their gigantism are not well understood. Thus, a genomics approach, applied to a single Candidatus Thiomargarita nelsonii cell was employed to explore the genetic potential of one of these enigmatic giant bacteria. The Thiomargarita cell was obtained from an assemblage of budding Ca. T. nelsonii attached to a provannid gastropod shell from Hydrate Ridge, a methane seep offshore of Oregon, USA. Here we present a manually curated genome of Bud S10 resulting from a hybrid assembly of long Pacific Biosciences and short Illumina sequencing reads. With respect to inorganic carbon fixation and sulfur oxidation pathways, the Ca. T. nelsonii Hydrate Ridge Bud S10 genome was similar to marine sister taxa within the family Beggiatoaceae. However, the Bud S10 genome contains genes suggestive of the genetic potential for lithotrophic growth on arsenite and perhaps hydrogen. The genome also revealed that Bud S10 likely respires nitrate via two pathways: a complete denitrification pathway and a dissimilatory nitrate reduction to ammonia pathway. Both pathways have been predicted, but not previously fully elucidated, in the genomes of other large, vacuolated, sulfur-oxidizing bacteria. Surprisingly, the genome also had a high number of unusual features for a bacterium to include the largest number of metacaspases and introns ever reported in a bacterium. Also present, are a large number of other mobile genetic elements, such as insertion sequence (IS) transposable elements and miniature inverted-repeat transposable elements (MITEs). In some cases, mobile genetic elements disrupted key genes in metabolic pathways. For example, a MITE interrupts hupL, which encodes the large subunit of the hydrogenase in hydrogen oxidation. Moreover, we detected a group I intron in one of the most critical genes in the sulfur oxidation pathway, dsrA. The dsrA group

  18. Algal genomes reveal evolutionary mosaicism and the fate of nucleomorphs

    SciTech Connect

    Curtis, Bruce A.; Tanifuji, Goro; Burki, Fabien; Gruber, Ansgar; Irimia, Manuuel; Maruyama, Shinichiro; Arias, Maria C.; Ball, Steven G.; Gile, Gillian H.; Hirakawa, Yoshihisa; Hopkins, Julia F.; Kuo, Alan; Rensing, Stefan A.; Schmutz, Jeremy; Symeonidi, Aikaterini; Elias, Marek; Eveleigh, Robert J. M.; Herman, Emily K.; Klute, Mary J.; Nakayama, Takuro; Obornik, Miroslav; Reyes-Prieto, Adrian; Armbrust, E. Virginia; Aves, Stephen J.; Beiko, Robert G.; Coutinho, Pedro; Dacks, Joel B.; Durnford, Dion G.; Fast, Naomi M.; Green, Beverley R.; Grisdale, Cameron J.; Hempel, Franziska; Henrissat, Bernard; Hoppner, Marc P.; Ishida, Ken-Ichiro; Kim, Eunsoo; Koreny, Ludek; Kroth, Peter G.; Liu, Yuan; Malik, Shehre-Banoo; Maier, Uwe G.; McRose, Darcy; Mock, Thomas; Neilson, Jonathan A. D.; Onodera, Naoko T.; Poole, Anthony M.; Pritham, Ellen J.; Richards, Thomas A.; Rocap, Gabrielle; Roy, Scott W.; Sarai, Chihiro; Schaack, Sarah; Shirato, Shu; Slamovits, Claudio H.; Spencer, Davie F.; Suzuki, Shigekatsu; Worden, Alexandra Z.; Zauner, Stefan; Barry, Kerrie; Bell, Callum; Bharti, Arvind K.; Crow, John A.; Grimwood, Jane; Kramer, Robin; Lindquist, Erika; Lucas, Susan; Salamov, Asaf; McFadden, Geoffrey I.; Lane, Christopher E.; Keeling, Patrick J.; Gray, Michael W.; Grigoriev, Igor V.; Archibald, John M.

    2012-08-10

    Cryptophyte and chlorarachniophyte algae are transitional forms in the widespread secondary endosymbiotic acquisition of photosynthesis by engulfment of eukaryotic algae. Unlike most secondary plastid-bearing algae, miniaturized versions of the endosymbiont nuclei (nucleomorphs) persist in cryptophytes and chlorarachniophytes. To determine why, and to address other fundamental questions about eukaryote eukaryote endosymbiosis, we sequenced the nuclear genomes of the cryptophyte Guillardia theta and the chlorarachniophyte Bigelowiella natans. Both genomes have 21,000 protein genes and are intron rich, and B. natans exhibits unprecedented alternative splicing for a single-celled organism. Phylogenomic analyses and subcellular targeting predictions reveal extensive genetic and biochemical mosaicism, with both host- and endosymbiont-derived genes servicing the mitochondrion, the host cell cytosol, the plastid and the remnant endosymbiont cytosol of both algae. Mitochondrion-to-nucleus gene transfer still occurs in both organisms but plastid-to-nucleus and nucleomorph-to-nucleus transfers do not, which explains why a small residue of essential genes remains locked in each nucleomorph.

  19. Transcriptome profiling reveals mosaic genomic origins of modern cultivated barley

    PubMed Central

    Dai, Fei; Chen, Zhong-Hua; Wang, Xiaolei; Li, Zefeng; Jin, Gulei; Wu, Dezhi; Cai, Shengguan; Wang, Ning; Wu, Feibo; Nevo, Eviatar; Zhang, Guoping

    2014-01-01

    The domestication of cultivated barley has been used as a model system for studying the origins and early spread of agrarian culture. Our previous results indicated that the Tibetan Plateau and its vicinity is one of the centers of domestication of cultivated barley. Here we reveal multiple origins of domesticated barley using transcriptome profiling of cultivated and wild-barley genotypes. Approximately 48-Gb of clean transcript sequences in 12 Hordeum spontaneum and 9 Hordeum vulgare accessions were generated. We reported 12,530 de novo assembled transcripts in all of the 21 samples. Population structure analysis showed that Tibetan hulless barley (qingke) might have existed in the early stage of domestication. Based on the large number of unique genomic regions showing the similarity between cultivated and wild-barley groups, we propose that the genomic origin of modern cultivated barley is derived from wild-barley genotypes in the Fertile Crescent (mainly in chromosomes 1H, 2H, and 3H) and Tibet (mainly in chromosomes 4H, 5H, 6H, and 7H). This study indicates that the domestication of barley may have occurred over time in geographically distinct regions. PMID:25197090

  20. Genetic investigation within Lactococcus garvieae revealed two genomic lineages.

    PubMed

    Ferrario, Chiara; Ricci, Giovanni; Borgo, Francesca; Rollando, Alessandro; Fortina, Maria Grazia

    2012-07-01

    The diversity of a collection of 49 Lactococcus garvieae strains, including isolates of dairy, fish, meat, vegetable and cereal origin, was explored using a molecular polyphasic approach comprising PCR-ribotyping, REP and RAPD-PCR analyses and a multilocus restriction typing (MLRT) carried out on six partial genes (atpA, tuf, dltA, als, gapC, and galP). This approach allowed high-resolution cluster analysis in which two major groups were distinguishable: one group included dairy isolates, the other group meat isolates. Unexpectedly, of the 12 strains coming from fish, four grouped with dairy isolates, whereas the others with meat isolates. Likewise, strains isolated from vegetables allocated between the two main groups. These findings revealed high variability within the species at both gene and genome levels. The observed genetic heterogeneity among L. garvieae strains was not entirely coherent with the ecological niche of origin of the strains, but rather supports the idea of an early separation of L. garvieae population into two independent genomic lineages. PMID:22568590

  1. Genomic analysis of primordial dwarfism reveals novel disease genes.

    PubMed

    Shaheen, Ranad; Faqeih, Eissa; Ansari, Shinu; Abdel-Salam, Ghada; Al-Hassnan, Zuhair N; Al-Shidi, Tarfa; Alomar, Rana; Sogaty, Sameera; Alkuraya, Fowzan S

    2014-02-01

    Primordial dwarfism (PD) is a disease in which severely impaired fetal growth persists throughout postnatal development and results in stunted adult size. The condition is highly heterogeneous clinically, but the use of certain phenotypic aspects such as head circumference and facial appearance has proven helpful in defining clinical subgroups. In this study, we present the results of clinical and genomic characterization of 16 new patients in whom a broad definition of PD was used (e.g., 3M syndrome was included). We report a novel PD syndrome with distinct facies in two unrelated patients, each with a different homozygous truncating mutation in CRIPT. Our analysis also reveals, in addition to mutations in known PD disease genes, the first instance of biallelic truncating BRCA2 mutation causing PD with normal bone marrow analysis. In addition, we have identified a novel locus for Seckel syndrome based on a consanguineous multiplex family and identified a homozygous truncating mutation in DNA2 as the likely cause. An additional novel PD disease candidate gene XRCC4 was identified by autozygome/exome analysis, and the knockout mouse phenotype is highly compatible with PD. Thus, we add a number of novel genes to the growing list of PD-linked genes, including one which we show to be linked to a novel PD syndrome with a distinct facial appearance. PD is extremely heterogeneous genetically and clinically, and genomic tools are often required to reach a molecular diagnosis. PMID:24389050

  2. Transcriptome profiling reveals mosaic genomic origins of modern cultivated barley.

    PubMed

    Dai, Fei; Chen, Zhong-Hua; Wang, Xiaolei; Li, Zefeng; Jin, Gulei; Wu, Dezhi; Cai, Shengguan; Wang, Ning; Wu, Feibo; Nevo, Eviatar; Zhang, Guoping

    2014-09-16

    The domestication of cultivated barley has been used as a model system for studying the origins and early spread of agrarian culture. Our previous results indicated that the Tibetan Plateau and its vicinity is one of the centers of domestication of cultivated barley. Here we reveal multiple origins of domesticated barley using transcriptome profiling of cultivated and wild-barley genotypes. Approximately 48-Gb of clean transcript sequences in 12 Hordeum spontaneum and 9 Hordeum vulgare accessions were generated. We reported 12,530 de novo assembled transcripts in all of the 21 samples. Population structure analysis showed that Tibetan hulless barley (qingke) might have existed in the early stage of domestication. Based on the large number of unique genomic regions showing the similarity between cultivated and wild-barley groups, we propose that the genomic origin of modern cultivated barley is derived from wild-barley genotypes in the Fertile Crescent (mainly in chromosomes 1H, 2H, and 3H) and Tibet (mainly in chromosomes 4H, 5H, 6H, and 7H). This study indicates that the domestication of barley may have occurred over time in geographically distinct regions. PMID:25197090

  3. Mapping the Space of Genomic Signatures

    PubMed Central

    Kari, Lila; Hill, Kathleen A.; Sayem, Abu S.; Karamichalis, Rallis; Bryans, Nathaniel; Davis, Katelyn; Dattani, Nikesh S.

    2015-01-01

    We propose a computational method to measure and visualize interrelationships among any number of DNA sequences allowing, for example, the examination of hundreds or thousands of complete mitochondrial genomes. An "image distance" is computed for each pair of graphical representations of DNA sequences, and the distances are visualized as a Molecular Distance Map: Each point on the map represents a DNA sequence, and the spatial proximity between any two points reflects the degree of structural similarity between the corresponding sequences. The graphical representation of DNA sequences utilized, Chaos Game Representation (CGR), is genome- and species-specific and can thus act as a genomic signature. Consequently, Molecular Distance Maps could inform species identification, taxonomic classifications and, to a certain extent, evolutionary history. The image distance employed, Structural Dissimilarity Index (DSSIM), implicitly compares the occurrences of oligomers of length up to k (herein k = 9) in DNA sequences. We computed DSSIM distances for more than 5 million pairs of complete mitochondrial genomes, and used Multi-Dimensional Scaling (MDS) to obtain Molecular Distance Maps that visually display the sequence relatedness in various subsets, at different taxonomic levels. This general-purpose method does not require DNA sequence alignment and can thus be used to compare similar or vastly different DNA sequences, genomic or computer-generated, of the same or different lengths. We illustrate potential uses of this approach by applying it to several taxonomic subsets: phylum Vertebrata, (super)kingdom Protista, classes Amphibia-Insecta-Mammalia, class Amphibia, and order Primates. This analysis of an extensive dataset confirms that the oligomer composition of full mtDNA sequences can be a source of taxonomic information. This method also correctly finds the mtDNA sequences most closely related to that of the anatomically modern human (the Neanderthal, the Denisovan

  4. Genome sequence of Thermofilum pendens reveals an exceptional loss of biosynthetic pathways without genome reduction

    SciTech Connect

    Kyrpides, Nikos; Anderson, Iain; Rodriguez, Jason; Susanti, Dwi; Porat, Iris; Reich, Claudia; Ulrich, Luke E.; Elkins, James G.; Mavromatis, Kostas; Lykidis, Athanasios; Kim, Edwin; Thompson, Linda S.; Nolan, Matt; Land, Miriam; Copeland, Alex; Lapidus, Alla; Lucas, Susan; Detter, Chris; Zhulin, Igor B.; Olsen, Gary J.; Whitman, William; Mukhopadhyay, Biswarup; Bristow, James; Kyrpides, Nikos

    2008-01-01

    We report the complete genome of Thermofilum pendens, a deep-branching, hyperthermophilic member of the order Thermoproteales within the archaeal kingdom Crenarchaeota. T. pendens is a sulfur-dependent, anaerobic heterotroph isolated from a solfatara in Iceland. It is an extracellular commensal, requiring an extract of Thermoproteus tenax for growth, and the genome sequence reveals that biosynthetic pathways for purines, most amino acids, and most cofactors are absent. In fact T. pendens has fewer biosynthetic enzymes than obligate intracellular parasites, although it does not display other features common among obligate parasites and thus does not appear to be in the process of becoming a parasite. It appears that T. pendens has adapted to life in an environment rich in nutrients. T. pendens was known to utilize peptides as an energy source, but the genome reveals substantial ability to grow on carbohydrates. T. pendens is the first crenarchaeote and only the second archaeon found to have a transporter of the phosphotransferase system. In addition to fermentation, T. pendens may gain energy from sulfur reduction with hydrogen and formate as electron donors. It may also be capable of sulfur-independent growth on formate with formate hydrogenlyase. Additional novel features are the presence of a monomethylamine:corrinoid methyltransferase, the first time this enzyme has been found outside of Methanosarcinales, and a presenilin-related protein. Predicted highly expressed proteins do not include housekeeping genes, and instead include ABC transporters for carbohydrates and peptides, and CRISPR-associated proteins.

  5. Genome Sequence of Thermofilum pendens Reveals an Exceptional Loss of Biosynthetic Pathways without Genome Reduction

    SciTech Connect

    Anderson, Iain; Rodriquez, Jason; Susanti, Dwi; Porat, I.; Reich, Claudia; Ulrich, Luke; Elkins, James G; Mavromatis, K; Lykidis, A; Kim, Edwin; Thompson, Linda S; Nolan, Matt; Land, Miriam L; Copeland, A; Lapidus, Alla L.; Lucas, Susan; Detter, J C; Zhulin, Igor B; Olsen, Gary; Whitman, W. B.; Mukhopadhyay, Biswarup; Bristow, James; Kyrpides, Nikos C

    2008-01-01

    We report the complete genome of Thermofilum pendens, a deep-branching member of class Thermoproteales of Crenarchaeota. T. pendens is a sulfur-dependent, anaerobic heterotroph isolated from a solfatara in Iceland. It was known to utilize peptides as an energy source, but the genome reveals substantial ability to grow on carbohydrates. T. pendens is the first Crenarchaeote and only the second archaeon found to have transporters of the phosphotransferase system. T. pendens is known to require an extract of Thermoproteus tenax for growth, and the genome sequence reveals that biosynthetic pathways for purines, most amino acids, and most cofactors are absent. T. pendens has fewer biosynthetic enzymes than any other free-living organism. In addition to heterotrophy, T. pendens may gain energy from sulfur reduction with hydrogen and formate as electron donors. It may also be capable of sulfur-independent growth on formate with formate hydrogenlyase. Additional novel features are the presence of a monomethylamine:corrinoid methyltransferase, the first time this enzyme has been found outside of Methanosarcinales, and a presenilin-related protein from a new subfamily. Predicted highly expressed proteins include ABC transporters for carbohydrates and peptides, and CRISPR-associated proteins, suggesting that defense against viruses is a high priority.

  6. Integrative genomic analysis by interoperation of bioinformatics tools in GenomeSpace

    PubMed Central

    Thorvaldsdottir, Helga; Liefeld, Ted; Ocana, Marco; Borges-Rivera, Diego; Pochet, Nathalie; Robinson, James T.; Demchak, Barry; Hull, Tim; Ben-Artzi, Gil; Blankenberg, Daniel; Barber, Galt P.; Lee, Brian T.; Kuhn, Robert M.; Nekrutenko, Anton; Segal, Eran; Ideker, Trey; Reich, Michael; Regev, Aviv; Chang, Howard Y.; Mesirov, Jill P.

    2015-01-01

    Integrative analysis of multiple data types to address complex biomedical questions requires the use of multiple software tools in concert and remains an enormous challenge for most of the biomedical research community. Here we introduce GenomeSpace (http://www.genomespace.org), a cloud-based, cooperative community resource. Seeded as a collaboration of six of the most popular genomics analysis tools, GenomeSpace now supports the streamlined interaction of 20 bioinformatics tools and data resources. To facilitate the ability of non-programming users’ to leverage GenomeSpace in integrative analysis, it offers a growing set of ‘recipes’, short workflows involving a few tools and steps to guide investigators through high utility analysis tasks. PMID:26780094

  7. Genomic View of Bipolar Disorder Revealed by Whole Genome Sequencing in a Genetic Isolate

    PubMed Central

    Georgi, Benjamin; Craig, David; Kember, Rachel L.; Liu, Wencheng; Lindquist, Ingrid; Nasser, Sara; Brown, Christopher; Egeland, Janice A.; Paul, Steven M.; Bućan, Maja

    2014-01-01

    Bipolar disorder is a common, heritable mental illness characterized by recurrent episodes of mania and depression. Despite considerable effort to elucidate the genetic underpinnings of bipolar disorder, causative genetic risk factors remain elusive. We conducted a comprehensive genomic analysis of bipolar disorder in a large Old Order Amish pedigree. Microsatellite genotypes and high-density SNP-array genotypes of 388 family members were combined with whole genome sequence data for 50 of these subjects, comprising 18 parent-child trios. This study design permitted evaluation of candidate variants within the context of haplotype structure by resolving the phase in sequenced parent-child trios and by imputation of variants into multiple unsequenced siblings. Non-parametric and parametric linkage analysis of the entire pedigree as well as on smaller clusters of families identified several nominally significant linkage peaks, each of which included dozens of predicted deleterious variants. Close inspection of exonic and regulatory variants in genes under the linkage peaks using family-based association tests revealed additional credible candidate genes for functional studies and further replication in population-based cohorts. However, despite the in-depth genomic characterization of this unique, large and multigenerational pedigree from a genetic isolate, there was no convergence of evidence implicating a particular set of risk loci or common pathways. The striking haplotype and locus heterogeneity we observed has profound implications for the design of studies of bipolar and other related disorders. PMID:24625924

  8. Genome sequence surveys of Brachiola algerae and Edhazardia aedis reveal microsporidia with low gene densities

    PubMed Central

    Williams, Bryony AP; Lee, Renny CH; Becnel, James J; Weiss, Louis M; Fast, Naomi M; Keeling, Patrick J

    2008-01-01

    Background Microsporidia are well known models of extreme nuclear genome reduction and compaction. The smallest microsporidian genomes have received the most attention, but genomes of different species range in size from 2.3 Mb to 19.5 Mb and the nature of the larger genomes remains unknown. Results Here we have undertaken genome sequence surveys of two diverse microsporidia, Brachiola algerae and Edhazardia aedis. In both species we find very large intergenic regions, many transposable elements, and a low gene-density, all in contrast to the small, model microsporidian genomes. We also find no recognizable genes that are not also found in other surveyed or sequenced microsporidian genomes. Conclusion Our results demonstrate that microsporidian genome architecture varies greatly between microsporidia. Much of the genome size difference could be accounted for by non-coding material, such as intergenic spaces and retrotransposons, and this suggests that the forces dictating genome size may vary across the phylum. PMID:18445287

  9. Gorilla genome structural variation reveals evolutionary parallelisms with chimpanzee.

    PubMed

    Ventura, Mario; Catacchio, Claudia R; Alkan, Can; Marques-Bonet, Tomas; Sajjadian, Saba; Graves, Tina A; Hormozdiari, Fereydoun; Navarro, Arcadi; Malig, Maika; Baker, Carl; Lee, Choli; Turner, Emily H; Chen, Lin; Kidd, Jeffrey M; Archidiacono, Nicoletta; Shendure, Jay; Wilson, Richard K; Eichler, Evan E

    2011-10-01

    Structural variation has played an important role in the evolutionary restructuring of human and great ape genomes. Recent analyses have suggested that the genomes of chimpanzee and human have been particularly enriched for this form of genetic variation. Here, we set out to assess the extent of structural variation in the gorilla lineage by generating 10-fold genomic sequence coverage from a western lowland gorilla and integrating these data into a physical and cytogenetic framework of structural variation. We discovered and validated over 7665 structural changes within the gorilla lineage, including sequence resolution of inversions, deletions, duplications, and mobile element insertions. A comparison with human and other ape genomes shows that the gorilla genome has been subjected to the highest rate of segmental duplication. We show that both the gorilla and chimpanzee genomes have experienced independent yet convergent patterns of structural mutation that have not occurred in humans, including the formation of subtelomeric heterochromatic caps, the hyperexpansion of segmental duplications, and bursts of retroviral integrations. Our analysis suggests that the chimpanzee and gorilla genomes are structurally more derived than either orangutan or human genomes. PMID:21685127

  10. Genome Comparisons Reveal a Dominant Mechanism of Chromosome Number Reduction in Grasses and Accelerated Genome Evolution in Triticeae

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Single nucleotide polymorphism was employed in the construction of a high-resolution, expressed sequence tag (EST) map of Aegilops tauschii, the diploid source of the wheat D genome. Comparison of the map with the rice and sorghum genome sequences revealed 50 inversions and translocations; 2, 8, and...

  11. The Capsaspora genome reveals a complex unicellular prehistory of animals

    PubMed Central

    Suga, Hiroshi; Chen, Zehua; de Mendoza, Alex; Sebé-Pedrós, Arnau; Brown, Matthew W.; Kramer, Eric; Carr, Martin; Kerner, Pierre; Vervoort, Michel; Sánchez-Pons, Núria; Torruella, Guifré; Derelle, Romain; Manning, Gerard; Lang, B. Franz; Russ, Carsten; Haas, Brian J.; Roger, Andrew J.; Nusbaum, Chad; Ruiz-Trillo, Iñaki

    2013-01-01

    To reconstruct the evolutionary origin of multicellular animals from their unicellular ancestors, the genome sequences of diverse unicellular relatives are essential. However, only the genome of the choanoflagellate Monosiga brevicollis has been reported to date. Here we completely sequence the genome of the filasterean Capsaspora owczarzaki, the closest known unicellular relative of metazoans besides choanoflagellates. Analyses of this genome alter our understanding of the molecular complexity of metazoans’ unicellular ancestors showing that they had a richer repertoire of proteins involved in cell adhesion and transcriptional regulation than previously inferred only with the choanoflagellate genome. Some of these proteins were secondarily lost in choanoflagellates. In contrast, most intercellular signalling systems controlling development evolved later concomitant with the emergence of the first metazoans. We propose that the acquisition of these metazoan-specific developmental systems and the co-option of pre-existing genes drove the evolutionary transition from unicellular protists to metazoans. PMID:23942320

  12. Functional Genomics Reveals Linkers Critical for Influenza Virus Polymerase

    PubMed Central

    Wang, Lulan; Wu, Aiping; Wang, Yao E.; Quanquin, Natalie; Li, Chunfeng; Wang, Jingfeng; Chen, Hsiang-Wen; Liu, Suyang; Liu, Ping; Zhang, Hong; Qin, F. Xiao-Feng

    2015-01-01

    ABSTRACT Influenza virus mRNA synthesis by the RNA-dependent RNA polymerase involves binding and cleavage of capped cellular mRNA by the PB2 and PA subunits, respectively, and extension of viral mRNA by PB1. However, the mechanism for such a dynamic process is unclear. Using high-throughput mutagenesis and sequencing analysis, we have not only generated a comprehensive functional map for the microdomains of individual subunits but also have revealed the PA linker to be critical for polymerase activity. This PA linker binds to PB1 and also forms ionic interactions with the PA C-terminal channel. Nearly all mutants with five-amino-acid insertions in the linker were nonviable. Our model further suggests that the PA linker plays an important role in the conformational changes that occur between stages that favor capped mRNA binding and cleavage and those associated with viral mRNA synthesis. IMPORTANCE The RNA-dependent RNA polymerase of influenza virus consists of the PB1, PB2, and PA subunits. By combining genome-wide mutagenesis analysis with the recently discovered crystal structure of the influenza polymerase heterotrimer, we generated a comprehensive functional map of the entire influenza polymerase complex. We identified the microdomains of individual subunits, including the catalytic domains, the interaction interfaces between subunits, and nine linkers interconnecting different domains. Interestingly, we found that mutants with five-amino-acid insertions in individual linkers were nonviable, suggesting the critical roles these linkers play in coordinating spatial relationships between the subunits. We further identified an extended PA linker that binds to PB1 and also forms ionic interactions with the PA C-terminal channel. PMID:26719244

  13. Diverse circovirus-like genome architectures revealed by environmental metagenomics.

    PubMed

    Rosario, Karyna; Duffy, Siobain; Breitbart, Mya

    2009-10-01

    Single-stranded DNA (ssDNA) viruses with circular genomes are the smallest viruses known to infect eukaryotes. The present study identified 10 novel genomes similar to ssDNA circoviruses through data-mining of public viral metagenomes. The metagenomic libraries included samples from reclaimed water and three different marine environments (Chesapeake Bay, British Columbia coastal waters and Sargasso Sea). All the genomes have similarities to the replication (Rep) protein of circoviruses; however, only half have genomic features consistent with known circoviruses. Some of the genomes exhibit a mixture of genomic features associated with different families of ssDNA viruses (i.e. circoviruses, geminiviruses and parvoviruses). Unique genome architectures and phylogenetic analysis of the Rep protein suggest that these viruses belong to novel genera and/or families. Investigating the complex community of ssDNA viruses in the environment can lead to the discovery of divergent species and help elucidate evolutionary links between ssDNA viruses. PMID:19570956

  14. Nannochloropsis Genomes Reveal Evolution of Microalgal Oleaginous Traits

    PubMed Central

    Hu, Jianqiang; Han, Danxiang; Wang, Hui; Zeng, Xiaowei; Jing, Xiaoyan; Zhou, Qian; Su, Xiaoquan; Chang, Xingzhi; Wang, Anhui; Wang, Wei; Jia, Jing; Wei, Li; Xin, Yi; Qiao, Yinghe; Huang, Ranran; Chen, Jie; Han, Bo; Yoon, Kangsup; Hill, Russell T.; Zohar, Yonathan; Chen, Feng; Hu, Qiang; Xu, Jian

    2014-01-01

    Oleaginous microalgae are promising feedstock for biofuels, yet the genetic diversity, origin and evolution of oleaginous traits remain largely unknown. Here we present a detailed phylogenomic analysis of five oleaginous Nannochloropsis species (a total of six strains) and one time-series transcriptome dataset for triacylglycerol (TAG) synthesis on one representative strain. Despite small genome sizes, high coding potential and relative paucity of mobile elements, the genomes feature small cores of ca. 2,700 protein-coding genes and a large pan-genome of >38,000 genes. The six genomes share key oleaginous traits, such as the enrichment of selected lipid biosynthesis genes and certain glycoside hydrolase genes that potentially shift carbon flux from chrysolaminaran to TAG synthesis. The eleven type II diacylglycerol acyltransferase genes (DGAT-2) in every strain, each expressed during TAG synthesis, likely originated from three ancient genomes, including the secondary endosymbiosis host and the engulfed green and red algae. Horizontal gene transfers were inferred in most lipid synthesis nodes with expanded gene doses and many glycoside hydrolase genes. Thus multiple genome pooling and horizontal genetic exchange, together with selective inheritance of lipid synthesis genes and species-specific gene loss, have led to the enormous genetic apparatus for oleaginousness and the wide genomic divergence among present-day Nannochloropsis. These findings have important implications in the screening and genetic engineering of microalgae for biofuels. PMID:24415958

  15. Comparative Genomic Analyses of the Human NPHP1 Locus Reveal Complex Genomic Architecture and Its Regional Evolution in Primates

    PubMed Central

    Yuan, Bo; Liu, Pengfei; Gupta, Aditya; Beck, Christine R.; Tejomurtula, Anusha; Campbell, Ian M.; Gambin, Tomasz; Simmons, Alexandra D.; Withers, Marjorie A.; Harris, R. Alan; Rogers, Jeffrey; Schwartz, David C.; Lupski, James R.

    2015-01-01

    Many loci in the human genome harbor complex genomic structures that can result in susceptibility to genomic rearrangements leading to various genomic disorders. Nephronophthisis 1 (NPHP1, MIM# 256100) is an autosomal recessive disorder that can be caused by defects of NPHP1; the gene maps within the human 2q13 region where low copy repeats (LCRs) are abundant. Loss of function of NPHP1 is responsible for approximately 85% of the NPHP1 cases—about 80% of such individuals carry a large recurrent homozygous NPHP1 deletion that occurs via nonallelic homologous recombination (NAHR) between two flanking directly oriented ~45 kb LCRs. Published data revealed a non-pathogenic inversion polymorphism involving the NPHP1 gene flanked by two inverted ~358 kb LCRs. Using optical mapping and array-comparative genomic hybridization, we identified three potential novel structural variant (SV) haplotypes at the NPHP1 locus that may protect a haploid genome from the NPHP1 deletion. Inter-species comparative genomic analyses among primate genomes revealed massive genomic changes during evolution. The aggregated data suggest that dynamic genomic rearrangements occurred historically within the NPHP1 locus and generated SV haplotypes observed in the human population today, which may confer differential susceptibility to genomic instability and the NPHP1 deletion within a personal genome. Our study documents diverse SV haplotypes at a complex LCR-laden human genomic region. Comparative analyses provide a model for how this complex region arose during primate evolution, and studies among humans suggest that intra-species polymorphism may potentially modulate an individual’s susceptibility to acquiring disease-associated alleles. PMID:26641089

  16. Microsporidian genome analysis reveals evolutionary strategies for obligate intracellular growth

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Microsporidia comprise a large phylum of obligate intracellular eukaryotes that are fungalrelated parasites responsible for widespread disease, and here we address questions about microsporidia biology and evolution. We sequenced three microsporidian genomes from two species, Nematocida parisii and...

  17. The cavefish genome reveals candidate genes for eye loss.

    PubMed

    McGaugh, Suzanne E; Gross, Joshua B; Aken, Bronwen; Blin, Maryline; Borowsky, Richard; Chalopin, Domitille; Hinaux, Hélène; Jeffery, William R; Keene, Alex; Ma, Li; Minx, Patrick; Murphy, Daniel; O'Quin, Kelly E; Rétaux, Sylvie; Rohner, Nicolas; Searle, Steve M J; Stahl, Bethany A; Tabin, Cliff; Volff, Jean-Nicolas; Yoshizawa, Masato; Warren, Wesley C

    2014-01-01

    Natural populations subjected to strong environmental selection pressures offer a window into the genetic underpinnings of evolutionary change. Cavefish populations, Astyanax mexicanus (Teleostei: Characiphysi), exhibit repeated, independent evolution for a variety of traits including eye degeneration, pigment loss, increased size and number of taste buds and mechanosensory organs, and shifts in many behavioural traits. Surface and cave forms are interfertile making this system amenable to genetic interrogation; however, lack of a reference genome has hampered efforts to identify genes responsible for changes in cave forms of A. mexicanus. Here we present the first de novo genome assembly for Astyanax mexicanus cavefish, contrast repeat elements to other teleost genomes, identify candidate genes underlying quantitative trait loci (QTL), and assay these candidate genes for potential functional and expression differences. We expect the cavefish genome to advance understanding of the evolutionary process, as well as, analogous human disease including retinal dysfunction. PMID:25329095

  18. The cavefish genome reveals candidate genes for eye loss

    PubMed Central

    McGaugh, Suzanne E.; Gross, Joshua B.; Aken, Bronwen; Blin, Maryline; Borowsky, Richard; Chalopin, Domitille; Hinaux, Hélène; Jeffery, William R.; Keene, Alex; Ma, Li; Minx, Patrick; Murphy, Daniel; O’Quin, Kelly E.; Rétaux, Sylvie; Rohner, Nicolas; Searle, Steve M. J.; Stahl, Bethany A.; Tabin, Cliff; Volff, Jean-Nicolas; Yoshizawa, Masato; Warren, Wesley C.

    2014-01-01

    Natural populations subjected to strong environmental selection pressures offer a window into the genetic underpinnings of evolutionary change. Cavefish populations, Astyanax mexicanus (Teleostei: Characiphysi), exhibit repeated, independent evolution for a variety of traits including eye degeneration, pigment loss, increased size and number of taste buds and mechanosensory organs, and shifts in many behavioural traits. Surface and cave forms are interfertile making this system amenable to genetic interrogation; however, lack of a reference genome has hampered efforts to identify genes responsible for changes in cave forms of A. mexicanus. Here we present the first de novo genome assembly for Astyanax mexicanus cavefish, contrast repeat elements to other teleost genomes, identify candidate genes underlying quantitative trait loci (QTL), and assay these candidate genes for potential functional and expression differences. We expect the cavefish genome to advance understanding of the evolutionary process, as well as, analogous human disease including retinal dysfunction. PMID:25329095

  19. Genomic Mining Reveals Deep Evolutionary Relationships between Bornaviruses and Bats

    PubMed Central

    Cui, Jie; Wang, Lin-Fa

    2015-01-01

    Bats globally harbor viruses in order Mononegavirales, such as lyssaviruses and henipaviruses; however, little is known about their relationships with bornaviruses. Previous studies showed that viral fossils of bornaviral origin are embedded in the genomes of several mammalian species such as primates, indicative of an ancient origin of exogenous bornaviruses. In this study, we mined the available 10 bat genomes and recreated a clear evolutionary relationship of endogenous bornaviral elements and bats. Comparative genomics showed that endogenization of bornaviral elements frequently occurred in vesper bats, harboring EBLLs (endogenous bornavirus-like L elements) in their genomes. Molecular dating uncovered a continuous bornavirus-bat interaction spanning 70 million years. We conclude that better understanding of modern exogenous bornaviral circulation in bat populations is warranted. PMID:26569285

  20. Butterfly genome reveals promiscuous exchange of mimicry adaptations among species

    PubMed Central

    Dasmahapatra, Kanchon K; Walters, James R.; Briscoe, Adriana D.; Davey, John W.; Whibley, Annabel; Nadeau, Nicola J.; Zimin, Aleksey V.; Hughes, Daniel S. T.; Ferguson, Laura C.; Martin, Simon H.; Salazar, Camilo; Lewis, James J.; Adler, Sebastian; Ahn, Seung-Joon; Baker, Dean A.; Baxter, Simon W.; Chamberlain, Nicola L.; Chauhan, Ritika; Counterman, Brian A.; Dalmay, Tamas; Gilbert, Lawrence E.; Gordon, Karl; Heckel, David G.; Hines, Heather M.; Hoff, Katharina J.; Holland, Peter W.H.; Jacquin-Joly, Emmanuelle; Jiggins, Francis M.; Jones, Robert T.; Kapan, Durrell D.; Kersey, Paul; Lamas, Gerardo; Lawson, Daniel; Mapleson, Daniel; Maroja, Luana S.; Martin, Arnaud; Moxon, Simon; Palmer, William J.; Papa, Riccardo; Papanicolaou, Alexie; Pauchet, Yannick; Ray, David A.; Rosser, Neil; Salzberg, Steven L.; Supple, Megan A.; Surridge, Alison; Tenger-Trolander, Ayse; Vogel, Heiko; Wilkinson, Paul A.; Wilson, Derek; Yorke, James A.; Yuan, Furong; Balmuth, Alexi L.; Eland, Cathlene; Gharbi, Karim; Thomson, Marian; Gibbs, Richard A.; Han, Yi; Jayaseelan, Joy C.; Kovar, Christie; Mathew, Tittu; Muzny, Donna M.; Ongeri, Fiona; Pu, Ling-Ling; Qu, Jiaxin; Thornton, Rebecca L.; Worley, Kim C.; Wu, Yuan-Qing; Linares, Mauricio; Blaxter, Mark L.; Constant, Richard H. ffrench; Joron, Mathieu; Kronforst, Marcus R.; Mullen, Sean P.; Reed, Robert D.; Scherer, Steven E.; Richards, Stephen; Mallet, James; McMillan, W. Owen; Jiggins, Chris D.

    2012-01-01

    The evolutionary importance of hybridization and introgression has long been debated1. We used genomic tools to investigate introgression in Heliconius, a rapidly radiating genus of neotropical butterflies widely used in studies of ecology, behaviour, mimicry and speciation2-5 . We sequenced the genome of Heliconius melpomene and compared it with other taxa to investigate chromosomal evolution in Lepidoptera and gene flow among multiple Heliconius species and races. Among 12,657 predicted genes for Heliconius, biologically important expansions of families of chemosensory and Hox genes are particularly noteworthy. Chromosomal organisation has remained broadly conserved since the Cretaceous, when butterflies split from the silkmoth lineage. Using genomic resequencing, we show hybrid exchange of genes between three co-mimics, H. melpomene, H. timareta, and H. elevatus, especially at two genomic regions that control mimicry pattern. Closely related Heliconius species clearly exchange protective colour pattern genes promiscuously, implying a major role for hybridization in adaptive radiation. PMID:22722851

  1. Three crocodilian genomes reveal ancestral patterns of evolution among archosaurs

    PubMed Central

    Green, Richard E; Braun, Edward L; Armstrong, Joel; Earl, Dent; Nguyen, Ngan; Hickey, Glenn; Vandewege, Michael W; St John, John A; Capella-Gutiérrez, Salvador; Castoe, Todd A; Kern, Colin; Fujita, Matthew K; Opazo, Juan C; Jurka, Jerzy; Kojima, Kenji K; Caballero, Juan; Hubley, Robert M; Smit, Arian F; Platt, Roy N; Lavoie, Christine A; Ramakodi, Meganathan P; Finger, John W; Suh, Alexander; Isberg, Sally R; Miles, Lee; Chong, Amanda Y; Jaratlerdsiri, Weerachai; Gongora, Jaime; Moran, Christopher; Iriarte, Andrés; McCormack, John; Burgess, Shane C; Edwards, Scott V; Lyons, Eric; Williams, Christina; Breen, Matthew; Howard, Jason T; Gresham, Cathy R; Peterson, Daniel G; Schmitz, Jürgen; Pollock, David D; Haussler, David; Triplett, Eric W; Zhang, Guojie; Irie, Naoki; Jarvis, Erich D; Brochu, Christopher A; Schmidt, Carl J; McCarthy, Fiona M; Faircloth, Brant C; Hoffmann, Federico G; Glenn, Travis C; Gabaldón, Toni; Paten, Benedict; Ray, David A

    2015-01-01

    To provide context for the diversifications of archosaurs, the group that includes crocodilians, dinosaurs and birds, we generated draft genomes of three crocodilians, Alligator mississippiensis (the American alligator), Crocodylus porosus (the saltwater crocodile), and Gavialis gangeticus (the Indian gharial). We observed an exceptionally slow rate of genome evolution within crocodilians at all levels, including nucleotide substitutions, indels, transposable element content and movement, gene family evolution, and chromosomal synteny. When placed within the context of related taxa including birds and turtles, this suggests that the common ancestor of all of these taxa also exhibited slow genome evolution and that the relatively rapid evolution of bird genomes represents an autapomorphy within that clade. The data also provided the opportunity to analyze heterozygosity in crocodilians, which indicates a likely reduction in population size for all three taxa through the Pleistocene. Finally, these new data combined with newly published bird genomes allowed us to reconstruct the partial genome of the common ancestor of archosaurs providing a tool to investigate the genetic starting material of crocodilians, birds, and dinosaurs. PMID:25504731

  2. Three crocodilian genomes reveal ancestral patterns of evolution among archosaurs.

    PubMed

    Green, Richard E; Braun, Edward L; Armstrong, Joel; Earl, Dent; Nguyen, Ngan; Hickey, Glenn; Vandewege, Michael W; St John, John A; Capella-Gutiérrez, Salvador; Castoe, Todd A; Kern, Colin; Fujita, Matthew K; Opazo, Juan C; Jurka, Jerzy; Kojima, Kenji K; Caballero, Juan; Hubley, Robert M; Smit, Arian F; Platt, Roy N; Lavoie, Christine A; Ramakodi, Meganathan P; Finger, John W; Suh, Alexander; Isberg, Sally R; Miles, Lee; Chong, Amanda Y; Jaratlerdsiri, Weerachai; Gongora, Jaime; Moran, Christopher; Iriarte, Andrés; McCormack, John; Burgess, Shane C; Edwards, Scott V; Lyons, Eric; Williams, Christina; Breen, Matthew; Howard, Jason T; Gresham, Cathy R; Peterson, Daniel G; Schmitz, Jürgen; Pollock, David D; Haussler, David; Triplett, Eric W; Zhang, Guojie; Irie, Naoki; Jarvis, Erich D; Brochu, Christopher A; Schmidt, Carl J; McCarthy, Fiona M; Faircloth, Brant C; Hoffmann, Federico G; Glenn, Travis C; Gabaldón, Toni; Paten, Benedict; Ray, David A

    2014-12-12

    To provide context for the diversification of archosaurs--the group that includes crocodilians, dinosaurs, and birds--we generated draft genomes of three crocodilians: Alligator mississippiensis (the American alligator), Crocodylus porosus (the saltwater crocodile), and Gavialis gangeticus (the Indian gharial). We observed an exceptionally slow rate of genome evolution within crocodilians at all levels, including nucleotide substitutions, indels, transposable element content and movement, gene family evolution, and chromosomal synteny. When placed within the context of related taxa including birds and turtles, this suggests that the common ancestor of all of these taxa also exhibited slow genome evolution and that the comparatively rapid evolution is derived in birds. The data also provided the opportunity to analyze heterozygosity in crocodilians, which indicates a likely reduction in population size for all three taxa through the Pleistocene. Finally, these data combined with newly published bird genomes allowed us to reconstruct the partial genome of the common ancestor of archosaurs, thereby providing a tool to investigate the genetic starting material of crocodilians, birds, and dinosaurs. PMID:25504731

  3. Signatures of selection in tilapia revealed by whole genome resequencing

    PubMed Central

    Hong Xia, Jun; Bai, Zhiyi; Meng, Zining; Zhang, Yong; Wang, Le; Liu, Feng; Jing, Wu; Yi Wan, Zi; Li, Jiale; Lin, Haoran; Hua Yue, Gen

    2015-01-01

    Natural selection and selective breeding for genetic improvement have left detectable signatures within the genome of a species. Identification of selection signatures is important in evolutionary biology and for detecting genes that facilitate to accelerate genetic improvement. However, selection signatures, including artificial selection and natural selection, have only been identified at the whole genome level in several genetically improved fish species. Tilapia is one of the most important genetically improved fish species in the world. Using next-generation sequencing, we sequenced the genomes of 47 tilapia individuals. We identified a total of 1.43 million high-quality SNPs and found that the LD block sizes ranged from 10–100 kb in tilapia. We detected over a hundred putative selective sweep regions in each line of tilapia. Most selection signatures were located in non-coding regions of the tilapia genome. The Wnt signaling, gonadotropin-releasing hormone receptor and integrin signaling pathways were under positive selection in all improved tilapia lines. Our study provides a genome-wide map of genetic variation and selection footprints in tilapia, which could be important for genetic studies and accelerating genetic improvement of tilapia. PMID:26373374

  4. The Architecture of a Scrambled Genome Reveals Massive Levels of Genomic Rearrangement during Development

    PubMed Central

    Chen, Xiao; Bracht, John R.; Goldman, Aaron David; Dolzhenko, Egor; Clay, Derek M.; Swart, Estienne C.; Perlman, David H.; Doak, Thomas G.; Stuart, Andrew; Amemiya, Chris T.; Sebra, Robert P.; Landweber, Laura F.

    2014-01-01

    SUMMARY Programmed DNA rearrangements in the single-celled eukaryote Oxytricha trifallax completely rewire its germline into a somatic nucleus during development. This elaborate, RNA-mediated pathway eliminates noncoding DNA sequences that interrupt gene loci and reorganizes the remaining fragments by inversions and permutations to produce functional genes. Here, we report the Oxytricha germline genome and compare it to the somatic genome to present a global view of its massive scale of genome rearrangements. The remarkably encrypted genome architecture contains >3,500 scrambled genes, as well as >800 predicted germline-limited genes expressed, and some posttranslationally modified, during genome rearrangements. Gene segments for different somatic loci often interweave with each other. Single gene segments can contribute to multiple, distinct somatic loci. Terminal precursor segments from neighboring somatic loci map extremely close to each other, often overlapping. This genome assembly provides a draft of a scrambled genome and a powerful model for studies of genome rearrangement. PMID:25171416

  5. PSI-2: Structural Genomics to Cover Protein Domain Family Space

    PubMed Central

    Dessailly, Benoît H.; Nair, Rajesh; Jaroszewski, Lukasz; Fajardo, J. Eduardo; Kouranov, Andrei; Lee, David; Fiser, Andras; Godzik, Adam; Rost, Burkhard; Orengo, Christine

    2010-01-01

    Summary One major objective of structural genomics efforts, including the NIH-funded Protein Structure Initiative (PSI), has been to increase the structural coverage of protein sequence space. Here, we present the target selection strategy used during the second phase of PSI (PSI-2). This strategy, jointly devised by the bioinformatics groups associated with the PSI-2 large-scale production centres, targets representatives from large, structurally uncharacterised protein domain families, and from structurally uncharacterised subfamilies in very large and diverse families with incomplete structural coverage. These very large families are extremely diverse both structurally and functionally, and are highly over-represented in known proteomes. On the basis of several metrics, we then discuss to what extent PSI-2, during its first three years, has increased the structural coverage of genomes, and contributed structural and functional novelty. Together, the results presented here suggest that PSI-2 is successfully meeting its objectives and provides useful insights into structural and functional space. PMID:19523904

  6. The genomes of four tapeworm species reveal adaptations to parasitism

    PubMed Central

    Sánchez-Flores, Alejandro; Brooks, Karen L.; Tracey, Alan; Bobes, Raúl J.; Fragoso, Gladis; Sciutto, Edda; Aslett, Martin; Beasley, Helen; Bennett, Hayley M.; Cai, Xuepeng; Camicia, Federico; Clark, Richard; Cucher, Marcela; De Silva, Nishadi; Day, Tim A; Deplazes, Peter; Estrada, Karel; Fernández, Cecilia; Holland, Peter W. H.; Hou, Junling; Hu, Songnian; Huckvale, Thomas; Hung, Stacy S.; Kamenetzky, Laura; Keane, Jacqueline A.; Kiss, Ferenc; Koziol, Uriel; Lambert, Olivia; Liu, Kan; Luo, Xuenong; Luo, Yingfeng; Macchiaroli, Natalia; Nichol, Sarah; Paps, Jordi; Parkinson, John; Pouchkina-Stantcheva, Natasha; Riddiford, Nick; Rosenzvit, Mara; Salinas, Gustavo; Wasmuth, James D.; Zamanian, Mostafa; Zheng, Yadong; Cai, Jianping; Soberón, Xavier; Olson, Peter D.; Laclette, Juan P.; Brehm, Klaus; Berriman, Matthew

    2014-01-01

    Summary Tapeworms cause debilitating neglected diseases that can be deadly and often require surgery due to ineffective drugs. Here we present the first analysis of tapeworm genome sequences using the human-infective species Echinococcus multilocularis, E. granulosus, Taenia solium and the laboratory model Hymenolepis microstoma as examples. The 115-141 megabase genomes offer insights into the evolution of parasitism. Synteny is maintained with distantly related blood flukes but we find extreme losses of genes and pathways ubiquitous in other animals, including 34 homeobox families and several determinants of stem cell fate. Tapeworms have species-specific expansions of non-canonical heat shock proteins and families of known antigens; specialised detoxification pathways, and metabolism finely tuned to rely on nutrients scavenged from their hosts. We identify new potential drug targets, including those on which existing pharmaceuticals may act. The genomes provide a rich resource to underpin the development of urgently needed treatments and control. PMID:23485966

  7. Genome analysis of the platypus reveals unique signatures of evolution.

    PubMed

    Warren, Wesley C; Hillier, LaDeana W; Marshall Graves, Jennifer A; Birney, Ewan; Ponting, Chris P; Grützner, Frank; Belov, Katherine; Miller, Webb; Clarke, Laura; Chinwalla, Asif T; Yang, Shiaw-Pyng; Heger, Andreas; Locke, Devin P; Miethke, Pat; Waters, Paul D; Veyrunes, Frédéric; Fulton, Lucinda; Fulton, Bob; Graves, Tina; Wallis, John; Puente, Xose S; López-Otín, Carlos; Ordóñez, Gonzalo R; Eichler, Evan E; Chen, Lin; Cheng, Ze; Deakin, Janine E; Alsop, Amber; Thompson, Katherine; Kirby, Patrick; Papenfuss, Anthony T; Wakefield, Matthew J; Olender, Tsviya; Lancet, Doron; Huttley, Gavin A; Smit, Arian F A; Pask, Andrew; Temple-Smith, Peter; Batzer, Mark A; Walker, Jerilyn A; Konkel, Miriam K; Harris, Robert S; Whittington, Camilla M; Wong, Emily S W; Gemmell, Neil J; Buschiazzo, Emmanuel; Vargas Jentzsch, Iris M; Merkel, Angelika; Schmitz, Juergen; Zemann, Anja; Churakov, Gennady; Kriegs, Jan Ole; Brosius, Juergen; Murchison, Elizabeth P; Sachidanandam, Ravi; Smith, Carly; Hannon, Gregory J; Tsend-Ayush, Enkhjargal; McMillan, Daniel; Attenborough, Rosalind; Rens, Willem; Ferguson-Smith, Malcolm; Lefèvre, Christophe M; Sharp, Julie A; Nicholas, Kevin R; Ray, David A; Kube, Michael; Reinhardt, Richard; Pringle, Thomas H; Taylor, James; Jones, Russell C; Nixon, Brett; Dacheux, Jean-Louis; Niwa, Hitoshi; Sekita, Yoko; Huang, Xiaoqiu; Stark, Alexander; Kheradpour, Pouya; Kellis, Manolis; Flicek, Paul; Chen, Yuan; Webber, Caleb; Hardison, Ross; Nelson, Joanne; Hallsworth-Pepin, Kym; Delehaunty, Kim; Markovic, Chris; Minx, Pat; Feng, Yucheng; Kremitzki, Colin; Mitreva, Makedonka; Glasscock, Jarret; Wylie, Todd; Wohldmann, Patricia; Thiru, Prathapan; Nhan, Michael N; Pohl, Craig S; Smith, Scott M; Hou, Shunfeng; Nefedov, Mikhail; de Jong, Pieter J; Renfree, Marilyn B; Mardis, Elaine R; Wilson, Richard K

    2008-05-01

    We present a draft genome sequence of the platypus, Ornithorhynchus anatinus. This monotreme exhibits a fascinating combination of reptilian and mammalian characters. For example, platypuses have a coat of fur adapted to an aquatic lifestyle; platypus females lactate, yet lay eggs; and males are equipped with venom similar to that of reptiles. Analysis of the first monotreme genome aligned these features with genetic innovations. We find that reptile and platypus venom proteins have been co-opted independently from the same gene families; milk protein genes are conserved despite platypuses laying eggs; and immune gene family expansions are directly related to platypus biology. Expansions of protein, non-protein-coding RNA and microRNA families, as well as repeat elements, are identified. Sequencing of this genome now provides a valuable resource for deep mammalian comparative analyses, as well as for monotreme biology and conservation. PMID:18464734

  8. Genome analysis of the platypus reveals unique signatures of evolution

    PubMed Central

    Warren, Wesley C.; Hillier, LaDeana W.; Marshall Graves, Jennifer A.; Birney, Ewan; Ponting, Chris P.; Grützner, Frank; Belov, Katherine; Miller, Webb; Clarke, Laura; Chinwalla, Asif T.; Yang, Shiaw-Pyng; Heger, Andreas; Locke, Devin P.; Miethke, Pat; Waters, Paul D.; Veyrunes, Frédéric; Fulton, Lucinda; Fulton, Bob; Graves, Tina; Wallis, John; Puente, Xose S.; López-Otín, Carlos; Ordóñez, Gonzalo R.; Eichler, Evan E.; Chen, Lin; Cheng, Ze; Deakin, Janine E.; Alsop, Amber; Thompson, Katherine; Kirby, Patrick; Papenfuss, Anthony T.; Wakefield, Matthew J.; Olender, Tsviya; Lancet, Doron; Huttley, Gavin A.; Smit, Arian F. A.; Pask, Andrew; Temple-Smith, Peter; Batzer, Mark A.; Walker, Jerilyn A.; Konkel, Miriam K.; Harris, Robert S.; Whittington, Camilla M.; Wong, Emily S. W.; Gemmell, Neil J.; Buschiazzo, Emmanuel; Vargas Jentzsch, Iris M.; Merkel, Angelika; Schmitz, Juergen; Zemann, Anja; Churakov, Gennady; Kriegs, Jan Ole; Brosius, Juergen; Murchison, Elizabeth P.; Sachidanandam, Ravi; Smith, Carly; Hannon, Gregory J.; Tsend-Ayush, Enkhjargal; McMillan, Daniel; Attenborough, Rosalind; Rens, Willem; Ferguson-Smith, Malcolm; Lefèvre, Christophe M.; Sharp, Julie A.; Nicholas, Kevin R.; Ray, David A.; Kube, Michael; Reinhardt, Richard; Pringle, Thomas H.; Taylor, James; Jones, Russell C.; Nixon, Brett; Dacheux, Jean-Louis; Niwa, Hitoshi; Sekita, Yoko; Huang, Xiaoqiu; Stark, Alexander; Kheradpour, Pouya; Kellis, Manolis; Flicek, Paul; Chen, Yuan; Webber, Caleb; Hardison, Ross; Nelson, Joanne; Hallsworth-Pepin, Kym; Delehaunty, Kim; Markovic, Chris; Minx, Pat; Feng, Yucheng; Kremitzki, Colin; Mitreva, Makedonka; Glasscock, Jarret; Wylie, Todd; Wohldmann, Patricia; Thiru, Prathapan; Nhan, Michael N.; Pohl, Craig S.; Smith, Scott M.; Hou, Shunfeng; Renfree, Marilyn B.; Mardis, Elaine R.; Wilson, Richard K.

    2009-01-01

    We present a draft genome sequence of the platypus, Ornithorhynchus anatinus. This monotreme exhibits a fascinating combination of reptilian and mammalian characters. For example, platypuses have a coat of fur adapted to an aquatic lifestyle; platypus females lactate, yet lay eggs; and males are equipped with venom similar to that of reptiles. Analysis of the first monotreme genome aligned these features with genetic innovations. We find that reptile and platypus venom proteins have been co-opted independently from the same gene families; milk protein genes are conserved despite platypuses laying eggs; and immune gene family expansions are directly related to platypus biology. Expansions of protein, non-protein-coding RNA and microRNA families, as well as repeat elements, are identified. Sequencing of this genome now provides a valuable resource for deep mammalian comparative analyses, as well as for monotreme biology and conservation. PMID:18464734

  9. Klebsormidium flaccidum genome reveals primary factors for plant terrestrial adaptation.

    PubMed

    Hori, Koichi; Maruyama, Fumito; Fujisawa, Takatomo; Togashi, Tomoaki; Yamamoto, Nozomi; Seo, Mitsunori; Sato, Syusei; Yamada, Takuji; Mori, Hiroshi; Tajima, Naoyuki; Moriyama, Takashi; Ikeuchi, Masahiko; Watanabe, Mai; Wada, Hajime; Kobayashi, Koichi; Saito, Masakazu; Masuda, Tatsuru; Sasaki-Sekimoto, Yuko; Mashiguchi, Kiyoshi; Awai, Koichiro; Shimojima, Mie; Masuda, Shinji; Iwai, Masako; Nobusawa, Takashi; Narise, Takafumi; Kondo, Satoshi; Saito, Hikaru; Sato, Ryoichi; Murakawa, Masato; Ihara, Yuta; Oshima-Yamada, Yui; Ohtaka, Kinuka; Satoh, Masanori; Sonobe, Kohei; Ishii, Midori; Ohtani, Ryosuke; Kanamori-Sato, Miyu; Honoki, Rina; Miyazaki, Daichi; Mochizuki, Hitoshi; Umetsu, Jumpei; Higashi, Kouichi; Shibata, Daisuke; Kamiya, Yuji; Sato, Naoki; Nakamura, Yasukazu; Tabata, Satoshi; Ida, Shigeru; Kurokawa, Ken; Ohta, Hiroyuki

    2014-01-01

    The colonization of land by plants was a key event in the evolution of life. Here we report the draft genome sequence of the filamentous terrestrial alga Klebsormidium flaccidum (Division Charophyta, Order Klebsormidiales) to elucidate the early transition step from aquatic algae to land plants. Comparison of the genome sequence with that of other algae and land plants demonstrate that K. flaccidum acquired many genes specific to land plants. We demonstrate that K. flaccidum indeed produces several plant hormones and homologues of some of the signalling intermediates required for hormone actions in higher plants. The K. flaccidum genome also encodes a primitive system to protect against the harmful effects of high-intensity light. The presence of these plant-related systems in K. flaccidum suggests that, during evolution, this alga acquired the fundamental machinery required for adaptation to terrestrial environments. PMID:24865297

  10. Klebsormidium flaccidum genome reveals primary factors for plant terrestrial adaptation

    PubMed Central

    Hori, Koichi; Maruyama, Fumito; Fujisawa, Takatomo; Togashi, Tomoaki; Yamamoto, Nozomi; Seo, Mitsunori; Sato, Syusei; Yamada, Takuji; Mori, Hiroshi; Tajima, Naoyuki; Moriyama, Takashi; Ikeuchi, Masahiko; Watanabe, Mai; Wada, Hajime; Kobayashi, Koichi; Saito, Masakazu; Masuda, Tatsuru; Sasaki-Sekimoto, Yuko; Mashiguchi, Kiyoshi; Awai, Koichiro; Shimojima, Mie; Masuda, Shinji; Iwai, Masako; Nobusawa, Takashi; Narise, Takafumi; Kondo, Satoshi; Saito, Hikaru; Sato, Ryoichi; Murakawa, Masato; Ihara, Yuta; Oshima-Yamada, Yui; Ohtaka, Kinuka; Satoh, Masanori; Sonobe, Kohei; Ishii, Midori; Ohtani, Ryosuke; Kanamori-Sato, Miyu; Honoki, Rina; Miyazaki, Daichi; Mochizuki, Hitoshi; Umetsu, Jumpei; Higashi, Kouichi; Shibata, Daisuke; Kamiya, Yuji; Sato, Naoki; Nakamura, Yasukazu; Tabata, Satoshi; Ida, Shigeru; Kurokawa, Ken; Ohta, Hiroyuki

    2014-01-01

    The colonization of land by plants was a key event in the evolution of life. Here we report the draft genome sequence of the filamentous terrestrial alga Klebsormidium flaccidum (Division Charophyta, Order Klebsormidiales) to elucidate the early transition step from aquatic algae to land plants. Comparison of the genome sequence with that of other algae and land plants demonstrate that K. flaccidum acquired many genes specific to land plants. We demonstrate that K. flaccidum indeed produces several plant hormones and homologues of some of the signalling intermediates required for hormone actions in higher plants. The K. flaccidum genome also encodes a primitive system to protect against the harmful effects of high-intensity light. The presence of these plant-related systems in K. flaccidum suggests that, during evolution, this alga acquired the fundamental machinery required for adaptation to terrestrial environments. PMID:24865297

  11. Plastic architecture of bacterial genome revealed by comparative genomics of Photorhabdus variants

    PubMed Central

    Gaudriault, Sophie; Pages, Sylvie; Lanois, Anne; Laroui, Christine; Teyssier, Corinne; Jumas-Bilak, Estelle; Givaudan, Alain

    2008-01-01

    Background The phenotypic consequences of large genomic architecture modifications within a clonal bacterial population are rarely evaluated because of the difficulties associated with using molecular approaches in a mixed population. Bacterial variants frequently arise among Photorhabdus luminescens, a nematode-symbiotic and insect-pathogenic bacterium. We therefore studied genome plasticity within Photorhabdus variants. Results We used a combination of macrorestriction and DNA microarray experiments to perform a comparative genomic study of different P. luminescens TT01 variants. Prolonged culturing of TT01 strain and a genomic variant, collected from the laboratory-maintained symbiotic nematode, generated bacterial lineages composed of primary and secondary phenotypic variants and colonial variants. The primary phenotypic variants exhibit several characteristics that are absent from the secondary forms. We identify substantial plasticity of the genome architecture of some variants, mediated mainly by deletions in the 'flexible' gene pool of the TT01 reference genome and also by genomic amplification. We show that the primary or secondary phenotypic variant status is independent from global genomic architecture and that the bacterial lineages are genomic lineages. We focused on two unusual genomic changes: a deletion at a new recombination hotspot composed of long approximate repeats; and a 275 kilobase single block duplication belonging to a new class of genomic duplications. Conclusion Our findings demonstrate that major genomic variations occur in Photorhabdus clonal populations. The phenotypic consequences of these genomic changes are cryptic. This study provides insight into the field of bacterial genome architecture and further elucidates the role played by clonal genomic variation in bacterial genome evolution. PMID:18647395

  12. Culture Independent Genomic Comparisons Reveal Environmental Adaptations for Altiarchaeales

    PubMed Central

    Baker, Brett J.; Probst, Alexander J.; Podar, Mircea; Lloyd, Karen G.

    2016-01-01

    The recently proposed candidatus order Altiarchaeales remains an uncultured archaeal lineage composed of genetically diverse, globally widespread organisms frequently observed in anoxic subsurface environments. In spite of 15 years of studies on the psychrophilic biofilm-producing Candidatus Altiarchaeum hamiconexum and its close relatives, very little is known about the phylogenetic and functional diversity of the widespread free-living marine members of this taxon. From methanogenic sediments in the White Oak River Estuary, NC, USA, we sequenced a single cell amplified genome (SAG), WOR_SM1_SCG, and used it to identify and refine two high-quality genomes from metagenomes, WOR_SM1_79 and WOR_SM1_86-2, from the same site. These three genomic reconstructions form a monophyletic group, which also includes three previously published genomes from metagenomes from terrestrial springs and a SAG from Sakinaw Lake in a group previously designated as pMC2A384. A synapomorphic mutation in the Altiarchaeales tRNA synthetase β subunit, pheT, caused the protein to be encoded as two subunits at non-adjacent loci. Consistent with the terrestrial spring clades, our estuarine genomes contained a near-complete autotrophic metabolism, H2 or CO as potential electron donors, a reductive acetyl-CoA pathway for carbon fixation, and methylotroph-like NADP(H)-dependent dehydrogenase. Phylogenies based on 16S rRNA genes and concatenated conserved proteins identified two distinct sub-clades of Altiarchaeales, Alti-1 populated by organisms from actively flowing springs, and Alti-2 which was more widespread, diverse, and not associated with visible mats. The core Alti-1 genome suggested Alti-1 is adapted for the stream environment with lipopolysaccharide production capacity and extracellular hami structures. The core Alti-2 genome suggested members of this clade are free-living with distinct mechanisms for energy maintenance, motility, osmoregulation, and sulfur redox reactions. These data

  13. Culture Independent Genomic Comparisons Reveal Environmental Adaptations for Altiarchaeales.

    PubMed

    Bird, Jordan T; Baker, Brett J; Probst, Alexander J; Podar, Mircea; Lloyd, Karen G

    2016-01-01

    The recently proposed candidatus order Altiarchaeales remains an uncultured archaeal lineage composed of genetically diverse, globally widespread organisms frequently observed in anoxic subsurface environments. In spite of 15 years of studies on the psychrophilic biofilm-producing Candidatus Altiarchaeum hamiconexum and its close relatives, very little is known about the phylogenetic and functional diversity of the widespread free-living marine members of this taxon. From methanogenic sediments in the White Oak River Estuary, NC, USA, we sequenced a single cell amplified genome (SAG), WOR_SM1_SCG, and used it to identify and refine two high-quality genomes from metagenomes, WOR_SM1_79 and WOR_SM1_86-2, from the same site. These three genomic reconstructions form a monophyletic group, which also includes three previously published genomes from metagenomes from terrestrial springs and a SAG from Sakinaw Lake in a group previously designated as pMC2A384. A synapomorphic mutation in the Altiarchaeales tRNA synthetase β subunit, pheT, caused the protein to be encoded as two subunits at non-adjacent loci. Consistent with the terrestrial spring clades, our estuarine genomes contained a near-complete autotrophic metabolism, H2 or CO as potential electron donors, a reductive acetyl-CoA pathway for carbon fixation, and methylotroph-like NADP(H)-dependent dehydrogenase. Phylogenies based on 16S rRNA genes and concatenated conserved proteins identified two distinct sub-clades of Altiarchaeales, Alti-1 populated by organisms from actively flowing springs, and Alti-2 which was more widespread, diverse, and not associated with visible mats. The core Alti-1 genome suggested Alti-1 is adapted for the stream environment with lipopolysaccharide production capacity and extracellular hami structures. The core Alti-2 genome suggested members of this clade are free-living with distinct mechanisms for energy maintenance, motility, osmoregulation, and sulfur redox reactions. These data

  14. Butterfly genome reveals promiscuous exchange of mimicry adaptations among species.

    PubMed

    2012-07-01

    The evolutionary importance of hybridization and introgression has long been debated. Hybrids are usually rare and unfit, but even infrequent hybridization can aid adaptation by transferring beneficial traits between species. Here we use genomic tools to investigate introgression in Heliconius, a rapidly radiating genus of neotropical butterflies widely used in studies of ecology, behaviour, mimicry and speciation. We sequenced the genome of Heliconius melpomene and compared it with other taxa to investigate chromosomal evolution in Lepidoptera and gene flow among multiple Heliconius species and races. Among 12,669 predicted genes, biologically important expansions of families of chemosensory and Hox genes are particularly noteworthy. Chromosomal organization has remained broadly conserved since the Cretaceous period, when butterflies split from the Bombyx (silkmoth) lineage. Using genomic resequencing, we show hybrid exchange of genes between three co-mimics, Heliconius melpomene, Heliconius timareta and Heliconius elevatus, especially at two genomic regions that control mimicry pattern. We infer that closely related Heliconius species exchange protective colour-pattern genes promiscuously, implying that hybridization has an important role in adaptive radiation. PMID:22722851

  15. Genomic Variants Revealed by Invariably Missing Genotypes in Nelore Cattle

    PubMed Central

    da Silva, Joaquim Manoel; Giachetto, Poliana Fernanda; da Silva, Luiz Otávio Campos; Cintra, Leandro Carrijo; Paiva, Samuel Rezende; Caetano, Alexandre Rodrigues; Yamagishi, Michel Eduardo Beleza

    2015-01-01

    High density genotyping panels have been used in a wide range of applications. From population genetics to genome-wide association studies, this technology still offers the lowest cost and the most consistent solution for generating SNP data. However, in spite of the application, part of the generated data is always discarded from final datasets based on quality control criteria used to remove unreliable markers. Some discarded data consists of markers that failed to generate genotypes, labeled as missing genotypes. A subset of missing genotypes that occur in the whole population under study may be caused by technical issues but can also be explained by the presence of genomic variations that are in the vicinity of the assayed SNP and that prevent genotyping probes from annealing. The latter case may contain relevant information because these missing genotypes might be used to identify population-specific genomic variants. In order to assess which case is more prevalent, we used Illumina HD Bovine chip genotypes from 1,709 Nelore (Bos indicus) samples. We found 3,200 missing genotypes among the whole population. NGS re-sequencing data from 8 sires were used to verify the presence of genomic variations within their flanking regions in 81.56% of these missing genotypes. Furthermore, we discovered 3,300 novel SNPs/Indels, 31% of which are located in genes that may affect traits of importance for the genetic improvement of cattle production. PMID:26305794

  16. Phenotypic, genomic, transcriptomic and proteomic changes in Bacillus cereus after a short-term space flight

    NASA Astrophysics Data System (ADS)

    Su, Longxiang; Zhou, Lisha; Liu, Jinwen; Cen, Zhong; Wu, Chunyan; Wang, Tong; Zhou, Tao; Chang, De; Guo, Yinghua; Fang, Xiangqun; Wang, Junfeng; Li, Tianzhi; Yin, Sanjun; Dai, Wenkui; Zhou, Yuping; Zhao, Jiao; Fang, Chengxiang; Yang, Ruifu; Liu, Changting

    2014-01-01

    The environment in space could affect microorganisms by changing a variety of features, including proliferation rate, cell physiology, cell metabolism, biofilm production, virulence, and drug resistance. However, the relevant mechanisms remain unclear. To explore the effect of a space environment on Bacillus cereus, a strain of B. cereus was sent to space for 398 h by ShenZhou VIII from November 1, 2011 to November 17, 2011. A ground simulation with similar temperature conditions was simultaneously performed as a control. After the flight, the flight and control strains were further analyzed using phenotypic, genomic, transcriptomic and proteomic techniques to explore the divergence of B. cereus in a space environment. The flight strains exhibited a significantly slower growth rate, a significantly higher amikacin resistance level, and changes in metabolism relative to the ground control strain. After the space flight, three polymorphic loci were found in the flight strains LCT-BC25 and LCT-BC235. A combined transcriptome and proteome analysis was performed, and this analysis revealed that the flight strains had changes in genes/proteins relevant to metabolism. In addition, certain genes/proteins that are relevant to structural function, gene expression modification and translation, and virulence were also altered. Our study represents the first documented analysis of the phenotypic, genomic, transcriptomic, and proteomic changes that occur in B. cereus during space flight, and our results could be beneficial to the field of space microbiology.

  17. Upper Palaeolithic genomes reveal deep roots of modern Eurasians

    PubMed Central

    Jones, Eppie R.; Gonzalez-Fortes, Gloria; Connell, Sarah; Siska, Veronika; Eriksson, Anders; Martiniano, Rui; McLaughlin, Russell L.; Gallego Llorente, Marcos; Cassidy, Lara M.; Gamba, Cristina; Meshveliani, Tengiz; Bar-Yosef, Ofer; Müller, Werner; Belfer-Cohen, Anna; Matskevich, Zinovi; Jakeli, Nino; Higham, Thomas F. G.; Currat, Mathias; Lordkipanidze, David; Hofreiter, Michael; Manica, Andrea; Pinhasi, Ron; Bradley, Daniel G.

    2015-01-01

    We extend the scope of European palaeogenomics by sequencing the genomes of Late Upper Palaeolithic (13,300 years old, 1.4-fold coverage) and Mesolithic (9,700 years old, 15.4-fold) males from western Georgia in the Caucasus and a Late Upper Palaeolithic (13,700 years old, 9.5-fold) male from Switzerland. While we detect Late Palaeolithic–Mesolithic genomic continuity in both regions, we find that Caucasus hunter-gatherers (CHG) belong to a distinct ancient clade that split from western hunter-gatherers ∼45 kya, shortly after the expansion of anatomically modern humans into Europe and from the ancestors of Neolithic farmers ∼25 kya, around the Last Glacial Maximum. CHG genomes significantly contributed to the Yamnaya steppe herders who migrated into Europe ∼3,000 BC, supporting a formative Caucasus influence on this important Early Bronze age culture. CHG left their imprint on modern populations from the Caucasus and also central and south Asia possibly marking the arrival of Indo-Aryan languages. PMID:26567969

  18. Upper Palaeolithic genomes reveal deep roots of modern Eurasians.

    PubMed

    Jones, Eppie R; Gonzalez-Fortes, Gloria; Connell, Sarah; Siska, Veronika; Eriksson, Anders; Martiniano, Rui; McLaughlin, Russell L; Gallego Llorente, Marcos; Cassidy, Lara M; Gamba, Cristina; Meshveliani, Tengiz; Bar-Yosef, Ofer; Müller, Werner; Belfer-Cohen, Anna; Matskevich, Zinovi; Jakeli, Nino; Higham, Thomas F G; Currat, Mathias; Lordkipanidze, David; Hofreiter, Michael; Manica, Andrea; Pinhasi, Ron; Bradley, Daniel G

    2015-01-01

    We extend the scope of European palaeogenomics by sequencing the genomes of Late Upper Palaeolithic (13,300 years old, 1.4-fold coverage) and Mesolithic (9,700 years old, 15.4-fold) males from western Georgia in the Caucasus and a Late Upper Palaeolithic (13,700 years old, 9.5-fold) male from Switzerland. While we detect Late Palaeolithic-Mesolithic genomic continuity in both regions, we find that Caucasus hunter-gatherers (CHG) belong to a distinct ancient clade that split from western hunter-gatherers ∼45 kya, shortly after the expansion of anatomically modern humans into Europe and from the ancestors of Neolithic farmers ∼25 kya, around the Last Glacial Maximum. CHG genomes significantly contributed to the Yamnaya steppe herders who migrated into Europe ∼3,000 BC, supporting a formative Caucasus influence on this important Early Bronze age culture. CHG left their imprint on modern populations from the Caucasus and also central and south Asia possibly marking the arrival of Indo-Aryan languages. PMID:26567969

  19. New Assembly, Reannotation and Analysis of the Entamoeba histolytica Genome Reveal New Genomic Features and Protein Content Information

    PubMed Central

    Lorenzi, Hernan A.; Puiu, Daniela; Miller, Jason R.; Brinkac, Lauren M.; Amedeo, Paolo; Hall, Neil; Caler, Elisabet V.

    2010-01-01

    Background In order to maintain genome information accurately and relevantly, original genome annotations need to be updated and evaluated regularly. Manual reannotation of genomes is important as it can significantly reduce the propagation of errors and consequently diminishes the time spent on mistaken research. For this reason, after five years from the initial submission of the Entamoeba histolytica draft genome publication, we have re-examined the original 23 Mb assembly and the annotation of the predicted genes. Principal Findings The evaluation of the genomic sequence led to the identification of more than one hundred artifactual tandem duplications that were eliminated by re-assembling the genome. The reannotation was done using a combination of manual and automated genome analysis. The new 20 Mb assembly contains 1,496 scaffolds and 8,201 predicted genes, of which 60% are identical to the initial annotation and the remaining 40% underwent structural changes. Functional classification of 60% of the genes was modified based on recent sequence comparisons and new experimental data. We have assigned putative function to 3,788 proteins (46% of the predicted proteome) based on the annotation of predicted gene families, and have identified 58 protein families of five or more members that share no homology with known proteins and thus could be entamoeba specific. Genome analysis also revealed new features such as the presence of segmental duplications of up to 16 kb flanked by inverted repeats, and the tight association of some gene families with transposable elements. Significance This new genome annotation and analysis represents a more refined and accurate blueprint of the pathogen genome, and provides an upgraded tool as reference for the study of many important aspects of E. histolytica biology, such as genome evolution and pathogenesis. PMID:20559563

  20. Comparative Genomics Reveal Extensive Transposon-Mediated Genomic Plasticity and Diversity among Potential Effector Proteins within the Genus Coxiella▿ †

    PubMed Central

    Beare, Paul A.; Unsworth, Nathan; Andoh, Masako; Voth, Daniel E.; Omsland, Anders; Gilk, Stacey D.; Williams, Kelly P.; Sobral, Bruno W.; Kupko, John J.; Porcella, Stephen F.; Samuel, James E.; Heinzen, Robert A.

    2009-01-01

    Genetically distinct isolates of Coxiella burnetii, the cause of human Q fever, display different phenotypes with respect to in vitro infectivity/cytopathology and pathogenicity for laboratory animals. Moreover, correlations between C. burnetii genomic groups and human disease presentation (acute versus chronic) have been described, suggesting that isolates have distinct virulence characteristics. To provide a more-complete understanding of C. burnetii's genetic diversity, evolution, and pathogenic potential, we deciphered the whole-genome sequences of the K (Q154) and G (Q212) human chronic endocarditis isolates and the naturally attenuated Dugway (5J108-111) rodent isolate. Cross-genome comparisons that included the previously sequenced Nine Mile (NM) reference isolate (RSA493) revealed both novel gene content and disparate collections of pseudogenes that may contribute to isolate virulence and other phenotypes. While C. burnetii genomes are highly syntenous, recombination between abundant insertion sequence (IS) elements has resulted in genome plasticity manifested as chromosomal rearrangement of syntenic blocks and DNA insertions/deletions. The numerous IS elements, genomic rearrangements, and pseudogenes of C. burnetii isolates are consistent with genome structures of other bacterial pathogens that have recently emerged from nonpathogens with expanded niches. The observation that the attenuated Dugway isolate has the largest genome with the fewest pseudogenes and IS elements suggests that this isolate's lineage is at an earlier stage of pathoadaptation than the NM, K, and G lineages. PMID:19047403

  1. Comparative Genomic and Phylogenomic Analyses Reveal a Conserved Core Genome Shared by Estuarine and Oceanic Cyanopodoviruses

    PubMed Central

    Huang, Sijun; Zhang, Si; Jiao, Nianzhi; Chen, Feng

    2015-01-01

    Podoviruses are among the major viral groups that infect marine picocyanobacteria Prochlorococcus and Synechococcus. Here, we reported the genome sequences of five Synechococcus podoviruses isolated from the estuarine environment, and performed comparative genomic and phylogenomic analyses based on a total of 20 cyanopodovirus genomes. The genomes of all the known marine cyanopodoviruses are highly syntenic. A pan-genome of 349 clustered orthologous groups was determined, among which 15 were core genes. These core genes make up nearly half of each genome in length, reflecting the high level of genome conservation among this cyanophage type. The whole genome phylogenies based on concatenated core genes and gene content were highly consistent and confirmed the separation of two discrete marine cyanopodovirus clusters MPP-A and MPP-B. The genomes within cluster MPP-B grouped into subclusters mainly corresponding to Prochlorococcus or Synechococcus host types. Auxiliary metabolic genes tend to occur in a specific phylogenetic group of these cyanopodoviruses. All the MPP-B phages analyzed here encode the photosynthesis gene psbA, which are absent in all the MPP-A genomes thus far. Interestingly, all the MPP-B and two MPP-A Synechococcus podoviruses encode the thymidylate synthase gene thyX, while at the same genome locus all the MPP-B Prochlorococcus podoviruses encode the transaldolase gene talC. Both genes are hypothesized to have the potential to facilitate the biosynthesis of deoxynucleotide for phage replication. Inheritance of specific functional genes could be important to the evolution and ecological fitness of certain cyanophage genotypes. Our analyses demonstrate that cyanopodoviruses of estuarine and oceanic origins share a conserved core genome and suggest that accessory genes may be related to environmental adaptation. PMID:26569403

  2. Genomic and transcriptomic analysis of NDM-1 Klebsiella pneumoniae in spaceflight reveal mechanisms underlying environmental adaptability

    PubMed Central

    Li, Jia; Liu, Fei; Wang, Qi; Ge, Pupu; Woo, Patrick C. Y.; Yan, Jinghua; Zhao, Yanlin; Gao, George F.; Liu, Cui Hua; Liu, Changting

    2014-01-01

    The emergence and rapid spread of New Delhi Metallo-beta-lactamase-1 (NDM-1)-producing Klebsiella pneumoniae strains has caused a great concern worldwide. To better understand the mechanisms underlying environmental adaptation of those highly drug-resistant K. pneumoniae strains, we took advantage of the China's Shenzhou 10 spacecraft mission to conduct comparative genomic and transcriptomic analysis of a NDM-1 K. pneumoniae strain (ATCC BAA-2146) being cultivated under different conditions. The samples were recovered from semisolid medium placed on the ground (D strain), in simulated space condition (M strain), or in Shenzhou 10 spacecraft (T strain) for analysis. Our data revealed multiple variations underlying pathogen adaptation into different environments in terms of changes in morphology, H2O2 tolerance and biofilm formation ability, genomic stability and regulation of metabolic pathways. Additionally, we found a few non-coding RNAs to be differentially regulated. The results are helpful for better understanding the adaptive mechanisms of drug-resistant bacterial pathogens. PMID:25163721

  3. Genomic and transcriptomic analysis of NDM-1 Klebsiella pneumoniae in spaceflight reveal mechanisms underlying environmental adaptability.

    PubMed

    Li, Jia; Liu, Fei; Wang, Qi; Ge, Pupu; Woo, Patrick C Y; Yan, Jinghua; Zhao, Yanlin; Gao, George F; Liu, Cui Hua; Liu, Changting

    2014-01-01

    The emergence and rapid spread of New Delhi Metallo-beta-lactamase-1 (NDM-1)-producing Klebsiella pneumoniae strains has caused a great concern worldwide. To better understand the mechanisms underlying environmental adaptation of those highly drug-resistant K. pneumoniae strains, we took advantage of the China's Shenzhou 10 spacecraft mission to conduct comparative genomic and transcriptomic analysis of a NDM-1 K. pneumoniae strain (ATCC BAA-2146) being cultivated under different conditions. The samples were recovered from semisolid medium placed on the ground (D strain), in simulated space condition (M strain), or in Shenzhou 10 spacecraft (T strain) for analysis. Our data revealed multiple variations underlying pathogen adaptation into different environments in terms of changes in morphology, H2O2 tolerance and biofilm formation ability, genomic stability and regulation of metabolic pathways. Additionally, we found a few non-coding RNAs to be differentially regulated. The results are helpful for better understanding the adaptive mechanisms of drug-resistant bacterial pathogens. PMID:25163721

  4. High Resolution Genetic Mapping by Genome Sequencing Reveals Genome Duplication and Tetraploid Genetic Structure of the Diploid Miscanthus sinensis

    PubMed Central

    Ma, Xue-Feng; Jensen, Elaine; Alexandrov, Nickolai; Troukhan, Maxim; Zhang, Liping; Thomas-Jones, Sian; Farrar, Kerrie; Clifton-Brown, John; Donnison, Iain; Swaller, Timothy; Flavell, Richard

    2012-01-01

    We have created a high-resolution linkage map of Miscanthus sinensis, using genotyping-by-sequencing (GBS), identifying all 19 linkage groups for the first time. The result is technically significant since Miscanthus has a very large and highly heterozygous genome, but has no or limited genomics information to date. The composite linkage map containing markers from both parental linkage maps is composed of 3,745 SNP markers spanning 2,396 cM on 19 linkage groups with a 0.64 cM average resolution. Comparative genomics analyses of the M. sinensis composite linkage map to the genomes of sorghum, maize, rice, and Brachypodium distachyon indicate that sorghum has the closest syntenic relationship to Miscanthus compared to other species. The comparative results revealed that each pair of the 19 M. sinensis linkages aligned to one sorghum chromosome, except for LG8, which mapped to two sorghum chromosomes (4 and 7), presumably due to a chromosome fusion event after genome duplication. The data also revealed several other chromosome rearrangements relative to sorghum, including two telomere-centromere inversions of the sorghum syntenic chromosome 7 in LG8 of M. sinensis and two paracentric inversions of sorghum syntenic chromosome 4 in LG7 and LG8 of M. sinensis. The results clearly demonstrate, for the first time, that the diploid M. sinensis is tetraploid origin consisting of two sub-genomes. This complete and high resolution composite linkage map will not only serve as a useful resource for novel QTL discoveries, but also enable informed deployment of the wealth of existing genomics resources of other species to the improvement of Miscanthus as a high biomass energy crop. In addition, it has utility as a reference for genome sequence assembly for the forthcoming whole genome sequencing of the Miscanthus genus. PMID:22439001

  5. High resolution genetic mapping by genome sequencing reveals genome duplication and tetraploid genetic structure of the diploid Miscanthus sinensis.

    PubMed

    Ma, Xue-Feng; Jensen, Elaine; Alexandrov, Nickolai; Troukhan, Maxim; Zhang, Liping; Thomas-Jones, Sian; Farrar, Kerrie; Clifton-Brown, John; Donnison, Iain; Swaller, Timothy; Flavell, Richard

    2012-01-01

    We have created a high-resolution linkage map of Miscanthus sinensis, using genotyping-by-sequencing (GBS), identifying all 19 linkage groups for the first time. The result is technically significant since Miscanthus has a very large and highly heterozygous genome, but has no or limited genomics information to date. The composite linkage map containing markers from both parental linkage maps is composed of 3,745 SNP markers spanning 2,396 cM on 19 linkage groups with a 0.64 cM average resolution. Comparative genomics analyses of the M. sinensis composite linkage map to the genomes of sorghum, maize, rice, and Brachypodium distachyon indicate that sorghum has the closest syntenic relationship to Miscanthus compared to other species. The comparative results revealed that each pair of the 19 M. sinensis linkages aligned to one sorghum chromosome, except for LG8, which mapped to two sorghum chromosomes (4 and 7), presumably due to a chromosome fusion event after genome duplication. The data also revealed several other chromosome rearrangements relative to sorghum, including two telomere-centromere inversions of the sorghum syntenic chromosome 7 in LG8 of M. sinensis and two paracentric inversions of sorghum syntenic chromosome 4 in LG7 and LG8 of M. sinensis. The results clearly demonstrate, for the first time, that the diploid M. sinensis is tetraploid origin consisting of two sub-genomes. This complete and high resolution composite linkage map will not only serve as a useful resource for novel QTL discoveries, but also enable informed deployment of the wealth of existing genomics resources of other species to the improvement of Miscanthus as a high biomass energy crop. In addition, it has utility as a reference for genome sequence assembly for the forthcoming whole genome sequencing of the Miscanthus genus. PMID:22439001

  6. The Laccaria and Tuber Genomes Reveal Unique Signatures of Mycorrhizal Symbiosis Evolution (2010 JGI User Meeting)

    SciTech Connect

    Knapp, Steve

    2010-03-24

    Francis Martin from the French agricultural research institute INRA talks on how "The Laccaria and Tuber genomes reveal unique signatures of mycorrhizal symbiosis evolution" on March 24, 2010 at the 5th Annual DOE JGI User Meeting

  7. Integrative genomic analysis by interoperation of bioinformatics tools in GenomeSpace.

    PubMed

    Qu, Kun; Garamszegi, Sara; Wu, Felix; Thorvaldsdottir, Helga; Liefeld, Ted; Ocana, Marco; Borges-Rivera, Diego; Pochet, Nathalie; Robinson, James T; Demchak, Barry; Hull, Tim; Ben-Artzi, Gil; Blankenberg, Daniel; Barber, Galt P; Lee, Brian T; Kuhn, Robert M; Nekrutenko, Anton; Segal, Eran; Ideker, Trey; Reich, Michael; Regev, Aviv; Chang, Howard Y; Mesirov, Jill P

    2016-03-01

    Complex biomedical analyses require the use of multiple software tools in concert and remain challenging for much of the biomedical research community. We introduce GenomeSpace (http://www.genomespace.org), a cloud-based, cooperative community resource that currently supports the streamlined interaction of 20 bioinformatics tools and data resources. To facilitate integrative analysis by non-programmers, it offers a growing set of 'recipes', short workflows to guide investigators through high-utility analysis tasks. PMID:26780094

  8. Comparative genomics reveals conserved positioning of essential genomic clusters in highly rearranged Thermococcales chromosomes

    PubMed Central

    Cossu, Matteo; Da Cunha, Violette; Toffano-Nioche, Claire; Forterre, Patrick; Oberto, Jacques

    2015-01-01

    The genomes of the 21 completely sequenced Thermococcales display a characteristic high level of rearrangements. As a result, the prediction of their origin and termination of replication on the sole basis of chromosomal DNA composition or skew is inoperative. Using a different approach based on biologically relevant sequences, we were able to determine oriC position in all 21 genomes. The position of dif, the site where chromosome dimers are resolved before DNA segregation could be predicted in 19 genomes. Computation of the core genome uncovered a number of essential gene clusters with a remarkably stable chromosomal position across species, in sharp contrast with the scrambled nature of their genomes. The active chromosomal reorganization of numerous genes acquired by horizontal transfer, mainly from mobile elements, could explain this phenomenon. PMID:26166067

  9. Joint assembly and genetic mapping of the Atlantic horseshoe crab genome reveals ancient whole genome duplication

    PubMed Central

    2014-01-01

    Background Horseshoe crabs are marine arthropods with a fossil record extending back approximately 450 million years. They exhibit remarkable morphological stability over their long evolutionary history, retaining a number of ancestral arthropod traits, and are often cited as examples of “living fossils.” As arthropods, they belong to the Ecdysozoa, an ancient super-phylum whose sequenced genomes (including insects and nematodes) have thus far shown more divergence from the ancestral pattern of eumetazoan genome organization than cnidarians, deuterostomes and lophotrochozoans. However, much of ecdysozoan diversity remains unrepresented in comparative genomic analyses. Results Here we apply a new strategy of combined de novo assembly and genetic mapping to examine the chromosome-scale genome organization of the Atlantic horseshoe crab, Limulus polyphemus. We constructed a genetic linkage map of this 2.7 Gbp genome by sequencing the nuclear DNA of 34 wild-collected, full-sibling embryos and their parents at a mean redundancy of 1.1x per sample. The map includes 84,307 sequence markers grouped into 1,876 distinct genetic intervals and 5,775 candidate conserved protein coding genes. Conclusions Comparison with other metazoan genomes shows that the L. polyphemus genome preserves ancestral bilaterian linkage groups, and that a common ancestor of modern horseshoe crabs underwent one or more ancient whole genome duplications 300 million years ago, followed by extensive chromosome fusion. These results provide a counter-example to the often noted correlation between whole genome duplication and evolutionary radiations. The new, low-cost genetic mapping method for obtaining a chromosome-scale view of non-model organism genomes that we demonstrate here does not require laboratory culture, and is potentially applicable to a broad range of other species. PMID:24987520

  10. Genomic Characterization of Methanomicrobiales Reveals Three Classes of Methanogens

    SciTech Connect

    Anderson, Iain; Ulrich, Luke; Lupa, Boguslaw; Susanti, Dwi; Porat, I.; Hooper, Sean; Lykidis, A; Sieprawska-Lupa, Magdalena; Dharmarajan, Lakshmi; Goltsman, Eugene; Lapidus, Alla L.; Saunders, Elizabeth H; Han, Cliff; Land, Miriam L; Lucas, Susan; Mukhopadhyay, Biswarup; Whitman, William; Woese, Carl; Bristow, James; Kyrpides, Nikos C

    2009-01-01

    Background Methanomicrobiales is the least studied order of methanogens. While these organisms appear to be more closely related to the Methanosarcinales in ribosomal-based phylogenetic analyses, they are metabolically more similar to Class I methanogens. Methodology/Principal Findings In order to improve our understanding of this lineage, we have completely sequenced the genomes of two members of this order, Methanocorpusculum labreanum Z and Methanoculleus marisnigri JR1, and compared them with the genome of a third, Methanospirillum hungatei JF-1. Similar to Class I methanogens, Methanomicrobiales use a partial reductive citric acid cycle for 2-oxoglutarate biosynthesis, and they have the Eha energy-converting hydrogenase. In common with Methanosarcinales, Methanomicrobiales possess the Ech hydrogenase and at least some of them may couple formylmethanofuran formation and heterodisulfide reduction to transmembrane ion gradients. Uniquely, M. labreanum and M. hungatei contain hydrogenases similar to the Pyrococcus furiosus Mbh hydrogenase, and all three Methanomicrobiales have anti-sigma factor and anti-anti-sigma factor regulatory proteins not found in other methanogens. Phylogenetic analysis based on seven core proteins of methanogenesis and cofactor biosynthesis places the Methanomicrobiales equidistant from Class I methanogens and Methanosarcinales. Conclusions/Significance Our results indicate that Methanomicrobiales, rather than being similar to Class I methanogens or Methanomicrobiales, share some features of both and have some unique properties. We find that there are three distinct classes of methanogens: the Class I methanogens, the Methanomicrobiales (Class II), and the Methanosarcinales (Class III).

  11. Genomic Characterization of Methanomicrobiales Reveals Three Classes of Methanogens

    SciTech Connect

    Anderson, Iain; Ulrich, Luke E.; Lupa, Boguslaw; Susanti, Dwi; Porat, Iris; Hooper, Sean D.; Lykidis, Athanasios; Sieprawska-Lupa, Magdalena; Dharmarajan, Lakshmi; Goltsman, Eugene; Lapidus, Alla; Saunders, Elizabeth; Han, Cliff; Land, Miriam; Lucas, Susan; Mukhopadhyay, Biswarup; Whitman, William B.; Woese, Carl; Bristow, James; Kyrpides, Nikos

    2009-05-01

    Methanomicrobiales is the least studied order of methanogens. While these organisms appear to be more closely related to the Methanosarcinales in ribosomal-based phylogenetic analyses, they are metabolically more similar to Class I methanogens. In order to improve our understanding of this lineage, we have completely sequenced the genomes of two members of this order, Methanocorpusculum labreanum Z and Methanoculleus marisnigri JR1, and compared them with the genome of a third, Methanospirillum hungatei JF-1. Similar to Class I methanogens, Methanomicrobiales use a partial reductive citric acid cycle for 2-oxoglutarate biosynthesis, and they have the Eha energy-converting hydrogenase. In common with Methanosarcinales, Methanomicrobiales possess the Ech hydrogenase and at least some of them may couple formylmethanofuran formation and heterodisulfide reduction to transmembrane ion gradients. Uniquely, M. labreanum and M. hungatei contain hydrogenases similar to the Pyrococcus furiosus Mbh hydrogenase, and all three Methanomicrobiales have anti-sigma factor and anti-anti-sigma factor regulatory proteins not found in other methanogens. Phylogenetic analysis based on seven core proteins of methanogenesis and cofactor biosynthesis places the Methanomicrobiales equidistant from Class I methanogens and Methanosarcinales. Our results indicate that Methanomicrobiales, rather than being similar to Class I methanogens or Methanomicrobiales, share some features of both and have some unique properties. We find that there are three distinct classes of methanogens: the Class I methanogens, the Methanomicrobiales (Class II), and the Methanosarcinales (Class III).

  12. Comparative Genomics Analysis of Streptomyces Species Reveals Their Adaptation to the Marine Environment and Their Diversity at the Genomic Level

    PubMed Central

    Tian, Xinpeng; Zhang, Zhewen; Yang, Tingting; Chen, Meili; Li, Jie; Chen, Fei; Yang, Jin; Li, Wenjie; Zhang, Bing; Zhang, Zhang; Wu, Jiayan; Zhang, Changsheng; Long, Lijuan; Xiao, Jingfa

    2016-01-01

    Over 200 genomes of streptomycete strains that were isolated from various environments are available from the NCBI. However, little is known about the characteristics that are linked to marine adaptation in marine-derived streptomycetes. The particularity and complexity of the marine environment suggest that marine streptomycetes are genetically diverse. Here, we sequenced nine strains from the Streptomyces genus that were isolated from different longitudes, latitudes, and depths of the South China Sea. Then we compared these strains to 22 NCBI downloaded streptomycete strains. Thirty-one streptomycete strains are clearly grouped into a marine-derived subgroup and multiple source subgroup-based phylogenetic tree. The phylogenetic analyses have revealed the dynamic process underlying streptomycete genome evolution, and lateral gene transfer is an important driving force during the process. Pan-genomics analyses have revealed that streptomycetes have an open pan-genome, which reflects the diversity of these streptomycetes and guarantees the species a quick and economical response to diverse environments. Functional and comparative genomics analyses indicate that the marine-derived streptomycetes subgroup possesses some common characteristics of marine adaptation. Our findings have expanded our knowledge of how ocean isolates of streptomycete strains adapt to marine environments. The availability of streptomycete genomes from the South China Sea will be beneficial for further analysis on marine streptomycetes and will enrich the South China Sea’s genetic data sources. PMID:27446038

  13. Comparative Genomics Analysis of Streptomyces Species Reveals Their Adaptation to the Marine Environment and Their Diversity at the Genomic Level.

    PubMed

    Tian, Xinpeng; Zhang, Zhewen; Yang, Tingting; Chen, Meili; Li, Jie; Chen, Fei; Yang, Jin; Li, Wenjie; Zhang, Bing; Zhang, Zhang; Wu, Jiayan; Zhang, Changsheng; Long, Lijuan; Xiao, Jingfa

    2016-01-01

    Over 200 genomes of streptomycete strains that were isolated from various environments are available from the NCBI. However, little is known about the characteristics that are linked to marine adaptation in marine-derived streptomycetes. The particularity and complexity of the marine environment suggest that marine streptomycetes are genetically diverse. Here, we sequenced nine strains from the Streptomyces genus that were isolated from different longitudes, latitudes, and depths of the South China Sea. Then we compared these strains to 22 NCBI downloaded streptomycete strains. Thirty-one streptomycete strains are clearly grouped into a marine-derived subgroup and multiple source subgroup-based phylogenetic tree. The phylogenetic analyses have revealed the dynamic process underlying streptomycete genome evolution, and lateral gene transfer is an important driving force during the process. Pan-genomics analyses have revealed that streptomycetes have an open pan-genome, which reflects the diversity of these streptomycetes and guarantees the species a quick and economical response to diverse environments. Functional and comparative genomics analyses indicate that the marine-derived streptomycetes subgroup possesses some common characteristics of marine adaptation. Our findings have expanded our knowledge of how ocean isolates of streptomycete strains adapt to marine environments. The availability of streptomycete genomes from the South China Sea will be beneficial for further analysis on marine streptomycetes and will enrich the South China Sea's genetic data sources. PMID:27446038

  14. High-resolution genomic profiling of chronic lymphocytic leukemia reveals new recurrent genomic alterations.

    PubMed

    Edelmann, Jennifer; Holzmann, Karlheinz; Miller, Florian; Winkler, Dirk; Bühler, Andreas; Zenz, Thorsten; Bullinger, Lars; Kühn, Michael W M; Gerhardinger, Andreas; Bloehdorn, Johannes; Radtke, Ina; Su, Xiaoping; Ma, Jing; Pounds, Stanley; Hallek, Michael; Lichter, Peter; Korbel, Jan; Busch, Raymonde; Mertens, Daniel; Downing, James R; Stilgenbauer, Stephan; Döhner, Hartmut

    2012-12-01

    To identify genomic alterations in chronic lymphocytic leukemia (CLL), we performed single-nucleotide polymorphism-array analysis using Affymetrix Version 6.0 on 353 samples from untreated patients entered in the CLL8 treatment trial. Based on paired-sample analysis (n = 144), a mean of 1.8 copy number alterations per patient were identified; approximately 60% of patients carried no copy number alterations other than those detected by fluorescence in situ hybridization analysis. Copy-neutral loss-of-heterozygosity was detected in 6% of CLL patients and was found most frequently on 13q, 17p, and 11q. Minimally deleted regions were refined on 13q14 (deleted in 61% of patients) to the DLEU1 and DLEU2 genes, on 11q22.3 (27% of patients) to ATM, on 2p16.1-2p15 (gained in 7% of patients) to a 1.9-Mb fragment containing 9 genes, and on 8q24.21 (5% of patients) to a segment 486 kb proximal to the MYC locus. 13q deletions exhibited proximal and distal breakpoint cluster regions. Among the most common novel lesions were deletions at 15q15.1 (4% of patients), with the smallest deletion (70.48 kb) found in the MGA locus. Sequence analysis of MGA in 59 samples revealed a truncating mutation in one CLL patient lacking a 15q deletion. MNT at 17p13.3, which in addition to MGA and MYC encodes for the network of MAX-interacting proteins, was also deleted recurrently. PMID:23047824

  15. Plasmodium knowlesi Genome Sequences from Clinical Isolates Reveal Extensive Genomic Dimorphism

    PubMed Central

    Millar, Scott B.; Sanderson, Theo; Otto, Thomas D.; Lu, Woon Chan; Krishna, Sanjeev; Rayner, Julian C.; Cox-Singh, Janet

    2015-01-01

    Plasmodium knowlesi is a newly described zoonosis that causes malaria in the human population that can be severe and fatal. The study of P. knowlesi parasites from human clinical isolates is relatively new and, in order to obtain maximum information from patient sample collections, we explored the possibility of generating P. knowlesi genome sequences from archived clinical isolates. Our patient sample collection consisted of frozen whole blood samples that contained excessive human DNA contamination and, in that form, were not suitable for parasite genome sequencing. We developed a method to reduce the amount of human DNA in the thawed blood samples in preparation for high throughput parasite genome sequencing using Illumina HiSeq and MiSeq sequencing platforms. Seven of fifteen samples processed had sufficiently pure P. knowlesi DNA for whole genome sequencing. The reads were mapped to the P. knowlesi H strain reference genome and an average mapping of 90% was obtained. Genes with low coverage were removed leaving 4623 genes for subsequent analyses. Previously we identified a DNA sequence dimorphism on a small fragment of the P. knowlesi normocyte binding protein xa gene on chromosome 14. We used the genome data to assemble full-length Pknbpxa sequences and discovered that the dimorphism extended along the gene. An in-house algorithm was developed to detect SNP sites co-associating with the dimorphism. More than half of the P. knowlesi genome was dimorphic, involving genes on all chromosomes and suggesting that two distinct types of P. knowlesi infect the human population in Sarawak, Malaysian Borneo. We use P. knowlesi clinical samples to demonstrate that Plasmodium DNA from archived patient samples can produce high quality genome data. We show that analyses, of even small numbers of difficult clinical malaria isolates, can generate comprehensive genomic information that will improve our understanding of malaria parasite diversity and pathobiology. PMID:25830531

  16. Single-Molecule FISH Reveals Non-selective Packaging of Rift Valley Fever Virus Genome Segments

    PubMed Central

    Wichgers Schreur, Paul J.; Kortekaas, Jeroen

    2016-01-01

    The bunyavirus genome comprises a small (S), medium (M), and large (L) RNA segment of negative polarity. Although genome segmentation confers evolutionary advantages by enabling genome reassortment events with related viruses, genome segmentation also complicates genome replication and packaging. Accumulating evidence suggests that genomes of viruses with eight or more genome segments are incorporated into virions by highly selective processes. Remarkably, little is known about the genome packaging process of the tri-segmented bunyaviruses. Here, we evaluated, by single-molecule RNA fluorescence in situ hybridization (FISH), the intracellular spatio-temporal distribution and replication kinetics of the Rift Valley fever virus (RVFV) genome and determined the segment composition of mature virions. The results reveal that the RVFV genome segments start to replicate near the site of infection before spreading and replicating throughout the cytoplasm followed by translocation to the virion assembly site at the Golgi network. Despite the average intracellular S, M and L genome segments approached a 1:1:1 ratio, major differences in genome segment ratios were observed among cells. We also observed a significant amount of cells lacking evidence of M-segment replication. Analysis of two-segmented replicons and four-segmented viruses subsequently confirmed the previous notion that Golgi recruitment is mediated by the Gn glycoprotein. The absence of colocalization of the different segments in the cytoplasm and the successful rescue of a tri-segmented variant with a codon shuffled M-segment suggested that inter-segment interactions are unlikely to drive the copackaging of the different segments into a single virion. The latter was confirmed by direct visualization of RNPs inside mature virions which showed that the majority of virions lack one or more genome segments. Altogether, this study suggests that RVFV genome packaging is a non-selective process. PMID:27548280

  17. Single-Molecule FISH Reveals Non-selective Packaging of Rift Valley Fever Virus Genome Segments.

    PubMed

    Wichgers Schreur, Paul J; Kortekaas, Jeroen

    2016-08-01

    The bunyavirus genome comprises a small (S), medium (M), and large (L) RNA segment of negative polarity. Although genome segmentation confers evolutionary advantages by enabling genome reassortment events with related viruses, genome segmentation also complicates genome replication and packaging. Accumulating evidence suggests that genomes of viruses with eight or more genome segments are incorporated into virions by highly selective processes. Remarkably, little is known about the genome packaging process of the tri-segmented bunyaviruses. Here, we evaluated, by single-molecule RNA fluorescence in situ hybridization (FISH), the intracellular spatio-temporal distribution and replication kinetics of the Rift Valley fever virus (RVFV) genome and determined the segment composition of mature virions. The results reveal that the RVFV genome segments start to replicate near the site of infection before spreading and replicating throughout the cytoplasm followed by translocation to the virion assembly site at the Golgi network. Despite the average intracellular S, M and L genome segments approached a 1:1:1 ratio, major differences in genome segment ratios were observed among cells. We also observed a significant amount of cells lacking evidence of M-segment replication. Analysis of two-segmented replicons and four-segmented viruses subsequently confirmed the previous notion that Golgi recruitment is mediated by the Gn glycoprotein. The absence of colocalization of the different segments in the cytoplasm and the successful rescue of a tri-segmented variant with a codon shuffled M-segment suggested that inter-segment interactions are unlikely to drive the copackaging of the different segments into a single virion. The latter was confirmed by direct visualization of RNPs inside mature virions which showed that the majority of virions lack one or more genome segments. Altogether, this study suggests that RVFV genome packaging is a non-selective process. PMID:27548280

  18. Comparative Genomics Reveals Biomarkers to Identify Lactobacillus Species.

    PubMed

    Koul, Shikha; Kalia, Vipin Chandra

    2016-09-01

    Bacteria possessing multiple copies of 16S rRNA (rrs) gene demonstrate high intragenomic heterogeneity. It hinders clear distinction at species level and even leads to overestimation of the bacterial diversity. Fifty completely sequenced genomes belonging to 19 species of Lactobacillus species were found to possess 4-9 copies of rrs each. Multiple sequence alignment of 268 rrs genes from all the 19 species could be classified into 20 groups. Lactobacillus sanfranciscensis TMW 1.1304 was the only species where all the 7 copies of rrs were exactly similar and thus formed a distinct group. In order to circumvent the problem of high heterogeneity arising due to multiple copies of rrs, 19 additional genes (732-3645 nucleotides in size) common to Lactobacillus genomes, were selected and digested with 10 Type II restriction endonucleases (RE), under in silico conditions. The following unique gene-RE combinations: recA (1098 nts)-HpyCH4 V, CviAII, BfuCI and RsaI were found to be useful in identifying 29 strains representing 17 species. Digestion patterns of genes-ruvB (1020 nts), dnaA (1368 nts), purA (1290 nts), dnaJ (1140 nts), and gyrB (1944 nts) in combination with REs-AluI, BfuCI, CviAI, Taq1, and Tru9I allowed clear identification of an additional 14 strains belonging to 8 species. Digestion pattern of genes recA, ruvB, dnaA, purA, dnaJ and gyrB can be used as biomarkers for identifying different species of Lactobacillus. PMID:27407290

  19. Efficient analysis of mouse genome sequences reveal many nonsense variants.

    PubMed

    Steeland, Sophie; Timmermans, Steven; Van Ryckeghem, Sara; Hulpiau, Paco; Saeys, Yvan; Van Montagu, Marc; Vandenbroucke, Roosmarijn E; Libert, Claude

    2016-05-17

    Genetic polymorphisms in coding genes play an important role when using mouse inbred strains as research models. They have been shown to influence research results, explain phenotypical differences between inbred strains, and increase the amount of interesting gene variants present in the many available inbred lines. SPRET/Ei is an inbred strain derived from Mus spretus that has ∼1% sequence difference with the C57BL/6J reference genome. We obtained a listing of all SNPs and insertions/deletions (indels) present in SPRET/Ei from the Mouse Genomes Project (Wellcome Trust Sanger Institute) and processed these data to obtain an overview of all transcripts having nonsynonymous coding sequence variants. We identified 8,883 unique variants affecting 10,096 different transcripts from 6,328 protein-coding genes, which is about 28% of all coding genes. Because only a subset of these variants results in drastic changes in proteins, we focused on variations that are nonsense mutations that ultimately resulted in a gain of a stop codon. These genes were identified by in silico changing the C57BL/6J coding sequences to the SPRET/Ei sequences, converting them to amino acid (AA) sequences, and comparing the AA sequences. All variants and transcripts affected were also stored in a database, which can be browsed using a SPRET/Ei M. spretus variants web tool (www.spretus.org), including a manual. We validated the tool by demonstrating the loss of function of three proteins predicted to be severely truncated, namely Fas, IRAK2, and IFNγR1. PMID:27147605

  20. Decelerated genome evolution in modern vertebrates revealed by analysis of multiple lancelet genomes

    PubMed Central

    Huang, Shengfeng; Chen, Zelin; Yan, Xinyu; Yu, Ting; Huang, Guangrui; Yan, Qingyu; Pontarotti, Pierre Antoine; Zhao, Hongchen; Li, Jie; Yang, Ping; Wang, Ruihua; Li, Rui; Tao, Xin; Deng, Ting; Wang, Yiquan; Li, Guang; Zhang, Qiujin; Zhou, Sisi; You, Leiming; Yuan, Shaochun; Fu, Yonggui; Wu, Fenfang; Dong, Meiling; Chen, Shangwu; Xu, Anlong

    2014-01-01

    Vertebrates diverged from other chordates ~500 Myr ago and experienced successful innovations and adaptations, but the genomic basis underlying vertebrate origins are not fully understood. Here we suggest, through comparison with multiple lancelet (amphioxus) genomes, that ancient vertebrates experienced high rates of protein evolution, genome rearrangement and domain shuffling and that these rates greatly slowed down after the divergence of jawed and jawless vertebrates. Compared with lancelets, modern vertebrates retain, at least relatively, less protein diversity, fewer nucleotide polymorphisms, domain combinations and conserved non-coding elements (CNE). Modern vertebrates also lost substantial transposable element (TE) diversity, whereas lancelets preserve high TE diversity that includes even the long-sought RAG transposon. Lancelets also exhibit rapid gene turnover, pervasive transcription, fastest exon shuffling in metazoans and substantial TE methylation not observed in other invertebrates. These new lancelet genome sequences provide new insights into the chordate ancestral state and the vertebrate evolution. PMID:25523484

  1. Decelerated genome evolution in modern vertebrates revealed by analysis of multiple lancelet genomes.

    PubMed

    Huang, Shengfeng; Chen, Zelin; Yan, Xinyu; Yu, Ting; Huang, Guangrui; Yan, Qingyu; Pontarotti, Pierre Antoine; Zhao, Hongchen; Li, Jie; Yang, Ping; Wang, Ruihua; Li, Rui; Tao, Xin; Deng, Ting; Wang, Yiquan; Li, Guang; Zhang, Qiujin; Zhou, Sisi; You, Leiming; Yuan, Shaochun; Fu, Yonggui; Wu, Fenfang; Dong, Meiling; Chen, Shangwu; Xu, Anlong

    2014-01-01

    Vertebrates diverged from other chordates ~500 Myr ago and experienced successful innovations and adaptations, but the genomic basis underlying vertebrate origins are not fully understood. Here we suggest, through comparison with multiple lancelet (amphioxus) genomes, that ancient vertebrates experienced high rates of protein evolution, genome rearrangement and domain shuffling and that these rates greatly slowed down after the divergence of jawed and jawless vertebrates. Compared with lancelets, modern vertebrates retain, at least relatively, less protein diversity, fewer nucleotide polymorphisms, domain combinations and conserved non-coding elements (CNE). Modern vertebrates also lost substantial transposable element (TE) diversity, whereas lancelets preserve high TE diversity that includes even the long-sought RAG transposon. Lancelets also exhibit rapid gene turnover, pervasive transcription, fastest exon shuffling in metazoans and substantial TE methylation not observed in other invertebrates. These new lancelet genome sequences provide new insights into the chordate ancestral state and the vertebrate evolution. PMID:25523484

  2. Integrated Genomic Analysis of Pancreatic Ductal Adenocarcinomas Reveals Genomic Rearrangement Events as Significant Drivers of Disease.

    PubMed

    Murphy, Stephen J; Hart, Steven N; Halling, Geoffrey C; Johnson, Sarah H; Smadbeck, James B; Drucker, Travis; Lima, Joema Felipe; Rohakhtar, Fariborz Rakhshan; Harris, Faye R; Kosari, Farhad; Subramanian, Subbaya; Petersen, Gloria M; Wiltshire, Timothy D; Kipp, Benjamin R; Truty, Mark J; McWilliams, Robert R; Couch, Fergus J; Vasmatzis, George

    2016-02-01

    Many somatic mutations have been detected in pancreatic ductal adenocarcinoma (PDAC), leading to the identification of some key drivers of disease progression, but the involvement of large genomic rearrangements has often been overlooked. In this study, we performed mate pair sequencing (MPseq) on genomic DNA from 24 PDAC tumors, including 15 laser-captured microdissected PDAC and 9 patient-derived xenografts, to identify genome-wide rearrangements. Large genomic rearrangements with intragenic breakpoints altering key regulatory genes involved in PDAC progression were detected in all tumors. SMAD4, ZNF521, and FHIT were among the most frequently hit genes. Conversely, commonly reported genes with copy number gains, including MYC and GATA6, were frequently observed in the absence of direct intragenic breakpoints, suggesting a requirement for sustaining oncogenic function during PDAC progression. Integration of data from MPseq, exome sequencing, and transcriptome analysis of primary PDAC cases identified limited overlap in genes affected by both rearrangements and point mutations. However, significant overlap was observed in major PDAC-associated signaling pathways, with all PDAC exhibiting reduced SMAD4 expression, reduced SMAD-dependent TGFβ signaling, and increased WNT and Hedgehog signaling. The frequent loss of SMAD4 and FHIT due to genomic rearrangements strongly implicates these genes as key drivers of PDAC, thus highlighting the strengths of an integrated genomic and transcriptomic approach for identifying mechanisms underlying disease initiation and progression. PMID:26676757

  3. Pancreatic cancer genomes reveal aberrations in axon guidance pathway genes

    PubMed Central

    Biankin, Andrew V.; Waddell, Nicola; Kassahn, Karin S.; Gingras, Marie-Claude; Muthuswamy, Lakshmi B.; Johns, Amber L.; Miller, David K.; Wilson, Peter J.; Patch, Ann-Marie; Wu, Jianmin; Chang, David K.; Cowley, Mark J.; Gardiner, Brooke B.; Song, Sarah; Harliwong, Ivon; Idrisoglu, Senel; Nourse, Craig; Nourbakhsh, Ehsan; Manning, Suzanne; Wani, Shivangi; Gongora, Milena; Pajic, Marina; Scarlett, Christopher J.; Gill, Anthony J.; Pinho, Andreia V.; Rooman, Ilse; Anderson, Matthew; Holmes, Oliver; Leonard, Conrad; Taylor, Darrin; Wood, Scott; Xu, Qinying; Nones, Katia; Fink, J. Lynn; Christ, Angelika; Bruxner, Tim; Cloonan, Nicole; Kolle, Gabriel; Newell, Felicity; Pinese, Mark; Mead, R. Scott; Humphris, Jeremy L.; Kaplan, Warren; Jones, Marc D.; Colvin, Emily K.; Nagrial, Adnan M.; Humphrey, Emily S.; Chou, Angela; Chin, Venessa T.; Chantrill, Lorraine A.; Mawson, Amanda; Samra, Jaswinder S.; Kench, James G.; Lovell, Jessica A.; Daly, Roger J.; Merrett, Neil D.; Toon, Christopher; Epari, Krishna; Nguyen, Nam Q.; Barbour, Andrew; Zeps, Nikolajs; Kakkar, Nipun; Zhao, Fengmei; Wu, Yuan Qing; Wang, Min; Muzny, Donna M.; Fisher, William E.; Brunicardi, F. Charles; Hodges, Sally E.; Reid, Jeffrey G.; Drummond, Jennifer; Chang, Kyle; Han, Yi; Lewis, Lora R.; Dinh, Huyen; Buhay, Christian J.; Beck, Timothy; Timms, Lee; Sam, Michelle; Begley, Kimberly; Brown, Andrew; Pai, Deepa; Panchal, Ami; Buchner, Nicholas; De Borja, Richard; Denroche, Robert E.; Yung, Christina K.; Serra, Stefano; Onetto, Nicole; Mukhopadhyay, Debabrata; Tsao, Ming-Sound; Shaw, Patricia A.; Petersen, Gloria M.; Gallinger, Steven; Hruban, Ralph H.; Maitra, Anirban; Iacobuzio-Donahue, Christine A.; Schulick, Richard D.; Wolfgang, Christopher L.; Morgan, Richard A.; Lawlor, Rita T.; Capelli, Paola; Corbo, Vincenzo; Scardoni, Maria; Tortora, Giampaolo; Tempero, Margaret A.; Mann, Karen M.; Jenkins, Nancy A.; Perez-Mancera, Pedro A.; Adams, David J.; Largaespada, David A.; Wessels, Lodewyk F. A.; Rust, Alistair G.; Stein, Lincoln D.; Tuveson, David A.; Copeland, Neal G.; Musgrove, Elizabeth A.; Scarpa, Aldo; Eshleman, James R.; Hudson, Thomas J.; Sutherland, Robert L.; Wheeler, David A.; Pearson, John V.; McPherson, John D.; Gibbs, Richard A.; Grimmond, Sean M.

    2012-01-01

    Pancreatic cancer is a highly lethal malignancy with few effective therapies. We performed exome sequencing and copy number analysis to define genomic aberrations in a prospectively accrued clinical cohort (n = 142) of early (stage I and II) sporadic pancreatic ductal adenocarcinoma. Detailed analysis of 99 informative tumours identified substantial heterogeneity with 2,016 non-silent mutations and 1,628 copy-number variations. We define 16 significantly mutated genes, reaffirming known mutations (KRAS, TP53, CDKN2A, SMAD4, MLL3, TGFBR2, ARID1A and SF3B1), and uncover novel mutated genes including additional genes involved in chromatin modification (EPC1 and ARID2), DNA damage repair (ATM) and other mechanisms (ZIM2, MAP2K4, NALCN, SLC16A4 and MAGEA6). Integrative analysis with in vitro functional data and animal models provided supportive evidence for potential roles for these genetic aberrations in carcinogenesis. Pathway-based analysis of recurrently mutated genes recapitulated clustering in core signalling pathways in pancreatic ductal adenocarcinoma, and identified new mutated genes in each pathway. We also identified frequent and diverse somatic aberrations in genes described traditionally as embryonic regulators of axon guidance, particularly SLIT/ROBO signalling, which was also evident in murine Sleeping Beauty transposon-mediated somatic mutagenesis models of pancreatic cancer, providing further supportive evidence for the potential involvement of axon guidance genes in pancreatic carcinogenesis. PMID:23103869

  4. Genome structure and primitive sex chromosome revealed in Populus

    SciTech Connect

    Tuskan, Gerald A; Yin, Tongming; Gunter, Lee E; Blaudez, D

    2008-01-01

    We constructed a comprehensive genetic map for Populus and ordered 332 Mb of sequence scaffolds along the 19 haploid chromosomes in order to compare chromosomal regions among diverse members of the genus. These efforts lead us to conclude that chromosome XIX in Populus is evolving into a sex chromosome. Consistent segregation distortion in favor of the sub-genera Tacamahaca alleles provided evidence of divergent selection among species, particularly at the proximal end of chromosome XIX. A large microsatellite marker (SSR) cluster was detected in the distorted region even though the genome-wide distribute SSR sites was uniform across the physical map. The differences between the genetic map and physical sequence data suggested recombination suppression was occurring in the distorted region. A gender-determination locus and an overabundance of NBS-LRR genes were also co-located to the distorted region and were put forth as the cause for divergent selection and recombination suppression. This hypothesis was verified by using fine-scale mapping of an integrated scaffold in the vicinity of the gender-determination locus. As such it appears that chromosome XIX in Populus is in the process of evolving from an autosome into a sex chromosome and that NBS-LRR genes may play important role in the chromosomal diversification process in Populus.

  5. Whole-genome sequencing reveals oncogenic mutations in mycosis fungoides

    PubMed Central

    McGirt, Laura Y.; Jia, Peilin; Baerenwald, Devin A.; Duszynski, Robert J.; Dahlman, Kimberly B.; Zic, John A.; Zwerner, Jeffrey P.; Hucks, Donald; Dave, Utpal; Zhao, Zhongming

    2015-01-01

    The pathogenesis of mycosis fungoides (MF), the most common cutaneous T-cell lymphoma (CTCL), is unknown. Although genetic alterations have been identified, none are considered consistently causative in MF. To identify potential drivers of MF, we performed whole-genome sequencing of MF tumors and matched normal skin. Targeted ultra-deep sequencing of MF samples and exome sequencing of CTCL cell lines were also performed. Multiple mutations were identified that affected the same pathways, including epigenetic, cell-fate regulation, and cytokine signaling, in MF tumors and CTCL cell lines. Specifically, interleukin-2 signaling pathway mutations, including activating Janus kinase 3 (JAK3) mutations, were detected. Treatment with a JAK3 inhibitor significantly reduced CTCL cell survival. Additionally, the mutation data identified 2 other potential contributing factors to MF, ultraviolet light, and a polymorphism in the tumor suppressor p53 (TP53). Therefore, genetic alterations in specific pathways in MF were identified that may be viable, effective new targets for treatment. PMID:26082451

  6. Genomic analysis of regulatory network dynamics reveals large topological changes

    NASA Astrophysics Data System (ADS)

    Luscombe, Nicholas M.; Madan Babu, M.; Yu, Haiyuan; Snyder, Michael; Teichmann, Sarah A.; Gerstein, Mark

    2004-09-01

    Network analysis has been applied widely, providing a unifying language to describe disparate systems ranging from social interactions to power grids. It has recently been used in molecular biology, but so far the resulting networks have only been analysed statically. Here we present the dynamics of a biological network on a genomic scale, by integrating transcriptional regulatory information and gene-expression data for multiple conditions in Saccharomyces cerevisiae. We develop an approach for the statistical analysis of network dynamics, called SANDY, combining well-known global topological measures, local motifs and newly derived statistics. We uncover large changes in underlying network architecture that are unexpected given current viewpoints and random simulations. In response to diverse stimuli, transcription factors alter their interactions to varying degrees, thereby rewiring the network. A few transcription factors serve as permanent hubs, but most act transiently only during certain conditions. By studying sub-network structures, we show that environmental responses facilitate fast signal propagation (for example, with short regulatory cascades), whereas the cell cycle and sporulation direct temporal progression through multiple stages (for example, with highly inter-connected transcription factors). Indeed, to drive the latter processes forward, phase-specific transcription factors inter-regulate serially, and ubiquitously active transcription factors layer above them in a two-tiered hierarchy. We anticipate that many of the concepts presented here-particularly the large-scale topological changes and hub transience-will apply to other biological networks, including complex sub-systems in higher eukaryotes.

  7. Genomic investigation reveals evolution and lifestyle adaptation of endophytic Staphylococcus epidermidis

    PubMed Central

    Chaudhry, Vasvi; Patil, Prabhu B.

    2016-01-01

    Staphylococcus epidermidis is a major human associated bacterium and also an emerging nosocomial pathogen. There are reports of its association to rodents, sheep and plants. However, comparative and evolutionary studies of ecologically diverse strains of S. epidermidis are lacking. Here, we report the whole genome sequences of four S. epidermidis strains isolated from surface sterilized rice seeds along with genome sequence of type strain. Phylogenomic analysis of rice endophytic S. epidermidis (RESE) with “type strain” unequivocally established their species identity. Whole genome based tree of 93 strains of S. epidermidis revealed RESE as distinct sub-lineage which is more related to rodent sub-lineage than to majority of human lineage strains. Furthermore, comparative genomics revealed 20% variable gene-pool in S. epidermidis, suggesting that genomes of ecologically diverse strains are under flux. Interestingly, we were also able to map several genomic regions that are under flux and gave rise to RESE strains. The largest of these genomic regions encodes a cluster of genes unique to RESE that are known to be required for survival and stress tolerance, apart from those required for adaptation to plant habitat. The genomes and genes of RESE represent distinct ecological resource/sequences and provided first evolutionary insights into adaptation of S. epidermidis to plants. PMID:26758912

  8. Comprehensive Genomic Characterization of Campylobacter Genus Reveals Some Underlying Mechanisms for its Genomic Diversification

    PubMed Central

    Zhou, Yizhuang; Bu, Lijing; Guo, Min; Zhou, Chengran; Wang, Yongdong; Chen, Liyu; Liu, Jie

    2013-01-01

    Campylobacter species.are phenotypically diverse in many aspects including host habitats and pathogenicities, which demands comprehensive characterization of the entire Campylobacter genus to study their underlying genetic diversification. Up to now, 34 Campylobacter strains have been sequenced and published in public databases, providing good opportunity to systemically analyze their genomic diversities. In this study, we first conducted genomic characterization, which includes genome-wide alignments, pan-genome analysis, and phylogenetic identification, to depict the genetic diversity of Campylobacter genus. Afterward, we improved the tetranucleotide usage pattern-based naïve Bayesian classifier to identify the abnormal composition fragments (ACFs, fragments with significantly different tetranucleotide frequency profiles from its genomic tetranucleotide frequency profiles) including horizontal gene transfers (HGTs) to explore the mechanisms for the genetic diversity of this organism. Finally, we analyzed the HGTs transferred via bacteriophage transductions. To our knowledge, this study is the first to use single nucleotide polymorphism information to construct liable microevolution phylogeny of 21 Campylobacter jejuni strains. Combined with the phylogeny of all the collected Campylobacter species based on genome-wide core gene information, comprehensive phylogenetic inference of all 34 Campylobacter organisms was determined. It was found that C. jejuni harbors a high fraction of ACFs possibly through intraspecies recombination, whereas other Campylobacter members possess numerous ACFs possibly via intragenus recombination. Furthermore, some Campylobacter strains have undergone significant ancient viral integration during their evolution process. The improved method is a powerful tool for bacterial genomic analysis. Moreover, the findings would provide useful information for future research on Campylobacter genus. PMID:23940551

  9. A SNP based linkage map of the turkey genome reveals multiple intrachromosomal rearrangements between the Turkey and Chicken genomes

    PubMed Central

    2010-01-01

    Background The turkey (Meleagris gallopavo) is an important agricultural species that is the second largest contributor to the world's poultry meat production. The genomic resources of turkey provide turkey breeders with tools needed for the genetic improvement of commercial breeds of turkey for economically important traits. A linkage map of turkey is essential not only for the mapping of quantitative trait loci, but also as a framework to enable the assignment of sequence contigs to specific chromosomes. Comparative genomics with chicken provides insight into mechanisms of genome evolution and helps in identifying rare genomic events such as genomic rearrangements and duplications/deletions. Results Eighteen full sib families, comprising 1008 (35 F1 and 973 F2) birds, were genotyped for 775 single nucleotide polymorphisms (SNPs). Of the 775 SNPs, 570 were informative and used to construct a linkage map in turkey. The final map contains 531 markers in 28 linkage groups. The total genetic distance covered by these linkage groups is 2,324 centimorgans (cM) with the largest linkage group (81 loci) measuring 326 cM. Average marker interval for all markers across the 28 linkage groups is 4.6 cM. Comparative mapping of turkey and chicken revealed two inter-, and 57 intrachromosomal rearrangements between these two species. Conclusion Our turkey genetic map of 531 markers reveals a genome length of 2,324 cM. Our linkage map provides an improvement of previously published maps because of the more even distribution of the markers and because the map is completely based on SNP markers enabling easier and faster genotyping assays than the microsatellitemarkers used in previous linkage maps. Turkey and chicken are shown to have a highly conserved genomic structure with a relatively low number of inter-, and intrachromosomal rearrangements. PMID:21092123

  10. Genome comparison of two Magnaporthe oryzae field isolates reveals genome variations and potential virulence effectors

    PubMed Central

    2013-01-01

    Background Rice blast caused by the fungus Magnaporthe oryzae is an important disease in virtually every rice growing region of the world, which leads to significant annual decreases of grain quality and yield. To prevent disease, resistance genes in rice have been cloned and introduced into susceptible cultivars. However, introduced resistance can often be broken within few years of release, often due to mutation of cognate avirulence genes in fungal field populations. Results To better understand the pattern of mutation of M. oryzae field isolates under natural selection forces, we used a next generation sequencing approach to analyze the genomes of two field isolates FJ81278 and HN19311, as well as the transcriptome of FJ81278. By comparing the de novo genome assemblies of the two isolates against the finished reference strain 70–15, we identified extensive polymorphisms including unique genes, SNPs (single nucleotide polymorphism) and indels, structural variations, copy number variations, and loci under strong positive selection. The 1.75 MB of isolate-specific genome content carrying 118 novel genes from FJ81278, and 0.83 MB from HN19311 were also identified. By analyzing secreted proteins carrying polymorphisms, in total 256 candidate virulence effectors were found and 6 were chosen for functional characterization. Conclusions We provide results from genome comparison analysis showing extensive genome variation, and generated a list of M. oryzae candidate virulence effectors for functional characterization. PMID:24341723

  11. Comparative genome sequencing reveals genomic signature of extreme desiccation tolerance in the anhydrobiotic midge.

    PubMed

    Gusev, Oleg; Suetsugu, Yoshitaka; Cornette, Richard; Kawashima, Takeshi; Logacheva, Maria D; Kondrashov, Alexey S; Penin, Aleksey A; Hatanaka, Rie; Kikuta, Shingo; Shimura, Sachiko; Kanamori, Hiroyuki; Katayose, Yuichi; Matsumoto, Takashi; Shagimardanova, Elena; Alexeev, Dmitry; Govorun, Vadim; Wisecaver, Jennifer; Mikheyev, Alexander; Koyanagi, Ryo; Fujie, Manabu; Nishiyama, Tomoaki; Shigenobu, Shuji; Shibata, Tomoko F; Golygina, Veronika; Hasebe, Mitsuyasu; Okuda, Takashi; Satoh, Nori; Kikawada, Takahiro

    2014-01-01

    Anhydrobiosis represents an extreme example of tolerance adaptation to water loss, where an organism can survive in an ametabolic state until water returns. Here we report the first comparative analysis examining the genomic background of extreme desiccation tolerance, which is exclusively found in larvae of the only anhydrobiotic insect, Polypedilum vanderplanki. We compare the genomes of P. vanderplanki and a congeneric desiccation-sensitive midge P. nubifer. We determine that the genome of the anhydrobiotic species specifically contains clusters of multi-copy genes with products that act as molecular shields. In addition, the genome possesses several groups of genes with high similarity to known protective proteins. However, these genes are located in distinct paralogous clusters in the genome apart from the classical orthologues of the corresponding genes shared by both chironomids and other insects. The transcripts of these clustered paralogues contribute to a large majority of the mRNA pool in the desiccating larvae and most likely define successful anhydrobiosis. Comparison of expression patterns of orthologues between two chironomid species provides evidence for the existence of desiccation-specific gene expression systems in P. vanderplanki. PMID:25216354

  12. Genome resequencing in Populus: Revealing large-scale genome variation and implications on specialized-trait genomics

    SciTech Connect

    Muchero, Wellington; Labbe, Jessy L; Priya, Ranjan; DiFazio, Steven P; Tuskan, Gerald A

    2014-01-01

    To date, Populus ranks among a few plant species with a complete genome sequence and other highly developed genomic resources. With the first genome sequence among all tree species, Populus has been adopted as a suitable model organism for genomic studies in trees. However, far from being just a model species, Populus is a key renewable economic resource that plays a significant role in providing raw materials for the biofuel and pulp and paper industries. Therefore, aside from leading frontiers of basic tree molecular biology and ecological research, Populus leads frontiers in addressing global economic challenges related to fuel and fiber production. The latter fact suggests that research aimed at improving quality and quantity of Populus as a raw material will likely drive the pursuit of more targeted and deeper research in order to unlock the economic potential tied in molecular biology processes that drive this tree species. Advances in genome sequence-driven technologies, such as resequencing individual genotypes, which in turn facilitates large scale SNP discovery and identification of large scale polymorphisms are key determinants of future success in these initiatives. In this treatise we discuss implications of genome sequence-enable technologies on Populus genomic and genetic studies of complex and specialized-traits.

  13. Genome sequencing and comparative genomics of honey bee microsporidia, Nosema apis reveal novel insights into host-parasite interactions

    PubMed Central

    2013-01-01

    Background The microsporidia parasite Nosema contributes to the steep global decline of honey bees that are critical pollinators of food crops. There are two species of Nosema that have been found to infect honey bees, Nosema apis and N. ceranae. Genome sequencing of N. apis and comparative genome analysis with N. ceranae, a fully sequenced microsporidia species, reveal novel insights into host-parasite interactions underlying the parasite infections. Results We applied the whole-genome shotgun sequencing approach to sequence and assemble the genome of N. apis which has an estimated size of 8.5 Mbp. We predicted 2,771 protein- coding genes and predicted the function of each putative protein using the Gene Ontology. The comparative genomic analysis led to identification of 1,356 orthologs that are conserved between the two Nosema species and genes that are unique characteristics of the individual species, thereby providing a list of virulence factors and new genetic tools for studying host-parasite interactions. We also identified a highly abundant motif in the upstream promoter regions of N. apis genes. This motif is also conserved in N. ceranae and other microsporidia species and likely plays a role in gene regulation across the microsporidia. Conclusions The availability of the N. apis genome sequence is a significant addition to the rapidly expanding body of microsprodian genomic data which has been improving our understanding of eukaryotic genome diversity and evolution in a broad sense. The predicted virulent genes and transcriptional regulatory elements are potential targets for innovative therapeutics to break down the life cycle of the parasite. PMID:23829473

  14. Genomic sequencing reveals gene content, genomic organization, and recombination relationships in barley.

    PubMed

    Rostoks, Nils; Park, Yong-Jin; Ramakrishna, Wusirika; Ma, Jianxin; Druka, Arnis; Shiloff, Bryan A; SanMiguel, Phillip J; Jiang, Zeyu; Brueggeman, Robert; Sandhu, Devinder; Gill, Kulvinder; Bennetzen, Jeffrey L; Kleinhofs, Andris

    2002-05-01

    Barley (Hordeum vulgare L.) is one of the most important large-genome cereals with extensive genetic resources available in the public sector. Studies of genome organization in barley have been limited primarily to genetic markers and sparse sequence data. Here we report sequence analysis of 417.5 kb DNA from four BAC clones from different genomic locations. Sequences were analyzed with respect to gene content, the arrangement of repetitive sequences and the relationship of gene density to recombination frequencies. Gene densities ranged from 1 gene per 12 kb to 1 gene per 103 kb with an average of 1 gene per 21 kb. In general, genes were organized into islands separated by large blocks of nested retrotransposons. Single genes in apparent isolation were also found. Genes occupied 11% of the total sequence, LTR retrotransposons and other repeated elements accounted for 51.9% and the remaining 37.1% could not be annotated. PMID:12021850

  15. Comparative Genomics of Flatworms (Platyhelminthes) Reveals Shared Genomic Features of Ecto- and Endoparastic Neodermata

    PubMed Central

    Hahn, Christoph; Fromm, Bastian; Bachmann, Lutz

    2014-01-01

    The ectoparasitic Monogenea comprise a major part of the obligate parasitic flatworm diversity. Although genomic adaptations to parasitism have been studied in the endoparasitic tapeworms (Cestoda) and flukes (Trematoda), no representative of the Monogenea has been investigated yet. We present the high-quality draft genome of Gyrodactylus salaris, an economically important monogenean ectoparasite of wild Atlantic salmon (Salmo salar). A total of 15,488 gene models were identified, of which 7,102 were functionally annotated. The controversial phylogenetic relationships within the obligate parasitic Neodermata were resolved in a phylogenomic analysis using 1,719 gene models (alignment length of >500,000 amino acids) for a set of 16 metazoan taxa. The Monogenea were found basal to the Cestoda and Trematoda, which implies ectoparasitism being plesiomorphic within the Neodermata and strongly supports a common origin of complex life cycles. Comparative analysis of seven parasitic flatworm genomes identified shared genomic features for the ecto- and endoparasitic lineages, such as a substantial reduction of the core bilaterian gene complement, including the homeodomain-containing genes, and a loss of the piwi and vasa genes, which are considered essential for animal development. Furthermore, the shared loss of functional fatty acid biosynthesis pathways and the absence of peroxisomes, the latter organelles presumed ubiquitous in eukaryotes except for parasitic protozoans, were inferred. The draft genome of G. salaris opens for future in-depth analyses of pathogenicity and host specificity of poorly characterized G. salaris strains, and will enhance studies addressing the genomics of host–parasite interactions and speciation in the highly diverse monogenean flatworms. PMID:24732282

  16. Comparative genomics of parasitic silkworm microsporidia reveal an association between genome expansion and host adaptation

    PubMed Central

    2013-01-01

    Background Microsporidian Nosema bombycis has received much attention because the pébrine disease of domesticated silkworms results in great economic losses in the silkworm industry. So far, no effective treatment could be found for pébrine. Compared to other known Nosema parasites, N. bombycis can unusually parasitize a broad range of hosts. To gain some insights into the underlying genetic mechanism of pathological ability and host range expansion in this parasite, a comparative genomic approach is conducted. The genome of two Nosema parasites, N. bombycis and N. antheraeae (an obligatory parasite to undomesticated silkworms Antheraea pernyi), were sequenced and compared with their distantly related species, N. ceranae (an obligatory parasite to honey bees). Results Our comparative genomics analysis show that the N. bombycis genome has greatly expanded due to the following three molecular mechanisms: 1) the proliferation of host-derived transposable elements, 2) the acquisition of many horizontally transferred genes from bacteria, and 3) the production of abundnant gene duplications. To our knowledge, duplicated genes derived not only from small-scale events (e.g., tandem duplications) but also from large-scale events (e.g., segmental duplications) have never been seen so abundant in any reported microsporidia genomes. Our relative dating analysis further indicated that these duplication events have arisen recently over very short evolutionary time. Furthermore, several duplicated genes involving in the cytotoxic metabolic pathway were found to undergo positive selection, suggestive of the role of duplicated genes on the adaptive evolution of pathogenic ability. Conclusions Genome expansion is rarely considered as the evolutionary outcome acting on those highly reduced and compact parasitic microsporidian genomes. This study, for the first time, demonstrates that the parasitic genomes can expand, instead of shrink, through several common molecular mechanisms

  17. Asymmetric cryo-EM reconstruction of phage MS2 reveals genome structure in situ.

    PubMed

    Koning, Roman I; Gomez-Blanco, Josue; Akopjana, Inara; Vargas, Javier; Kazaks, Andris; Tars, Kaspars; Carazo, José María; Koster, Abraham J

    2016-01-01

    In single-stranded ribonucleic acid (RNA) viruses, virus capsid assembly and genome packaging are intertwined processes. Using cryo-electron microscopy and single particle analysis we determined the asymmetric virion structure of bacteriophage MS2, which includes 178 copies of the coat protein, a single copy of the A-protein and the RNA genome. This reveals that in situ, the viral RNA genome can adopt a defined conformation. The RNA forms a branched network of stem-loops that almost all allocate near the capsid inner surface, while predominantly binding to coat protein dimers that are located in one-half of the capsid. This suggests that genomic RNA is highly involved in genome packaging and virion assembly. PMID:27561669

  18. Asymmetric cryo-EM reconstruction of phage MS2 reveals genome structure in situ

    PubMed Central

    Koning, Roman I; Gomez-Blanco, Josue; Akopjana, Inara; Vargas, Javier; Kazaks, Andris; Tars, Kaspars; Carazo, José María; Koster, Abraham J.

    2016-01-01

    In single-stranded ribonucleic acid (RNA) viruses, virus capsid assembly and genome packaging are intertwined processes. Using cryo-electron microscopy and single particle analysis we determined the asymmetric virion structure of bacteriophage MS2, which includes 178 copies of the coat protein, a single copy of the A-protein and the RNA genome. This reveals that in situ, the viral RNA genome can adopt a defined conformation. The RNA forms a branched network of stem-loops that almost all allocate near the capsid inner surface, while predominantly binding to coat protein dimers that are located in one-half of the capsid. This suggests that genomic RNA is highly involved in genome packaging and virion assembly. PMID:27561669

  19. Genetic variability of mutans streptococci revealed by wide whole-genome sequencing

    PubMed Central

    2013-01-01

    Background Mutans streptococci are a group of bacteria significantly contributing to tooth decay. Their genetic variability is however still not well understood. Results Genomes of 6 clinical S. mutans isolates of different origins, one isolate of S. sobrinus (DSM 20742) and one isolate of S. ratti (DSM 20564) were sequenced and comparatively analyzed. Genome alignment revealed a mosaic-like structure of genome arrangement. Genes related to pathogenicity are found to have high variations among the strains, whereas genes for oxidative stress resistance are well conserved, indicating the importance of this trait in the dental biofilm community. Analysis of genome-scale metabolic networks revealed significant differences in 42 pathways. A striking dissimilarity is the unique presence of two lactate oxidases in S. sobrinus DSM 20742, probably indicating an unusual capability of this strain in producing H2O2 and expanding its ecological niche. In addition, lactate oxidases may form with other enzymes a novel energetic pathway in S. sobrinus DSM 20742 that can remedy its deficiency in citrate utilization pathway. Using 67 S. mutans genomes currently available including the strains sequenced in this study, we estimates the theoretical core genome size of S. mutans, and performed modeling of S. mutans pan-genome by applying different fitting models. An “open” pan-genome was inferred. Conclusions The comparative genome analyses revealed diversities in the mutans streptococci group, especially with respect to the virulence related genes and metabolic pathways. The results are helpful for better understanding the evolution and adaptive mechanisms of these oral pathogen microorganisms and for combating them. PMID:23805886

  20. Space Movie Reveals Shocking Secrets Of The Crab Pulsa

    NASA Astrophysics Data System (ADS)

    2002-09-01

    Just when it seemed like the summer movie season had ended, two of NASA's Great Observatories have produced their own action movie. Multiple observations made over several months with NASA's Chandra X-ray Observatory and the Hubble Space Telescope captured the spectacle of matter and antimatter propelled to near the speed of light by the Crab pulsar, a rapidly rotating neutron star the size of Manhattan. "Through this movie, the Crab Nebula has come to life," said Jeff Hester of Arizona State University in Tempe, lead author of a paper in the September 20th issue of The Astrophysical Journal Letters. "We can see how this awesome cosmic generator actually works." The Crab was first observed by Chinese astronomers in 1054 A.D. and has since become one of the most studied objects in the sky. By combining the power of both Chandra and Hubble, the movie reveals features never seen in still images. By understanding the Crab, astronomers hope to unlock the secrets of how similar objects across the universe are powered. Crab Nebula Composite Image Crab Nebula Composite Image Bright wisps can be seen moving outward at half the speed of light to form an expanding ring that is visible in both X-ray and optical images. These wisps appear to originate from a shock wave that shows up as an inner X-ray ring. This ring consists of about two dozen knots that form, brighten and fade, jitter around, and occasionally undergo outbursts that give rise to expanding clouds of particles, but remain in roughly the same location. "These data leave little doubt that the inner X-ray ring is the location of the shock wave that turns the high-speed wind from the pulsar into extremely energetic particles," said Koji Mori of Penn State University in University Park, a coauthor of the paper. Another dramatic feature of the movie is a turbulent jet that lies perpendicular to the inner and outer rings. Violent internal motions are obvious, as is a slow motion outward into the surrounding nebula of

  1. Genome evolution in diploid and tetraploid Coffea species as revealed by comparative analysis of orthologous genome segments.

    PubMed

    Cenci, Alberto; Combes, Marie-Christine; Lashermes, Philippe

    2012-01-01

    Sequence comparison of orthologous regions enables estimation of the divergence between genomes, analysis of their evolution and detection of particular features of the genomes, such as sequence rearrangements and transposable elements. Despite the economic importance of Coffea species, little genomic information is currently available. Coffea is a relatively young genus that includes more than one hundred diploid species and a single tetraploid species. Three Coffea orthologous regions of 470-900 kb were analyzed and compared: both subgenomes of allotetraploid Coffea arabica (contributed by the diploid species Coffea eugenioides and Coffea canephora) and the genome of diploid C. canephora. Sequence divergence was calculated on global alignments or on coding and non-coding sequences separately. A search for transposable elements detected 43 retrotransposons and 198 transposons in the sequences analyzed. Comparative insertion analysis made it possible to locate 165 TE insertions in the phylogenetic tree of the three genomes/subgenomes. In the tetraploid C. arabica, a homoeologous non-reciprocal transposition (HNRT) was detected and characterized: a 50 kb region of the C. eugenioides derived subgenome replaced the C. canephora derived counterpart. Comparative sequence analysis on three Coffea genomes/subgenomes revealed almost perfect gene synteny, low sequence divergence and a high number of shared transposable elements. Compared to the results of similar analysis in other genera (Aegilops/Triticum and Oryza), Coffea genomes/subgenomes appeared to be dramatically less diverged, which is consistent with the relatively recent radiation of the Coffea genus. Based on nucleotide substitution frequency, the HNRT was dated at 10,000-50,000 years BP, which is also the most recent estimation of the origin of C. arabica. PMID:22086332

  2. Comparative Genomics of the Extreme Acidophile Acidithiobacillus thiooxidans Reveals Intraspecific Divergence and Niche Adaptation

    PubMed Central

    Zhang, Xian; Feng, Xue; Tao, Jiemeng; Ma, Liyuan; Xiao, Yunhua; Liang, Yili; Liu, Xueduan; Yin, Huaqun

    2016-01-01

    Acidithiobacillus thiooxidans known for its ubiquity in diverse acidic and sulfur-bearing environments worldwide was used as the research subject in this study. To explore the genomic fluidity and intraspecific diversity of Acidithiobacillus thiooxidans (A. thiooxidans) species, comparative genomics based on nine draft genomes was performed. Phylogenomic scrutiny provided first insights into the multiple groupings of these strains, suggesting that genetic diversity might be potentially correlated with their geographic distribution as well as geochemical conditions. While these strains shared a large number of common genes, they displayed differences in gene content. Functional assignment indicated that the core genome was essential for microbial basic activities such as energy acquisition and uptake of nutrients, whereas the accessory genome was thought to be involved in niche adaptation. Comprehensive analysis of their predicted central metabolism revealed that few differences were observed among these strains. Further analyses showed evidences of relevance between environmental conditions and genomic diversification. Furthermore, a diverse pool of mobile genetic elements including insertion sequences and genomic islands in all A. thiooxidans strains probably demonstrated the frequent genetic flow (such as lateral gene transfer) in the extremely acidic environments. From another perspective, these elements might endow A. thiooxidans species with capacities to withstand the chemical constraints of their natural habitats. Taken together, our findings bring some valuable data to better understand the genomic diversity and econiche adaptation within A. thiooxidans strains. PMID:27548157

  3. Comparative Genomics of the Extreme Acidophile Acidithiobacillus thiooxidans Reveals Intraspecific Divergence and Niche Adaptation.

    PubMed

    Zhang, Xian; Feng, Xue; Tao, Jiemeng; Ma, Liyuan; Xiao, Yunhua; Liang, Yili; Liu, Xueduan; Yin, Huaqun

    2016-01-01

    Acidithiobacillus thiooxidans known for its ubiquity in diverse acidic and sulfur-bearing environments worldwide was used as the research subject in this study. To explore the genomic fluidity and intraspecific diversity of Acidithiobacillus thiooxidans (A. thiooxidans) species, comparative genomics based on nine draft genomes was performed. Phylogenomic scrutiny provided first insights into the multiple groupings of these strains, suggesting that genetic diversity might be potentially correlated with their geographic distribution as well as geochemical conditions. While these strains shared a large number of common genes, they displayed differences in gene content. Functional assignment indicated that the core genome was essential for microbial basic activities such as energy acquisition and uptake of nutrients, whereas the accessory genome was thought to be involved in niche adaptation. Comprehensive analysis of their predicted central metabolism revealed that few differences were observed among these strains. Further analyses showed evidences of relevance between environmental conditions and genomic diversification. Furthermore, a diverse pool of mobile genetic elements including insertion sequences and genomic islands in all A. thiooxidans strains probably demonstrated the frequent genetic flow (such as lateral gene transfer) in the extremely acidic environments. From another perspective, these elements might endow A. thiooxidans species with capacities to withstand the chemical constraints of their natural habitats. Taken together, our findings bring some valuable data to better understand the genomic diversity and econiche adaptation within A. thiooxidans strains. PMID:27548157

  4. Identification of novel RNA secondary structures within the hepatitis C virus genome reveals a cooperative involvement in genome packaging

    PubMed Central

    Stewart, H.; Bingham, R.J.; White, S. J.; Dykeman, E. C.; Zothner, C.; Tuplin, A. K.; Stockley, P. G.; Twarock, R.; Harris, M.

    2016-01-01

    The specific packaging of the hepatitis C virus (HCV) genome is hypothesised to be driven by Core-RNA interactions. To identify the regions of the viral genome involved in this process, we used SELEX (systematic evolution of ligands by exponential enrichment) to identify RNA aptamers which bind specifically to Core in vitro. Comparison of these aptamers to multiple HCV genomes revealed the presence of a conserved terminal loop motif within short RNA stem-loop structures. We postulated that interactions of these motifs, as well as sub-motifs which were present in HCV genomes at statistically significant levels, with the Core protein may drive virion assembly. We mutated 8 of these predicted motifs within the HCV infectious molecular clone JFH-1, thereby producing a range of mutant viruses predicted to possess altered RNA secondary structures. RNA replication and viral titre were unaltered in viruses possessing only one mutated structure. However, infectivity titres were decreased in viruses possessing a higher number of mutated regions. This work thus identified multiple novel RNA motifs which appear to contribute to genome packaging. We suggest that these structures act as cooperative packaging signals to drive specific RNA encapsidation during HCV assembly. PMID:26972799

  5. Whole-genome sequencing reveals small genomic regions of introgression in an introduced crater lake population of threespine stickleback.

    PubMed

    Yoshida, Kohta; Miyagi, Ryutaro; Mori, Seiichi; Takahashi, Aya; Makino, Takashi; Toyoda, Atsushi; Fujiyama, Asao; Kitano, Jun

    2016-04-01

    Invasive species pose a major threat to biological diversity. Although introduced populations often experience population bottlenecks, some invasive species are thought to be originated from hybridization between multiple populations or species, which can contribute to the maintenance of high genetic diversity. Recent advances in genome sequencing enable us to trace the evolutionary history of invasive species even at whole-genome level and may help to identify the history of past hybridization that may be overlooked by traditional marker-based analysis. Here, we conducted whole-genome sequencing of eight threespine stickleback (Gasterosteus aculeatus) individuals, four from a recently introduced crater lake population and four of the putative source population. We found that both populations have several small genomic regions with high genetic diversity, which resulted from introgression from a closely related species (Gasterosteus nipponicus). The sizes of the regions were too small to be detected with traditional marker-based analysis or even some reduced-representation sequencing methods. Further amplicon sequencing revealed linkage disequilibrium around an introgression site, which suggests the possibility of selective sweep at the introgression site. Thus, interspecies introgression might predate introduction and increase genetic variation in the source population. Whole-genome sequencing of even a small number of individuals can therefore provide higher resolution inference of history of introduced populations. PMID:27069575

  6. Comparative genomic de-convolution of the cotton genome revealed a decaploid ancestor and widespread chromosomal fractionation.

    PubMed

    Wang, Xiyin; Guo, Hui; Wang, Jinpeng; Lei, Tianyu; Liu, Tao; Wang, Zhenyi; Li, Yuxian; Lee, Tae-Ho; Li, Jingping; Tang, Haibao; Jin, Dianchuan; Paterson, Andrew H

    2016-02-01

    The 'apparently' simple genomes of many angiosperms mask complex evolutionary histories. The reference genome sequence for cotton (Gossypium spp.) revealed a ploidy change of a complexity unprecedented to date, indeed that could not be distinguished as to its exact dosage. Herein, by developing several comparative, computational and statistical approaches, we revealed a 5× multiplication in the cotton lineage of an ancestral genome common to cotton and cacao, and proposed evolutionary models to show how such a decaploid ancestor formed. The c. 70% gene loss necessary to bring the ancestral decaploid to its current gene count appears to fit an approximate geometrical model; that is, although many genes may be lost by single-gene deletion events, some may be lost in groups of consecutive genes. Gene loss following cotton decaploidy has largely just reduced gene copy numbers of some homologous groups. We designed a novel approach to deconvolute layers of chromosome homology, providing definitive information on gene orthology and paralogy across broad evolutionary distances, both of fundamental value and serving as an important platform to support further studies in and beyond cotton and genomics communities. PMID:26756535

  7. Dynamics of oscillatory phenotypes in S. cerevisiae reveal a network of genome-wide transcriptional oscillators

    PubMed Central

    Chin, Shwe L.; Marcus, Ian M.; Klevecz, Robert R.; Li, Caroline M.

    2012-01-01

    Genetic and environmental factors are well-studied influences on phenotype; however, time is a variable that is rarely considered when studying changes in cellular phenotype. Time-resolved microarray data revealed genome-wide transcriptional oscillation in a yeast continuous culture system with ~2 and ~4 h periods. We mapped the global patterns of transcriptional oscillations into a 3D map to represent different cellular phenotypes of redox cycles. This map shows the dynamic nature of gene expression in that transcripts are ordered and coupled to each other through time and concentration space. Although cells differed in oscillation periods, transcripts involved in certain processes were conserved in a deterministic way. When oscillation period lengthened, the peak to trough ratio of transcripts increased and the fraction of cells in the unbudded (G0/G1) phase of the cell division cycle increased. Decreasing the glucose level in the culture media was one way to increase the redox cycle, possibly from changes in metabolic flux. The period may be responding to lower glucose levels by increasing the fraction of cells in G1 and reducing S-phase gating so that cells can spend more time in catabolic processes. Our results support that gene transcripts are coordinated with metabolic functions and the cell division cycle. PMID:22289124

  8. Different genome-specific chromosome stabilities in synthetic Brassica allohexaploids revealed by wide crosses with Orychophragmus

    PubMed Central

    Ge, Xian-Hong; Wang, Jing; Li, Zai-Yun

    2009-01-01

    Background and Aims In sexual hybrids between cultivated Brassica species and another crucifer, Orychophragmus violaceus (2n = 24), parental genome separation during mitosis and meiosis is under genetic control but this phenomenon varies depending upon the Brassica species. To further investigate the mechanisms involved in parental genome separation, complex hybrids between synthetic Brassica allohexaploids (2n = 54, AABBCC) from three sources and O. violaceus were obtained and characterized. Methods Genomic in situ hybridization, amplified fragment length polymorphism (AFLP) and single-strand conformation polymorphism (SSCP) were used to explore chromosomal/genomic components and rRNA gene expression of the complex hybrids and their progenies. Key Results Complex hybrids with variable fertility exhibited phenotypes that were different from the female allohexaploids and expressed some traits from O. violaceus. These hybrids were mixoploids (2n = 34–46) and retained partial complements of allohexaploids, including whole chromosomes of the A and B genomes and some of the C genome but no intact O. violaceus chromosomes; AFLP bands specific for O. violaceus, novel for two parents and absent in hexaploids were detected. The complex hybrids produced progenies with chromosomes/genomic complements biased to B. juncea (2n = 36, AABB) and novel B. juncea lines with two genomes of different origins. The expression of rRNA genes from B. nigra was revealed in all allohexaploids and complex hybrids, showing that the hierarchy of nucleolar dominance (B. nigra, BB > B. rapa, AA > B. oleracea, CC) in Brassica allotetraploids was still valid in these plants. Conclusions The chromosomes of three genomes in these synthetic Brassica allohexaploids showed different genome-specific stabilities (B > A > C) under induction of alien chromosome elimination in crosses with O. violaceus, which was possibly affected by nucleolar dominance. PMID:19403626

  9. Adaptations to a subterranean environment and longevity revealed by the analysis of mole rat genomes

    PubMed Central

    Fang, Xiaodong; Seim, Inge; Huang, Zhiyong; Gerashchenko, Maxim V.; Xiong, Zhiqiang; Turanov, Anton A.; Zhu, Yabing; Lobanov, Alexei V.; Fan, Dingding; Yim, Sun Hee; Yao, Xiaoming; Ma, Siming; Yang, Lan; Lee, Sang-Goo; Kim, Eun Bae; Bronson, Roderick T.; Šumbera, Radim; Buffenstein, Rochelle; Zhou, Xin; Krogh, Anders; Park, Thomas J.; Zhang, Guojie; Wang, Jun; Gladyshev, Vadim N.

    2014-01-01

    SUMMARY Subterranean mammals spend their lives in dark, unventilated environments rich in carbon dioxide and ammonia, and low in oxygen. Many of these animals are also long-lived and exhibit reduced aging-associated diseases, such as neurodegenerative disorders and cancer. We sequenced the genome of the Damaraland mole rat (DMR, Fukomys damarensis) and improved the genome assembly of the naked mole rat (NMR, Heterocephalus glaber). Comparative genome analysis, along with transcriptomes of related subterranean rodents, reveal candidate molecular adaptations for subterranean life and longevity, including a divergent insulin peptide, expression of oxygen-carrying globins in the brain, prevention of high CO2-induced pain perception, and enhanced ammonia detoxification. Juxtaposition of the genomes of DMR and other more conventional animals with the genome of NMR revealed several truly exceptional NMR features: unusual thermogenesis, aberrant melatonin system, pain insensitivity, and novel processing of 28S rRNA. Together, the new genomes and transcriptomes extend our understanding of subterranean adaptations, stress resistance and longevity. PMID:25176646

  10. Retroviral enhancer detection insertions in zebrafish combined with comparative genomics reveal genomic regulatory blocks - a fundamental feature of vertebrate genomes

    PubMed Central

    Kikuta, Hiroshi; Fredman, David; Rinkwitz, Silke; Lenhard, Boris; Becker, Thomas S

    2007-01-01

    A large-scale enhancer detection screen was performed in the zebrafish using a retroviral vector carrying a basal promoter and a fluorescent protein reporter cassette. Analysis of insertional hotspots uncovered areas around developmental regulatory genes in which an insertion results in the same global expression pattern, irrespective of exact position. These areas coincide with vertebrate chromosomal segments containing identical gene order; a phenomenon known as conserved synteny and thought to be a vestige of evolution. Genomic comparative studies have found large numbers of highly conserved noncoding elements (HCNEs) spanning these and other loci. HCNEs are thought to act as transcriptional enhancers based on the finding that many of those that have been tested direct tissue specific expression in transient or transgenic assays. Although gene order in hox and other gene clusters has long been known to be conserved because of shared regulatory sequences or overlapping transcriptional units, the chromosomal areas found through insertional hotspots contain only one or a few developmental regulatory genes as well as phylogenetically unrelated genes. We have termed these regions genomic regulatory blocks (GRBs), and show that they underlie the phenomenon of conserved synteny through all sequenced vertebrate genomes. After teleost whole genome duplication, a subset of GRBs were retained in two copies, underwent degenerative changes compared with tetrapod loci that exist as single copy, and that therefore can be viewed as representing the ancestral form. We discuss these findings in light of evolution of vertebrate chromosomal architecture and the identification of human disease mutations. PMID:18047696

  11. Genome size diversity in angiosperms and its influence on gene space.

    PubMed

    Dodsworth, Steven; Leitch, Andrew R; Leitch, Ilia J

    2015-12-01

    Genome size varies c. 2400-fold in angiosperms (flowering plants), although the range of genome size is skewed towards small genomes, with a mean genome size of 1C=5.7Gb. One of the most crucial factors governing genome size in angiosperms is the relative amount and activity of repetitive elements. Recently, there have been new insights into how these repeats, previously discarded as 'junk' DNA, can have a significant impact on gene space (i.e. the part of the genome comprising all the genes and gene-related DNA). Here we review these new findings and explore in what ways genome size itself plays a role in influencing how repeats impact genome dynamics and gene space, including gene expression. PMID:26605684

  12. The Population Genomics of Sunflowers and Genomic Determinants of Protein Evolution Revealed by RNAseq

    PubMed Central

    Renaut, Sébastien; Grassa, Christopher J.; Moyers, Brook T.; Kane, Nolan C.; Rieseberg, Loren H.

    2012-01-01

    Few studies have investigated the causes of evolutionary rate variation among plant nuclear genes, especially in recently diverged species still capable of hybridizing in the wild. The recent advent of Next Generation Sequencing (NGS) permits investigation of genome wide rates of protein evolution and the role of selection in generating and maintaining divergence. Here, we use individual whole-transcriptome sequencing (RNAseq) to refine our understanding of the population genomics of wild species of sunflowers (Helianthus spp.) and the factors that affect rates of protein evolution. We aligned 35 GB of transcriptome sequencing data and identified 433,257 polymorphic sites (SNPs) in a reference transcriptome comprising 16,312 genes. Using SNP markers, we identified strong population clustering largely corresponding to the three species analyzed here (Helianthus annuus, H. petiolaris, H. debilis), with one distinct early generation hybrid. Then, we calculated the proportions of adaptive substitution fixed by selection (alpha) and identified gene ontology categories with elevated values of alpha. The “response to biotic stimulus” category had the highest mean alpha across the three interspecific comparisons, implying that natural selection imposed by other organisms plays an important role in driving protein evolution in wild sunflowers. Finally, we examined the relationship between protein evolution (dN/dS ratio) and several genomic factors predicted to co-vary with protein evolution (gene expression level, divergence and specificity, genetic divergence [FST], and nucleotide diversity pi). We find that variation in rates of protein divergence was correlated with gene expression level and specificity, consistent with results from a broad range of taxa and timescales. This would in turn imply that these factors govern protein evolution both at a microevolutionary and macroevolutionary timescale. Our results contribute to a general understanding of the determinants

  13. Linkage Mapping Reveals Strong Chiasma Interference in Sockeye Salmon: Implications for Interpreting Genomic Data

    PubMed Central

    Limborg, Morten T.; Waples, Ryan K.; Allendorf, Fred W.; Seeb, James E.

    2015-01-01

    Meiotic recombination is fundamental for generating new genetic variation and for securing proper disjunction. Further, recombination plays an essential role during the rediploidization process of polyploid-origin genomes because crossovers between pairs of homeologous chromosomes retain duplicated regions. A better understanding of how recombination affects genome evolution is crucial for interpreting genomic data; unfortunately, current knowledge mainly originates from a few model species. Salmonid fishes provide a valuable system for studying the effects of recombination in nonmodel species. Salmonid females generally produce thousands of embryos, providing large families for conducting inheritance studies. Further, salmonid genomes are currently rediploidizing after a whole genome duplication and can serve as models for studying the role of homeologous crossovers on genome evolution. Here, we present a detailed interrogation of recombination patterns in sockeye salmon (Oncorhynchus nerka). First, we use RAD sequencing of haploid and diploid gynogenetic families to construct a dense linkage map that includes paralogous loci and location of centromeres. We find a nonrandom distribution of paralogs that mainly cluster in extended regions distally located on 11 different chromosomes, consistent with ongoing homeologous recombination in these regions. We also estimate the strength of interference across each chromosome; results reveal strong interference and crossovers are mostly limited to one per arm. Interference was further shown to continue across centromeres, but metacentric chromosomes generally had at least one crossover on each arm. We discuss the relevance of these findings for both mapping and population genomic studies. PMID:26384769

  14. The Complete Genome Sequences, Unique Mutational Spectra, and Developmental Potency of Adult Neurons Revealed by Cloning.

    PubMed

    Hazen, Jennifer L; Faust, Gregory G; Rodriguez, Alberto R; Ferguson, William C; Shumilina, Svetlana; Clark, Royden A; Boland, Michael J; Martin, Greg; Chubukov, Pavel; Tsunemoto, Rachel K; Torkamani, Ali; Kupriyanov, Sergey; Hall, Ira M; Baldwin, Kristin K

    2016-03-16

    Somatic mutation in neurons is linked to neurologic disease and implicated in cell-type diversification. However, the origin, extent, and patterns of genomic mutation in neurons remain unknown. We established a nuclear transfer method to clonally amplify the genomes of neurons from adult mice for whole-genome sequencing. Comprehensive mutation detection and independent validation revealed that individual neurons harbor ∼100 unique mutations from all classes but lack recurrent rearrangements. Most neurons contain at least one gene-disrupting mutation and rare (0-2) mobile element insertions. The frequency and gene bias of neuronal mutations differ from other lineages, potentially due to novel mechanisms governing postmitotic mutation. Fertile mice were cloned from several neurons, establishing the compatibility of mutated adult neuronal genomes with reprogramming to pluripotency and development. PMID:26948891

  15. Baiji genomes reveal low genetic variability and new insights into secondary aquatic adaptations.

    PubMed

    Zhou, Xuming; Sun, Fengming; Xu, Shixia; Fan, Guangyi; Zhu, Kangli; Liu, Xin; Chen, Yuan; Shi, Chengcheng; Yang, Yunxia; Huang, Zhiyong; Chen, Jing; Hou, Haolong; Guo, Xuejiang; Chen, Wenbin; Chen, Yuefeng; Wang, Xiaohong; Lv, Tian; Yang, Dan; Zhou, Jiajian; Huang, Bangqing; Wang, Zhengfei; Zhao, Wei; Tian, Ran; Xiong, Zhiqiang; Xu, Junxiao; Liang, Xinming; Chen, Bingyao; Liu, Weiqing; Wang, Junyi; Pan, Shengkai; Fang, Xiaodong; Li, Ming; Wei, Fuwen; Xu, Xun; Zhou, Kaiya; Wang, Jun; Yang, Guang

    2013-01-01

    The baiji, or Yangtze River dolphin (Lipotes vexillifer), is a flagship species for the conservation of aquatic animals and ecosystems in the Yangtze River of China; however, this species has now been recognized as functionally extinct. Here we report a high-quality draft genome and three re-sequenced genomes of L. vexillifer using Illumina short-read sequencing technology. Comparative genomic analyses reveal that cetaceans have a slow molecular clock and molecular adaptations to their aquatic lifestyle. We also find a significantly lower number of heterozygous single nucleotide polymorphisms in the baiji compared to all other mammalian genomes reported thus far. A reconstruction of the demographic history of the baiji indicates that a bottleneck occurred near the end of the last deglaciation, a time coinciding with a rapid decrease in temperature and the rise of eustatic sea level. PMID:24169659

  16. Whole genome comparison of a large collection of mycobacteriophages reveals a continuum of phage genetic diversity.

    PubMed

    Pope, Welkin H; Bowman, Charles A; Russell, Daniel A; Jacobs-Sera, Deborah; Asai, David J; Cresawn, Steven G; Jacobs, William R; Hendrix, Roger W; Lawrence, Jeffrey G; Hatfull, Graham F

    2015-01-01

    The bacteriophage population is large, dynamic, ancient, and genetically diverse. Limited genomic information shows that phage genomes are mosaic, and the genetic architecture of phage populations remains ill-defined. To understand the population structure of phages infecting a single host strain, we isolated, sequenced, and compared 627 phages of Mycobacterium smegmatis. Their genetic diversity is considerable, and there are 28 distinct genomic types (clusters) with related nucleotide sequences. However, amino acid sequence comparisons show pervasive genomic mosaicism, and quantification of inter-cluster and intra-cluster relatedness reveals a continuum of genetic diversity, albeit with uneven representation of different phages. Furthermore, rarefaction analysis shows that the mycobacteriophage population is not closed, and there is a constant influx of genes from other sources. Phage isolation and analysis was performed by a large consortium of academic institutions, illustrating the substantial benefits of a disseminated, structured program involving large numbers of freshman undergraduates in scientific discovery. PMID:25919952

  17. Gekko japonicus genome reveals evolution of adhesive toe pads and tail regeneration.

    PubMed

    Liu, Yan; Zhou, Qian; Wang, Yongjun; Luo, Longhai; Yang, Jian; Yang, Linfeng; Liu, Mei; Li, Yingrui; Qian, Tianmei; Zheng, Yuan; Li, Meiyuan; Li, Jiang; Gu, Yun; Han, Zujing; Xu, Man; Wang, Yingjie; Zhu, Changlai; Yu, Bin; Yang, Yumin; Ding, Fei; Jiang, Jianping; Yang, Huanming; Gu, Xiaosong

    2015-01-01

    Reptiles are the most morphologically and physiologically diverse tetrapods, and have undergone 300 million years of adaptive evolution. Within the reptilian tetrapods, geckos possess several interesting features, including the ability to regenerate autotomized tails and to climb on smooth surfaces. Here we sequence the genome of Gekko japonicus (Schlegel's Japanese Gecko) and investigate genetic elements related to its physiology. We obtain a draft G. japonicus genome sequence of 2.55 Gb and annotated 22,487 genes. Comparative genomic analysis reveals specific gene family expansions or reductions that are associated with the formation of adhesive setae, nocturnal vision and tail regeneration, as well as the diversification of olfactory sensation. The obtained genomic data provide robust genetic evidence of adaptive evolution in reptiles. PMID:26598231

  18. Baiji genomes reveal low genetic variability and new insights into secondary aquatic adaptations

    PubMed Central

    Zhou, Xuming; Sun, Fengming; Xu, Shixia; Fan, Guangyi; Zhu, Kangli; Liu, Xin; Chen, Yuan; Shi, Chengcheng; Yang, Yunxia; Huang, Zhiyong; Chen, Jing; Hou, Haolong; Guo, Xuejiang; Chen, Wenbin; Chen, Yuefeng; Wang, Xiaohong; Lv, Tian; Yang, Dan; Zhou, Jiajian; Huang, Bangqing; Wang, Zhengfei; Zhao, Wei; Tian, Ran; Xiong, Zhiqiang; Xu, Junxiao; Liang, Xinming; Chen, Bingyao; Liu, Weiqing; Wang, Junyi; Pan, Shengkai; Fang, Xiaodong; Li, Ming; Wei, Fuwen; Xu, Xun; Zhou, Kaiya; Wang, Jun; Yang, Guang

    2013-01-01

    The baiji, or Yangtze River dolphin (Lipotes vexillifer), is a flagship species for the conservation of aquatic animals and ecosystems in the Yangtze River of China; however, this species has now been recognized as functionally extinct. Here we report a high-quality draft genome and three re-sequenced genomes of L. vexillifer using Illumina short-read sequencing technology. Comparative genomic analyses reveal that cetaceans have a slow molecular clock and molecular adaptations to their aquatic lifestyle. We also find a significantly lower number of heterozygous single nucleotide polymorphisms in the baiji compared to all other mammalian genomes reported thus far. A reconstruction of the demographic history of the baiji indicates that a bottleneck occurred near the end of the last deglaciation, a time coinciding with a rapid decrease in temperature and the rise of eustatic sea level. PMID:24169659

  19. Comparative genomics of Bifidobacterium animalis subsp. lactis reveals a strict monophyletic bifidobacterial taxon.

    PubMed

    Milani, Christian; Duranti, Sabrina; Lugli, Gabriele Andrea; Bottacini, Francesca; Strati, Francesco; Arioli, Stefania; Foroni, Elena; Turroni, Francesca; van Sinderen, Douwe; Ventura, Marco

    2013-07-01

    Strains of Bifidobacterium animalis subsp. lactis are extensively exploited by the food industry as health-promoting bacteria, although the genetic variability of members belonging to this taxon has so far not received much scientific attention. In this article, we describe the complete genetic makeup of the B. animalis subsp. lactis Bl12 genome and discuss the genetic relatedness of this strain with other sequenced strains belonging to this taxon. Moreover, a detailed comparative genomic analysis of B. animalis subsp. lactis genomes was performed, which revealed a closely related and isogenic nature of all currently available B. animalis subsp. lactis strains, thus strongly suggesting a closed pan-genome structure of this bacterial group. PMID:23645200

  20. Gekko japonicus genome reveals evolution of adhesive toe pads and tail regeneration

    PubMed Central

    Liu, Yan; Zhou, Qian; Wang, Yongjun; Luo, Longhai; Yang, Jian; Yang, Linfeng; Liu, Mei; Li, Yingrui; Qian, Tianmei; Zheng, Yuan; Li, Meiyuan; Li, Jiang; Gu, Yun; Han, Zujing; Xu, Man; Wang, Yingjie; Zhu, Changlai; Yu, Bin; Yang, Yumin; Ding, Fei; Jiang, Jianping; Yang, Huanming; Gu, Xiaosong

    2015-01-01

    Reptiles are the most morphologically and physiologically diverse tetrapods, and have undergone 300 million years of adaptive evolution. Within the reptilian tetrapods, geckos possess several interesting features, including the ability to regenerate autotomized tails and to climb on smooth surfaces. Here we sequence the genome of Gekko japonicus (Schlegel's Japanese Gecko) and investigate genetic elements related to its physiology. We obtain a draft G. japonicus genome sequence of 2.55 Gb and annotated 22,487 genes. Comparative genomic analysis reveals specific gene family expansions or reductions that are associated with the formation of adhesive setae, nocturnal vision and tail regeneration, as well as the diversification of olfactory sensation. The obtained genomic data provide robust genetic evidence of adaptive evolution in reptiles. PMID:26598231

  1. Genomes of three tomato pathogens within the Ralstonia solanacearum species complex reveal significant evolutionary divergence

    PubMed Central

    2010-01-01

    Background The Ralstonia solanacearum species complex includes thousands of strains pathogenic to an unusually wide range of plant species. These globally dispersed and heterogeneous strains cause bacterial wilt diseases, which have major socio-economic impacts. Pathogenicity is an ancestral trait in R. solanacearum and strains with high genetic variation can be subdivided into four phylotypes, correlating to isolates from Asia (phylotype I), the Americas (phylotype IIA and IIB), Africa (phylotype III) and Indonesia (phylotype IV). Comparison of genome sequences strains representative of this phylogenetic diversity can help determine which traits allow this bacterium to be such a pathogen of so many different plant species and how the bacteria survive in many different habitats. Results The genomes of three tomato bacterial wilt pathogens, CFBP2957 (phy. IIA), CMR15 (phy. III) and PSI07 (phy. IV) were sequenced and manually annotated. These genomes were compared with those of three previously sequenced R. solanacearum strains: GMI1000 (tomato, phy. I), IPO1609 (potato, phy. IIB), and Molk2 (banana, phy. IIB). The major genomic features (size, G+C content, number of genes) were conserved across all of the six sequenced strains. Despite relatively high genetic distances (calculated from average nucleotide identity) and many genomic rearrangements, more than 60% of the genes of the megaplasmid and 70% of those on the chromosome are syntenic. The three new genomic sequences revealed the presence of several previously unknown traits, probably acquired by horizontal transfers, within the genomes of R. solanacearum, including a type IV secretion system, a rhi-type anti-mitotic toxin and two small plasmids. Genes involved in virulence appear to be evolving at a faster rate than the genome as a whole. Conclusions Comparative analysis of genome sequences and gene content confirmed the differentiation of R. solanacearum species complex strains into four phylotypes. Genetic

  2. Genome comparisons reveal a dominant mechanism of chromosome number reduction in grasses and accelerated genome evolution in Triticeae

    PubMed Central

    Luo, M. C.; Deal, K. R.; Akhunov, E. D.; Akhunova, A. R.; Anderson, O. D.; Anderson, J. A.; Blake, N.; Clegg, M. T.; Coleman-Derr, D.; Conley, E. J.; Crossman, C. C.; Dubcovsky, J.; Gill, B. S.; Gu, Y. Q.; Hadam, J.; Heo, H. Y.; Huo, N.; Lazo, G.; Ma, Y.; Matthews, D. E.; McGuire, P. E.; Morrell, P. L.; Qualset, C. O.; Renfro, J.; Tabanao, D.; Talbert, L. E.; Tian, C.; Toleno, D. M.; Warburton, M. L.; You, F. M.; Zhang, W.; Dvorak, J.

    2009-01-01

    Single-nucleotide polymorphism was used in the construction of an expressed sequence tag map of Aegilops tauschii, the diploid source of the wheat D genome. Comparisons of the map with the rice and sorghum genome sequences revealed 50 inversions and translocations; 2, 8, and 40 were assigned respectively to the rice, sorghum, and Ae. tauschii lineages, showing greatly accelerated genome evolution in the large Triticeae genomes. The reduction of the basic chromosome number from 12 to 7 in the Triticeae has taken place by a process during which an entire chromosome is inserted by its telomeres into a break in the centromeric region of another chromosome. The original centromere–telomere polarity of the chromosome arms is maintained in the new chromosome. An intrachromosomal telomere–telomere fusion resulting in a pericentric translocation of a chromosome segment or an entire arm accompanied or preceded the chromosome insertion in some instances. Insertional dysploidy has been recorded in three grass subfamilies and appears to be the dominant mechanism of basic chromosome number reduction in grasses. A total of 64% and 66% of Ae. tauschii genes were syntenic with sorghum and rice genes, respectively. Synteny was reduced in the vicinity of the termini of modern Ae. tauschii chromosomes but not in the vicinity of the ancient termini embedded in the Ae. tauschii chromosomes, suggesting that the dependence of synteny erosion on gene location along the centromere–telomere axis either evolved recently in the Triticeae phylogenetic lineage or its evolution was recently accelerated. PMID:19717446

  3. Draft Genome Sequence of Arthrobacter crystallopoietes Strain BAB-32, Revealing Genes for Bioremediation

    PubMed Central

    Joshi, M. N.; Pandit, A. S.; Sharma, A.; Pandya, R. V.; Desai, S. M.; Saxena, A. K.

    2013-01-01

    Arthrobacter crystallopoietes strain BAB-32, a Gram-positive obligate aerobic actinobacterium having potential application in bioremediation and bioreduction of a few metals, was isolated from rhizosphere soil of Gandhinagar, Gujarat, India. The draft genome (4.3 Mb) of the strain revealed a few vital gene clusters involved in the metabolism of aromatic compounds, zinc, and sulfur. PMID:23833141

  4. Draft Genome Sequence of Arthrobacter crystallopoietes Strain BAB-32, Revealing Genes for Bioremediation.

    PubMed

    Joshi, M N; Pandit, A S; Sharma, A; Pandya, R V; Desai, S M; Saxena, A K; Bagatharia, S B

    2013-01-01

    Arthrobacter crystallopoietes strain BAB-32, a Gram-positive obligate aerobic actinobacterium having potential application in bioremediation and bioreduction of a few metals, was isolated from rhizosphere soil of Gandhinagar, Gujarat, India. The draft genome (4.3 Mb) of the strain revealed a few vital gene clusters involved in the metabolism of aromatic compounds, zinc, and sulfur. PMID:23833141

  5. Imaging mass spectrometry and genome mining reveal highly antifungal virulence factor of mushroom soft rot pathogen.

    PubMed

    Graupner, Katharina; Scherlach, Kirstin; Bretschneider, Tom; Lackner, Gerald; Roth, Martin; Gross, Harald; Hertweck, Christian

    2012-12-21

    Caught in the act: imaging mass spectrometry of a button mushroom infected with the soft rot pathogen Janthinobacterium agaricidamnosum in conjunction with genome mining revealed jagaricin as a highly antifungal virulence factor that is not produced under standard cultivation conditions. The structure of jagaricin was rigorously elucidated by a combination of physicochemical analyses, chemical derivatization, and bioinformatics. PMID:23161559

  6. Genome sequence of the necrotrophic plant pathogen Pythium ultimum reveals original pathogenicity mechanisms and effector repertoire.

    Technology Transfer Automated Retrieval System (TEKTRAN)

    The P. ultimum DAOM BR144 (=CBS 805.95 = ATCC200006) genome (42.8 Mb) encodes 15,290 genes, and has extensive sequence similarity and synteny with related Phytophthora spp., including the potato late blight pathogen Phytophthora infestans. Whole transcriptome sequencing revealed expression of 86 % o...

  7. Comparative genomic analysis of clinical and environmental Vibrio vulnificus isolates revealed biotype 3 evolutionary relationships

    PubMed Central

    Koton, Yael; Gordon, Michal; Chalifa-Caspi, Vered; Bisharat, Naiel

    2015-01-01

    In 1996 a common-source outbreak of severe soft tissue and bloodstream infections erupted among Israeli fish farmers and fish consumers due to changes in fish marketing policies. The causative pathogen was a new strain of Vibrio vulnificus, named biotype 3, which displayed a unique biochemical and genotypic profile. Initial observations suggested that the pathogen erupted as a result of genetic recombination between two distinct populations. We applied a whole genome shotgun sequencing approach using several V. vulnificus strains from Israel in order to study the pan genome of V. vulnificus and determine the phylogenetic relationship of biotype 3 with existing populations. The core genome of V. vulnificus based on 16 draft and complete genomes consisted of 3068 genes, representing between 59 and 78% of the whole genome of 16 strains. The accessory genome varied in size from 781 to 2044 kbp. Phylogenetic analysis based on whole, core, and accessory genomes displayed similar clustering patterns with two main clusters, clinical (C) and environmental (E), all biotype 3 strains formed a distinct group within the E cluster. Annotation of accessory genomic regions found in biotype 3 strains and absent from the core genome yielded 1732 genes, of which the vast majority encoded hypothetical proteins, phage-related proteins, and mobile element proteins. A total of 1916 proteins (including 713 hypothetical proteins) were present in all human pathogenic strains (both biotype 3 and non-biotype 3) and absent from the environmental strains. Clustering analysis of the non-hypothetical proteins revealed 148 protein clusters shared by all human pathogenic strains; these included transcriptional regulators, arylsulfatases, methyl-accepting chemotaxis proteins, acetyltransferases, GGDEF family proteins, transposases, type IV secretory system (T4SS) proteins, and integrases. Our study showed that V. vulnificus biotype 3 evolved from environmental populations and formed a genetically

  8. The Methanosarcina barkeri genome: comparative analysis withMethanosarcina acetivorans and Methanosarcina mazei reveals extensiverearrangement within methanosarcinal genomes

    SciTech Connect

    Maeder, Dennis L.; Anderson, Iain; Brettin, Thomas S.; Bruce,David C.; Gilna, Paul; Han, Cliff S.; Lapidus, Alla; Metcalf, William W.; Saunders, Elizabeth; Tapia, Roxanne; Sowers, Kevin R.

    2006-05-19

    We report here a comparative analysis of the genome sequence of Methanosarcina barkeri with those of Methanosarcina acetivorans and Methanosarcina mazei. All three genomes share a conserved double origin of replication and many gene clusters. M. barkeri is distinguished by having an organization that is well conserved with respect to the other Methanosarcinae in the region proximal to the origin of replication with interspecies gene similarities as high as 95%. However it is disordered and marked by increased transposase frequency and decreased gene synteny and gene density in the proximal semi-genome. Of the 3680 open reading frames in M. barkeri, 678 had paralogs with better than 80% similarity to both M. acetivorans and M. mazei while 128 nonhypothetical orfs were unique (non-paralogous) amongst these species including a complete formate dehydrogenase operon, two genes required for N-acetylmuramic acid synthesis, a 14 gene gas vesicle cluster and a bacterial P450-specific ferredoxin reductase cluster not previously observed or characterized in this genus. A cryptic 36 kbp plasmid sequence was detected in M. barkeri that contains an orc1 gene flanked by a presumptive origin of replication consisting of 38 tandem repeats of a 143 nt motif. Three-way comparison of these genomes reveals differing mechanisms for the accrual of changes. Elongation of the large M. acetivorans is the result of multiple gene-scale insertions and duplications uniformly distributed in that genome, while M. barkeri is characterized by localized inversions associated with the loss of gene content. In contrast, the relatively short M. mazei most closely approximates the ancestral organizational state.

  9. Comparison of space flight and heavy ion radiation induced genomic/epigenomic mutations in rice (Oryza sativa)

    NASA Astrophysics Data System (ADS)

    Shi, Jinming; Lu, Weihong; Sun, Yeqing

    2014-04-01

    Rice seeds, after space flight and low dose heavy ion radiation treatment were cultured on ground. Leaves of the mature plants were obtained for examination of genomic/epigenomic mutations by using amplified fragment length polymorphism (AFLP) and methylation sensitive amplification polymorphism (MSAP) method, respectively. The mutation sites were identified by fragment recovery and sequencing. The heritability of the mutations was detected in the next generation. Results showed that both space flight and low dose heavy ion radiation can induce significant alterations on rice genome and epigenome (P < 0.05). For both genetic and epigenetic assays, while there was no significant difference in mutation rates and their ability to be inherited to the next generation, the site of mutations differed between the space flight and radiation treated groups. More than 50% of the mutation sites were shared by two radiation treated groups, radiated with different LET value and dose, while only about 20% of the mutation sites were shared by space flight group and radiation treated group. Moreover, in space flight group, we found that DNA methylation changes were more prone to occur on CNG sequence than CG sequence. Sequencing results proved that both space flight and heavy ion radiation induced mutations were widely spread on rice genome including coding region and repeated region. Our study described and compared the characters of space flight and low dose heavy ion radiation induced genomic/epigenomic mutations. Our data revealed the mechanisms of application of space environment for mutagenesis and crop breeding. Furthermore, this work implicated that the nature of mutations induced under space flight conditions may involve factors beyond ion radiation.

  10. Genome sequencing reveals complex secondary metabolome in the marine actinomycete Salinispora tropica

    PubMed Central

    Udwary, Daniel W.; Zeigler, Lisa; Asolkar, Ratnakar N.; Singan, Vasanth; Lapidus, Alla; Fenical, William; Jensen, Paul R.; Moore, Bradley S.

    2007-01-01

    Recent fermentation studies have identified actinomycetes of the marine-dwelling genus Salinispora as prolific natural product producers. To further evaluate their biosynthetic potential, we sequenced the 5,183,331-bp S. tropica CNB-440 circular genome and analyzed all identifiable secondary natural product gene clusters. Our analysis shows that S. tropica dedicates a large percentage of its genome (≈9.9%) to natural product assembly, which is greater than previous Streptomyces genome sequences as well as other natural product-producing actinomycetes. The S. tropica genome features polyketide synthase systems of every known formally classified family, nonribosomal peptide synthetases, and several hybrid clusters. Although a few clusters appear to encode molecules previously identified in Streptomyces species, the majority of the 17 biosynthetic loci are novel. Specific chemical information about putative and observed natural product molecules is presented and discussed. In addition, our bioinformatic analysis not only was critical for the structure elucidation of the polyene macrolactam salinilactam A, but its structural analysis aided the genome assembly of the highly repetitive slm loci. This study firmly establishes the genus Salinispora as a rich source of drug-like molecules and importantly reveals the powerful interplay between genomic analysis and traditional natural product isolation studies. PMID:17563368

  11. Whole-Genome Sequencing Reveals Genetic Variation in the Asian House Rat

    PubMed Central

    Teng, Huajing; Zhang, Yaohua; Shi, Chengmin; Mao, Fengbiao; Hou, Lingling; Guo, Hongling; Sun, Zhongsheng; Zhang, Jianxu

    2016-01-01

    Whole-genome sequencing of wild-derived rat species can provide novel genomic resources, which may help decipher the genetics underlying complex phenotypes. As a notorious pest, reservoir of human pathogens, and colonizer, the Asian house rat, Rattus tanezumi, is successfully adapted to its habitat. However, little is known regarding genetic variation in this species. In this study, we identified over 41,000,000 single-nucleotide polymorphisms, plus insertions and deletions, through whole-genome sequencing and bioinformatics analyses. Moreover, we identified over 12,000 structural variants, including 143 chromosomal inversions. Further functional analyses revealed several fixed nonsense mutations associated with infection and immunity-related adaptations, and a number of fixed missense mutations that may be related to anticoagulant resistance. A genome-wide scan for loci under selection identified various genes related to neural activity. Our whole-genome sequencing data provide a genomic resource for future genetic studies of the Asian house rat species and have the potential to facilitate understanding of the molecular adaptations of rats to their ecological niches. PMID:27172215

  12. Whole-Genome Sequencing Reveals Genetic Variation in the Asian House Rat.

    PubMed

    Teng, Huajing; Zhang, Yaohua; Shi, Chengmin; Mao, Fengbiao; Hou, Lingling; Guo, Hongling; Sun, Zhongsheng; Zhang, Jianxu

    2016-01-01

    Whole-genome sequencing of wild-derived rat species can provide novel genomic resources, which may help decipher the genetics underlying complex phenotypes. As a notorious pest, reservoir of human pathogens, and colonizer, the Asian house rat, Rattus tanezumi, is successfully adapted to its habitat. However, little is known regarding genetic variation in this species. In this study, we identified over 41,000,000 single-nucleotide polymorphisms, plus insertions and deletions, through whole-genome sequencing and bioinformatics analyses. Moreover, we identified over 12,000 structural variants, including 143 chromosomal inversions. Further functional analyses revealed several fixed nonsense mutations associated with infection and immunity-related adaptations, and a number of fixed missense mutations that may be related to anticoagulant resistance. A genome-wide scan for loci under selection identified various genes related to neural activity. Our whole-genome sequencing data provide a genomic resource for future genetic studies of the Asian house rat species and have the potential to facilitate understanding of the molecular adaptations of rats to their ecological niches. PMID:27172215

  13. Multiple genome sequences reveal adaptations of a phototrophic bacterium to sediment microenvironments.

    SciTech Connect

    Oda, Yasuhiro; Larimer, Frank W; Chain, Patrick S. G.; Malfatti, Stephanie; Shin, Maria V; Vergez, Lisa; Hauser, Loren John; Land, Miriam L; Braatsch, Stephan; Beatty, Thomas; Pelletier, Dale A; Schaefer, Amy L; Harwood, Caroline S

    2008-11-01

    The bacterial genus Rhodopseudomonas is comprised of photosynthetic bacteria found widely distributed in aquatic sediments. Members of the genus catalyze hydrogen gas production, carbon dioxide sequestration, and biomass turnover. The genome sequence of Rhodopseudomonas palustris CGA009 revealed a surprising richness of metabolic versatility that would seem to explain its ability to live in a heterogeneous environment like sediment. However, there is considerable genotypic diversity among Rhodopseudomonas isolates. Here we report the complete genome sequences of four additional members of the genus isolated from a restricted geographical area. The sequences confirm that the isolates belong to a coherent taxonomic unit, but they also have significant differences. Whole genome alignments show that the circular chromosomes of the isolates consist of a collinear backbone with a moderate number of genomic rearrangements that impact local gene order and orientation. There are 3,319 genes, 70% of the genes in each genome, shared by four or more strains. Between 10% and 18% of the genes in each genome are strain specific. Some of these genes suggest specialized physiological traits, which we verified experimentally, that include expanded light harvesting, oxygen respiration, and nitrogen fixation capabilities, as well as anaerobic fermentation. Strain-specific adaptations include traits that may be useful in bioenergy applications. This work suggests that against a backdrop of metabolic versatility that is a defining characteristic of Rhodopseudomonas, different ecotypes have evolved to take advantage of physical and chemical conditions in sediment microenvironments that are too small for human observation.

  14. Population Genomics Reveals Chromosome-Scale Heterogeneous Evolution in a Protoploid Yeast

    PubMed Central

    Friedrich, Anne; Jung, Paul; Reisser, Cyrielle; Fischer, Gilles; Schacherer, Joseph

    2015-01-01

    Yeast species represent an ideal model system for population genomic studies but large-scale polymorphism surveys have only been reported for species of the Saccharomyces genus so far. Hence, little is known about intraspecific diversity and evolution in yeast. To obtain a new insight into the evolutionary forces shaping natural populations, we sequenced the genomes of an expansive worldwide collection of isolates from a species distantly related to Saccharomyces cerevisiae: Lachancea kluyveri (formerly S. kluyveri). We identified 6.5 million single nucleotide polymorphisms and showed that a large introgression event of 1 Mb of GC-rich sequence in the chromosomal arm probably occurred in the last common ancestor of all L. kluyveri strains. Our population genomic data clearly revealed that this 1-Mb region underwent a molecular evolution pattern very different from the rest of the genome. It is characterized by a higher recombination rate, with a dramatically elevated A:T → G:C substitution rate, which is the signature of an increased GC-biased gene conversion. In addition, the predicted base composition at equilibrium demonstrates that the chromosome-scale compositional heterogeneity will persist after the genome has reached mutational equilibrium. Altogether, the data presented herein clearly show that distinct recombination and substitution regimes can coexist and lead to different evolutionary patterns within a single genome. PMID:25349286

  15. Population genomics reveals chromosome-scale heterogeneous evolution in a protoploid yeast.

    PubMed

    Friedrich, Anne; Jung, Paul; Reisser, Cyrielle; Fischer, Gilles; Schacherer, Joseph

    2015-01-01

    Yeast species represent an ideal model system for population genomic studies but large-scale polymorphism surveys have only been reported for species of the Saccharomyces genus so far. Hence, little is known about intraspecific diversity and evolution in yeast. To obtain a new insight into the evolutionary forces shaping natural populations, we sequenced the genomes of an expansive worldwide collection of isolates from a species distantly related to Saccharomyces cerevisiae: Lachancea kluyveri (formerly S. kluyveri). We identified 6.5 million single nucleotide polymorphisms and showed that a large introgression event of 1 Mb of GC-rich sequence in the chromosomal arm probably occurred in the last common ancestor of all L. kluyveri strains. Our population genomic data clearly revealed that this 1-Mb region underwent a molecular evolution pattern very different from the rest of the genome. It is characterized by a higher recombination rate, with a dramatically elevated A:T → G:C substitution rate, which is the signature of an increased GC-biased gene conversion. In addition, the predicted base composition at equilibrium demonstrates that the chromosome-scale compositional heterogeneity will persist after the genome has reached mutational equilibrium. Altogether, the data presented herein clearly show that distinct recombination and substitution regimes can coexist and lead to different evolutionary patterns within a single genome. PMID:25349286

  16. Comparative mapping in the Poaceae family reveals translocations in the complex polyploid genome of sugarcane

    PubMed Central

    2014-01-01

    Background The understanding of sugarcane genetics has lagged behind that of other members of the Poaceae family such as wheat, rice, barley and sorghum mainly due to the complexity, size and polyploidization of the genome. We have used the genetic map of a sugarcane cultivar to generate a consensus genetic map to increase genome coverage for comparison to the sorghum genome. We have utilized the recently developed sugarcane DArT array to increase the marker density within the genetic map. The sequence of these DArT markers plus SNP and EST-SSR markers was then used to form a bridge to the sorghum genomic sequence by BLAST alignment to start to unravel the complex genomic architecture of sugarcane. Results Comparative mapping revealed that certain sugarcane chromosomes show greater levels of synteny to sorghum than others. On a macrosyntenic level a good collinearity was observed between sugarcane and sorghum for 4 of the 8 homology groups (HGs). These 4 HGs were syntenic to four sorghum chromosomes with from 98% to 100% of these chromosomes covered by these linked markers. Four major chromosome rearrangements were identified between the other four sugarcane HGs and sorghum, two of which were condensations of chromosomes reducing the basic chromosome number of sugarcane from x = 10 to x = 8. This macro level of synteny was transferred to other members within the Poaceae family such as maize to uncover the important evolutionary relationships that exist between sugarcane and these species. Conclusions Comparative mapping of sugarcane to the sorghum genome has revealed new information on the genome structure of sugarcane which will help guide identification of important genes for use in sugarcane breeding. Furthermore of the four major chromosome rearrangements identified in this study, three were common to maize providing some evidence that chromosome reduction from a common paleo-ancestor of both maize and sugarcane was driven by the same translocation

  17. Comparison of the complete genome sequence of two closely related isolates of ‘Candidatus Phytoplasma australiense’ reveals genome plasticity

    PubMed Central

    2013-01-01

    Background ‘Candidatus Phytoplasma australiense’ is associated with at least nine diseases in Australia and New Zealand. The impact of this phytoplasma is considerable, both economically and environmentally. The genome of a NZ isolate was sequenced in an effort to understand its pathogenicity and ecology. Comparison with a closely related Australian isolate enabled us to examine mechanisms of genomic rearrangement. Results The complete genome sequence of a strawberry lethal yellows (SLY) isolate of ‘Candidatus Phytoplasma australiense’ was determined. It is a circular genome of 959,779 base pairs with 1126 predicted open reading frames. Despite being 80 kbp larger than another ‘Ca. Phytoplasma australiense’ isolate PAa, the variation between housekeeping genes was generally less than 1% at a nucleotide level. The difference in size between the two isolates was largely due to the number and size of potential mobile units (PMUs), which contributed to some changes in gene order. Comparison of the genomes of the two isolates revealed that the highly conserved 5′ UTR of a putative DNA-directed RNA polymerase seems to be associated with insertion and rearrangement events. Two types of PMUs have been identified on the basis of the order of three to four conserved genes, with both PMUs appearing to have been present in the last common ancestor of ‘Ca. Phytoplasma asteris’ and ‘Ca. Phytoplasma australiense’. Comparison with other phytoplasma genomes showed that modification methylases were, in general, species-specific. A putative methylase (xorIIM) found in ‘Ca. Phytoplasma australiense’ appeared to have no analogue in any other firmicute, and we believe has been introduced by way of lateral gene transfer. A putative retrostransposon (ltrA) analogous to that found in OY-M was present in both isolates, although all examples in PAa appear to be fragments. Comparative analysis identified highly conserved 5′ and 3′ UTR regions of ltrA, which may

  18. A functional genomics strategy that uses metabolome data to reveal the phenotype of silent mutations.

    PubMed

    Raamsdonk, L M; Teusink, B; Broadhurst, D; Zhang, N; Hayes, A; Walsh, M C; Berden, J A; Brindle, K M; Kell, D B; Rowland, J J; Westerhoff, H V; van Dam, K; Oliver, S G

    2001-01-01

    A large proportion of the 6,000 genes present in the genome of Saccharomyces cerevisiae, and of those sequenced in other organisms, encode proteins of unknown function. Many of these genes are "silent, " that is, they show no overt phenotype, in terms of growth rate or other fluxes, when they are deleted from the genome. We demonstrate how the intracellular concentrations of metabolites can reveal phenotypes for proteins active in metabolic regulation. Quantification of the change of several metabolite concentrations relative to the concentration change of one selected metabolite can reveal the site of action, in the metabolic network, of a silent gene. In the same way, comprehensive analyses of metabolite concentrations in mutants, providing "metabolic snapshots," can reveal functions when snapshots from strains deleted for unstudied genes are compared to those deleted for known genes. This approach to functional analysis, using comparative metabolomics, we call FANCY-an abbreviation for functional analysis by co-responses in yeast. PMID:11135551

  19. Integrated syntenic and phylogenomic analyses reveal an ancient genome duplication in monocots.

    PubMed

    Jiao, Yuannian; Li, Jingping; Tang, Haibao; Paterson, Andrew H

    2014-07-01

    Unraveling widespread polyploidy events throughout plant evolution is a necessity for inferring the impacts of whole-genome duplication (WGD) on speciation, functional innovations, and to guide identification of true orthologs in divergent taxa. Here, we employed an integrated syntenic and phylogenomic analyses to reveal an ancient WGD that shaped the genomes of all commelinid monocots, including grasses, bromeliads, bananas (Musa acuminata), ginger, palms, and other plants of fundamental, agricultural, and/or horticultural interest. First, comprehensive phylogenomic analyses revealed 1421 putative gene families that retained ancient duplication shared by Musa (Zingiberales) and grass (Poales) genomes, indicating an ancient WGD in monocots. Intergenomic synteny blocks of Musa and Oryza were investigated, and 30 blocks were shown to be duplicated before Musa-Oryza divergence an estimated 120 to 150 million years ago. Synteny comparisons of four monocot (rice [Oryza sativa], sorghum [Sorghum bicolor], banana, and oil palm [Elaeis guineensis]) and two eudicot (grape [Vitis vinifera] and sacred lotus [Nelumbo nucifera]) genomes also support this additional WGD in monocots, herein called Tau (τ). Integrating synteny and phylogenomic comparisons achieves better resolution of ancient polyploidy events than either approach individually, a principle that is exemplified in the disambiguation of a WGD series of rho (ρ)-sigma (σ)-tau (τ) in the grass lineages that echoes the alpha (α)-beta (β)-gamma (γ) series previously revealed in the Arabidopsis thaliana lineage. PMID:25082857

  20. Integrated Syntenic and Phylogenomic Analyses Reveal an Ancient Genome Duplication in Monocots[W

    PubMed Central

    Jiao, Yuannian; Li, Jingping; Tang, Haibao; Paterson, Andrew H.

    2014-01-01

    Unraveling widespread polyploidy events throughout plant evolution is a necessity for inferring the impacts of whole-genome duplication (WGD) on speciation, functional innovations, and to guide identification of true orthologs in divergent taxa. Here, we employed an integrated syntenic and phylogenomic analyses to reveal an ancient WGD that shaped the genomes of all commelinid monocots, including grasses, bromeliads, bananas (Musa acuminata), ginger, palms, and other plants of fundamental, agricultural, and/or horticultural interest. First, comprehensive phylogenomic analyses revealed 1421 putative gene families that retained ancient duplication shared by Musa (Zingiberales) and grass (Poales) genomes, indicating an ancient WGD in monocots. Intergenomic synteny blocks of Musa and Oryza were investigated, and 30 blocks were shown to be duplicated before Musa-Oryza divergence an estimated 120 to 150 million years ago. Synteny comparisons of four monocot (rice [Oryza sativa], sorghum [Sorghum bicolor], banana, and oil palm [Elaeis guineensis]) and two eudicot (grape [Vitis vinifera] and sacred lotus [Nelumbo nucifera]) genomes also support this additional WGD in monocots, herein called Tau (τ). Integrating synteny and phylogenomic comparisons achieves better resolution of ancient polyploidy events than either approach individually, a principle that is exemplified in the disambiguation of a WGD series of rho (ρ)-sigma (σ)-tau (τ) in the grass lineages that echoes the alpha (α)-beta (β)-gamma (γ) series previously revealed in the Arabidopsis thaliana lineage. PMID:25082857

  1. De novo sequence assembly of Albugo candida reveals a small genome relative to other biotrophic oomycetes

    PubMed Central

    2011-01-01

    Background Albugo candida is a biotrophic oomycete that parasitizes various species of Brassicaceae, causing a disease (white blister rust) with remarkable convergence in behaviour to unrelated rusts of basidiomycete fungi. Results A recent genome analysis of the oomycete Hyaloperonospora arabidopsidis suggests that a reduction in the number of genes encoding secreted pathogenicity proteins, enzymes for assimilation of inorganic nitrogen and sulphur represent a genomic signature for the evolution of obligate biotrophy. Here, we report a draft reference genome of a major crop pathogen Albugo candida (another obligate biotrophic oomycete) with an estimated genome of 45.3 Mb. This is very similar to the genome size of a necrotrophic oomycete Pythium ultimum (43 Mb) but less than half that of H. arabidopsidis (99 Mb). Sequencing of A. candida transcripts from infected host tissue and zoosporangia combined with genome-wide annotation revealed 15,824 predicted genes. Most of the predicted genes lack significant similarity with sequences from other oomycetes. Most intriguingly, A. candida appears to have a much smaller repertoire of pathogenicity-related proteins than H. arabidopsidis including genes that encode RXLR effector proteins, CRINKLER-like genes, and elicitins. Necrosis and Ethylene inducing Peptides were not detected in the genome of A. candida. Putative orthologs of tat-C, a component of the twin arginine translocase system, were identified from multiple oomycete genera along with proteins containing putative tat-secretion signal peptides. Conclusion Albugo candida has a comparatively small genome amongst oomycetes, retains motility of sporangial inoculum, and harbours a much smaller repertoire of candidate effectors than was recently reported for H. arabidopsidis. This minimal gene repertoire could indicate a lack of expansion, rather than a reduction, in the number of genes that signify the evolution of biotrophy in oomycetes. PMID:21995639

  2. Bacterial DNA Sifted from the Trichoplax adhaerens (Animalia: Placozoa) Genome Project Reveals a Putative Rickettsial Endosymbiont

    PubMed Central

    Driscoll, Timothy; Gillespie, Joseph J.; Nordberg, Eric K.; Azad, Abdu F.; Sobral, Bruno W.

    2013-01-01

    Eukaryotic genome sequencing projects often yield bacterial DNA sequences, data typically considered as microbial contamination. However, these sequences may also indicate either symbiont genes or lateral gene transfer (LGT) to host genomes. These bacterial sequences can provide clues about eukaryote–microbe interactions. Here, we used the genome of the primitive animal Trichoplax adhaerens (Metazoa: Placozoa), which is known to harbor an uncharacterized Gram-negative endosymbiont, to search for the presence of bacterial DNA sequences. Bioinformatic and phylogenomic analyses of extracted data from the genome assembly (181 bacterial coding sequences [CDS]) and trace read archive (16S rDNA) revealed a dominant proteobacterial profile strongly skewed to Rickettsiales (Alphaproteobacteria) genomes. By way of phylogenetic analysis of 16S rDNA and 113 proteins conserved across proteobacterial genomes, as well as identification of 27 rickettsial signature genes, we propose a Rickettsiales endosymbiont of T. adhaerens (RETA). The majority (93%) of the identified bacterial CDS belongs to small scaffolds containing prokaryotic-like genes; however, 12 CDS were identified on large scaffolds comprised of eukaryotic-like genes, suggesting that T. adhaerens might have recently acquired bacterial genes. These putative LGTs may coincide with the placozoan’s aquatic niche and symbiosis with RETA. This work underscores the rich, and relatively untapped, resource of eukaryotic genome projects for harboring data pertinent to host–microbial interactions. The nature of unknown (or poorly characterized) bacterial species may only emerge via analysis of host genome sequencing projects, particularly if these species are resistant to cell culturing, as are many obligate intracellular microbes. Our work provides methodological insight for such an approach. PMID:23475938

  3. The complex hybrid origins of the root knot nematodes revealed through comparative genomics

    PubMed Central

    Kumar, Sujai; Koutsovoulos, Georgios; Blaxter, Mark L.

    2014-01-01

    Root knot nematodes (RKN) can infect most of the world’s agricultural crop species and are among the most important of all plant pathogens. As yet however we have little understanding of their origins or the genomic basis of their extreme polyphagy. The most damaging pathogens reproduce by obligatory mitotic parthenogenesis and it has been suggested that these species originated from interspecific hybridizations between unknown parental taxa. We have sequenced the genome of the diploid meiotic parthenogen Meloidogyne floridensis, and use a comparative genomic approach to test the hypothesis that this species was involved in the hybrid origin of the tropical mitotic parthenogen Meloidogyne incognita. Phylogenomic analysis of gene families from M. floridensis, M. incognita and an outgroup species Meloidogyne hapla was carried out to trace the evolutionary history of these species’ genomes, and we demonstrate that M. floridensis was one of the parental species in the hybrid origins of M. incognita. Analysis of the M. floridensis genome itself revealed many gene loci present in divergent copies, as they are in M. incognita, indicating that it too had a hybrid origin. The triploid M. incognita is shown to be a complex double-hybrid between M. floridensis and a third, unidentified, parent. The agriculturally important RKN have very complex origins involving the mixing of several parental genomes by hybridization and their extreme polyphagy and success in agricultural environments may be related to this hybridization, producing transgressive variation on which natural selection can act. It is now clear that studying RKN variation via individual marker loci may fail due to the species’ convoluted origins, and multi-species population genomics is essential to understand the hybrid diversity and adaptive variation of this important species complex. This comparative genomic analysis provides a compelling example of the importance and complexity of hybridization in

  4. Whole genome sequencing revealed host adaptation-focused genomic plasticity of pathogenic Leptospira

    PubMed Central

    Xu, Yinghua; Zhu, Yongzhang; Wang, Yuezhu; Chang, Yung-Fu; Zhang, Ying; Jiang, Xiugao; Zhuang, Xuran; Zhu, Yongqiang; Zhang, Jinlong; Zeng, Lingbing; Yang, Minjun; Li, Shijun; Wang, Shengyue; Ye, Qiang; Xin, Xiaofang; Zhao, Guoping; Zheng, Huajun; Guo, Xiaokui; Wang, Junzhi

    2016-01-01

    Leptospirosis, caused by pathogenic Leptospira spp., has recently been recognized as an emerging infectious disease worldwide. Despite its severity and global importance, knowledge about the molecular pathogenesis and virulence evolution of Leptospira spp. remains limited. Here we sequenced and analyzed 102 isolates representing global sources. A high genomic variability were observed among different Leptospira species, which was attributed to massive gene gain and loss events allowing for adaptation to specific niche conditions and changing host environments. Horizontal gene transfer and gene duplication allowed the stepwise acquisition of virulence factors in pathogenic Leptospira evolved from a recent common ancestor. More importantly, the abundant expansion of specific virulence-related protein families, such as metalloproteases-associated paralogs, were exclusively identified in pathogenic species, reflecting the importance of these protein families in the pathogenesis of leptospirosis. Our observations also indicated that positive selection played a crucial role on this bacteria adaptation to hosts. These novel findings may lead to greater understanding of the global diversity and virulence evolution of Leptospira spp. PMID:26833181

  5. Whole genome sequencing revealed host adaptation-focused genomic plasticity of pathogenic Leptospira.

    PubMed

    Xu, Yinghua; Zhu, Yongzhang; Wang, Yuezhu; Chang, Yung-Fu; Zhang, Ying; Jiang, Xiugao; Zhuang, Xuran; Zhu, Yongqiang; Zhang, Jinlong; Zeng, Lingbing; Yang, Minjun; Li, Shijun; Wang, Shengyue; Ye, Qiang; Xin, Xiaofang; Zhao, Guoping; Zheng, Huajun; Guo, Xiaokui; Wang, Junzhi

    2016-01-01

    Leptospirosis, caused by pathogenic Leptospira spp., has recently been recognized as an emerging infectious disease worldwide. Despite its severity and global importance, knowledge about the molecular pathogenesis and virulence evolution of Leptospira spp. remains limited. Here we sequenced and analyzed 102 isolates representing global sources. A high genomic variability were observed among different Leptospira species, which was attributed to massive gene gain and loss events allowing for adaptation to specific niche conditions and changing host environments. Horizontal gene transfer and gene duplication allowed the stepwise acquisition of virulence factors in pathogenic Leptospira evolved from a recent common ancestor. More importantly, the abundant expansion of specific virulence-related protein families, such as metalloproteases-associated paralogs, were exclusively identified in pathogenic species, reflecting the importance of these protein families in the pathogenesis of leptospirosis. Our observations also indicated that positive selection played a crucial role on this bacteria adaptation to hosts. These novel findings may lead to greater understanding of the global diversity and virulence evolution of Leptospira spp. PMID:26833181

  6. The Physcomitrella genome reveals evolutionary insights into the conquest of land by plants

    SciTech Connect

    Rensing, Stefan A.; Lang, Daniel; Zimmer, Andreas D.; Terry, Astrid; Salamov, Asaf; Shapiro, Harris; Nishiyama, Tomaoki; Perroud, Pierre-Francois; Lindquist, Erika A.; Kamisugi, Yasuko; Tanahashi, Takako; Sakakibara, Keiko; Fujita, Tomomichi; Oishi, Kazuko; Shin, Tadasu; Kuroki, Yoko; Toyoda, Atsushi; Suzuki, Yutaka; Hashimoto, Shin-ichi; Yamaguchi, Kazuo; Sugano, Sumio; Kohara, Yuji; Fujiyama, Asao; Anterola, Aldwin; Aoki, Setsuyuki; Ashton, Neil; Barbazuk, W. Brad; Barker, Elizabeth; Bennetzen, Jeffrey L.; Blankenship, Robert; Cho, Sung Hyun; Dutcher, Susan K.; Estelle, Mark; Fawcett, Jeffrey A.; Gundlach, Heidrum; Hanada, Kousuke; Melkozernov, Alexander; Murata, Takashi; Nelson, David R.; Pils, Birgit; Prigge, Michael; Reiss, Bernd; Renner, Tanya; Rombauts, Stephane; Rushton, Paul J.; Sanderfoot, Anton; Schween, Gabriele; Shiu, Shin-Han; Stueber, Kurt; Theodoulou, Frederica L.; Tu, Hank; Van de Peer, Yves; Verrier, Paul J.; Waters, Elizabeth; Wood, Andrew; Yang, Lixing; Cove, David; Cuming, Andrew C.; Hasebe, Mitsayasu; Lucas, Susan; Mishler, Brent D.; Reski, Ralf; Grigoriev, Igor V.; Quatrano, Rakph S.; Boore, Jeffrey L.

    2007-09-18

    We report the draft genome sequence of the model moss Physcomitrella patens and compare its features with those of flowering plants, from which it is separated by more than 400 million years, and unicellular aquatic algae. This comparison reveals genomic changes concomitant with the evolutionary movement to land, including a general increase in gene family complexity; loss of genes associated with aquatic environments (e.g., flagellar arms); acquisition of genes for tolerating terrestrial stresses (e.g., variation in temperature and water availability); and the development of the auxin and abscisic acid signaling pathways for coordinating multicellular growth and dehydration response. The Physcomitrella genome provides a resource for phylogenetic inferences about gene function and for experimental analysis of plant processes through this plant's unique facility for reverse genetics.

  7. Mutational Strand Asymmetries in Cancer Genomes Reveal Mechanisms of DNA Damage and Repair.

    PubMed

    Haradhvala, Nicholas J; Polak, Paz; Stojanov, Petar; Covington, Kyle R; Shinbrot, Eve; Hess, Julian M; Rheinbay, Esther; Kim, Jaegil; Maruvka, Yosef E; Braunstein, Lior Z; Kamburov, Atanas; Hanawalt, Philip C; Wheeler, David A; Koren, Amnon; Lawrence, Michael S; Getz, Gad

    2016-01-28

    Mutational processes constantly shape the somatic genome, leading to immunity, aging, cancer, and other diseases. When cancer is the outcome, we are afforded a glimpse into these processes by the clonal expansion of the malignant cell. Here, we characterize a less explored layer of the mutational landscape of cancer: mutational asymmetries between the two DNA strands. Analyzing whole-genome sequences of 590 tumors from 14 different cancer types, we reveal widespread asymmetries across mutagenic processes, with transcriptional ("T-class") asymmetry dominating UV-, smoking-, and liver-cancer-associated mutations and replicative ("R-class") asymmetry dominating POLE-, APOBEC-, and MSI-associated mutations. We report a striking phenomenon of transcription-coupled damage (TCD) on the non-transcribed DNA strand and provide evidence that APOBEC mutagenesis occurs on the lagging-strand template during DNA replication. As more genomes are sequenced, studying and classifying their asymmetries will illuminate the underlying biological mechanisms of DNA damage and repair. PMID:26806129

  8. Genome sequencing reveals fine scale diversification and reticulation history during speciation in Sus

    PubMed Central

    2013-01-01

    Background Elucidating the process of speciation requires an in-depth understanding of the evolutionary history of the species in question. Studies that rely upon a limited number of genetic loci do not always reveal actual evolutionary history, and often confuse inferences related to phylogeny and speciation. Whole-genome data, however, can overcome this issue by providing a nearly unbiased window into the patterns and processes of speciation. In order to reveal the complexity of the speciation process, we sequenced and analyzed the genomes of 10 wild pigs, representing morphologically or geographically well-defined species and subspecies of the genus Sus from insular and mainland Southeast Asia, and one African common warthog. Results Our data highlight the importance of past cyclical climatic fluctuations in facilitating the dispersal and isolation of populations, thus leading to the diversification of suids in one of the most species-rich regions of the world. Moreover, admixture analyses revealed extensive, intra- and inter-specific gene-flow that explains previous conflicting results obtained from a limited number of loci. We show that these multiple episodes of gene-flow resulted from both natural and human-mediated dispersal. Conclusions Our results demonstrate the importance of past climatic fluctuations and human mediated translocations in driving and complicating the process of speciation in island Southeast Asia. This case study demonstrates that genomics is a powerful tool to decipher the evolutionary history of a genus, and reveals the complexity of the process of speciation. PMID:24070215

  9. Transcriptional profiling in response to terminal drought stress reveals differential responses along the wheat genome

    PubMed Central

    Aprile, Alessio; Mastrangelo, Anna M; De Leonardis, Anna M; Galiba, Gabor; Roncaglia, Enrica; Ferrari, Francesco; De Bellis, Luigi; Turchi, Luana; Giuliano, Giovanni; Cattivelli, Luigi

    2009-01-01

    Background Water stress during grain filling has a marked effect on grain yield, leading to a reduced endosperm cell number and thus sink capacity to accumulate dry matter. The bread wheat cultivar Chinese Spring (CS), a Chinese Spring terminal deletion line (CS_5AL-10) and the durum wheat cultivar Creso were subjected to transcriptional profiling after exposure to mild and severe drought stress at the grain filling stage to find evidences of differential stress responses associated to different wheat genome regions. Results The transcriptome analysis of Creso, CS and its deletion line revealed 8,552 non redundant probe sets with different expression levels, mainly due to the comparisons between the two species. The drought treatments modified the expression of 3,056 probe sets. Besides a set of genes showing a similar drought response in Creso and CS, cluster analysis revealed several drought response features that can be associated to the different genomic structure of Creso, CS and CS_5AL-10. Some drought-related genes were expressed at lower level (or not expressed) in Creso (which lacks the D genome) or in the CS_5AL-10 deletion line compared to CS. The chromosome location of a set of these genes was confirmed by PCR-based mapping on the D genome (or the 5AL-10 region). Many clusters were characterized by different level of expression in Creso, CS and CS_AL-10, suggesting that the different genome organization of the three genotypes may affect plant adaptation to stress. Clusters with similar expression trend were grouped and functional classified to mine the biological mean of their activation or repression. Genes involved in ABA, proline, glycine-betaine and sorbitol pathways were found up-regulated by drought stress. Furthermore, the enhanced expression of a set of transposons and retrotransposons was detected in CS_5AL-10. Conclusion Bread and durum wheat genotypes were characterized by a different physiological reaction to water stress and by a

  10. Analysis of the Mitochondrial Genome in Hypomyces aurantius Reveals a Novel Twintron Complex in Fungi

    PubMed Central

    Deng, Youjin; Zhang, Qihui; Ming, Ray; Lin, Longji; Lin, Xiangzhi; Lin, Yiying; Li, Xiao; Xie, Baogui; Wen, Zhiqiang

    2016-01-01

    Hypomyces aurantius is a mycoparasite that causes cobweb disease, a most serious disease of cultivated mushrooms. Intra-species identification is vital for disease control, however the lack of genomic data makes development of molecular markers challenging. Small size, high copy number, and high mutation rate of fungal mitochondrial genome makes it a good candidate for intra and inter species differentiation. In this study, the mitochondrial genome of H. H.a0001 was determined from genomic DNA using Illumina sequencing. The roughly 72 kb genome shows all major features found in other Hypocreales: 14 common protein genes, large and small subunit rRNAs genes and 27 tRNAs genes. Gene arrangement comparison showed conserved gene orders in Hypocreales mitochondria are relatively conserved, with the exception of Acremonium chrysogenum and Acremonium implicatum. Mitochondrial genome comparison also revealed that intron length primarily contributes to mitogenome size variation. Seventeen introns were detected in six conserved genes: five in cox1, four in rnl, three in cob, two each in atp6 and cox3, and one in cox2. Four introns were found to contain two introns or open reading frames: cox3-i2 is a twintron containing two group IA type introns; cox2-i1 is a group IB intron encoding two homing endonucleases; and cox1-i4 and cox1-i3 both contain two open reading frame (ORFs). Analyses combining secondary intronic structures, insertion sites, and similarities of homing endonuclease genes reveal two group IA introns arranged side by side within cox3-i2. Mitochondrial data for H. aurantius provides the basis for further studies relating to population genetics and species identification. PMID:27376282

  11. Analysis of the Mitochondrial Genome in Hypomyces aurantius Reveals a Novel Twintron Complex in Fungi.

    PubMed

    Deng, Youjin; Zhang, Qihui; Ming, Ray; Lin, Longji; Lin, Xiangzhi; Lin, Yiying; Li, Xiao; Xie, Baogui; Wen, Zhiqiang

    2016-01-01

    Hypomyces aurantius is a mycoparasite that causes cobweb disease, a most serious disease of cultivated mushrooms. Intra-species identification is vital for disease control, however the lack of genomic data makes development of molecular markers challenging. Small size, high copy number, and high mutation rate of fungal mitochondrial genome makes it a good candidate for intra and inter species differentiation. In this study, the mitochondrial genome of H. H.a0001 was determined from genomic DNA using Illumina sequencing. The roughly 72 kb genome shows all major features found in other Hypocreales: 14 common protein genes, large and small subunit rRNAs genes and 27 tRNAs genes. Gene arrangement comparison showed conserved gene orders in Hypocreales mitochondria are relatively conserved, with the exception of Acremonium chrysogenum and Acremonium implicatum. Mitochondrial genome comparison also revealed that intron length primarily contributes to mitogenome size variation. Seventeen introns were detected in six conserved genes: five in cox1, four in rnl, three in cob, two each in atp6 and cox3, and one in cox2. Four introns were found to contain two introns or open reading frames: cox3-i2 is a twintron containing two group IA type introns; cox2-i1 is a group IB intron encoding two homing endonucleases; and cox1-i4 and cox1-i3 both contain two open reading frame (ORFs). Analyses combining secondary intronic structures, insertion sites, and similarities of homing endonuclease genes reveal two group IA introns arranged side by side within cox3-i2. Mitochondrial data for H. aurantius provides the basis for further studies relating to population genetics and species identification. PMID:27376282

  12. Comparison of 26 sphingomonad genomes reveals diverse environmental adaptations and biodegradative capabilities.

    PubMed

    Aylward, Frank O; McDonald, Bradon R; Adams, Sandra M; Valenzuela, Alejandra; Schmidt, Rebeccah A; Goodwin, Lynne A; Woyke, Tanja; Currie, Cameron R; Suen, Garret; Poulsen, Michael

    2013-06-01

    Sphingomonads comprise a physiologically versatile group within the Alphaproteobacteria that includes strains of interest for biotechnology, human health, and environmental nutrient cycling. In this study, we compared 26 sphingomonad genome sequences to gain insight into their ecology, metabolic versatility, and environmental adaptations. Our multilocus phylogenetic and average amino acid identity (AAI) analyses confirm that Sphingomonas, Sphingobium, Sphingopyxis, and Novosphingobium are well-resolved monophyletic groups with the exception of Sphingomonas sp. strain SKA58, which we propose belongs to the genus Sphingobium. Our pan-genomic analysis of sphingomonads reveals numerous species-specific open reading frames (ORFs) but few signatures of genus-specific cores. The organization and coding potential of the sphingomonad genomes appear to be highly variable, and plasmid-mediated gene transfer and chromosome-plasmid recombination, together with prophage- and transposon-mediated rearrangements, appear to play prominent roles in the genome evolution of this group. We find that many of the sphingomonad genomes encode numerous oxygenases and glycoside hydrolases, which are likely responsible for their ability to degrade various recalcitrant aromatic compounds and polysaccharides, respectively. Many of these enzymes are encoded on megaplasmids, suggesting that they may be readily transferred between species. We also identified enzymes putatively used for the catabolism of sulfonate and nitroaromatic compounds in many of the genomes, suggesting that plant-based compounds or chemical contaminants may be sources of nitrogen and sulfur. Many of these sphingomonads appear to be adapted to oligotrophic environments, but several contain genomic features indicative of host associations. Our work provides a basis for understanding the ecological strategies employed by sphingomonads and their role in environmental nutrient cycling. PMID:23563954

  13. Adaptation of an abundant Roseobacter RCA organism to pelagic systems revealed by genomic and transcriptomic analyses.

    PubMed

    Voget, Sonja; Wemheuer, Bernd; Brinkhoff, Thorsten; Vollmers, John; Dietrich, Sascha; Giebel, Helge-Ansgar; Beardsley, Christine; Sardemann, Carla; Bakenhus, Insa; Billerbeck, Sara; Daniel, Rolf; Simon, Meinhard

    2015-02-01

    The RCA (Roseobacter clade affiliated) cluster, with an internal 16S rRNA gene sequence similarity of >98%, is the largest cluster of the marine Roseobacter clade and most abundant in temperate to (sub)polar oceans, constituting up to 35% of total bacterioplankton. The genome analysis of the first described species of the RCA cluster, Planktomarina temperata RCA23, revealed that this phylogenetic lineage is deeply branching within the Roseobacter clade. It shares not >65.7% of homologous genes with any other organism of this clade. The genome is the smallest of all closed genomes of the Roseobacter clade, exhibits various features of genome streamlining and encompasses genes for aerobic anoxygenic photosynthesis (AAP) and CO oxidation. In order to assess the biogeochemical significance of the RCA cluster we investigated a phytoplankton spring bloom in the North Sea. This cluster constituted 5.1% of the total, but 10-31% (mean 18.5%) of the active bacterioplankton. A metatranscriptomic analysis showed that the genome of P. temperata RCA23 was transcribed to 94% in the bloom with some variations during day and night. The genome of P. temperata RCA23 was also retrieved to 84% from metagenomic data sets from a Norwegian fjord and to 82% from stations of the Global Ocean Sampling expedition in the northwestern Atlantic. In this region, up to 6.5% of the total reads mapped on the genome of P. temperata RCA23. This abundant taxon appears to be a major player in ocean biogeochemistry. PMID:25083934

  14. Comparative Genomics Analyses Reveal Extensive Chromosome Colinearity and Novel Quantitative Trait Loci in Eucalyptus.

    PubMed

    Li, Fagen; Zhou, Changpin; Weng, Qijie; Li, Mei; Yu, Xiaoli; Guo, Yong; Wang, Yu; Zhang, Xiaohong; Gan, Siming

    2015-01-01

    Dense genetic maps, along with quantitative trait loci (QTLs) detected on such maps, are powerful tools for genomics and molecular breeding studies. In the important woody genus Eucalyptus, the recent release of E. grandis genome sequence allows for sequence-based genomic comparison and searching for positional candidate genes within QTL regions. Here, dense genetic maps were constructed for E. urophylla and E. tereticornis using genomic simple sequence repeats (SSR), expressed sequence tag (EST) derived SSR, EST-derived cleaved amplified polymorphic sequence (EST-CAPS), and diversity arrays technology (DArT) markers. The E. urophylla and E. tereticornis maps comprised 700 and 585 markers across 11 linkage groups, totaling at 1,208.2 and 1,241.4 cM in length, respectively. Extensive synteny and colinearity were observed as compared to three earlier DArT-based eucalypt maps (two maps with E. grandis × E. urophylla and one map of E. globulus) and with the E. grandis genome sequence. Fifty-three QTLs for growth (10-56 months of age) and wood density (56 months) were identified in 22 discrete regions on both maps, in which only one colocalizaiton was found between growth and wood density. Novel QTLs were revealed as compared with those previously detected on DArT-based maps for similar ages in Eucalyptus. Eleven to 585 positional candidate genes were obained for a 56-month-old QTL through aligning QTL confidence interval with the E. grandis genome. These results will assist in comparative genomics studies, targeted gene characterization, and marker-assisted selection in Eucalyptus and the related taxa. PMID:26695430

  15. Comparative Genomics Analyses Reveal Extensive Chromosome Colinearity and Novel Quantitative Trait Loci in Eucalyptus

    PubMed Central

    Weng, Qijie; Li, Mei; Yu, Xiaoli; Guo, Yong; Wang, Yu; Zhang, Xiaohong; Gan, Siming

    2015-01-01

    Dense genetic maps, along with quantitative trait loci (QTLs) detected on such maps, are powerful tools for genomics and molecular breeding studies. In the important woody genus Eucalyptus, the recent release of E. grandis genome sequence allows for sequence-based genomic comparison and searching for positional candidate genes within QTL regions. Here, dense genetic maps were constructed for E. urophylla and E. tereticornis using genomic simple sequence repeats (SSR), expressed sequence tag (EST) derived SSR, EST-derived cleaved amplified polymorphic sequence (EST-CAPS), and diversity arrays technology (DArT) markers. The E. urophylla and E. tereticornis maps comprised 700 and 585 markers across 11 linkage groups, totaling at 1,208.2 and 1,241.4 cM in length, respectively. Extensive synteny and colinearity were observed as compared to three earlier DArT-based eucalypt maps (two maps with E. grandis × E. urophylla and one map of E. globulus) and with the E. grandis genome sequence. Fifty-three QTLs for growth (10–56 months of age) and wood density (56 months) were identified in 22 discrete regions on both maps, in which only one colocalizaiton was found between growth and wood density. Novel QTLs were revealed as compared with those previously detected on DArT-based maps for similar ages in Eucalyptus. Eleven to 585 positional candidate genes were obained for a 56-month-old QTL through aligning QTL confidence interval with the E. grandis genome. These results will assist in comparative genomics studies, targeted gene characterization, and marker-assisted selection in Eucalyptus and the related taxa. PMID:26695430

  16. Development and application of a novel genome-wide SNP array reveals domestication history in soybean

    PubMed Central

    Wang, Jiao; Chu, Shanshan; Zhang, Huairen; Zhu, Ying; Cheng, Hao; Yu, Deyue

    2016-01-01

    Domestication of soybeans occurred under the intense human-directed selections aimed at developing high-yielding lines. Tracing the domestication history and identifying the genes underlying soybean domestication require further exploration. Here, we developed a high-throughput NJAU 355 K SoySNP array and used this array to study the genetic variation patterns in 367 soybean accessions, including 105 wild soybeans and 262 cultivated soybeans. The population genetic analysis suggests that cultivated soybeans have tended to originate from northern and central China, from where they spread to other regions, accompanied with a gradual increase in seed weight. Genome-wide scanning for evidence of artificial selection revealed signs of selective sweeps involving genes controlling domestication-related agronomic traits including seed weight. To further identify genomic regions related to seed weight, a genome-wide association study (GWAS) was conducted across multiple environments in wild and cultivated soybeans. As a result, a strong linkage disequilibrium region on chromosome 20 was found to be significantly correlated with seed weight in cultivated soybeans. Collectively, these findings should provide an important basis for genomic-enabled breeding and advance the study of functional genomics in soybean. PMID:26856884

  17. Genomic and physiological analysis reveals versatile metabolic capacity of deep-sea Photobacterium phosphoreum ANT-2200.

    PubMed

    Zhang, Sheng-Da; Santini, Claire-Lise; Zhang, Wei-Jia; Barbe, Valérie; Mangenot, Sophie; Guyomar, Charlotte; Garel, Marc; Chen, Hai-Tao; Li, Xue-Gong; Yin, Qun-Jian; Zhao, Yuan; Armengaud, Jean; Gaillard, Jean-Charles; Martini, Séverine; Pradel, Nathalie; Vidaud, Claude; Alberto, François; Médigue, Claudine; Tamburini, Christian; Wu, Long-Fei

    2016-05-01

    Bacteria of the genus Photobacterium thrive worldwide in oceans and show substantial eco-physiological diversity including free-living, symbiotic and piezophilic life styles. Genomic characteristics underlying this variability across species are poorly understood. Here we carried out genomic and physiological analysis of Photobacterium phosphoreum strain ANT-2200, the first deep-sea luminous bacterium of which the genome has been sequenced. Using optical mapping we updated the genomic data and reassembled it into two chromosomes and a large plasmid. Genomic analysis revealed a versatile energy metabolic potential and physiological analysis confirmed its growth capacity by deriving energy from fermentation of glucose or maltose, by respiration with formate as electron donor and trimethlyamine N-oxide (TMAO), nitrate or fumarate as electron acceptors, or by chemo-organo-heterotrophic growth in rich media. Despite that it was isolated at a site with saturated dissolved oxygen, the ANT-2200 strain possesses four gene clusters coding for typical anaerobic enzymes, the TMAO reductases. Elevated hydrostatic pressure enhances the TMAO reductase activity, mainly due to the increase of isoenzyme TorA1. The high copy number of the TMAO reductase isoenzymes and pressure-enhanced activity might imply a strategy developed by bacteria to adapt to deep-sea habitats where the instant TMAO availability may increase with depth. PMID:27039108

  18. Genome-sequence analysis of Acinetobacter johnsonii MB44 reveals potential nematode-virulent factors.

    PubMed

    Tian, Shijing; Ali, Muhammad; Xie, Li; Li, Lin

    2016-01-01

    Acinetobacter johnsonii is generally recognized as a nonpathogenic bacterium although it is often found in hospital environments. However, a newly identified isolate of this species from a frost-plant-tissue sample, namely, A. johnsonii MB44, showed significant nematicidal activity against the model organism Caenorhabditis elegans. To expand our understanding of this bacterial species, we generated a draft genome sequence of MB44 and analyzed its genomic features related to nematicidal attributes. The 3.36 Mb long genome contains 3636 predicted protein-coding genes and 95 RNA genes (including 14 rRNA genes), with a G + C content of 41.37 %. Genomic analysis of the prediction of nematicidal proteins using the software MP3 revealed a total of 108 potential virulence proteins. Some of these proteins were homologous to the known virulent proteins identified from Acinetobacter baumannii, a pathogenic species of the genus Acinetobacter. These virulent proteins included the outer membrane protein A, the phospholipase D, and penicillin-binding protein 7/8. Moreover, one siderophore biosynthesis gene cluster and one capsular polysaccharide gene cluster, which were predicted to be important virulence factors for C. elegans, were identified in the MB44 genome. The current study demonstrated that A. johnsonii MB44, with its nematicidal activity, could be an opportunistic pathogen to animals. PMID:27429894

  19. De Novo Sequences of Haloquadratum walsbyi from Lake Tyrrell, Australia, Reveal a Variable Genomic Landscape

    PubMed Central

    Tully, Benjamin J.; Emerson, Joanne B.; Andrade, Karen; Brocks, Jochen J.; Allen, Eric E.; Banfield, Jillian F.; Heidelberg, Karla B.

    2015-01-01

    Hypersaline systems near salt saturation levels represent an extreme environment, in which organisms grow and survive near the limits of life. One of the abundant members of the microbial communities in hypersaline systems is the square archaeon, Haloquadratum walsbyi. Utilizing a short-read metagenome from Lake Tyrrell, a hypersaline ecosystem in Victoria, Australia, we performed a comparative genomic analysis of H. walsbyi to better understand the extent of variation between strains/subspecies. Results revealed that previously isolated strains/subspecies do not fully describe the complete repertoire of the genomic landscape present in H. walsbyi. Rearrangements, insertions, and deletions were observed for the Lake Tyrrell derived Haloquadratum genomes and were supported by environmental de novo sequences, including shifts in the dominant genomic landscape of the two most abundant strains. Analysis pertaining to halomucins indicated that homologs for this large protein are not a feature common for all species of Haloquadratum. Further, we analyzed ATP-binding cassette transporters (ABC-type transporters) for evidence of niche partitioning between different strains/subspecies. We were able to identify unique and variable transporter subunits from all five genomes analyzed and the de novo environmental sequences, suggesting that differences in nutrient and carbon source acquisition may play a role in maintaining distinct strains/subspecies. PMID:25709557

  20. Development and application of a novel genome-wide SNP array reveals domestication history in soybean.

    PubMed

    Wang, Jiao; Chu, Shanshan; Zhang, Huairen; Zhu, Ying; Cheng, Hao; Yu, Deyue

    2016-01-01

    Domestication of soybeans occurred under the intense human-directed selections aimed at developing high-yielding lines. Tracing the domestication history and identifying the genes underlying soybean domestication require further exploration. Here, we developed a high-throughput NJAU 355 K SoySNP array and used this array to study the genetic variation patterns in 367 soybean accessions, including 105 wild soybeans and 262 cultivated soybeans. The population genetic analysis suggests that cultivated soybeans have tended to originate from northern and central China, from where they spread to other regions, accompanied with a gradual increase in seed weight. Genome-wide scanning for evidence of artificial selection revealed signs of selective sweeps involving genes controlling domestication-related agronomic traits including seed weight. To further identify genomic regions related to seed weight, a genome-wide association study (GWAS) was conducted across multiple environments in wild and cultivated soybeans. As a result, a strong linkage disequilibrium region on chromosome 20 was found to be significantly correlated with seed weight in cultivated soybeans. Collectively, these findings should provide an important basis for genomic-enabled breeding and advance the study of functional genomics in soybean. PMID:26856884

  1. A comparison of the Caulobacter NA1000 and K31 genomes reveals extensive genome rearrangements and differences in metabolic potential

    PubMed Central

    Ash, Kurt; Brown, Theta; Watford, Tynetta; Scott, LaTia E.; Stephens, Craig; Ely, Bert

    2014-01-01

    The genus Caulobacter is found in a variety of habitats and is known for its ability to thrive in low-nutrient conditions. K31 is a novel Caulobacter isolate that has the ability to tolerate copper and chlorophenols, and can grow at 4°C with a doubling time of 40 h. K31 contains a 5.5 Mb chromosome that codes for more than 5500 proteins and two large plasmids (234 and 178 kb) that code for 438 additional proteins. A comparison of the K31 and the Caulobacter crescentus NA1000 genomes revealed extensive rearrangements of gene order, suggesting that the genomes had been randomly scrambled. However, a careful analysis revealed that the distance from the origin of replication was conserved for the majority of the genes and that many of the rearrangements involved inversions that included the origin of replication. On a finer scale, numerous small indels were observed. K31 proteins involved in essential functions shared 80–95% amino acid sequence identity with their C. crescentus homologues, while other homologue pairs tended to have lower levels of identity. In addition, the K31 chromosome contains more than 1600 genes with no homologue in NA1000. PMID:25274120

  2. Comparative Genomics and Transcriptomics Analyses Reveal Divergent Lifestyle Features of Nematode Endoparasitic Fungus Hirsutella minnesotensis

    PubMed Central

    Lai, Yiling; Liu, Keke; Zhang, Xinyu; Zhang, Xiaoling; Li, Kuan; Wang, Niuniu; Shu, Chi; Wu, Yunpeng; Wang, Chengshu; Bushley, Kathryn E.; Xiang, Meichun; Liu, Xingzhong

    2014-01-01

    Hirsutella minnesotensis [Ophiocordycipitaceae (Hypocreales, Ascomycota)] is a dominant endoparasitic fungus by using conidia that adhere to and penetrate the secondary stage juveniles of soybean cyst nematode. Its genome was de novo sequenced and compared with five entomopathogenic fungi in the Hypocreales and three nematode-trapping fungi in the Orbiliales (Ascomycota). The genome of H. minnesotensis is 51.4 Mb and encodes 12,702 genes enriched with transposable elements up to 32%. Phylogenomic analysis revealed that H. minnesotensis was diverged from entomopathogenic fungi in Hypocreales. Genome of H. minnesotensis is similar to those of entomopathogenic fungi to have fewer genes encoding lectins for adhesion and glycoside hydrolases for cellulose degradation, but is different from those of nematode-trapping fungi to possess more genes for protein degradation, signal transduction, and secondary metabolism. Those results indicate that H. minnesotensis has evolved different mechanism for nematode endoparasitism compared with nematode-trapping fungi. Transcriptomics analyses for the time-scale parasitism revealed the upregulations of lectins, secreted proteases and the genes for biosynthesis of secondary metabolites that could be putatively involved in host surface adhesion, cuticle degradation, and host manipulation. Genome and transcriptome analyses provided comprehensive understanding of the evolution and lifestyle of nematode endoparasitism. PMID:25359922

  3. Exploration of sequence space as the basis of viral RNA genome segmentation

    PubMed Central

    Moreno, Elena; Ojosnegros, Samuel; García-Arriaza, Juan; Escarmís, Cristina; Domingo, Esteban; Perales, Celia

    2014-01-01

    The mechanisms of viral RNA genome segmentation are unknown. On extensive passage of foot-and-mouth disease virus in baby hamster kidney-21 cells, the virus accumulated multiple point mutations and underwent a transition akin to genome segmentation. The standard single RNA genome molecule was replaced by genomes harboring internal in-frame deletions affecting the L- or capsid-coding region. These genomes were infectious and killed cells by complementation. Here we show that the point mutations in the nonstructural protein-coding region (P2, P3) that accumulated in the standard genome before segmentation increased the relative fitness of the segmented version relative to the standard genome. Fitness increase was documented by intracellular expression of virus-coded proteins and infectious progeny production by RNAs with the internal deletions placed in the sequence context of the parental and evolved genome. The complementation activity involved several viral proteins, one of them being the leader proteinase L. Thus, a history of genetic drift with accumulation of point mutations was needed to allow a major variation in the structure of a viral genome. Thus, exploration of sequence space by a viral genome (in this case an unsegmented RNA) can reach a point of the space in which a totally different genome structure (in this case, a segmented RNA) is favored over the form that performed the exploration. PMID:24757055

  4. Comparative Genomics Reveals Insight into Virulence Strategies of Plant Pathogenic Oomycetes

    PubMed Central

    Adhikari, Bishwo N.; Hamilton, John P.; Zerillo, Marcelo M.; Tisserat, Ned; Lévesque, C. André; Buell, C. Robin

    2013-01-01

    The kingdom Stramenopile includes diatoms, brown algae, and oomycetes. Plant pathogenic oomycetes, including Phytophthora, Pythium and downy mildew species, cause devastating diseases on a wide range of host species and have a significant impact on agriculture. Here, we report comparative analyses on the genomes of thirteen straminipilous species, including eleven plant pathogenic oomycetes, to explore common features linked to their pathogenic lifestyle. We report the sequencing, assembly, and annotation of six Pythium genomes and comparison with other stramenopiles including photosynthetic diatoms, and other plant pathogenic oomycetes such as Phytophthora species, Hyaloperonospora arabidopsidis, and Pythium ultimum var. ultimum. Novel features of the oomycete genomes include an expansion of genes encoding secreted effectors and plant cell wall degrading enzymes in Phytophthora species and an over-representation of genes involved in proteolytic degradation and signal transduction in Pythium species. A complete lack of classical RxLR effectors was observed in the seven surveyed Pythium genomes along with an overall reduction of pathogenesis-related gene families in H. arabidopsidis. Comparative analyses revealed fewer genes encoding enzymes involved in carbohydrate metabolism in Pythium species and H. arabidopsidis as compared to Phytophthora species, suggesting variation in virulence mechanisms within plant pathogenic oomycete species. Shared features between the oomycetes and diatoms revealed common mechanisms of intracellular signaling and transportation. Our analyses demonstrate the value of comparative genome analyses for exploring the evolution of pathogenesis and survival mechanisms in the oomycetes. The comparative analyses of seven Pythium species with the closely related oomycetes, Phytophthora species and H. arabidopsidis, and distantly related diatoms provide insight into genes that underlie virulence. PMID:24124466

  5. Genome sequence of the necrotrophic plant pathogen Pythium ultimum reveals original pathogenicity mechanisms and effector repertoire

    PubMed Central

    2010-01-01

    Background Pythium ultimum is a ubiquitous oomycete plant pathogen responsible for a variety of diseases on a broad range of crop and ornamental species. Results The P. ultimum genome (42.8 Mb) encodes 15,290 genes and has extensive sequence similarity and synteny with related Phytophthora species, including the potato blight pathogen Phytophthora infestans. Whole transcriptome sequencing revealed expression of 86% of genes, with detectable differential expression of suites of genes under abiotic stress and in the presence of a host. The predicted proteome includes a large repertoire of proteins involved in plant pathogen interactions, although, surprisingly, the P. ultimum genome does not encode any classical RXLR effectors and relatively few Crinkler genes in comparison to related phytopathogenic oomycetes. A lower number of enzymes involved in carbohydrate metabolism were present compared to Phytophthora species, with the notable absence of cutinases, suggesting a significant difference in virulence mechanisms between P. ultimum and more host-specific oomycete species. Although we observed a high degree of orthology with Phytophthora genomes, there were novel features of the P. ultimum proteome, including an expansion of genes involved in proteolysis and genes unique to Pythium. We identified a small gene family of cadherins, proteins involved in cell adhesion, the first report of these in a genome outside the metazoans. Conclusions Access to the P. ultimum genome has revealed not only core pathogenic mechanisms within the oomycetes but also lineage-specific genes associated with the alternative virulence and lifestyles found within the pythiaceous lineages compared to the Peronosporaceae. PMID:20626842

  6. Comparative genomics reveals insight into virulence strategies of plant pathogenic oomycetes.

    PubMed

    Adhikari, Bishwo N; Hamilton, John P; Zerillo, Marcelo M; Tisserat, Ned; Lévesque, C André; Buell, C Robin

    2013-01-01

    The kingdom Stramenopile includes diatoms, brown algae, and oomycetes. Plant pathogenic oomycetes, including Phytophthora, Pythium and downy mildew species, cause devastating diseases on a wide range of host species and have a significant impact on agriculture. Here, we report comparative analyses on the genomes of thirteen straminipilous species, including eleven plant pathogenic oomycetes, to explore common features linked to their pathogenic lifestyle. We report the sequencing, assembly, and annotation of six Pythium genomes and comparison with other stramenopiles including photosynthetic diatoms, and other plant pathogenic oomycetes such as Phytophthora species, Hyaloperonospora arabidopsidis, and Pythium ultimum var. ultimum. Novel features of the oomycete genomes include an expansion of genes encoding secreted effectors and plant cell wall degrading enzymes in Phytophthora species and an over-representation of genes involved in proteolytic degradation and signal transduction in Pythium species. A complete lack of classical RxLR effectors was observed in the seven surveyed Pythium genomes along with an overall reduction of pathogenesis-related gene families in H. arabidopsidis. Comparative analyses revealed fewer genes encoding enzymes involved in carbohydrate metabolism in Pythium species and H. arabidopsidis as compared to Phytophthora species, suggesting variation in virulence mechanisms within plant pathogenic oomycete species. Shared features between the oomycetes and diatoms revealed common mechanisms of intracellular signaling and transportation. Our analyses demonstrate the value of comparative genome analyses for exploring the evolution of pathogenesis and survival mechanisms in the oomycetes. The comparative analyses of seven Pythium species with the closely related oomycetes, Phytophthora species and H. arabidopsidis, and distantly related diatoms provide insight into genes that underlie virulence. PMID:24124466

  7. Genomic analysis reveals Lactobacillus sanfranciscensis as stable element in traditional sourdoughs

    PubMed Central

    2011-01-01

    Sourdough has played a significant role in human nutrition and culture for thousands of years and is still of eminent importance for human diet and the bakery industry. Lactobacillus sanfranciscensis is the predominant key bacterium in traditionally fermented sourdoughs. The genome of L. sanfranciscensis TMW 1.1304 isolated from an industrial sourdough fermentation was sequenced with a combined Sanger/454-pyrosequencing approach followed by gap closing by walking on fosmids. The sequencing data revealed a circular chromosomal sequence of 1,298,316 bp and two additional plasmids, pLS1 and pLS2, with sizes of 58,739 bp and 18,715 bp, which are predicted to encode 1,437, 63 and 19 orfs, respectively. The overall GC content of the chromosome is 34.71%. Several specific features appear to contribute to the ability of L. sanfranciscensis to outcompete other bacteria in the fermentation. L. sanfranciscensis contains the smallest genome within the lactobacilli and the highest density of ribosomal RNA operons per Mbp genome among all known genomes of free-living bacteria, which is important for the rapid growth characteristics of the organism. A high frequency of gene inactivation and elimination indicates a process of reductive evolution. The biosynthetic capacity for amino acids scarcely availably in cereals and exopolysaccharides reveal the molecular basis for an autochtonous sourdough organism with potential for further exploitation in functional foods. The presence of two CRISPR/cas loci versus a high number of transposable elements suggests recalcitrance to gene intrusion and high intrinsic genome plasticity. PMID:21995419

  8. Whole-Genome Sequencing Reveals Diverse Models of Structural Variations in Esophageal Squamous Cell Carcinoma

    PubMed Central

    Cheng, Caixia; Zhou, Yong; Li, Hongyi; Xiong, Teng; Li, Shuaicheng; Bi, Yanghui; Kong, Pengzhou; Wang, Fang; Cui, Heyang; Li, Yaoping; Fang, Xiaodong; Yan, Ting; Li, Yike; Wang, Juan; Yang, Bin; Zhang, Ling; Jia, Zhiwu; Song, Bin; Hu, Xiaoling; Yang, Jie; Qiu, Haile; Zhang, Gehong; Liu, Jing; Xu, Enwei; Shi, Ruyi; Zhang, Yanyan; Liu, Haiyan; He, Chanting; Zhao, Zhenxiang; Qian, Yu; Rong, Ruizhou; Han, Zhiwei; Zhang, Yanlin; Luo, Wen; Wang, Jiaqian; Peng, Shaoliang; Yang, Xukui; Li, Xiangchun; Li, Lin; Fang, Hu; Liu, Xingmin; Ma, Li; Chen, Yunqing; Guo, Shiping; Chen, Xing; Xi, Yanfeng; Li, Guodong; Liang, Jianfang; Yang, Xiaofeng; Guo, Jiansheng; Jia, JunMei; Li, Qingshan; Cheng, Xiaolong; Zhan, Qimin; Cui, Yongping

    2016-01-01

    Comprehensive identification of somatic structural variations (SVs) and understanding their mutational mechanisms in cancer might contribute to understanding biological differences and help to identify new therapeutic targets. Unfortunately, characterization of complex SVs across the whole genome and the mutational mechanisms underlying esophageal squamous cell carcinoma (ESCC) is largely unclear. To define a comprehensive catalog of somatic SVs, affected target genes, and their underlying mechanisms in ESCC, we re-analyzed whole-genome sequencing (WGS) data from 31 ESCCs using Meerkat algorithm to predict somatic SVs and Patchwork to determine copy-number changes. We found deletions and translocations with NHEJ and alt-EJ signature as the dominant SV types, and 16% of deletions were complex deletions. SVs frequently led to disruption of cancer-associated genes (e.g., CDKN2A and NOTCH1) with different mutational mechanisms. Moreover, chromothripsis, kataegis, and breakage-fusion-bridge (BFB) were identified as contributing to locally mis-arranged chromosomes that occurred in 55% of ESCCs. These genomic catastrophes led to amplification of oncogene through chromothripsis-derived double-minute chromosome formation (e.g., FGFR1 and LETM2) or BFB-affected chromosomes (e.g., CCND1, EGFR, ERBB2, MMPs, and MYC), with approximately 30% of ESCCs harboring BFB-derived CCND1 amplification. Furthermore, analyses of copy-number alterations reveal high frequency of whole-genome duplication (WGD) and recurrent focal amplification of CDCA7 that might act as a potential oncogene in ESCC. Our findings reveal molecular defects such as chromothripsis and BFB in malignant transformation of ESCCs and demonstrate diverse models of SVs-derived target genes in ESCCs. These genome-wide SV profiles and their underlying mechanisms provide preventive, diagnostic, and therapeutic implications for ESCCs. PMID:26833333

  9. Whole-Genome Sequencing Reveals Diverse Models of Structural Variations in Esophageal Squamous Cell Carcinoma.

    PubMed

    Cheng, Caixia; Zhou, Yong; Li, Hongyi; Xiong, Teng; Li, Shuaicheng; Bi, Yanghui; Kong, Pengzhou; Wang, Fang; Cui, Heyang; Li, Yaoping; Fang, Xiaodong; Yan, Ting; Li, Yike; Wang, Juan; Yang, Bin; Zhang, Ling; Jia, Zhiwu; Song, Bin; Hu, Xiaoling; Yang, Jie; Qiu, Haile; Zhang, Gehong; Liu, Jing; Xu, Enwei; Shi, Ruyi; Zhang, Yanyan; Liu, Haiyan; He, Chanting; Zhao, Zhenxiang; Qian, Yu; Rong, Ruizhou; Han, Zhiwei; Zhang, Yanlin; Luo, Wen; Wang, Jiaqian; Peng, Shaoliang; Yang, Xukui; Li, Xiangchun; Li, Lin; Fang, Hu; Liu, Xingmin; Ma, Li; Chen, Yunqing; Guo, Shiping; Chen, Xing; Xi, Yanfeng; Li, Guodong; Liang, Jianfang; Yang, Xiaofeng; Guo, Jiansheng; Jia, JunMei; Li, Qingshan; Cheng, Xiaolong; Zhan, Qimin; Cui, Yongping

    2016-02-01

    Comprehensive identification of somatic structural variations (SVs) and understanding their mutational mechanisms in cancer might contribute to understanding biological differences and help to identify new therapeutic targets. Unfortunately, characterization of complex SVs across the whole genome and the mutational mechanisms underlying esophageal squamous cell carcinoma (ESCC) is largely unclear. To define a comprehensive catalog of somatic SVs, affected target genes, and their underlying mechanisms in ESCC, we re-analyzed whole-genome sequencing (WGS) data from 31 ESCCs using Meerkat algorithm to predict somatic SVs and Patchwork to determine copy-number changes. We found deletions and translocations with NHEJ and alt-EJ signature as the dominant SV types, and 16% of deletions were complex deletions. SVs frequently led to disruption of cancer-associated genes (e.g., CDKN2A and NOTCH1) with different mutational mechanisms. Moreover, chromothripsis, kataegis, and breakage-fusion-bridge (BFB) were identified as contributing to locally mis-arranged chromosomes that occurred in 55% of ESCCs. These genomic catastrophes led to amplification of oncogene through chromothripsis-derived double-minute chromosome formation (e.g., FGFR1 and LETM2) or BFB-affected chromosomes (e.g., CCND1, EGFR, ERBB2, MMPs, and MYC), with approximately 30% of ESCCs harboring BFB-derived CCND1 amplification. Furthermore, analyses of copy-number alterations reveal high frequency of whole-genome duplication (WGD) and recurrent focal amplification of CDCA7 that might act as a potential oncogene in ESCC. Our findings reveal molecular defects such as chromothripsis and BFB in malignant transformation of ESCCs and demonstrate diverse models of SVs-derived target genes in ESCCs. These genome-wide SV profiles and their underlying mechanisms provide preventive, diagnostic, and therapeutic implications for ESCCs. PMID:26833333

  10. Partial sequencing of the bottle gourd genome reveals markers useful for phylogenetic analysis and breeding

    PubMed Central

    2011-01-01

    Background Bottle gourd [Lagenaria siceraria (Mol.) Standl.] is an important cucurbit crop worldwide. Archaeological research indicates that bottle gourd was domesticated more than 10,000 years ago, making it one of the earliest plants cultivated by man. In spite of its widespread importance and long history of cultivation almost nothing has been known about the genome of this species thus far. Results We report here the partial sequencing of bottle gourd genome using the 454 GS-FLX Titanium sequencing platform. A total of 150,253 sequence reads, which were assembled into 3,994 contigs and 82,522 singletons were generated. The total length of the non-redundant singletons/assemblies is 32 Mb, theoretically covering ~ 10% of the bottle gourd genome. Functional annotation of the sequences revealed a broad range of functional types, covering all the three top-level ontologies. Comparison of the gene sequences between bottle gourd and the model cucurbit cucumber (Cucumis sativus) revealed a 90% sequence similarity on average. Using the sequence information, 4395 microsatellite-containing sequences were identified and 400 SSR markers were developed, of which 94% amplified bands of anticipated sizes. Transferability of these markers to four other cucurbit species showed obvious decline with increasing phylogenetic distance. From analyzing polymorphisms of a subset of 14 SSR markers assayed on 44 representative China bottle gourd varieties/landraces, a principal coordinates (PCo) analysis output and a UPGMA-based dendrogram were constructed. Bottle gourd accessions tended to group by fruit shape rather than geographic origin, although in certain subclades the lines from the same or close origin did tend to cluster. Conclusions This work provides an initial basis for genome characterization, gene isolation and comparative genomics analysis in bottle gourd. The SSR markers developed would facilitate marker assisted breeding schemes for efficient introduction of desired

  11. Draft Genome Sequences of Ralstonia pickettii Strains SSH4 and CW2, Isolated from Space Equipment

    PubMed Central

    Monsieurs, Pieter; Mijnendonckx, Kristel; Provoost, Ann; Venkateswaran, Kasthuri; Ott, C. Mark; Leys, Natalie

    2014-01-01

    Ralstonia pickettii SSH4 and CW2 were isolated from space equipment. Here, we report their draft genome sequences with the aim of gaining insight into their potential to adapt to these environments. PMID:25189592

  12. Draft Genome Sequences of Ralstonia pickettii Strains SSH4 and CW2, Isolated from Space Equipment.

    PubMed

    Monsieurs, Pieter; Mijnendonckx, Kristel; Provoost, Ann; Venkateswaran, Kasthuri; Ott, C Mark; Leys, Natalie; Van Houdt, Rob

    2014-01-01

    Ralstonia pickettii SSH4 and CW2 were isolated from space equipment. Here, we report their draft genome sequences with the aim of gaining insight into their potential to adapt to these environments. PMID:25189592

  13. Parallel and Space-Efficient Construction of Burrows-Wheeler Transform and Suffix Array for Big Genome Data.

    PubMed

    Liu, Yongchao; Hankeln, Thomas; Schmidt, Bertil

    2016-01-01

    Next-generation sequencing technologies have led to the sequencing of more and more genomes, propelling related research into the era of big data. In this paper, we present ParaBWT, a parallelized Burrows-Wheeler transform (BWT) and suffix array construction algorithm for big genome data. In ParaBWT, we have investigated a progressive construction approach to constructing the BWT of single genome sequences in linear space complexity, but with a small constant factor. This approach has been further parallelized using multi-threading based on a master-slave coprocessing model. After gaining the BWT, the suffix array is constructed in a memory-efficient manner. The performance of ParaBWT has been evaluated using two sequences generated from two human genome assemblies: the Ensembl Homo sapiens assembly and the human reference genome. Our performance comparison to FMD-index and Bwt-disk reveals that on 12 CPU cores, ParaBWT runs up to 2.2× faster than FMD-index and up to 99.0× faster than Bwt-disk. BWT construction algorithms for very long genomic sequences are time consuming and (due to their incremental nature) inherently difficult to parallelize. Thus, their parallelization is challenging and even relatively small speedups like the ones of our method over FMD-index are of high importance to research. ParaBWT is written in C++, and is freely available at http://parabwt.sourceforge.net. PMID:27295644

  14. Genomic DNA Methylation Analyses Reveal the Distinct Profiles in Castor Bean Seeds with Persistent Endosperms.

    PubMed

    Xu, Wei; Yang, Tianquan; Dong, Xue; Li, De-Zhu; Liu, Aizhong

    2016-06-01

    Investigations of genomic DNA methylation in seeds have been restricted to a few model plants. The endosperm genomic DNA hypomethylation has been identified in angiosperm, but it is difficult to dissect the mechanism of how this hypomethylation is established and maintained because endosperm is ephemeral and disappears with seed development in most dicots. Castor bean (Ricinus communis), unlike Arabidopsis (Arabidopsis thaliana), endosperm is persistent throughout seed development, providing an excellent model in which to dissect the mechanism of endosperm genomic hypomethylation in dicots. We characterized the DNA methylation-related genes encoding DNA methyltransferases and demethylases and analyzed their expression profiles in different tissues. We examined genomic methylation including CG, CHG, and CHH contexts in endosperm and embryo tissues using bisulfite sequencing and revealed that the CHH methylation extent in endosperm and embryo was, unexpectedly, substantially higher than in previously studied plants, irrespective of the CHH percentage in their genomes. In particular, we found that the endosperm exhibited a global reduction in CG and CHG methylation extents relative to the embryo, markedly switching global gene expression. However, CHH methylation occurring in endosperm did not exhibit a significant reduction. Combining with the expression of 24-nucleotide small interfering RNAs (siRNAs) mapped within transposable element (TE) regions and genes involved in the RNA-directed DNA methylation pathway, we demonstrate that the 24-nucleotide siRNAs played a critical role in maintaining CHH methylation and repressing the activation of TEs in persistent endosperm development. This study discovered a novel genomic DNA methylation pattern and proposes the potential mechanism occurring in dicot seeds with persistent endosperm. PMID:27208275

  15. Metagenome sequence of Elaphomyces granulatus from sporocarp tissue reveals Ascomycota ectomycorrhizal fingerprints of genome expansion and a Proteobacteria-rich microbiome.

    PubMed

    Quandt, C Alisha; Kohler, Annegret; Hesse, Cedar N; Sharpton, Thomas J; Martin, Francis; Spatafora, Joseph W

    2015-08-01

    Many obligate symbiotic fungi are difficult to maintain in culture, and there is a growing need for alternative approaches to obtaining tissue and subsequent genomic assemblies from such species. In this study, the genome of Elaphomyces granulatus was sequenced from sporocarp tissue. The genome assembly remains on many contigs, but gene space is estimated to be mostly complete. Phylogenetic analyses revealed that the Elaphomyces lineage is most closely related to Talaromyces and Trichocomaceae s.s. The genome of E. granulatus is reduced in carbohydrate-active enzymes, despite a large expansion in genome size, both of which are consistent with what is seen in Tuber melanosporum, the other sequenced ectomycorrhizal ascomycete. A large number of transposable elements are predicted in the E. granulatus genome, especially Gypsy-like long terminal repeats, and there has also been an expansion in helicases. The metagenome is a complex community dominated by bacteria in Bradyrhizobiaceae, and there is evidence to suggest that the community may be reduced in functional capacity as estimated by KEGG pathways. Through the sequencing of sporocarp tissue, this study has provided insights into Elaphomyces phylogenetics, genomics, metagenomics and the evolution of the ectomycorrhizal association. PMID:25753751

  16. Unusual Light in Dark Space Revealed by Los Alamos, NASA

    ScienceCinema

    Smidt, Joseph

    2015-01-05

    By looking at the dark spaces between visible galaxies and stars the NASA/JPL CIBER sounding rocket experiment has produced data that could redefine what constitutes a galaxy. CIBER, the Cosmic Infrared Background Experiment, is designed to understand the physics going on between visible stars and galaxies. The relatively small, sub-orbital rocket unloads a camera that snaps pictures of the night sky in near-infrared wavelengths, between 1.2 and 1.6 millionth of a meter. Scientists take the data and remove all the known visible stars and galaxies and quantify what is left.

  17. Unusual Light in Dark Space Revealed by Los Alamos, NASA

    SciTech Connect

    Smidt, Joseph

    2014-11-07

    By looking at the dark spaces between visible galaxies and stars the NASA/JPL CIBER sounding rocket experiment has produced data that could redefine what constitutes a galaxy. CIBER, the Cosmic Infrared Background Experiment, is designed to understand the physics going on between visible stars and galaxies. The relatively small, sub-orbital rocket unloads a camera that snaps pictures of the night sky in near-infrared wavelengths, between 1.2 and 1.6 millionth of a meter. Scientists take the data and remove all the known visible stars and galaxies and quantify what is left.

  18. Exploration of the Chemical Space of Public Genomic Databases

    EPA Science Inventory

    The current project aims to chemically index the content of public genomic databases to make these data accessible in relation to other publicly available, chemically-indexed toxicological information.

  19. DEFINING THE CHEMICAL SPACE OF PUBLIC GENOMIC DATA.

    EPA Science Inventory

    The pharmaceutical industry has demonstrated success in integrating of chemogenomic knowledge into predictive toxicological models, due in part to industry's access to large amounts of proprietary and commercial reference genomic data sets.

  20. Advances in the translational genomics of neuroblastoma: From improving risk stratification and revealing novel biology to identifying actionable genomic alterations.

    PubMed

    Bosse, Kristopher R; Maris, John M

    2016-01-01

    Neuroblastoma is an embryonal malignancy that commonly affects young children and is remarkably heterogenous in its malignant potential. Recently, the genetic basis of neuroblastoma has come into focus and not only has catalyzed a more comprehensive understanding of neuroblastoma tumorigenesis but also has revealed novel oncogenic vulnerabilities that are being therapeutically leveraged. Neuroblastoma is a model pediatric solid tumor in its use of recurrent genomic alterations, such as high-level MYCN (v-myc avian myelocytomatosis viral oncogene neuroblastoma-derived homolog) amplification, for risk stratification. Given the relative paucity of recurrent, activating, somatic point mutations or gene fusions in primary neuroblastoma tumors studied at initial diagnosis, innovative treatment approaches beyond small molecules targeting mutated or dysregulated kinases will be required moving forward to achieve noticeable improvements in overall patient survival. However, the clonally acquired, oncogenic aberrations in relapsed neuroblastomas are currently being defined and may offer an opportunity to improve patient outcomes with molecularly targeted therapy directed toward aberrantly regulated pathways in relapsed disease. This review summarizes the current state of knowledge about neuroblastoma genetics and genomics, highlighting the improved prognostication and potential therapeutic opportunities that have arisen from recent advances in understanding germline predisposition, recurrent segmental chromosomal alterations, somatic point mutations and translocations, and clonal evolution in relapsed neuroblastoma. PMID:26539795

  1. A GENOME-WIDE LINKAGE AND ASSOCIATION SCAN REVEALS NOVEL LOCI FOR AUTISM

    PubMed Central

    Weiss, Lauren A.; Arking, Dan E.

    2009-01-01

    Summary Although autism is a highly heritable neurodevelopmental disorder, attempts to identify specific susceptibility genes have thus far met with limited success 1. Genome-wide association studies (GWAS) using half a million or more markers, particularly those with very large sample sizes achieved through meta-analysis, have shown great success in mapping genes for other complex genetic traits (http://www.genome.gov/26525384). Consequently, we initiated a linkage and association mapping study using half a million genome-wide SNPs in a common set of 1,031 multiplex autism families (1,553 affected offspring). We identified regions of suggestive and significant linkage on chromosomes 6q27 and 20p13, respectively. Initial analysis did not yield genome-wide significant associations; however, genotyping of top hits in additional families revealed a SNP on chromosome 5p15 (between SEMA5A and TAS2R1) that was significantly associated with autism (P = 2 × 10−7). We also demonstrated that expression of SEMA5A is reduced in brains from autistic patients, further implicating SEMA5A as an autism susceptibility gene. The linkage regions reported here provide targets for rare variation screening while the discovery of a single novel association demonstrates the action of common variants. PMID:19812673

  2. Unique Features of a Japanese ‘Candidatus Liberibacter asiaticus’ Strain Revealed by Whole Genome Sequencing

    PubMed Central

    Katoh, Hiroshi; Miyata, Shin-ichi; Inoue, Hiromitsu; Iwanami, Toru

    2014-01-01

    Citrus greening (huanglongbing) is the most destructive disease of citrus worldwide. It is spread by citrus psyllids and is associated with phloem-limited bacteria of three species of α-Proteobacteria, namely, ‘Candidatus Liberibacter asiaticus’, ‘Ca. L. americanus’, and ‘Ca. L. africanus’. Recent findings suggested that some Japanese strains lack the bacteriophage-type DNA polymerase region (DNA pol), in contrast to the Floridian psy62 strain. The whole genome sequence of the pol-negative ‘Ca. L. asiaticus’ Japanese isolate Ishi-1 was determined by metagenomic analysis of DNA extracted from ‘Ca. L. asiaticus’-infected psyllids and leaf midribs. The 1.19-Mb genome has an average 36.32% GC content. Annotation revealed 13 operons encoding rRNA and 44 tRNA genes, but no typical bacterial pathogenesis-related genes were located within the genome, similar to the Floridian psy62 and Chinese gxpsy. In contrast to other ‘Ca. L. asiaticus’ strains, the genome of the Japanese Ishi-1 strain lacks a prophage-related region. PMID:25180586

  3. Genomic Signatures Reveal New Evidences for Selection of Important Traits in Domestic Cattle

    PubMed Central

    Xu, Lingyang; Bickhart, Derek M.; Cole, John B.; Schroeder, Steven G.; Song, Jiuzhou; Tassell, Curtis P. Van; Sonstegard, Tad S.; Liu, George E.

    2015-01-01

    We investigated diverse genomic selections using high-density single nucleotide polymorphism data of five distinct cattle breeds. Based on allele frequency differences, we detected hundreds of candidate regions under positive selection across Holstein, Angus, Charolais, Brahman, and N'Dama. In addition to well-known genes such as KIT, MC1R, ASIP, GHR, LCORL, NCAPG, WIF1, and ABCA12, we found evidence for a variety of novel and less-known genes under selection in cattle, such as LAP3, SAR1B, LRIG3, FGF5, and NUDCD3. Selective sweeps near LAP3 were then validated by next-generation sequencing. Genome-wide association analysis involving 26,362 Holsteins confirmed that LAP3 and SAR1B were related to milk production traits, suggesting that our candidate regions were likely functional. In addition, haplotype network analyses further revealed distinct selective pressures and evolution patterns across these five cattle breeds. Our results provided a glimpse into diverse genomic selection during cattle domestication, breed formation, and recent genetic improvement. These findings will facilitate genome-assisted breeding to improve animal production and health. PMID:25431480

  4. 'Candidatus Competibacter'-lineage genomes retrieved from metagenomes reveal functional metabolic diversity.

    PubMed

    McIlroy, Simon J; Albertsen, Mads; Andresen, Eva K; Saunders, Aaron M; Kristiansen, Rikke; Stokholm-Bjerregaard, Mikkel; Nielsen, Kåre L; Nielsen, Per H

    2014-03-01

    The glycogen-accumulating organism (GAO) 'Candidatus Competibacter' (Competibacter) uses aerobically stored glycogen to enable anaerobic carbon uptake, which is subsequently stored as polyhydroxyalkanoates (PHAs). This biphasic metabolism is key for the Competibacter to survive under the cyclic anaerobic-'feast': aerobic-'famine' regime of enhanced biological phosphorus removal (EBPR) wastewater treatment systems. As they do not contribute to phosphorus (P) removal, but compete for resources with the polyphosphate-accumulating organisms (PAO), thought responsible for P removal, their proliferation theoretically reduces the EBPR capacity. In this study, two complete genomes from Competibacter were obtained from laboratory-scale enrichment reactors through metagenomics. Phylogenetic analysis identified the two genomes, 'Candidatus Competibacter denitrificans' and 'Candidatus Contendobacter odensis', as being affiliated with Competibacter-lineage subgroups 1 and 5, respectively. Both have genes for glycogen and PHA cycling and for the metabolism of volatile fatty acids. Marked differences were found in their potential for the Embden-Meyerhof-Parnas and Entner-Doudoroff glycolytic pathways, as well as for denitrification, nitrogen fixation, fermentation, trehalose synthesis and utilisation of glucose and lactate. Genetic comparison of P metabolism pathways with sequenced PAOs revealed the absence of the Pit phosphate transporter in the Competibacter-lineage genomes--identifying a key metabolic difference with the PAO physiology. These genomes are the first from any GAO organism and provide new insights into the complex interaction and niche competition between PAOs and GAOs in EBPR systems. PMID:24173461

  5. Comparative genomics Lactobacillus reuteri from sourdough reveals adaptation of an intestinal symbiont to food fermentations.

    PubMed

    Zheng, Jinshui; Zhao, Xin; Lin, Xiaoxi B; Gänzle, Michael

    2015-01-01

    Lactobacillus reuteri is a dominant member of intestinal microbiota of vertebrates, and occurs in food fermentations. The stable presence of L. reuteri in sourdough provides the opportunity to study the adaptation of vertebrate symbionts to an extra-intestinal habitat. This study evaluated this adaptation by comparative genomics of 16 strains of L. reuteri. A core genome phylogenetic tree grouped L. reuteri into 5 clusters corresponding to the host-adapted lineages. The topology of a gene content tree, which includes accessory genes, differed from the core genome phylogenetic tree, suggesting that the differentiation of L. reuteri is shaped by gene loss or acquisition. About 10% of the core genome (124 core genes) were under positive selection. In lineage III sourdough isolates, 177 genes were under positive selection, mainly related to energy conversion and carbohydrate metabolism. The analysis of the competitiveness of L. reuteri in sourdough revealed that the competitivess of sourdough isolates was equal or higher when compared to rodent isolates. This study provides new insights into the adaptation of L. reuteri to food and intestinal habitats, suggesting that these two habitats exert different selective pressure related to growth rate and energy (carbohydrate) metabolism. PMID:26658825

  6. Comparative genomics Lactobacillus reuteri from sourdough reveals adaptation of an intestinal symbiont to food fermentations

    PubMed Central

    Zheng, Jinshui; Zhao, Xin; Lin, Xiaoxi B.; Gänzle, Michael

    2015-01-01

    Lactobacillus reuteri is a dominant member of intestinal microbiota of vertebrates, and occurs in food fermentations. The stable presence of L. reuteri in sourdough provides the opportunity to study the adaptation of vertebrate symbionts to an extra-intestinal habitat. This study evaluated this adaptation by comparative genomics of 16 strains of L. reuteri. A core genome phylogenetic tree grouped L. reuteri into 5 clusters corresponding to the host-adapted lineages. The topology of a gene content tree, which includes accessory genes, differed from the core genome phylogenetic tree, suggesting that the differentiation of L. reuteri is shaped by gene loss or acquisition. About 10% of the core genome (124 core genes) were under positive selection. In lineage III sourdough isolates, 177 genes were under positive selection, mainly related to energy conversion and carbohydrate metabolism. The analysis of the competitiveness of L. reuteri in sourdough revealed that the competitivess of sourdough isolates was equal or higher when compared to rodent isolates. This study provides new insights into the adaptation of L. reuteri to food and intestinal habitats, suggesting that these two habitats exert different selective pressure related to growth rate and energy (carbohydrate) metabolism. PMID:26658825

  7. A korarchaeal genome reveals insights into the evolution of the Archaea

    SciTech Connect

    Anderson, Iain J; Elkins, James G.; Podar, Mircea; Graham, David E.; Makarova, Kira S.; Wolf, Yuri; Randau, Lennart; Hedlund, Brian P.; Brochier-Armanet, Celine; Kunin, Victor; Anderson, Iain; Lapidus, Alla; Goltsman, Eugene; Barry, Kerrie; Koonin, Eugene V.; Hugenholtz, Phil; Kyrpides, Nikos; Wanner, Gerhard; Richardson, Paul; Keller, Martin; Stetter, Karl O.

    2008-06-05

    The candidate division Korarchaeota comprises a group of uncultivated microorganisms that, by their small subunit rRNA phylogeny, may have diverged early from the major archaeal phyla Crenarchaeota and Euryarchaeota. Here, we report the initial characterization of a member of the Korarchaeota with the proposed name,"Candidatus Korarchaeum cryptofilum," which exhibits an ultrathin filamentous morphology. To investigate possible ancestral relationships between deep-branching Korarchaeota and other phyla, we used whole-genome shotgun sequencing to construct a complete composite korarchaeal genome from enriched cells. The genome was assembled into a single contig 1.59 Mb in length with a G + C content of 49percent. Of the 1,617 predicted protein-coding genes, 1,382 (85percent) could be assigned to a revised set of archaeal Clusters of Orthologous Groups (COGs). The predicted gene functions suggest that the organism relies on a simple mode of peptide fermentation for carbon and energy and lacks the ability to synthesize de novo purines, CoA, and several other cofactors. Phylogenetic analyses based on conserved single genes and concatenated protein sequences positioned the korarchaeote as a deep archaeal lineage with an apparent affinity to the Crenarchaeota. However, the predicted gene content revealed that several conserved cellular systems, such as cell division, DNA replication, and tRNA maturation, resemble the counterparts in the Euryarchaeota. In light of the known composition of archaeal genomes, the Korarchaeota might have retained a set of cellular features that represents the ancestral archaeal form.

  8. A Korarchael Genome Reveals Insights into the Evolution of the Archaea

    SciTech Connect

    Lapidus, Alla; Elkins, James G.; Podar, Mircea; Graham, David E.; Makarova, Kira S.; Wolf, Yuri; Randau, Lennart; Hedlund, Brian P.; Brochier-Armanet, Celine; Kunin, Victor; Anderson, Iain; Lapidus, Alla; Goltsman, Eugene; Barry, Kerrie; Koonin, Eugene V.; Hugenholtz, Phil; Kyrpides, Nikos; Wanner, Gerhard; Richardson, Paul; Keller, Martin; Stetter, Karl O.

    2008-01-07

    The candidate division Korarchaeota comprises a group of uncultivated microorganisms that, by their small subunit rRNA phylogeny, may have diverged early from the major archaeal phyla Crenarchaeota and Euryarchaeota. Here, we report the initial characterization of a member of the Korarchaeota with the proposed name, ?Candidatus Korarchaeum cryptofilum,? which exhibits an ultrathin filamentous morphology. To investigate possible ancestral relationships between deep-branching Korarchaeota and other phyla, we used whole-genome shotgun sequencing to construct a complete composite korarchaeal genome from enriched cells. The genome was assembled into a single contig 1.59 Mb in length with a G + C content of 49percent. Of the 1,617 predicted protein-coding genes, 1,382 (85percent) could be assigned to a revised set of archaeal Clusters of Orthologous Groups (COGs). The predicted gene functions suggest that the organism relies on a simple mode of peptide fermentation for carbon and energy and lacks the ability to synthesize de novo purines, CoA, and several other cofactors. Phylogenetic analyses based on conserved single genes and concatenated protein sequences positioned the korarchaeote as a deep archaeal lineage with an apparent affinity to the Crenarchaeota. However, the predicted gene content revealed that several conserved cellular systems, such as cell division, DNA replication, and tRNA maturation, resemble the counterparts in the Euryarchaeota. In light of the known composition of archaeal genomes, the Korarchaeota might have retained a set of cellular features that represents the ancestral archaeal form.

  9. Genomic signatures reveal new evidences for selection of important traits in domestic cattle.

    PubMed

    Xu, Lingyang; Bickhart, Derek M; Cole, John B; Schroeder, Steven G; Song, Jiuzhou; Tassell, Curtis P Van; Sonstegard, Tad S; Liu, George E

    2015-03-01

    We investigated diverse genomic selections using high-density single nucleotide polymorphism data of five distinct cattle breeds. Based on allele frequency differences, we detected hundreds of candidate regions under positive selection across Holstein, Angus, Charolais, Brahman, and N'Dama. In addition to well-known genes such as KIT, MC1R, ASIP, GHR, LCORL, NCAPG, WIF1, and ABCA12, we found evidence for a variety of novel and less-known genes under selection in cattle, such as LAP3, SAR1B, LRIG3, FGF5, and NUDCD3. Selective sweeps near LAP3 were then validated by next-generation sequencing. Genome-wide association analysis involving 26,362 Holsteins confirmed that LAP3 and SAR1B were related to milk production traits, suggesting that our candidate regions were likely functional. In addition, haplotype network analyses further revealed distinct selective pressures and evolution patterns across these five cattle breeds. Our results provided a glimpse into diverse genomic selection during cattle domestication, breed formation, and recent genetic improvement. These findings will facilitate genome-assisted breeding to improve animal production and health. PMID:25431480

  10. Infer Metagenomic Abundance and Reveal Homologous Genomes Based on the Structure of Taxonomy Tree.

    PubMed

    Qiu, Yu-Qing; Tian, Xue; Zhang, Shihua

    2015-01-01

    Metagenomic research uses sequencing technologies to investigate the genetic biodiversity of microbiomes presented in various ecosystems or animal tissues. The composition of a microbial community is highly associated with the environment in which the organisms exist. As large amount of sequencing short reads of microorganism genomes obtained, accurately estimating the abundance of microorganisms within a metagenomic sample is becoming an increasing challenge in bioinformatics. In this paper, we describe a hierarchical taxonomy tree-based mixture model (HTTMM) for estimating the abundance of taxon within a microbial community by incorporating the structure of the taxonomy tree. In this model, genome-specific short reads and homologous short reads among genomes can be distinguished and represented by leaf and intermediate nodes in the taxonomy tree, respectively. We adopt an expectation-maximization algorithm to solve this model. Using simulated and real-world data, we demonstrate that the proposed method is superior to both flat mixture model and lowest common ancestry-based methods. Moreover, this model can reveal previously unaddressed homologous genomes. PMID:26451823

  11. Whole genome comparison of a large collection of mycobacteriophages reveals a continuum of phage genetic diversity

    PubMed Central

    Pope, Welkin H; Bowman, Charles A; Russell, Daniel A; Jacobs-Sera, Deborah; Asai, David J; Cresawn, Steven G; Jacobs, William R; Hendrix, Roger W; Lawrence, Jeffrey G; Hatfull, Graham F; Abbazia, Patrick; Ababio, Amma; Adam, Naazneen

    2015-01-01

    The bacteriophage population is large, dynamic, ancient, and genetically diverse. Limited genomic information shows that phage genomes are mosaic, and the genetic architecture of phage populations remains ill-defined. To understand the population structure of phages infecting a single host strain, we isolated, sequenced, and compared 627 phages of Mycobacterium smegmatis. Their genetic diversity is considerable, and there are 28 distinct genomic types (clusters) with related nucleotide sequences. However, amino acid sequence comparisons show pervasive genomic mosaicism, and quantification of inter-cluster and intra-cluster relatedness reveals a continuum of genetic diversity, albeit with uneven representation of different phages. Furthermore, rarefaction analysis shows that the mycobacteriophage population is not closed, and there is a constant influx of genes from other sources. Phage isolation and analysis was performed by a large consortium of academic institutions, illustrating the substantial benefits of a disseminated, structured program involving large numbers of freshman undergraduates in scientific discovery. DOI: http://dx.doi.org/10.7554/eLife.06416.001 PMID:25919952

  12. Single Nucleus Genome Sequencing Reveals High Similarity among Nuclei of an Endomycorrhizal Fungus

    PubMed Central

    Zhang, Zhonghua; Ivanov, Sergey; Saunders, Diane G. O.; Mu, Desheng; Pang, Erli; Cao, Huifen; Cha, Hwangho; Lin, Tao; Zhou, Qian; Shang, Yi; Li, Ying; Sharma, Trupti; van Velzen, Robin; de Ruijter, Norbert; Aanen, Duur K.; Win, Joe; Kamoun, Sophien; Bisseling, Ton; Geurts, René; Huang, Sanwen

    2014-01-01

    Nuclei of arbuscular endomycorrhizal fungi have been described as highly diverse due to their asexual nature and absence of a single cell stage with only one nucleus. This has raised fundamental questions concerning speciation, selection and transmission of the genetic make-up to next generations. Although this concept has become textbook knowledge, it is only based on studying a few loci, including 45S rDNA. To provide a more comprehensive insight into the genetic makeup of arbuscular endomycorrhizal fungi, we applied de novo genome sequencing of individual nuclei of Rhizophagus irregularis. This revealed a surprisingly low level of polymorphism between nuclei. In contrast, within a nucleus, the 45S rDNA repeat unit turned out to be highly diverged. This finding demystifies a long-lasting hypothesis on the complex genetic makeup of arbuscular endomycorrhizal fungi. Subsequent genome assembly resulted in the first draft reference genome sequence of an arbuscular endomycorrhizal fungus. Its length is 141 Mbps, representing over 27,000 protein-coding gene models. We used the genomic sequence to reinvestigate the phylogenetic relationships of Rhizophagus irregularis with other fungal phyla. This unambiguously demonstrated that Glomeromycota are more closely related to Mucoromycotina than to its postulated sister Dikarya. PMID:24415955

  13. Comparative Analysis of 35 Basidiomycete Genomes Reveals Diversity and Uniqueness of the Phylum

    SciTech Connect

    Riley, Robert; Salamov, Asaf; Otillar, Robert; Fagnan, Kirsten; Boussau, Bastien; Brown, Daren; Henrissat, Bernard; Levasseur, Anthony; Held, Benjamin; Nagy, Laszlo; Floudas, Dimitris; Morin, Emmanuelle; Manning, Gerard; Baker, Scott; Martin, Francis; Blanchette, Robert; Hibbett, David; Grigoriev, Igor V.

    2013-03-11

    Fungi of the phylum Basidiomycota (basidiomycetes), make up some 37percent of the described fungi, and are important in forestry, agriculture, medicine, and bioenergy. This diverse phylum includes symbionts, pathogens, and saprobes including wood decaying fungi. To better understand the diversity of this phylum we compared the genomes of 35 basidiomycete fungi including 6 newly sequenced genomes. The genomes of basidiomycetes span extremes of genome size, gene number, and repeat content. A phylogenetic tree of Basidiomycota was generated using the Phyldog software, which uses all available protein sequence data to simultaneously infer gene and species trees. Analysis of core genes reveals that some 48percent of basidiomycete proteins are unique to the phylum with nearly half of those (22percent) comprising proteins found in only one organism. Phylogenetic patterns of plant biomass-degrading genes suggest a continuum rather than a sharp dichotomy between the white rot and brown rot modes of wood decay among the members of Agaricomycotina subphylum. There is a correlation of the profile of certain gene families to nutritional mode in Agaricomycotina. Based on phylogenetically-informed PCA analysis of such profiles, we predict that that Botryobasidium botryosum and Jaapia argillacea have properties similar to white rot species, although neither has liginolytic class II fungal peroxidases. Furthermore, we find that both fungi exhibit wood decay with white rot-like characteristics in growth assays. Analysis of the rate of discovery of proteins with no or few homologs suggests the high value of continued sequencing of basidiomycete fungi.

  14. Comparative genomic analysis of Lactobacillus plantarum ZJ316 reveals its genetic adaptation and potential probiotic profiles* #

    PubMed Central

    Li, Ping; Li, Xuan; Gu, Qing; Lou, Xiu-yu; Zhang, Xiao-mei; Song, Da-feng; Zhang, Chen

    2016-01-01

    Objective: In previous studies, Lactobacillus plantarum ZJ316 showed probiotic properties, such as antimicrobial activity against various pathogens and the capacity to significantly improve pig growth and pork quality. The purpose of this study was to reveal the genes potentially related to its genetic adaptation and probiotic profiles based on comparative genomic analysis. Methods: The genome sequence of L. plantarum ZJ316 was compared with those of eight L. plantarum strains deposited in GenBank. BLASTN, Mauve, and MUMmer programs were used for genome alignment and comparison. CRISPRFinder was applied for searching the clustered regularly interspaced short palindromic repeats (CRISPRs). Results: We identified genes that encode proteins related to genetic adaptation and probiotic profiles, including carbohydrate transport and metabolism, proteolytic enzyme systems and amino acid biosynthesis, CRISPR adaptive immunity, stress responses, bile salt resistance, ability to adhere to the host intestinal wall, exopolysaccharide (EPS) biosynthesis, and bacteriocin biosynthesis. Conclusions: Comparative characterization of the L. plantarum ZJ316 genome provided the genetic basis for further elucidating the functional mechanisms of its probiotic properties. ZJ316 could be considered a potential probiotic candidate. PMID:27487802

  15. A map of rice genome variation reveals the origin of cultivated rice.

    PubMed

    Huang, Xuehui; Kurata, Nori; Wei, Xinghua; Wang, Zi-Xuan; Wang, Ahong; Zhao, Qiang; Zhao, Yan; Liu, Kunyan; Lu, Hengyun; Li, Wenjun; Guo, Yunli; Lu, Yiqi; Zhou, Congcong; Fan, Danlin; Weng, Qijun; Zhu, Chuanrang; Huang, Tao; Zhang, Lei; Wang, Yongchun; Feng, Lei; Furuumi, Hiroyasu; Kubo, Takahiko; Miyabayashi, Toshie; Yuan, Xiaoping; Xu, Qun; Dong, Guojun; Zhan, Qilin; Li, Canyang; Fujiyama, Asao; Toyoda, Atsushi; Lu, Tingting; Feng, Qi; Qian, Qian; Li, Jiayang; Han, Bin

    2012-10-25

    Crop domestications are long-term selection experiments that have greatly advanced human civilization. The domestication of cultivated rice (Oryza sativa L.) ranks as one of the most important developments in history. However, its origins and domestication processes are controversial and have long been debated. Here we generate genome sequences from 446 geographically diverse accessions of the wild rice species Oryza rufipogon, the immediate ancestral progenitor of cultivated rice, and from 1,083 cultivated indica and japonica varieties to construct a comprehensive map of rice genome variation. In the search for signatures of selection, we identify 55 selective sweeps that have occurred during domestication. In-depth analyses of the domestication sweeps and genome-wide patterns reveal that Oryza sativa japonica rice was first domesticated from a specific population of O. rufipogon around the middle area of the Pearl River in southern China, and that Oryza sativa indica rice was subsequently developed from crosses between japonica rice and local wild rice as the initial cultivars spread into South East and South Asia. The domestication-associated traits are analysed through high-resolution genetic mapping. This study provides an important resource for rice breeding and an effective genomics approach for crop domestication research. PMID:23034647

  16. Evolution and phylogeny of the mud shrimps (Crustacea: Decapoda) revealed from complete mitochondrial genomes

    PubMed Central

    2012-01-01

    Background The evolutionary history and relationships of the mud shrimps (Crustacea: Decapoda: Gebiidea and Axiidea) are contentious, with previous attempts revealing mixed results. The mud shrimps were once classified in the infraorder Thalassinidea. Recent molecular phylogenetic analyses, however, suggest separation of the group into two individual infraorders, Gebiidea and Axiidea. Mitochondrial (mt) genome sequence and structure can be especially powerful in resolving higher systematic relationships that may offer new insights into the phylogeny of the mud shrimps and the other decapod infraorders, and test the hypothesis of dividing the mud shrimps into two infraorders. Results We present the complete mitochondrial genome sequences of five mud shrimps, Austinogebia edulis, Upogebia major, Thalassina kelanang (Gebiidea), Nihonotrypaea thermophilus and Neaxius glyptocercus (Axiidea). All five genomes encode a standard set of 13 protein-coding genes, two ribosomal RNA genes, 22 transfer RNA genes and a putative control region. Except for T. kelanang, mud shrimp mitochondrial genomes exhibited rearrangements and novel patterns compared to the pancrustacean ground pattern. Each of the two Gebiidea species (A. edulis and U. major) and two Axiidea species (N. glyptocercus and N. thermophiles) share unique gene order specific to their infraorders and analyses further suggest these two derived gene orders have evolved independently. Phylogenetic analyses based on the concatenated nucleotide and amino acid sequences of 13 protein-coding genes indicate the possible polyphyly of mud shrimps, supporting the division of the group into two infraorders. However, the infraordinal relationships among the Gebiidea and Axiidea, and other reptants are poorly resolved. The inclusion of mt genome from more taxa, in particular the reptant infraorders Polychelida and Glypheidea is required in further analysis. Conclusions Phylogenetic analyses on the mt genome sequences and the

  17. Comparative genomic analysis reveals a distant liver enhancer upstream of the COUP-TFII gene

    SciTech Connect

    Baroukh, Nadine; Ahituv, Nadav; Chang, Jessie; Shoukry, Malak; Afzal, Veena; Rubin, Edward M.; Pennacchio, Len A.

    2004-08-20

    COUP-TFII is a central nuclear hormone receptor that tightly regulates the expression of numerous target lipid metabolism genes in vertebrates. However, it remains unclear how COUP-TFII itself is transcriptionally controlled since studies with its promoter and upstream region fail to recapitulate the genes liver expression. In an attempt to identify liver enhancers in the vicinity of COUP-TFII, we employed a comparative genomic approach. Initial comparisons between humans and mice of the 3,470kb gene poor region surrounding COUP-TFII revealed 2,023 conserved non-coding elements. To prioritize a subset of these elements for functional studies, we performed further genomic comparisons with the orthologous pufferfish (Fugu rubripes) locus and uncovered two anciently conserved non-coding sequences (CNS) upstream of COUP-TFII (CNS-62kb and CNS-66kb). Testing these two elements using reporter constructs in liver (HepG2) cells revealed that CNS-66kb, but not CNS-62kb, yielded robust in vitro enhancer activity. In addition, an in vivo reporter assay using naked DNA transfer with CNS-66kb linked to luciferase displayed strong reproducible liver expression in adult mice, further supporting its role as a liver enhancer. Together, these studies further support the utility of comparative genomics to uncover gene regulatory sequences based on evolutionary conservation and provide the substrates to better understand the regulation and expression of COUP-TFII.

  18. Correction: Comparative analysis of fungal genomes reveals different plant cell wall degrading capacity in fungi

    PubMed Central

    2014-01-01

    Abstract The version of this article published in BMC Genomics 2013, 14: 274, contains 9 unpublished genomes (Botryobasidium botryosum, Gymnopus luxurians, Hypholoma sublateritium, Jaapia argillacea, Hebeloma cylindrosporum, Conidiobolus coronatus, Laccaria amethystina, Paxillus involutus, and P. rubicundulus) downloaded from JGI website. In this correction, we removed these genomes after discussion with editors and data producers whom we should have contacted before downloading these genomes. Removing these data did not alter the principle results and conclusions of our original work. The relevant Figures 1, 2, 3, 4 and 6; and Table 1 have been revised. Additional files 1, 3, 4, and 5 were also revised. We would like to apologize for any confusion or inconvenience this may have caused. Background Fungi produce a variety of carbohydrate activity enzymes (CAZymes) for the degradation of plant polysaccharide materials to facilitate infection and/or gain nutrition. Identifying and comparing CAZymes from fungi with different nutritional modes or infection mechanisms may provide information for better understanding of their life styles and infection models. To date, over hundreds of fungal genomes are publicly available. However, a systematic comparative analysis of fungal CAZymes across the entire fungal kingdom has not been reported. Results In this study, we systemically identified glycoside hydrolases (GHs), polysaccharide lyases (PLs), carbohydrate esterases (CEs), and glycosyltransferases (GTs) as well as carbohydrate-binding modules (CBMs) in the predicted proteomes of 94 representative fungi from Ascomycota, Basidiomycota, Chytridiomycota, and Zygomycota. Comparative analysis of these CAZymes that play major roles in plant polysaccharide degradation revealed that fungi exhibit tremendous diversity in the number and variety of CAZymes. Among them, some families of GHs and CEs are the most prevalent CAZymes that are distributed in all of the fungi analyzed

  19. Ancient mitochondrial genome reveals trace of prehistoric migration in the east Pamir by pastoralists.

    PubMed

    Ning, Chao; Gao, Shizhu; Deng, Boping; Zheng, Hongxiang; Wei, Dong; Lv, Haoze; Li, Hongjie; Song, Li; Wu, Yong; Zhou, Hui; Cui, Yinqiu

    2016-02-01

    The complete mitochondrial genome of one 700-year-old individual found in Tashkurgan, Xinjiang was target enriched and sequenced in order to shed light on the population history of Tashkurgan and determine the phylogenetic relationship of haplogroup U5a. The ancient sample was assigned to a subclade of haplogroup U5a2a1, which is defined by two rare and stable transversions at 16114A and 13928C. Phylogenetic analysis shows a distribution pattern for U5a2a that is indicative of an origin in the Volga-Ural region and exhibits a clear eastward geographical expansion that correlates with the pastoral culture also entering the Eurasian steppe. The haplogroup U5a2a present in the ancient Tashkurgan individual reveals prehistoric migration in the East Pamir by pastoralists. This study shows that studying an ancient mitochondrial genome is a useful approach for studying the evolutionary process and population history of Eastern Pamir. PMID:26511065

  20. The Chlamydomonas Genome Reveals the Evolution of Key Animal and Plant Functions

    PubMed Central

    Merchant, Sabeeha S.; Prochnik, Simon E.; Vallon, Olivier; Harris, Elizabeth H.; Karpowicz, Steven J.; Witman, George B.; Terry, Astrid; Salamov, Asaf; Fritz-Laylin, Lillian K.; Maréchal-Drouard, Laurence; Marshall, Wallace F.; Qu, Liang-Hu; Nelson, David R.; Sanderfoot, Anton A.; Spalding, Martin H.; Kapitonov, Vladimir V.; Ren, Qinghu; Ferris, Patrick; Lindquist, Erika; Shapiro, Harris; Lucas, Susan M.; Grimwood, Jane; Schmutz, Jeremy; Cardol, Pierre; Cerutti, Heriberto; Chanfreau, Guillaume; Chen, Chun-Long; Cognat, Valérie; Croft, Martin T.; Dent, Rachel; Dutcher, Susan; Fernández, Emilio; Ferris, Patrick; Fukuzawa, Hideya; González-Ballester, David; González-Halphen, Diego; Hallmann, Armin; Hanikenne, Marc; Hippler, Michael; Inwood, William; Jabbari, Kamel; Kalanon, Ming; Kuras, Richard; Lefebvre, Paul A.; Lemaire, Stéphane D.; Lobanov, Alexey V.; Lohr, Martin; Manuell, Andrea; Meier, Iris; Mets, Laurens; Mittag, Maria; Mittelmeier, Telsa; Moroney, James V.; Moseley, Jeffrey; Napoli, Carolyn; Nedelcu, Aurora M.; Niyogi, Krishna; Novoselov, Sergey V.; Paulsen, Ian T.; Pazour, Greg; Purton, Saul; Ral, Jean-Philippe; Riaño-Pachón, Diego Mauricio; Riekhof, Wayne; Rymarquis, Linda; Schroda, Michael; Stern, David; Umen, James; Willows, Robert; Wilson, Nedra; Zimmer, Sara Lana; Allmer, Jens; Balk, Janneke; Bisova, Katerina; Chen, Chong-Jian; Elias, Marek; Gendler, Karla; Hauser, Charles; Lamb, Mary Rose; Ledford, Heidi; Long, Joanne C.; Minagawa, Jun; Page, M. Dudley; Pan, Junmin; Pootakham, Wirulda; Roje, Sanja; Rose, Annkatrin; Stahlberg, Eric; Terauchi, Aimee M.; Yang, Pinfen; Ball, Steven; Bowler, Chris; Dieckmann, Carol L.; Gladyshev, Vadim N.; Green, Pamela; Jorgensen, Richard; Mayfield, Stephen; Mueller-Roeber, Bernd; Rajamani, Sathish; Sayre, Richard T.; Brokstein, Peter; Dubchak, Inna; Goodstein, David; Hornick, Leila; Huang, Y. Wayne; Jhaveri, Jinal; Luo, Yigong; Martínez, Diego; Ngau, Wing Chi Abby; Otillar, Bobby; Poliakov, Alexander; Porter, Aaron; Szajkowski, Lukasz; Werner, Gregory; Zhou, Kemin; Grigoriev, Igor V.; Rokhsar, Daniel S.; Grossman, Arthur R.

    2010-01-01

    Chlamydomonas reinhardtii is a unicellular green alga whose lineage diverged from land plants over 1 billion years ago. It is a model system for studying chloroplast-based photosynthesis, as well as the structure, assembly, and function of eukaryotic flagella (cilia), which were inherited from the common ancestor of plants and animals, but lost in land plants. We sequenced the ∼120-megabase nuclear genome of Chlamydomonas and performed comparative phylogenomic analyses, identifying genes encoding uncharacterized proteins that are likely associated with the function and biogenesis of chloroplasts or eukaryotic flagella. Analyses of the Chlamydomonas genome advance our understanding of the ancestral eukaryotic cell, reveal previously unknown genes associated with photosynthetic and flagellar functions, and establish links between ciliopathy and the composition and function of flagella. PMID:17932292

  1. Oil Accumulation by the Oleaginous Diatom Fistulifera solaris as Revealed by the Genome and Transcriptome

    PubMed Central

    Veluchamy, Alaguraj; Tanaka, Michihiro; Abida, Heni; Maréchal, Eric; Bowler, Chris; Muto, Masaki; Sunaga, Yoshihiko; Tanaka, Masayoshi; Taniguchi, Takeaki; Fukuda, Yorikane; Nemoto, Michiko; Matsumoto, Mitsufumi; Wong, Pui Shan; Aburatani, Sachiyo; Fujibuchi, Wataru

    2015-01-01

    Oleaginous photosynthetic organisms such as microalgae are promising sources for biofuel production through the generation of carbon-neutral sustainable energy. However, the metabolic mechanisms driving high-rate lipid production in these oleaginous organisms remain unclear, thus impeding efforts to improve productivity through genetic modifications. We analyzed the genome and transcriptome of the oleaginous diatom Fistulifera solaris JPCC DA0580. Next-generation sequencing technology provided evidence of an allodiploid genome structure, suggesting unorthodox molecular evolutionary and genetic regulatory systems for reinforcing metabolic efficiencies. Although major metabolic pathways were shared with nonoleaginous diatoms, transcriptome analysis revealed unique expression patterns, such as concomitant upregulation of fatty acid/triacylglycerol biosynthesis and fatty acid degradation (β-oxidation) in concert with ATP production. This peculiar pattern of gene expression may account for the simultaneous growth and oil accumulation phenotype and may inspire novel biofuel production technology based on this oleaginous microalga. PMID:25634988

  2. The Chlamydomonas Genome Reveals the Evolution of Key Animal and Plant Functions

    SciTech Connect

    Merchant, Sabeeha S

    2007-04-09

    Chlamydomonas reinhardtii is a unicellular green alga whose lineage diverged from land plants over 1 billion years ago. It is a model system for studying chloroplast-based photosynthesis, as well as the structure, assembly, and function of eukaryotic flagella (cilia), which were inherited from the common ancestor of plants and animals, but lost in land plants. We sequenced the 120-megabase nuclear genome of Chlamydomonas and performed comparative phylogenomic analyses, identifying genes encoding uncharacterized proteins that are likely associated with the function and biogenesis of chloroplasts or eukaryotic flagella. Analyses of the Chlamydomonas genome advance our understanding of the ancestral eukaryotic cell, reveal previously unknown genes associated with photosynthetic and flagellar functions, and establish links between ciliopathy and the composition and function of flagella.

  3. Bifidobacterium asteroides PRL2011 Genome Analysis Reveals Clues for Colonization of the Insect Gut

    PubMed Central

    Bottacini, Francesca; Milani, Christian; Turroni, Francesca; Sánchez, Borja; Foroni, Elena; Duranti, Sabrina; Serafini, Fausta; Viappiani, Alice; Strati, Francesco; Ferrarini, Alberto; Delledonne, Massimo; Henrissat, Bernard; Coutinho, Pedro; Fitzgerald, Gerald F.; Margolles, Abelardo; van Sinderen, Douwe; Ventura, Marco

    2012-01-01

    Bifidobacteria are known as anaerobic/microaerophilic and fermentative microorganisms, which commonly inhabit the gastrointestinal tract of various animals and insects. Analysis of the 2,167,301 bp genome of Bifidobacterium asteroides PRL2011, a strain isolated from the hindgut of Apis mellifera var. ligustica, commonly known as the honey bee, revealed its predicted capability for respiratory metabolism. Conservation of the latter gene clusters in various B. asteroides strains enforces the notion that respiration is a common metabolic feature of this ancient bifidobacterial species, which has been lost in currently known mammal-derived Bifidobacterium species. In fact, phylogenomic based analyses suggested an ancient origin of B. asteroides and indicates it as an ancestor of the genus Bifidobacterium. Furthermore, the B. asteroides PRL2011 genome encodes various enzymes for coping with toxic products that arise as a result of oxygen-mediated respiration. PMID:23028506

  4. Whole-genome sequence comparisons reveal the evolution of Vibrio cholerae O1.

    PubMed

    Kim, Eun Jin; Lee, Chan Hee; Nair, G Balakrish; Kim, Dong Wook

    2015-08-01

    The analysis of the whole-genome sequences of Vibrio cholerae strains from previous and current cholera pandemics has demonstrated that genomic changes and alterations in phage CTX (particularly in the gene encoding the B subunit of cholera toxin) were major features in the evolution of V. cholerae. Recent studies have revealed the genetic mechanisms in these bacteria by which new variants of V. cholerae are generated from type-specific strains; these mechanisms suggest that certain strains are selected by environmental or human factors over time. By understanding the mechanisms and driving forces of historical and current changes in the V. cholerae population, it would be possible to predict the direction of such changes and the evolution of new variants; this has implications for the battle against cholera. PMID:25913612

  5. The complete genome sequence of Chromobacterium violaceum reveals remarkable and exploitable bacterial adaptability

    PubMed Central

    2003-01-01

    Chromobacterium violaceum is one of millions of species of free-living microorganisms that populate the soil and water in the extant areas of tropical biodiversity around the world. Its complete genome sequence reveals (i) extensive alternative pathways for energy generation, (ii) ≈500 ORFs for transport-related proteins, (iii) complex and extensive systems for stress adaptation and motility, and (iv) widespread utilization of quorum sensing for control of inducible systems, all of which underpin the versatility and adaptability of the organism. The genome also contains extensive but incomplete arrays of ORFs coding for proteins associated with mammalian pathogenicity, possibly involved in the occasional but often fatal cases of human C. violaceum infection. There is, in addition, a series of previously unknown but important enzymes and secondary metabolites including paraquat-inducible proteins, drug and heavy-metal-resistance proteins, multiple chitinases, and proteins for the detoxification of xenobiotics that may have biotechnological applications. PMID:14500782

  6. Genomes of cryptic chimpanzee Plasmodium species reveal key evolutionary events leading to human malaria

    PubMed Central

    Sundararaman, Sesh A.; Plenderleith, Lindsey J.; Liu, Weimin; Loy, Dorothy E.; Learn, Gerald H.; Li, Yingying; Shaw, Katharina S.; Ayouba, Ahidjo; Peeters, Martine; Speede, Sheri; Shaw, George M.; Bushman, Frederic D.; Brisson, Dustin; Rayner, Julian C.; Sharp, Paul M.; Hahn, Beatrice H.

    2016-01-01

    African apes harbour at least six Plasmodium species of the subgenus Laverania, one of which gave rise to human Plasmodium falciparum. Here we use a selective amplification strategy to sequence the genome of chimpanzee parasites classified as Plasmodium reichenowi and Plasmodium gaboni based on the subgenomic fragments. Genome-wide analyses show that these parasites indeed represent distinct species, with no evidence of cross-species mating. Both P. reichenowi and P. gaboni are 10-fold more diverse than P. falciparum, indicating a very recent origin of the human parasite. We also find a remarkable Laverania-specific expansion of a multigene family involved in erythrocyte remodelling, and show that a short region on chromosome 4, which encodes two essential invasion genes, was horizontally transferred into a recent P. falciparum ancestor. Our results validate the selective amplification strategy for characterizing cryptic pathogen species, and reveal evolutionary events that likely predisposed the precursor of P. falciparum to colonize humans. PMID:27002652

  7. Genomic analysis of hybrid rice varieties reveals numerous superior alleles that contribute to heterosis

    PubMed Central

    Huang, Xuehui; Yang, Shihua; Gong, Junyi; Zhao, Yan; Feng, Qi; Gong, Hao; Li, Wenjun; Zhan, Qilin; Cheng, Benyi; Xia, Junhui; Chen, Neng; Hao, Zhongna; Liu, Kunyan; Zhu, Chuanrang; Huang, Tao; Zhao, Qiang; Zhang, Lei; Fan, Danlin; Zhou, Congcong; Lu, Yiqi; Weng, Qijun; Wang, Zi-Xuan; Li, Jiayang; Han, Bin

    2015-01-01

    Exploitation of heterosis is one of the most important applications of genetics in agriculture. However, the genetic mechanisms of heterosis are only partly understood, and a global view of heterosis from a representative number of hybrid combinations is lacking. Here we develop an integrated genomic approach to construct a genome map for 1,495 elite hybrid rice varieties and their inbred parental lines. We investigate 38 agronomic traits and identify 130 associated loci. In-depth analyses of the effects of heterozygous genotypes reveal that there are only a few loci with strong overdominance effects in hybrids, but a strong correlation is observed between the yield and the number of superior alleles. While most parental inbred lines have only a small number of superior alleles, high-yielding hybrid varieties have several. We conclude that the accumulation of numerous rare superior alleles with positive dominance is an important contributor to the heterotic phenomena. PMID:25651972

  8. Genomes of cryptic chimpanzee Plasmodium species reveal key evolutionary events leading to human malaria.

    PubMed

    Sundararaman, Sesh A; Plenderleith, Lindsey J; Liu, Weimin; Loy, Dorothy E; Learn, Gerald H; Li, Yingying; Shaw, Katharina S; Ayouba, Ahidjo; Peeters, Martine; Speede, Sheri; Shaw, George M; Bushman, Frederic D; Brisson, Dustin; Rayner, Julian C; Sharp, Paul M; Hahn, Beatrice H

    2016-01-01

    African apes harbour at least six Plasmodium species of the subgenus Laverania, one of which gave rise to human Plasmodium falciparum. Here we use a selective amplification strategy to sequence the genome of chimpanzee parasites classified as Plasmodium reichenowi and Plasmodium gaboni based on the subgenomic fragments. Genome-wide analyses show that these parasites indeed represent distinct species, with no evidence of cross-species mating. Both P. reichenowi and P. gaboni are 10-fold more diverse than P. falciparum, indicating a very recent origin of the human parasite. We also find a remarkable Laverania-specific expansion of a multigene family involved in erythrocyte remodelling, and show that a short region on chromosome 4, which encodes two essential invasion genes, was horizontally transferred into a recent P. falciparum ancestor. Our results validate the selective amplification strategy for characterizing cryptic pathogen species, and reveal evolutionary events that likely predisposed the precursor of P. falciparum to colonize humans. PMID:27002652

  9. Conditional Epistatic Interaction Maps Reveal Global Functional Rewiring of Genome Integrity Pathways in Escherichia coli.

    PubMed

    Kumar, Ashwani; Beloglazova, Natalia; Bundalovic-Torma, Cedoljub; Phanse, Sadhna; Deineko, Viktor; Gagarinova, Alla; Musso, Gabriel; Vlasblom, James; Lemak, Sofia; Hooshyar, Mohsen; Minic, Zoran; Wagih, Omar; Mosca, Roberto; Aloy, Patrick; Golshani, Ashkan; Parkinson, John; Emili, Andrew; Yakunin, Alexander F; Babu, Mohan

    2016-01-26

    As antibiotic resistance is increasingly becoming a public health concern, an improved understanding of the bacterial DNA damage response (DDR), which is commonly targeted by antibiotics, could be of tremendous therapeutic value. Although the genetic components of the bacterial DDR have been studied extensively in isolation, how the underlying biological pathways interact functionally remains unclear. Here, we address this by performing systematic, unbiased, quantitative synthetic genetic interaction (GI) screens and uncover widespread changes in the GI network of the entire genomic integrity apparatus of Escherichia coli under standard and DNA-damaging growth conditions. The GI patterns of untreated cultures implicated two previously uncharacterized proteins (YhbQ and YqgF) as nucleases, whereas reorganization of the GI network after DNA damage revealed DDR roles for both annotated and uncharacterized genes. Analyses of pan-bacterial conservation patterns suggest that DDR mechanisms and functional relationships are near universal, highlighting a modular and highly adaptive genomic stress response. PMID:26774489

  10. Genome mining of ascomycetous fungi reveals their genetic potential for ergot alkaloid production.

    PubMed

    Gerhards, Nina; Matuschek, Marco; Wallwey, Christiane; Li, Shu-Ming

    2015-06-01

    Ergot alkaloids are important as mycotoxins or as drugs. Naturally occurring ergot alkaloids as well as their semisynthetic derivatives have been used as pharmaceuticals in modern medicine for decades. We identified 196 putative ergot alkaloid biosynthetic genes belonging to at least 31 putative gene clusters in 31 fungal species by genome mining of the 360 available genome sequences of ascomycetous fungi with known proteins. Detailed analysis showed that these fungi belong to the families Aspergillaceae, Clavicipitaceae, Arthrodermataceae, Helotiaceae and Thermoascaceae. Within the identified families, only a small number of taxa are represented. Literature search revealed a large diversity of ergot alkaloid structures in different fungi of the phylum Ascomycota. However, ergot alkaloid accumulation was only observed in 15 of the sequenced species. Therefore, this study provides genetic basis for further study on ergot alkaloid production in the sequenced strains. PMID:25796201

  11. Effect of long real space flight on the whole genome mRNA expression properties in medaka Oryzias latipes

    NASA Astrophysics Data System (ADS)

    Kozlova, Olga; Gusev, Oleg; Levinskikh, Margarita; Sychev, Vladimir; Poddubko, Svetlana

    The current study is addressed to the complex analysis of whole genome mRNA expression profile and properties of splicing variants formation in different organs of medaka fish exposed to prolonged space flight in the frame of joint Russia-Japan research program “Aquarium-AQH”. The fish were kept in the AQH joint-aquariums system in October-December 2013, followed by fixation in RNA-preserving buffers and freezing during the space flight. The samples we returned to the Earth frozen in March 2013 and mRNAs from four fish were sequenced in organ-specific manner using HiSeq Illumina sequencing platform. The ground group fish treated in the same way was used as a control. The comparison between the groups revealed space group-specific specific mRNA expression pattern. More than 50 genes (including several types of myosins) were down-regulated in the space group. Moreover, we found an evidence for formation of space group-specific splicing variants of mRNA. Taking together, the data suggest that in spite of aquatic environment, space flight-associated factors have a strong effect on the activity of fish genome. This work was supported in part by subsidy of the Russian Government to support the Program of competitive growth of Kazan Federal University among world class academic centres and universities.

  12. Evidence of codon usage in the nearest neighbor spacing distribution of bases in bacterial genomes

    NASA Astrophysics Data System (ADS)

    Higareda, M. F.; Geiger, O.; Mendoza, L.; Méndez-Sánchez, R. A.

    2012-02-01

    Statistical analysis of whole genomic sequences usually assumes a homogeneous nucleotide density throughout the genome, an assumption that has been proved incorrect for several organisms since the nucleotide density is only locally homogeneous. To avoid giving a single numerical value to this variable property, we propose the use of spectral statistics, which characterizes the density of nucleotides as a function of its position in the genome. We show that the cumulative density of bases in bacterial genomes can be separated into an average (or secular) plus a fluctuating part. Bacterial genomes can be divided into two groups according to the qualitative description of their secular part: linear and piecewise linear. These two groups of genomes show different properties when their nucleotide spacing distribution is studied. In order to analyze genomes having a variable nucleotide density, statistically, the use of unfolding is necessary, i.e., to get a separation between the secular part and the fluctuations. The unfolding allows an adequate comparison with the statistical properties of other genomes. With this methodology, four genomes were analyzed Burkholderia, Bacillus, Clostridium and Corynebacterium. Interestingly, the nearest neighbor spacing distributions or detrended distance distributions are very similar for species within the same genus but they are very different for species from different genera. This difference can be attributed to the difference in the codon usage.

  13. Comparative Genome Analysis Reveals Metabolic Versatility and Environmental Adaptations of Sulfobacillus thermosulfidooxidans Strain ST

    PubMed Central

    Guo, Xue; Yin, Huaqun; Liang, Yili; Hu, Qi; Zhou, Xishu; Xiao, Yunhua; Ma, Liyuan; Zhang, Xian; Qiu, Guanzhou; Liu, Xueduan

    2014-01-01

    The genus Sulfobacillus is a cohort of mildly thermophilic or thermotolerant acidophiles within the phylum Firmicutes and requires extremely acidic environments and hypersalinity for optimal growth. However, our understanding of them is still preliminary partly because few genome sequences are available. Here, the draft genome of Sulfobacillus thermosulfidooxidans strain ST was deciphered to obtain a comprehensive insight into the genetic content and to understand the cellular mechanisms necessary for its survival. Furthermore, the expressions of key genes related with iron and sulfur oxidation were verified by semi-quantitative RT-PCR analysis. The draft genome sequence of Sulfobacillus thermosulfidooxidans strain ST, which encodes 3225 predicted coding genes on a total length of 3,333,554 bp and a 48.35% G+C, revealed the high degree of heterogeneity with other Sulfobacillus species. The presence of numerous transposases, genomic islands and complete CRISPR/Cas defence systems testifies to its dynamic evolution consistent with the genome heterogeneity. As expected, S. thermosulfidooxidans encodes a suit of conserved enzymes required for the oxidation of inorganic sulfur compounds (ISCs). The model of sulfur oxidation in S. thermosulfidooxidans was proposed, which showed some different characteristics from the sulfur oxidation of Gram-negative A. ferrooxidans. Sulfur oxygenase reductase and heterodisulfide reductase were suggested to play important roles in the sulfur oxidation. Although the iron oxidation ability was observed, some key proteins cannot be identified in S. thermosulfidooxidans. Unexpectedly, a predicted sulfocyanin is proposed to transfer electrons in the iron oxidation. Furthermore, its carbon metabolism is rather flexible, can perform the transformation of pentose through the oxidative and non-oxidative pentose phosphate pathways and has the ability to take up small organic compounds. It encodes a multitude of heavy metal resistance systems to

  14. Complete genomes reveal signatures of demographic and genetic declines in the woolly mammoth

    PubMed Central

    Palkopoulou, Eleftheria; Mallick, Swapan; Skoglund, Pontus; Enk, Jacob; Rohland, Nadin; Li, Heng; Omrak, Ayça; Vartanyan, Sergey; Poinar, Hendrik; Götherström, Anders; Reich, David; Dalén, Love

    2015-01-01

    Summary The processes leading up to species extinctions are typically characterized by prolonged declines in population size and geographic distribution, followed by a phase in which populations are very small and may be subject to intrinsic threats, including loss of genetic diversity and inbreeding [1]. However, whether such genetic factors have had an impact on species prior to their extinction is unclear [2, 3]; examining this would require a detailed reconstruction of a species’ demographic history as well as changes in genome-wide diversity leading up to its extinction. Here, we present high-quality complete genome sequences from two woolly mammoths (Mammuthus primigenius). The first mammoth was sequenced at 17.1-fold coverage, and dates to ~4,300 years before present, constituting one of the last surviving individuals on Wrangel Island. The second mammoth, sequenced at 11.2-fold coverage, was obtained from a ~44,800 year old specimen from the Late Pleistocene population in northeastern Siberia. The demographic trajectories inferred from the two genomes are qualitatively similar and reveal a population bottleneck during the Middle or Early Pleistocene, and a more recent severe decline in the ancestors of the Wrangel mammoth at the end of the last glaciation. A comparison of the two genomes shows that the Wrangel mammoth has a 20% reduction in heterozygosity as well as a 28-fold increase in the fraction of the genome that is comprised of runs of homozygosity. We conclude that the population on Wrangel Island, which was the last surviving woolly mammoth population, was subject to reduced genetic diversity shortly before it became extinct. PMID:25913407

  15. Whole mitochondrial genome sequencing of domestic horses reveals incorporation of extensive wild horse diversity during domestication

    PubMed Central

    2011-01-01

    Background DNA target enrichment by micro-array capture combined with high throughput sequencing technologies provides the possibility to obtain large amounts of sequence data (e.g. whole mitochondrial DNA genomes) from multiple individuals at relatively low costs. Previously, whole mitochondrial genome data for domestic horses (Equus caballus) were limited to only a few specimens and only short parts of the mtDNA genome (especially the hypervariable region) were investigated for larger sample sets. Results In this study we investigated whole mitochondrial genomes of 59 domestic horses from 44 breeds and a single Przewalski horse (Equus przewalski) using a recently described multiplex micro-array capture approach. We found 473 variable positions within the domestic horses, 292 of which are parsimony-informative, providing a well resolved phylogenetic tree. Our divergence time estimate suggests that the mitochondrial genomes of modern horse breeds shared a common ancestor around 93,000 years ago and no later than 38,000 years ago. A Bayesian skyline plot (BSP) reveals a significant population expansion beginning 6,000-8,000 years ago with an ongoing exponential growth until the present, similar to other domestic animal species. Our data further suggest that a large sample of wild horse diversity was incorporated into the domestic population; specifically, at least 46 of the mtDNA lineages observed in domestic horses (73%) already existed before the beginning of domestication about 5,000 years ago. Conclusions Our study provides a window into the maternal origins of extant domestic horses and confirms that modern domestic breeds present a wide sample of the mtDNA diversity found in ancestral, now extinct, wild horse populations. The data obtained allow us to detect a population expansion event coinciding with the beginning of domestication and to estimate both the minimum number of female horses incorporated into the domestic gene pool and the time depth of the

  16. Complete genomes reveal signatures of demographic and genetic declines in the woolly mammoth.

    PubMed

    Palkopoulou, Eleftheria; Mallick, Swapan; Skoglund, Pontus; Enk, Jacob; Rohland, Nadin; Li, Heng; Omrak, Ayça; Vartanyan, Sergey; Poinar, Hendrik; Götherström, Anders; Reich, David; Dalén, Love

    2015-05-18

    The processes leading up to species extinctions are typically characterized by prolonged declines in population size and geographic distribution, followed by a phase in which populations are very small and may be subject to intrinsic threats, including loss of genetic diversity and inbreeding. However, whether such genetic factors have had an impact on species prior to their extinction is unclear; examining this would require a detailed reconstruction of a species' demographic history as well as changes in genome-wide diversity leading up to its extinction. Here, we present high-quality complete genome sequences from two woolly mammoths (Mammuthus primigenius). The first mammoth was sequenced at 17.1-fold coverage and dates to ∼4,300 years before present, representing one of the last surviving individuals on Wrangel Island. The second mammoth, sequenced at 11.2-fold coverage, was obtained from an ∼44,800-year-old specimen from the Late Pleistocene population in northeastern Siberia. The demographic trajectories inferred from the two genomes are qualitatively similar and reveal a population bottleneck during the Middle or Early Pleistocene, and a more recent severe decline in the ancestors of the Wrangel mammoth at the end of the last glaciation. A comparison of the two genomes shows that the Wrangel mammoth has a 20% reduction in heterozygosity as well as a 28-fold increase in the fraction of the genome that comprises runs of homozygosity. We conclude that the population on Wrangel Island, which was the last surviving woolly mammoth population, was subject to reduced genetic diversity shortly before it became extinct. PMID:25913407

  17. The Complete Genome Sequence of Fibrobacter succinogenes S85 Reveals a Cellulolytic and Metabolic Specialist

    PubMed Central

    Suen, Garret; Weimer, Paul J.; Stevenson, David M.; Aylward, Frank O.; Boyum, Julie; Deneke, Jan; Drinkwater, Colleen; Ivanova, Natalia N.; Mikhailova, Natalia; Chertkov, Olga; Goodwin, Lynne A.; Currie, Cameron R.; Mead, David; Brumm, Phillip J.

    2011-01-01

    Fibrobacter succinogenes is an important member of the rumen microbial community that converts plant biomass into nutrients usable by its host. This bacterium, which is also one of only two cultivated species in its phylum, is an efficient and prolific degrader of cellulose. Specifically, it has a particularly high activity against crystalline cellulose that requires close physical contact with this substrate. However, unlike other known cellulolytic microbes, it does not degrade cellulose using a cellulosome or by producing high extracellular titers of cellulase enzymes. To better understand the biology of F. succinogenes, we sequenced the genome of the type strain S85 to completion. A total of 3,085 open reading frames were predicted from its 3.84 Mbp genome. Analysis of sequences predicted to encode for carbohydrate-degrading enzymes revealed an unusually high number of genes that were classified into 49 different families of glycoside hydrolases, carbohydrate binding modules (CBMs), carbohydrate esterases, and polysaccharide lyases. Of the 31 identified cellulases, none contain CBMs in families 1, 2, and 3, typically associated with crystalline cellulose degradation. Polysaccharide hydrolysis and utilization assays showed that F. succinogenes was able to hydrolyze a number of polysaccharides, but could only utilize the hydrolytic products of cellulose. This suggests that F. succinogenes uses its array of hemicellulose-degrading enzymes to remove hemicelluloses to gain access to cellulose. This is reflected in its genome, as F. succinogenes lacks many of the genes necessary to transport and metabolize the hydrolytic products of non-cellulose polysaccharides. The F. succinogenes genome reveals a bacterium that specializes in cellulose as its sole energy source, and provides insight into a novel strategy for cellulose degradation. PMID:21526192

  18. Analysis of virus genomes from glacial environments reveals novel virus groups with unusual host interactions

    PubMed Central

    Bellas, Christopher M.; Anesio, Alexandre M.; Barker, Gary

    2015-01-01

    Microbial communities in glacial ecosystems are diverse, active, and subjected to strong viral pressures and infection rates. In this study we analyse putative virus genomes assembled from three dsDNA viromes from cryoconite hole ecosystems of Svalbard and the Greenland Ice Sheet to assess the potential hosts and functional role viruses play in these habitats. We assembled 208 million reads from the virus-size fraction and developed a procedure to select genuine virus scaffolds from cellular contamination. Our curated virus library contained 546 scaffolds up to 230 Kb in length, 54 of which were circular virus consensus genomes. Analysis of virus marker genes revealed a wide range of viruses had been assembled, including bacteriophages, cyanophages, nucleocytoplasmic large DNA viruses and a virophage, with putative hosts identified as Cyanobacteria, Alphaproteobacteria, Gammaproteobacteria, Actinobacteria, Firmicutes, eukaryotic algae and amoebae. Whole genome comparisons revealed the majority of circular genome scaffolds (CGS) formed 12 novel groups, two of which contained multiple phage members with plasmid-like properties, including a group of phage-plasmids possessing plasmid-like partition genes and toxin-antitoxin addiction modules to ensure their replication and a satellite phage-plasmid group. Surprisingly we also assembled a phage that not only encoded plasmid partition genes, but a clustered regularly interspaced short palindromic repeat (CRISPR)/Cas adaptive bacterial immune system. One of the spacers was an exact match for another phage in our virome, indicating that in a novel use of the system, the lysogen was potentially capable of conferring immunity on its bacterial host against other phage. Together these results suggest that highly novel and diverse groups of viruses are present in glacial environments, some of which utilize very unusual life strategies and genes to control their replication and maintain a long-term relationship with their hosts

  19. Comparative Whole-Genome Analysis of Clinical Isolates Reveals Characteristic Architecture of Mycobacterium tuberculosis Pangenome

    PubMed Central

    Periwal, Vinita; Patowary, Ashok; Vellarikkal, Shamsudheen Karuthedath; Gupta, Anju; Singh, Meghna; Mittal, Ashish; Jeyapaul, Shamini; Chauhan, Rajendra Kumar; Singh, Ajay Vir; Singh, Pravin Kumar; Garg, Parul; Katoch, Viswa Mohan; Katoch, Kiran; Chauhan, Devendra Singh; Sivasubbu, Sridhar; Scaria, Vinod

    2015-01-01

    The tubercle complex consists of closely related mycobacterium species which appear to be variants of a single species. Comparative genome analysis of different strains could provide useful clues and insights into the genetic diversity of the species. We integrated genome assemblies of 96 strains from Mycobacterium tuberculosis complex (MTBC), which included 8 Indian clinical isolates sequenced and assembled in this study, to understand its pangenome architecture. We predicted genes for all the 96 strains and clustered their respective CDSs into homologous gene clusters (HGCs) to reveal a hard-core, soft-core and accessory genome component of MTBC. The hard-core (HGCs shared amongst 100% of the strains) was comprised of 2,066 gene clusters whereas the soft-core (HGCs shared amongst at least 95% of the strains) comprised of 3,374 gene clusters. The change in the core and accessory genome components when observed as a function of their size revealed that MTBC has an open pangenome. We identified 74 HGCs that were absent from reference strains H37Rv and H37Ra but were present in most of clinical isolates. We report PCR validation on 9 candidate genes depicting 7 genes completely absent from H37Rv and H37Ra whereas 2 genes shared partial homology with them accounting to probable insertion and deletion events. The pangenome approach is a promising tool for studying strain specific genetic differences occurring within species. We also suggest that since selecting appropriate target genes for typing purposes requires the expected target gene be present in all isolates being typed, therefore estimating the core-component of the species becomes a subject of prime importance. PMID:25853708

  20. In Depth Characterization of Repetitive DNA in 23 Plant Genomes Reveals Sources of Genome Size Variation in the Legume Tribe Fabeae

    PubMed Central

    Macas, Jiří; Novák, Petr; Pellicer, Jaume; Čížková, Jana; Koblížková, Andrea; Neumann, Pavel; Fuková, Iva; Doležel, Jaroslav; Kelly, Laura J.; Leitch, Ilia J.

    2015-01-01

    The differential accumulation and elimination of repetitive DNA are key drivers of genome size variation in flowering plants, yet there have been few studies which have analysed how different types of repeats in related species contribute to genome size evolution within a phylogenetic context. This question is addressed here by conducting large-scale comparative analysis of repeats in 23 species from four genera of the monophyletic legume tribe Fabeae, representing a 7.6-fold variation in genome size. Phylogenetic analysis and genome size reconstruction revealed that this diversity arose from genome size expansions and contractions in different lineages during the evolution of Fabeae. Employing a combination of low-pass genome sequencing with novel bioinformatic approaches resulted in identification and quantification of repeats making up 55–83% of the investigated genomes. In turn, this enabled an analysis of how each major repeat type contributed to the genome size variation encountered. Differential accumulation of repetitive DNA was found to account for 85% of the genome size differences between the species, and most (57%) of this variation was found to be driven by a single lineage of Ty3/gypsy LTR-retrotransposons, the Ogre elements. Although the amounts of several other lineages of LTR-retrotransposons and the total amount of satellite DNA were also positively correlated with genome size, their contributions to genome size variation were much smaller (up to 6%). Repeat analysis within a phylogenetic framework also revealed profound differences in the extent of sequence conservation between different repeat types across Fabeae. In addition to these findings, the study has provided a proof of concept for the approach combining recent developments in sequencing and bioinformatics to perform comparative analyses of repetitive DNAs in a large number of non-model species without the need to assemble their genomes. PMID:26606051

  1. Exonic remnants of whole-genome duplication reveal cis-regulatory function of coding exons

    PubMed Central

    Dong, Xianjun; Navratilova, Pavla; Fredman, David; Drivenes, Øyvind; Becker, Thomas S.; Lenhard, Boris

    2010-01-01

    Using a comparative genomics approach to reconstruct the fate of genomic regulatory blocks (GRBs) and identify exonic remnants that have survived the disappearance of their host genes after whole-genome duplication (WGD) in teleosts, we discover a set of 38 candidate cis-regulatory coding exons (RCEs) with predicted target genes. These elements demonstrate evolutionary separation of overlapping protein-coding and regulatory information after WGD in teleosts. We present evidence that the corresponding mammalian exons are still under both coding and non-coding selection pressure, are more conserved than other protein coding exons in the host gene and several control sets, and share key characteristics with highly conserved non-coding elements in the same regions. Their dual function is corroborated by existing experimental data. Additionally, we show examples of human exon remnants stemming from the vertebrate 2R WGD. Our findings suggest that long-range cis-regulatory inputs for developmental genes are not limited to non-coding regions, but can also overlap the coding sequence of unrelated genes. Thus, exonic regulatory elements in GRBs might be functionally equivalent to those in non-coding regions, calling for a re-evaluation of the sequence space in which to look for long-range regulatory elements and experimentally test their activity. PMID:19969543

  2. Scanning the landscape of genome architecture of non-O1 and non-O139 Vibrio cholerae by whole genome mapping reveals extensive population genetic diversity

    DOE PAGESBeta

    Chapman, Carol; Henry, Matthew; Bishop-Lilly, Kimberly A.; Awosika, Joy; Briska, Adam; Ptashkin, Ryan N.; Wagner, Trevor; Rajanna, Chythanya; Tsang, Hsinyi; Johnson, Shannon L.; et al

    2015-03-20

    Historically, cholera outbreaks have been linked to V. cholerae O1 serogroup strains or its derivatives of the O37 and O139 serogroups. A genomic study on the 2010 Haiti cholera outbreak strains highlighted the putative role of non O1/non-O139 V. cholerae in causing cholera and the lack of genomic sequences of such strains from around the world. Here we address these gaps by scanning a global collection of V. cholerae strains as a first step towards understanding the population genetic diversity and epidemic potential of non O1/non-O139 strains. Whole Genome Mapping (Optical Mapping) based bar coding produces a high resolution, orderedmore » restriction map, depicting a complete view of the unique chromosomal architecture of an organism. To assess the genomic diversity of non-O1/non-O139 V. cholerae, we applied a Whole Genome Mapping strategy on a well-defined and geographically and temporally diverse strain collection, the Sakazaki serogroup type strains. Whole Genome Map data on 91 of the 206 serogroup type strains support the hypothesis that V. cholerae has an unprecedented genetic and genomic structural diversity. Interestingly, we discovered chromosomal fusions in two unusual strains that possess a single chromosome instead of the two chromosomes usually found in V. cholerae. We also found pervasive chromosomal rearrangements such as duplications and indels in many strains. The majority of Vibrio genome sequences currently in public databases are unfinished draft sequences. The Whole Genome Mapping approach presented here enables rapid screening of large strain collections to capture genomic complexities that would not have been otherwise revealed by unfinished draft genome sequencing and thus aids in assembling and finishing draft sequences of complex genomes. Furthermore, Whole Genome Mapping allows for prediction of novel V. cholerae non-O1/non-O139 strains that may have the potential to cause future cholera outbreaks.« less

  3. Scanning the Landscape of Genome Architecture of Non-O1 and Non-O139 Vibrio cholerae by Whole Genome Mapping Reveals Extensive Population Genetic Diversity

    PubMed Central

    Awosika, Joy; Briska, Adam; Ptashkin, Ryan N.; Wagner, Trevor; Rajanna, Chythanya; Tsang, Hsinyi; Johnson, Shannon L.; Mokashi, Vishwesh P.; Chain, Patrick S. G.; Sozhamannan, Shanmuga

    2015-01-01

    Historically, cholera outbreaks have been linked to V. cholerae O1 serogroup strains or its derivatives of the O37 and O139 serogroups. A genomic study on the 2010 Haiti cholera outbreak strains highlighted the putative role of non O1/non-O139 V. cholerae in causing cholera and the lack of genomic sequences of such strains from around the world. Here we address these gaps by scanning a global collection of V. cholerae strains as a first step towards understanding the population genetic diversity and epidemic potential of non O1/non-O139 strains. Whole Genome Mapping (Optical Mapping) based bar coding produces a high resolution, ordered restriction map, depicting a complete view of the unique chromosomal architecture of an organism. To assess the genomic diversity of non-O1/non-O139 V. cholerae, we applied a Whole Genome Mapping strategy on a well-defined and geographically and temporally diverse strain collection, the Sakazaki serogroup type strains. Whole Genome Map data on 91 of the 206 serogroup type strains support the hypothesis that V. cholerae has an unprecedented genetic and genomic structural diversity. Interestingly, we discovered chromosomal fusions in two unusual strains that possess a single chromosome instead of the two chromosomes usually found in V. cholerae. We also found pervasive chromosomal rearrangements such as duplications and indels in many strains. The majority of Vibrio genome sequences currently in public databases are unfinished draft sequences. The Whole Genome Mapping approach presented here enables rapid screening of large strain collections to capture genomic complexities that would not have been otherwise revealed by unfinished draft genome sequencing and thus aids in assembling and finishing draft sequences of complex genomes. Furthermore, Whole Genome Mapping allows for prediction of novel V. cholerae non-O1/non-O139 strains that may have the potential to cause future cholera outbreaks. PMID:25794000

  4. Comparative genomics of four closely related Clostridium perfringens bacteriophages reveals variable rates of evolution within a core genome

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Background: Biotechnological uses of bacteriophage gene products as alternatives to conventional antibiotics will require a thorough understanding of their genomic context. We sequenced and analyzed the genomes of four closely related phages isolated from Clostridium perfringens, an important agricu...

  5. Reconstruction of the ancestral plastid genome in Geraniaceae reveals a correlation between genome rearrangements, repeats, and nucleotide substitution rates.

    PubMed

    Weng, Mao-Lun; Blazier, John C; Govindu, Madhumita; Jansen, Robert K

    2014-03-01

    Geraniaceae plastid genomes are highly rearranged, and each of the four genera already sequenced in the family has a distinct genome organization. This study reports plastid genome sequences of six additional species, Francoa sonchifolia, Melianthus villosus, and Viviania marifolia from Geraniales, and Pelargonium alternans, California macrophylla, and Hypseocharis bilobata from Geraniaceae. These genome sequences, combined with previously published species, provide sufficient taxon sampling to reconstruct the ancestral plastid genome organization of Geraniaceae and the rearrangements unique to each genus. The ancestral plastid genome of Geraniaceae has a 4 kb inversion and a reduced, Pelargonium-like small single copy region. Our ancestral genome reconstruction suggests that a few minor rearrangements occurred in the stem branch of Geraniaceae followed by independent rearrangements in each genus. The genomic comparison demonstrates that a series of inverted repeat boundary shifts and inversions played a major role in shaping genome organization in the family. The distribution of repeats is strongly associated with breakpoints in the rearranged genomes, and the proportion and the number of large repeats (>20 bp and >60 bp) are significantly correlated with the degree of genome rearrangements. Increases in the degree of plastid genome rearrangements are correlated with the acceleration in nonsynonymous substitution rates (dN) but not with synonymous substitution rates (dS). Possible mechanisms that might contribute to this correlation, including DNA repair system and selection, are discussed. PMID:24336877

  6. Space stress and genome shock in developing plant cells

    NASA Technical Reports Server (NTRS)

    Krikorian, A. D.

    1996-01-01

    In the present paper I review symptoms of stress at the level of the nucleus in cells of plants grown in space under nonoptimized conditions. It remains to be disclosed to what extent gravity "unloading" in the space environment directly contributes to the low mitotic index and the chromosomal anomalies and damage that is frequently, but not invariably, demonstrable in space-grown plants. Evaluation of the available facts indicates that indirect effects play a major role and that there is a significant biological component to the susceptibility to stress damage equation as well. Much remains to be learned on how to provide strictly controlled, optimal environments for plant growth in space. Only after optimized controls become possible will one be able to attribute any observed space effects to lowered gravity or to other significant but more indirect effects of the space environment.

  7. Bioactivity-guided genome mining reveals the lomaiviticin biosynthetic gene cluster in Salinispora tropica

    PubMed Central

    Kersten, Roland D.; Lane, Amy L.; Nett, Markus; Richter, Taylor K. S.; Duggan, Brendan M.; Dorrestein, Pieter C.

    2013-01-01

    The use of genome sequences has become routine in guiding the discovery and identification of microbial natural products and their biosynthetic pathways. In silico prediction of molecular features, such as metabolic building blocks, physico-chemical properties or biological functions, from orphan gene clusters has opened up the characterization of many new chemo- and genotypes in genome mining approaches. Here, we guided our genome mining of two predicted enediyne pathways in Salinispora tropica CNB-440 by a DNA interference bioassay to isolate DNA-targeting enediyne polyketides. An organic extract of S. tropica showed DNA-interference activity that surprisingly was not abolished in genetic mutants of the targeted enediyne pathways, ST_pks1 and spo. Instead we showed that the product of the orphan type II polyketide synthase pathway, ST_pks2, is solely responsible for the DNA-interfering activity of the parent strain. Subsequent comparative metabolic profiling revealed the lomaiviticins, glycosylated diazofluorene polyketides, as the ST_pks2 products. This study marks the first report of the 59 open reading frame lomaiviticin gene cluster (lom) and supports the biochemical logic of their dimeric construction via a pathway related to the kinamycin monomer. PMID:23649992

  8. The genome of the seagrass Zostera marina reveals angiosperm adaptation to the sea.

    PubMed

    Olsen, Jeanine L; Rouzé, Pierre; Verhelst, Bram; Lin, Yao-Cheng; Bayer, Till; Collen, Jonas; Dattolo, Emanuela; De Paoli, Emanuele; Dittami, Simon; Maumus, Florian; Michel, Gurvan; Kersting, Anna; Lauritano, Chiara; Lohaus, Rolf; Töpel, Mats; Tonon, Thierry; Vanneste, Kevin; Amirebrahimi, Mojgan; Brakel, Janina; Boström, Christoffer; Chovatia, Mansi; Grimwood, Jane; Jenkins, Jerry W; Jueterbock, Alexander; Mraz, Amy; Stam, Wytze T; Tice, Hope; Bornberg-Bauer, Erich; Green, Pamela J; Pearson, Gareth A; Procaccini, Gabriele; Duarte, Carlos M; Schmutz, Jeremy; Reusch, Thorsten B H; Van de Peer, Yves

    2016-02-18

    Seagrasses colonized the sea on at least three independent occasions to form the basis of one of the most productive and widespread coastal ecosystems on the planet. Here we report the genome of Zostera marina (L.), the first, to our knowledge, marine angiosperm to be fully sequenced. This reveals unique insights into the genomic losses and gains involved in achieving the structural and physiological adaptations required for its marine lifestyle, arguably the most severe habitat shift ever accomplished by flowering plants. Key angiosperm innovations that were lost include the entire repertoire of stomatal genes, genes involved in the synthesis of terpenoids and ethylene signalling, and genes for ultraviolet protection and phytochromes for far-red sensing. Seagrasses have also regained functions enabling them to adjust to full salinity. Their cell walls contain all of the polysaccharides typical of land plants, but also contain polyanionic, low-methylated pectins and sulfated galactans, a feature shared with the cell walls of all macroalgae and that is important for ion homoeostasis, nutrient uptake and O2/CO2 exchange through leaf epidermal cells. The Z. marina genome resource will markedly advance a wide range of functional ecological studies from adaptation of marine ecosystems under climate warming, to unravelling the mechanisms of osmoregulation under high salinities that may further inform our understanding of the evolution of salt tolerance in crop plants. PMID:26814964

  9. Breeding signatures of rice improvement revealed by a genomic variation map from a large germplasm collection

    PubMed Central

    Xie, Weibo; Wang, Gongwei; Yuan, Meng; Yao, Wen; Lyu, Kai; Zhao, Hu; Yang, Meng; Li, Pingbo; Zhang, Xing; Yuan, Jing; Wang, Quanxiu; Liu, Fang; Dong, Huaxia; Zhang, Lejing; Li, Xinglei; Meng, Xiangzhou; Zhang, Wan; Xiong, Lizhong; He, Yuqing; Wang, Shiping; Yu, Sibin; Xu, Caiguo; Luo, Jie; Li, Xianghua; Xiao, Jinghua; Lian, Xingming; Zhang, Qifa

    2015-01-01

    Intensive rice breeding over the past 50 y has dramatically increased productivity especially in the indica subspecies, but our knowledge of the genomic changes associated with such improvement has been limited. In this study, we analyzed low-coverage sequencing data of 1,479 rice accessions from 73 countries, including landraces and modern cultivars. We identified two major subpopulations, indica I (IndI) and indica II (IndII), in the indica subspecies, which corresponded to the two putative heterotic groups resulting from independent breeding efforts. We detected 200 regions spanning 7.8% of the rice genome that had been differentially selected between IndI and IndII, and thus referred to as breeding signatures. These regions included large numbers of known functional genes and loci associated with important agronomic traits revealed by genome-wide association studies. Grain yield was positively correlated with the number of breeding signatures in a variety, suggesting that the number of breeding signatures in a line may be useful for predicting agronomic potential and the selected loci may provide targets for rice improvement. PMID:26358652

  10. The draft genome of Tibetan hulless barley reveals adaptive patterns to the high stressful Tibetan Plateau

    PubMed Central

    Zeng, Xingquan; Long, Hai; Wang, Zhuo; Zhao, Shancen; Tang, Yawei; Huang, Zhiyong; Wang, Yulin; Xu, Qijun; Mao, Likai; Deng, Guangbing; Yao, Xiaoming; Li, Xiangfeng; Bai, Lijun; Yuan, Hongjun; Pan, Zhifen; Liu, Renjian; Chen, Xin; WangMu, QiMei; Chen, Ming; Yu, Lili; Liang, Junjun; DunZhu, DaWa; Zheng, Yuan; Yu, Shuiyang; LuoBu, ZhaXi; Guang, Xuanmin; Li, Jiang; Deng, Cao; Hu, Wushu; Chen, Chunhai; TaBa, XiongNu; Gao, Liyun; Lv, Xiaodan; Abu, Yuval Ben; Fang, Xiaodong; Nevo, Eviatar; Yu, Maoqun; Wang, Jun; Tashi, Nyima

    2015-01-01

    The Tibetan hulless barley (Hordeum vulgare L. var. nudum), also called “Qingke” in Chinese and “Ne” in Tibetan, is the staple food for Tibetans and an important livestock feed in the Tibetan Plateau. The diploid nature and adaptation to diverse environments of the highland give it unique resources for genetic research and crop improvement. Here we produced a 3.89-Gb draft assembly of Tibetan hulless barley with 36,151 predicted protein-coding genes. Comparative analyses revealed the divergence times and synteny between barley and other representative Poaceae genomes. The expansion of the gene family related to stress responses was found in Tibetan hulless barley. Resequencing of 10 barley accessions uncovered high levels of genetic variation in Tibetan wild barley and genetic divergence between Tibetan and non-Tibetan barley genomes. Selective sweep analyses demonstrate adaptive correlations of genes under selection with extensive environmental variables. Our results not only construct a genomic framework for crop improvement but also provide evolutionary insights of highland adaptation of Tibetan hulless barley. PMID:25583503