Sample records for high-quality draft sequence

  1. Draft versus finished sequence data for DNA and protein diagnostic signature development

    PubMed Central

    Gardner, Shea N.; Lam, Marisa W.; Smith, Jason R.; Torres, Clinton L.; Slezak, Tom R.

    2005-01-01

    Sequencing pathogen genomes is costly, demanding careful allocation of limited sequencing resources. We built a computational Sequencing Analysis Pipeline (SAP) to guide decisions regarding the amount of genomic sequencing necessary to develop high-quality diagnostic DNA and protein signatures. SAP uses simulations to estimate the number of target genomes and close phylogenetic relatives (near neighbors or NNs) to sequence. We use SAP to assess whether draft data are sufficient or finished sequencing is required using Marburg and variola virus sequences. Simulations indicate that intermediate to high-quality draft with error rates of 10−3–10−5 (∼8× coverage) of target organisms is suitable for DNA signature prediction. Low-quality draft with error rates of ∼1% (3× to 6× coverage) of target isolates is inadequate for DNA signature prediction, although low-quality draft of NNs is sufficient, as long as the target genomes are of high quality. For protein signature prediction, sequencing errors in target genomes substantially reduce the detection of amino acid sequence conservation, even if the draft is of high quality. In summary, high-quality draft of target and low-quality draft of NNs appears to be a cost-effective investment for DNA signature prediction, but may lead to underestimation of predicted protein signatures. PMID:16243783

  2. First High-Quality Draft Genome Sequence of Pasteurella multocida Sequence Type 128 Isolated from Infected Bone.

    PubMed

    Kavousi, Niloofar; Eng, Wilhelm Wei Han; Lee, Yin Peng; Tan, Lian Huat; Thuraisingham, Ravindran; Yule, Catherine M; Gan, Han Ming

    2016-03-03

    We report here the first high-quality draft genome sequence of Pasteurella multocida sequence type 128, which was isolated from the infected finger bone of an adult female who was bitten by a domestic dog. The draft genome will be a valuable addition to the scarce genomic resources available for P. multocida. Copyright © 2016 Kavousi et al.

  3. Permanent Improved High-Quality Draft Genome Sequence of Nocardia casuarinae Strain BMG51109, an Endophyte of Actinorhizal Root Nodules of Casuarina glauca

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Ghodhbane-Gtari, Faten; Beauchemin, Nicholas; Louati, Moussa

    Here, we report the first genome sequence of a Nocardia plant endophyte, N. casuarinae strain BMG51109, isolated from Casuarina glauca root nodules. The improved high-quality draft genome sequence contains 8,787,999 bp with a 68.90% GC content and 7,307 predicted protein-coding genes.

  4. Permanent Improved High-Quality Draft Genome Sequence of Nocardia casuarinae Strain BMG51109, an Endophyte of Actinorhizal Root Nodules of Casuarina glauca

    DOE PAGES

    Ghodhbane-Gtari, Faten; Beauchemin, Nicholas; Louati, Moussa; ...

    2016-08-04

    Here, we report the first genome sequence of a Nocardia plant endophyte, N. casuarinae strain BMG51109, isolated from Casuarina glauca root nodules. The improved high-quality draft genome sequence contains 8,787,999 bp with a 68.90% GC content and 7,307 predicted protein-coding genes.

  5. High-quality permanent draft genome sequence of Bradyrhizobium sp. Th.b2, a microsymbiont of Amphicarpaea bracteata collected in Johnson City, New York

    DOE PAGES

    Tian, Rui; Parker, Matthew; Seshadri, Rekha; ...

    2015-05-16

    Bradyrhizobium sp. Th.b2 is an aerobic, motile, Gram-negative, non-spore-forming rod that was isolated from an effective nitrogen-fixing root nodule of Amphicarpaea bracteata collected in Johnson City, New York. Here we describe the features of Bradyrhizobium sp. Th.b2, together with high-quality permanent draft genome sequence information and annotation. The 10,118,060 high-quality draft genome is arranged in 266 scaffolds of 274 contigs, contains 9,809 protein-coding genes and 108 RNA-only encoding genes. In conclusion, this rhizobial genome was sequenced as part of the DOE Joint Genome Institute 2010 Genomic Encyclopedia for Bacteria and Archaea-Root Nodule Bacteria (GEBA-RNB) project.

  6. High-quality permanent draft genome sequence of Bradyrhizobium sp. Th.b2, a microsymbiont of Amphicarpaea bracteata collected in Johnson City, New York

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Tian, Rui; Parker, Matthew; Seshadri, Rekha

    Bradyrhizobium sp. Th.b2 is an aerobic, motile, Gram-negative, non-spore-forming rod that was isolated from an effective nitrogen-fixing root nodule of Amphicarpaea bracteata collected in Johnson City, New York. Here we describe the features of Bradyrhizobium sp. Th.b2, together with high-quality permanent draft genome sequence information and annotation. The 10,118,060 high-quality draft genome is arranged in 266 scaffolds of 274 contigs, contains 9,809 protein-coding genes and 108 RNA-only encoding genes. In conclusion, this rhizobial genome was sequenced as part of the DOE Joint Genome Institute 2010 Genomic Encyclopedia for Bacteria and Archaea-Root Nodule Bacteria (GEBA-RNB) project.

  7. High-quality permanent draft genome sequence of Bradyrhizobium sp. Tv2a.2, a microsymbiont of Tachigali versicolor discovered in Barro Colorado Island of Panama

    DOE PAGES

    Tian, Rui; Parker, Matthew; Seshadri, Rekha; ...

    2015-05-17

    Bradyrhizobiumsp. Tv2a.2 is an aerobic, motile, Gram-negative, non-spore-forming rod that was isolated from an effective nitrogen-fixing root nodule of Tachigali versicolor collected in Barro Colorado Island of Panama. Here we describe the features of Bradyrhizobiumsp. Tv2a.2, together with high-quality permanent draft genome sequence information and annotation. The 8,496,279 bp high-quality draft genome is arranged in 87 scaffolds of 87 contigs, contains 8,109 protein-coding genes and 72 RNA-only encoding genes. In conclusion, this rhizobial genome was sequenced as part of the DOE Joint Genome Institute 2010 Genomic Encyclopedia for Bacteria and Archaea-Root Nodule Bacteria (GEBA-RNB) project.

  8. Venturia carpophila draft genome sequence

    USDA-ARS?s Scientific Manuscript database

    Venturia carpophila causes peach scab, a disease that renders peach fruit unmarketable. We report a high-quality draft genome sequence (36.9 Mb) of V. carpophila from an isolate collected from a peach tree in central Georgia in the United States. The genome sequence described will be a useful resour...

  9. High-Quality Draft Genome Sequences of Four Lignocellulose-Degrading Bacteria Isolated from Puerto Rican Forest Soil: Gordonia sp., Paenibacillus sp., Variovorax sp., and Vogesella sp.

    DOE PAGES

    Woo, Hannah L.; DeAngelis, Kristen M.; Teshima, Hazuki; ...

    2017-05-04

    In this paper, we report the high-quality draft genome sequences of four phylogenetically diverse lignocellulose-degrading bacteria isolated from tropical soil ( Gordonia sp., Paenibacillus sp., Variovorax sp., and Vogesella sp.) to elucidate the genetic basis of their ability to degrade lignocellulose. These isolates may provide novel enzymes for biofuel production.

  10. High-Quality Draft Genome Sequences of Four Lignocellulose-Degrading Bacteria Isolated from Puerto Rican Forest Soil: Gordonia sp., Paenibacillus sp., Variovorax sp., and Vogesella sp.

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Woo, Hannah L.; DeAngelis, Kristen M.; Teshima, Hazuki

    In this paper, we report the high-quality draft genome sequences of four phylogenetically diverse lignocellulose-degrading bacteria isolated from tropical soil ( Gordonia sp., Paenibacillus sp., Variovorax sp., and Vogesella sp.) to elucidate the genetic basis of their ability to degrade lignocellulose. These isolates may provide novel enzymes for biofuel production.

  11. Resequencing of the common marmoset genome improves genome assemblies and gene-coding sequence analysis.

    PubMed

    Sato, Kengo; Kuroki, Yoko; Kumita, Wakako; Fujiyama, Asao; Toyoda, Atsushi; Kawai, Jun; Iriki, Atsushi; Sasaki, Erika; Okano, Hideyuki; Sakakibara, Yasubumi

    2015-11-20

    The first draft of the common marmoset (Callithrix jacchus) genome was published by the Marmoset Genome Sequencing and Analysis Consortium. The draft was based on whole-genome shotgun sequencing, and the current assembly version is Callithrix_jacches-3.2.1, but there still exist 187,214 undetermined gap regions and supercontigs and relatively short contigs that are unmapped to chromosomes in the draft genome. We performed resequencing and assembly of the genome of common marmoset by deep sequencing with high-throughput sequencing technology. Several different sequence runs using Illumina sequencing platforms were executed, and 181 Gbp of high-quality bases including mate-pairs with long insert lengths of 3, 8, 20, and 40 Kbp were obtained, that is, approximately 60× coverage. The resequencing significantly improved the MGSAC draft genome sequence. The N50 of the contigs, which is a statistical measure used to evaluate assembly quality, doubled. As a result, 51% of the contigs (total length: 299 Mbp) that were unmapped to chromosomes in the MGSAC draft were merged with chromosomal contigs, and the improved genome sequence helped to detect 5,288 new genes that are homologous to human cDNAs and the gaps in 5,187 transcripts of the Ensembl gene annotations were completely filled.

  12. Improved High-Quality Draft Genome Sequence and Annotation of Burkholderia contaminans LMG 23361T.

    PubMed

    Jung, Ji Young; Ahn, Youngbeom; Kweon, Ohgew; LiPuma, John J; Hussong, David; Marasa, Bernard S; Cerniglia, Carl E

    2017-04-20

    Burkholderia contaminans LMG 23361 is the type strain of the species isolated from the milk of a dairy sheep with mastitis. Some pharmaceutical products contain disinfectants such as benzalkonium chloride (BZK) and previously we reported that B. contaminans LMG 23361 T possesses the ability to inactivate BZK with high biodegradation rates. Here, we report an improved high-quality draft genome sequence of this strain. Copyright © 2017 Jung et al.

  13. High-Quality Draft Genome Sequence of Candida apicola NRRL Y-50540

    PubMed Central

    Vega-Alvarado, Leticia; Gómez-Angulo, Jorge; Escalante-García, Zazil; Grande, Ricardo; Gschaedler-Mathis, Anne; Amaya-Delgado, Lorena

    2015-01-01

    Candida apicola, a highly osmotolerant ascomycetes yeast, produces sophorolipids (biosurfactants), membrane fatty acids, and enzymes of biotechnological interest. The genome obtained has a high-quality draft for this species and can be used as a reference to perform further analyses, such as differential gene expression in yeast from Candida genera. PMID:26067948

  14. Draft genome sequence of Venturia carpophila, the causal agent of peach scab

    USDA-ARS?s Scientific Manuscript database

    Venturia carpophila causes peach scab, a disease that renders peach fruit unmarketable. We report a high-quality draft genome sequence (36.9 Mb) of V. carpophila from an isolate collected from a peach tree in central Georgia in the United States. The genome sequence described will be a useful resour...

  15. High-Quality Draft Genome Sequence of Babesia divergens, the Etiological Agent of Cattle and Human Babesiosis

    PubMed Central

    Cuesta, Isabel; González, Luis M.; Estrada, Karel; Grande, Ricardo; Zaballos, Ángel; Lobo, Cheryl A.; Barrera, Jorge

    2014-01-01

    Babesia divergens causes significant morbidity and mortality in cattle and splenectomized or immunocompromised individuals. Here, we present a 10.7-Mb high-quality draft genome of this parasite close to chromosome resolution that will enable comparative genome analyses and synteny studies among related parasites. PMID:25395649

  16. Use of low-coverage, large-insert, short-read data for rapid and accurate generation of enhanced-quality draft Pseudomonas genome sequences.

    PubMed

    O'Brien, Heath E; Gong, Yunchen; Fung, Pauline; Wang, Pauline W; Guttman, David S

    2011-01-01

    Next-generation genomic technology has both greatly accelerated the pace of genome research as well as increased our reliance on draft genome sequences. While groups such as the Genomics Standards Consortium have made strong efforts to promote genome standards there is a still a general lack of uniformity among published draft genomes, leading to challenges for downstream comparative analyses. This lack of uniformity is a particular problem when using standard draft genomes that frequently have large numbers of low-quality sequencing tracts. Here we present a proposal for an "enhanced-quality draft" genome that identifies at least 95% of the coding sequences, thereby effectively providing a full accounting of the genic component of the genome. Enhanced-quality draft genomes are easily attainable through a combination of small- and large-insert next-generation, paired-end sequencing. We illustrate the generation of an enhanced-quality draft genome by re-sequencing the plant pathogenic bacterium Pseudomonas syringae pv. phaseolicola 1448A (Pph 1448A), which has a published, closed genome sequence of 5.93 Mbp. We use a combination of Illumina paired-end and mate-pair sequencing, and surprisingly find that de novo assemblies with 100x paired-end coverage and mate-pair sequencing with as low as low as 2-5x coverage are substantially better than assemblies based on higher coverage. The rapid and low-cost generation of large numbers of enhanced-quality draft genome sequences will be of particular value for microbial diagnostics and biosecurity, which rely on precise discrimination of potentially dangerous clones from closely related benign strains.

  17. Draft Genome sequence of Frankia sp. Strain QA3, a nitrogen-fixing actinobacterium isolated from the root nodule of Alnus nitida

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Sen, Arnab; Beauchemin, Nicholas; Bruce, David

    Members of actinomycete genus Frankia form a nitrogen-fixing symbiosis with 8 different families of actinorhizal plants. We report a high-quality draft genome sequence for Frankia sp. stain QA3, a nitrogen-fixing actinobacterium isolated from root nodules of Alnus nitida.

  18. The fast changing landscape of sequencing technologies and their impact on microbial genome assemblies and annotation.

    PubMed

    Mavromatis, Konstantinos; Land, Miriam L; Brettin, Thomas S; Quest, Daniel J; Copeland, Alex; Clum, Alicia; Goodwin, Lynne; Woyke, Tanja; Lapidus, Alla; Klenk, Hans Peter; Cottingham, Robert W; Kyrpides, Nikos C

    2012-01-01

    The emergence of next generation sequencing (NGS) has provided the means for rapid and high throughput sequencing and data generation at low cost, while concomitantly creating a new set of challenges. The number of available assembled microbial genomes continues to grow rapidly and their quality reflects the quality of the sequencing technology used, but also of the analysis software employed for assembly and annotation. In this work, we have explored the quality of the microbial draft genomes across various sequencing technologies. We have compared the draft and finished assemblies of 133 microbial genomes sequenced at the Department of Energy-Joint Genome Institute and finished at the Los Alamos National Laboratory using a variety of combinations of sequencing technologies, reflecting the transition of the institute from Sanger-based sequencing platforms to NGS platforms. The quality of the public assemblies and of the associated gene annotations was evaluated using various metrics. Results obtained with the different sequencing technologies, as well as their effects on downstream processes, were analyzed. Our results demonstrate that the Illumina HiSeq 2000 sequencing system, the primary sequencing technology currently used for de novo genome sequencing and assembly at JGI, has various advantages in terms of total sequence throughput and cost, but it also introduces challenges for the downstream analyses. In all cases assembly results although on average are of high quality, need to be viewed critically and consider sources of errors in them prior to analysis. These data follow the evolution of microbial sequencing and downstream processing at the JGI from draft genome sequences with large gaps corresponding to missing genes of significant biological role to assemblies with multiple small gaps (Illumina) and finally to assemblies that generate almost complete genomes (Illumina+PacBio).

  19. High-quality permanent draft genome sequence of the Parapiptadenia rigida-nodulating Cupriavidus sp. strain UYPR2.512

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    De Meyer, Sofie E.; Fabiano, Elena; Tian, Rui

    Cupriavidus sp. strain UYPR2.512 is an aerobic, motile, Gram-negative, non-spore-forming rod that was isolated from a root nodule of Parapiptadenia rigida grown in soils from a native forest of Uruguay. Here we describe the features of Cupriavidus sp. strain UYPR2.512, together with sequence and annotation. We find the 7,858,949 bp high-quality permanent draft genome is arranged in 365 scaffolds of 369 contigs, contains 7,411 protein-coding genes and 76 RNA-only encoding genes, and is part of the GEBA-RNB project proposal.

  20. High-quality permanent draft genome sequence of the Parapiptadenia rigida-nodulating Cupriavidus sp. strain UYPR2.512

    DOE PAGES

    De Meyer, Sofie E.; Fabiano, Elena; Tian, Rui; ...

    2015-04-11

    Cupriavidus sp. strain UYPR2.512 is an aerobic, motile, Gram-negative, non-spore-forming rod that was isolated from a root nodule of Parapiptadenia rigida grown in soils from a native forest of Uruguay. Here we describe the features of Cupriavidus sp. strain UYPR2.512, together with sequence and annotation. We find the 7,858,949 bp high-quality permanent draft genome is arranged in 365 scaffolds of 369 contigs, contains 7,411 protein-coding genes and 76 RNA-only encoding genes, and is part of the GEBA-RNB project proposal.

  1. Improved High-Quality Draft Genome Sequence of the Eurypsychrophile Rhodotorula sp. JG1b, Isolated from Permafrost in the Hyperarid Upper-Elevation McMurdo Dry Valleys, Antarctica

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Goordial, Jacqueline; Raymond-Bouchard, Isabelle; Riley, Robert

    Here, we report the draft genome sequence of Rhodotorula sp. strain JG1b, a yeast that was isolated from ice-cemented permafrost in the upper-elevation McMurdo Dry Valleys, Antarctica. The sequenced genome size is 19.39 Mb, consisting of 156 scaffolds and containing a total of 5,625 predicted genes. This is the first known cold-adapted Rhodotorula sp. sequenced to date.

  2. Improved High-Quality Draft Genome Sequence of the Eurypsychrophile Rhodotorula sp. JG1b, Isolated from Permafrost in the Hyperarid Upper-Elevation McMurdo Dry Valleys, Antarctica

    DOE PAGES

    Goordial, Jacqueline; Raymond-Bouchard, Isabelle; Riley, Robert; ...

    2016-03-17

    Here, we report the draft genome sequence of Rhodotorula sp. strain JG1b, a yeast that was isolated from ice-cemented permafrost in the upper-elevation McMurdo Dry Valleys, Antarctica. The sequenced genome size is 19.39 Mb, consisting of 156 scaffolds and containing a total of 5,625 predicted genes. This is the first known cold-adapted Rhodotorula sp. sequenced to date.

  3. Exploiting long read sequencing technologies to establish high quality highly contiguous pig reference genome assemblies

    USDA-ARS?s Scientific Manuscript database

    The current pig reference genome sequence (Sscrofa10.2) was established using Sanger sequencing and following the clone-by-clone hierarchical shotgun sequencing approach used in the public human genome project. However, as sequence coverage was low (4-6x) the resulting assembly was only of draft qua...

  4. Draft Genome Sequence of Telmatospirillum siberiense 26-4b1, an Acidotolerant Peatland Alphaproteobacterium Potentially Involved in Sulfur Cycling

    PubMed Central

    Schreck, Katharina; Herbold, Craig W.; Daims, Holger; Wagner, Michael; Loy, Alexander

    2018-01-01

    ABSTRACT The facultative anaerobic chemoorganoheterotrophic alphaproteobacterium Telmatospirillum siberiense 26-4b1 was isolated from a Siberian peatland. We report here a 6.20-Mbp near-complete high-quality draft genome sequence of T. siberiense that reveals expected and novel metabolic potential for the genus Telmatospirillum, including genes for sulfur oxidation. PMID:29371357

  5. DOE Office of Scientific and Technical Information (OSTI.GOV)

    Tian, Rui; Parker, Matthew; Seshadri, Rekha

    Bradyrhizobiumsp. Tv2a.2 is an aerobic, motile, Gram-negative, non-spore-forming rod that was isolated from an effective nitrogen-fixing root nodule of Tachigali versicolor collected in Barro Colorado Island of Panama. Here we describe the features of Bradyrhizobiumsp. Tv2a.2, together with high-quality permanent draft genome sequence information and annotation. The 8,496,279 bp high-quality draft genome is arranged in 87 scaffolds of 87 contigs, contains 8,109 protein-coding genes and 72 RNA-only encoding genes. In conclusion, this rhizobial genome was sequenced as part of the DOE Joint Genome Institute 2010 Genomic Encyclopedia for Bacteria and Archaea-Root Nodule Bacteria (GEBA-RNB) project.

  6. Draft genome sequence of the marine bacterium Streptomyces griseoaurantiacus M045, which produces novel manumycin-type antibiotics with a pABA core component.

    PubMed

    Li, Fuchao; Jiang, Peng; Zheng, Huajun; Wang, Shengyue; Zhao, Guoping; Qin, Song; Liu, Zhaopu

    2011-07-01

    Streptomyces griseoaurantiacus M045, isolated from marine sediment, produces manumycin and chinikomycin antibiotics. Here we present a high-quality draft genome sequence of S. griseoaurantiacus M045, the first marine Streptomyces species to be sequenced and annotated. The genome encodes several gene clusters for biosynthesis of secondary metabolites and has provided insight into genomic islands linking secondary metabolism to functional adaptation in marine S. griseoaurantiacus M045.

  7. Draft Genome Sequence of Streptomyces clavuligerus NRRL 3585, a Producer of Diverse Secondary Metabolites▿

    PubMed Central

    Song, Ju Yeon; Jeong, Haeyoung; Yu, Dong Su; Fischbach, Michael A.; Park, Hong-Seog; Kim, Jae Jong; Seo, Jeong-Sun; Jensen, Susan E.; Oh, Tae Kwang; Lee, Kye Joon; Kim, Jihyun F.

    2010-01-01

    Streptomyces clavuligerus is an important industrial strain that produces a number of antibiotics, including clavulanic acid and cephamycin C. A high-quality draft genome sequence of the S. clavuligerus NRRL 3585 strain was produced by employing a hybrid approach that involved Sanger sequencing, Roche/454 pyrosequencing, optical mapping, and partial finishing. Its genome, comprising four linear replicons, one chromosome, and four plasmids, carries numerous sets of genes involved in the biosynthesis of secondary metabolites, including a variety of antibiotics. PMID:20889745

  8. Draft Genome Sequence of Telmatospirillum siberiense 26-4b1, an Acidotolerant Peatland Alphaproteobacterium Potentially Involved in Sulfur Cycling.

    PubMed

    Hausmann, Bela; Pjevac, Petra; Schreck, Katharina; Herbold, Craig W; Daims, Holger; Wagner, Michael; Loy, Alexander

    2018-01-25

    The facultative anaerobic chemoorganoheterotrophic alphaproteobacterium Telmatospirillum siberiense 26-4b1 was isolated from a Siberian peatland. We report here a 6.20-Mbp near-complete high-quality draft genome sequence of T. siberiense that reveals expected and novel metabolic potential for the genus Telmatospirillum , including genes for sulfur oxidation. Copyright © 2018 Hausmann et al.

  9. MBGD update 2015: microbial genome database for flexible ortholog analysis utilizing a diverse set of genomic data.

    PubMed

    Uchiyama, Ikuo; Mihara, Motohiro; Nishide, Hiroyo; Chiba, Hirokazu

    2015-01-01

    The microbial genome database for comparative analysis (MBGD) (available at http://mbgd.genome.ad.jp/) is a comprehensive ortholog database for flexible comparative analysis of microbial genomes, where the users are allowed to create an ortholog table among any specified set of organisms. Because of the rapid increase in microbial genome data owing to the next-generation sequencing technology, it becomes increasingly challenging to maintain high-quality orthology relationships while allowing the users to incorporate the latest genomic data available into an analysis. Because many of the recently accumulating genomic data are draft genome sequences for which some complete genome sequences of the same or closely related species are available, MBGD now stores draft genome data and allows the users to incorporate them into a user-specific ortholog database using the MyMBGD functionality. In this function, draft genome data are incorporated into an existing ortholog table created only from the complete genome data in an incremental manner to prevent low-quality draft data from affecting clustering results. In addition, to provide high-quality orthology relationships, the standard ortholog table containing all the representative genomes, which is first created by the rapid classification program DomClust, is now refined using DomRefine, a recently developed program for improving domain-level clustering using multiple sequence alignment information. © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.

  10. Draft Genome Sequence of thermoalkaliphilic Caldalkalibacillus thermarum strain TA2.A1 Reveals Molecular Adaptations to Extreme pH and Temperature

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Kalamorz, Falk; Keis, Stefanie; Stanton, Jo-Ann

    The genes and molecular machines that allow for a thermoalkaliphilic lifestyle have not been defined. To address this goal, we report on the improved high-quality draft genome sequence of Caldalkalibacillus thermarum strain TA2.A1, an obligately aerobic bacterium that grows optimally at pH 9.5 and 65 to 70 C on a wide variety of carbon and energy sources.

  11. Strategies for optimizing BioNano and Dovetail explored through a second reference quality assembly for the legume model, Medicago truncatula.

    PubMed

    Moll, Karen M; Zhou, Peng; Ramaraj, Thiruvarangan; Fajardo, Diego; Devitt, Nicholas P; Sadowsky, Michael J; Stupar, Robert M; Tiffin, Peter; Miller, Jason R; Young, Nevin D; Silverstein, Kevin A T; Mudge, Joann

    2017-08-04

    Third generation sequencing technologies, with sequencing reads in the tens- of kilo-bases, facilitate genome assembly by spanning ambiguous regions and improving continuity. This has been critical for plant genomes, which are difficult to assemble due to high repeat content, gene family expansions, segmental and tandem duplications, and polyploidy. Recently, high-throughput mapping and scaffolding strategies have further improved continuity. Together, these long-range technologies enable quality draft assemblies of complex genomes in a cost-effective and timely manner. Here, we present high quality genome assemblies of the model legume plant, Medicago truncatula (R108) using PacBio, Dovetail Chicago (hereafter, Dovetail) and BioNano technologies. To test these technologies for plant genome assembly, we generated five assemblies using all possible combinations and ordering of these three technologies in the R108 assembly. While the BioNano and Dovetail joins overlapped, they also showed complementary gains in continuity and join numbers. Both technologies spanned repetitive regions that PacBio alone was unable to bridge. Combining technologies, particularly Dovetail followed by BioNano, resulted in notable improvements compared to Dovetail or BioNano alone. A combination of PacBio, Dovetail, and BioNano was used to generate a high quality draft assembly of R108, a M. truncatula accession widely used in studies of functional genomics. As a test for the usefulness of the resulting genome sequence, the new R108 assembly was used to pinpoint breakpoints and characterize flanking sequence of a previously identified translocation between chromosomes 4 and 8, identifying more than 22.7 Mb of novel sequence not present in the earlier A17 reference assembly. Adding Dovetail followed by BioNano data yielded complementary improvements in continuity over the original PacBio assembly. This strategy proved efficient and cost-effective for developing a quality draft assembly compared to traditional reference assemblies.

  12. High-quality permanent draft genome sequence of the Lebeckia - nodulating Burkholderia dilworthii strain WSM3556T

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    De Meyer, Sofie E.; Tian, Rui; Seshadri, Rekha

    Burkholderia dilworthii strain WSM3556T is an aerobic, motile, Gram-negative, non-spore-forming rod that was isolated from an effective N2-fixing root nodule of Lebeckia ambigua collected near Grotto Bay Nature Reserve, in the Western Cape of South Africa, in October 2004. This plant persists in infertile and deep sandy soils with acidic pH, and is therefore an ideal candidate for a perennial based agriculture system in Western Australia. WSM3556T thus represents a potential inoculant quality strain for L. ambigua for which we describe the general features, together with genome sequence and annotation. Lastly, the 7,679,067 bp high-quality permanent draft genome is arrangedmore » in 140 scaffolds of 141 contigs, contains 7,059 protein-coding genes and 64 RNA-only encoding genes, and is part of the GEBA-RNB project proposal.« less

  13. High-quality permanent draft genome sequence of the Lebeckia ambigua-nodulating Burkholderia sp. strain WSM4176

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    De Meyer, Sofie E.; Tian, Rui; Seshadri, Rekha

    We report that Burkholderia sp. strain WSM4176 is an aerobic, motile, Gram-negative, non-spore-forming rod that was isolated from an effective N2-fixing root nodule of Lebeckia ambigua collected in Nieuwoudtville, Western Cape of South Africa, in October 2007. This plant persists in infertile, acidic and deep sandy soils, and is therefore an ideal candidate for a perennial based agriculture system in Western Australia. Here we describe the features of Burkholderia sp. strain WSM4176, which represents a potential inoculant quality strain for L. ambigua, together with sequence and annotation. The 9,065,247 bp high-quality-draft genome is arranged in 13 scaffolds of 65 contigs,more » contains 8369 protein-coding genes and 128 RNA-only encoding genes, and is part of the GEBA-RNB project proposal (Project ID 882).« less

  14. High-quality permanent draft genome sequence of the Lebeckia - nodulating Burkholderia dilworthii strain WSM3556T

    DOE PAGES

    De Meyer, Sofie E.; Tian, Rui; Seshadri, Rekha; ...

    2015-09-19

    Burkholderia dilworthii strain WSM3556T is an aerobic, motile, Gram-negative, non-spore-forming rod that was isolated from an effective N2-fixing root nodule of Lebeckia ambigua collected near Grotto Bay Nature Reserve, in the Western Cape of South Africa, in October 2004. This plant persists in infertile and deep sandy soils with acidic pH, and is therefore an ideal candidate for a perennial based agriculture system in Western Australia. WSM3556T thus represents a potential inoculant quality strain for L. ambigua for which we describe the general features, together with genome sequence and annotation. Lastly, the 7,679,067 bp high-quality permanent draft genome is arrangedmore » in 140 scaffolds of 141 contigs, contains 7,059 protein-coding genes and 64 RNA-only encoding genes, and is part of the GEBA-RNB project proposal.« less

  15. High-quality permanent draft genome sequence of the Lebeckia ambigua-nodulating Burkholderia sp. strain WSM4176

    DOE PAGES

    De Meyer, Sofie E.; Tian, Rui; Seshadri, Rekha; ...

    2015-10-16

    We report that Burkholderia sp. strain WSM4176 is an aerobic, motile, Gram-negative, non-spore-forming rod that was isolated from an effective N2-fixing root nodule of Lebeckia ambigua collected in Nieuwoudtville, Western Cape of South Africa, in October 2007. This plant persists in infertile, acidic and deep sandy soils, and is therefore an ideal candidate for a perennial based agriculture system in Western Australia. Here we describe the features of Burkholderia sp. strain WSM4176, which represents a potential inoculant quality strain for L. ambigua, together with sequence and annotation. The 9,065,247 bp high-quality-draft genome is arranged in 13 scaffolds of 65 contigs,more » contains 8369 protein-coding genes and 128 RNA-only encoding genes, and is part of the GEBA-RNB project proposal (Project ID 882).« less

  16. High-quality genome of the peach scab pathogen, Venturia carpophila

    USDA-ARS?s Scientific Manuscript database

    Venturia carpophila causes peach scab, a disease that renders peach (Prunus persica) fruit unmarketable. We report a high-quality draft genome (36.9 Mb) of V. carpophila from an isolate collected from a peach tree in central Georgia. The genome was sequenced by MiSeq using an Illumina paired-end lib...

  17. High-quality draft genome sequence of Kocuria marina SO9-6, an actinobacterium isolated from a copper mine

    PubMed Central

    Castro, Daniel B.A.; Pereira, Letícia Bianca; Silva, Marcus Vinícius M. e; Silva, Bárbara P. da; Palermo, Bruna Rafaella Z.; Carlos, Camila; Belgini, Daiane R.B.; Limache, Elmer Erasmo G.; Lacerda, Gileno V. Jr; Nery, Mariana B.P.; Gomes, Milene B.; Souza, Salatiel S. de; Silva, Thiago M. da; Rodrigues, Viviane D.; Paulino, Luciana C.; Vicentini, Renato; Ferraz, Lúcio F.C.; Ottoboni, Laura M.M.

    2015-01-01

    An actinobacterial strain, designated SO9-6, was isolated from a copper iron sulfide mineral. The organism is Gram-positive, facultatively anaerobic, and coccoid. Chemotaxonomic and phylogenetic properties were consistent with its classification in the genus Kocuria. Here, we report the first draft genome sequence of Kocuria marina SO9-6 under accession JROM00000000 (http://www.ncbi.nlm.nih.gov/nuccore/725823918), which provides insights for heavy metal bioremediation and production of compounds of biotechnological interest. PMID:26484219

  18. Augmenting Chinese hamster genome assembly by identifying regions of high confidence.

    PubMed

    Vishwanathan, Nandita; Bandyopadhyay, Arpan A; Fu, Hsu-Yuan; Sharma, Mohit; Johnson, Kathryn C; Mudge, Joann; Ramaraj, Thiruvarangan; Onsongo, Getiria; Silverstein, Kevin A T; Jacob, Nitya M; Le, Huong; Karypis, George; Hu, Wei-Shou

    2016-09-01

    Chinese hamster Ovary (CHO) cell lines are the dominant industrial workhorses for therapeutic recombinant protein production. The availability of genome sequence of Chinese hamster and CHO cells will spur further genome and RNA sequencing of producing cell lines. However, the mammalian genomes assembled using shot-gun sequencing data still contain regions of uncertain quality due to assembly errors. Identifying high confidence regions in the assembled genome will facilitate its use for cell engineering and genome engineering. We assembled two independent drafts of Chinese hamster genome by de novo assembly from shotgun sequencing reads and by re-scaffolding and gap-filling the draft genome from NCBI for improved scaffold lengths and gap fractions. We then used the two independent assemblies to identify high confidence regions using two different approaches. First, the two independent assemblies were compared at the sequence level to identify their consensus regions as "high confidence regions" which accounts for at least 78 % of the assembled genome. Further, a genome wide comparison of the Chinese hamster scaffolds with mouse chromosomes revealed scaffolds with large blocks of collinearity, which were also compiled as high-quality scaffolds. Genome scale collinearity was complemented with EST based synteny which also revealed conserved gene order compared to mouse. As cell line sequencing becomes more commonly practiced, the approaches reported here are useful for assessing the quality of assembly and potentially facilitate the engineering of cell lines. Copyright © 2016 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

  19. High-Quality Draft Genome Sequence of Thermocrinis jamiesonii GBS1 T Isolated from Great Boiling Spring, Nevada

    DOE PAGES

    Ganji, Rakesh; Murugapiran, Senthil K.; Ong, John C.; ...

    2016-10-20

    The draft genome of Thermocrinis jamiesonii GBS1 T is 1,315,625 bp in 10 contigs and encodes 1,463 predicted genes. The presence of sox genes and various glycoside hydrolases and the absence of uptake NiFe hydrogenases ( hyaB) are consistent with a requirement for thiosulfate and suggest the ability to use carbohydrate polymers.

  20. A manually annotated Actinidia chinensis var. chinensis (kiwifruit) genome highlights the challenges associated with draft genomes and gene prediction in plants.

    PubMed

    Pilkington, Sarah M; Crowhurst, Ross; Hilario, Elena; Nardozza, Simona; Fraser, Lena; Peng, Yongyan; Gunaseelan, Kularajathevan; Simpson, Robert; Tahir, Jibran; Deroles, Simon C; Templeton, Kerry; Luo, Zhiwei; Davy, Marcus; Cheng, Canhong; McNeilage, Mark; Scaglione, Davide; Liu, Yifei; Zhang, Qiong; Datson, Paul; De Silva, Nihal; Gardiner, Susan E; Bassett, Heather; Chagné, David; McCallum, John; Dzierzon, Helge; Deng, Cecilia; Wang, Yen-Yi; Barron, Lorna; Manako, Kelvina; Bowen, Judith; Foster, Toshi M; Erridge, Zoe A; Tiffin, Heather; Waite, Chethi N; Davies, Kevin M; Grierson, Ella P; Laing, William A; Kirk, Rebecca; Chen, Xiuyin; Wood, Marion; Montefiori, Mirco; Brummell, David A; Schwinn, Kathy E; Catanach, Andrew; Fullerton, Christina; Li, Dawei; Meiyalaghan, Sathiyamoorthy; Nieuwenhuizen, Niels; Read, Nicola; Prakash, Roneel; Hunter, Don; Zhang, Huaibi; McKenzie, Marian; Knäbel, Mareike; Harris, Alastair; Allan, Andrew C; Gleave, Andrew; Chen, Angela; Janssen, Bart J; Plunkett, Blue; Ampomah-Dwamena, Charles; Voogd, Charlotte; Leif, Davin; Lafferty, Declan; Souleyre, Edwige J F; Varkonyi-Gasic, Erika; Gambi, Francesco; Hanley, Jenny; Yao, Jia-Long; Cheung, Joey; David, Karine M; Warren, Ben; Marsh, Ken; Snowden, Kimberley C; Lin-Wang, Kui; Brian, Lara; Martinez-Sanchez, Marcela; Wang, Mindy; Ileperuma, Nadeesha; Macnee, Nikolai; Campin, Robert; McAtee, Peter; Drummond, Revel S M; Espley, Richard V; Ireland, Hilary S; Wu, Rongmei; Atkinson, Ross G; Karunairetnam, Sakuntala; Bulley, Sean; Chunkath, Shayhan; Hanley, Zac; Storey, Roy; Thrimawithana, Amali H; Thomson, Susan; David, Charles; Testolin, Raffaele; Huang, Hongwen; Hellens, Roger P; Schaffer, Robert J

    2018-04-16

    Most published genome sequences are drafts, and most are dominated by computational gene prediction. Draft genomes typically incorporate considerable sequence data that are not assigned to chromosomes, and predicted genes without quality confidence measures. The current Actinidia chinensis (kiwifruit) 'Hongyang' draft genome has 164 Mb of sequences unassigned to pseudo-chromosomes, and omissions have been identified in the gene models. A second genome of an A. chinensis (genotype Red5) was fully sequenced. This new sequence resulted in a 554.0 Mb assembly with all but 6 Mb assigned to pseudo-chromosomes. Pseudo-chromosomal comparisons showed a considerable number of translocation events have occurred following a whole genome duplication (WGD) event some consistent with centromeric Robertsonian-like translocations. RNA sequencing data from 12 tissues and ab initio analysis informed a genome-wide manual annotation, using the WebApollo tool. In total, 33,044 gene loci represented by 33,123 isoforms were identified, named and tagged for quality of evidential support. Of these 3114 (9.4%) were identical to a protein within 'Hongyang' The Kiwifruit Information Resource (KIR v2). Some proportion of the differences will be varietal polymorphisms. However, as most computationally predicted Red5 models required manual re-annotation this proportion is expected to be small. The quality of the new gene models was tested by fully sequencing 550 cloned 'Hort16A' cDNAs and comparing with the predicted protein models for Red5 and both the original 'Hongyang' assembly and the revised annotation from KIR v2. Only 48.9% and 63.5% of the cDNAs had a match with 90% identity or better to the original and revised 'Hongyang' annotation, respectively, compared with 90.9% to the Red5 models. Our study highlights the need to take a cautious approach to draft genomes and computationally predicted genes. Our use of the manual annotation tool WebApollo facilitated manual checking and correction of gene models enabling improvement of computational prediction. This utility was especially relevant for certain types of gene families such as the EXPANSIN like genes. Finally, this high quality gene set will supply the kiwifruit and general plant community with a new tool for genomics and other comparative analysis.

  1. High-quality permanent draft genome sequence of Bradyrhizobium sp. strain WSM1743 - an effective microsymbiont of an Indigofera sp. growing in Australia

    DOE PAGES

    Eshraghi, Leila; De Meyer, Sofie E.; Tian, Rui; ...

    2015-10-26

    Bradyrhizobium sp. strain WSM1743 is an aerobic, motile, Gram-negative, non-spore-forming rod that can exist as a soil saprophyte or as a legume microsymbiont of an Indigofera sp. WSM1743 was isolated from a nodule recovered from the roots of an Indigofera sp. growing 20 km north of Carnarvon in Australia. It is slow growing, tolerates up to 1 % NaCl and is capable of growth at 37 °C. Here we describe the features of Bradyrhizobium sp. strain WSM1743, together with genome sequence information and its annotation. Finally, the 8,341,956 bp high-quality permanent draft genome is arranged into 163 scaffolds and 167more » contigs, contains 7908 protein-coding genes and 75 RNA-only encoding genes and was sequenced as part of the Root Nodule Bacteria chapter of the Genomic Encyclopedia of Bacteria and Archaea project.« less

  2. High-quality permanent draft genome sequence of Bradyrhizobium sp. strain WSM1743 - an effective microsymbiont of an Indigofera sp. growing in Australia

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Eshraghi, Leila; De Meyer, Sofie E.; Tian, Rui

    Bradyrhizobium sp. strain WSM1743 is an aerobic, motile, Gram-negative, non-spore-forming rod that can exist as a soil saprophyte or as a legume microsymbiont of an Indigofera sp. WSM1743 was isolated from a nodule recovered from the roots of an Indigofera sp. growing 20 km north of Carnarvon in Australia. It is slow growing, tolerates up to 1 % NaCl and is capable of growth at 37 °C. Here we describe the features of Bradyrhizobium sp. strain WSM1743, together with genome sequence information and its annotation. Finally, the 8,341,956 bp high-quality permanent draft genome is arranged into 163 scaffolds and 167more » contigs, contains 7908 protein-coding genes and 75 RNA-only encoding genes and was sequenced as part of the Root Nodule Bacteria chapter of the Genomic Encyclopedia of Bacteria and Archaea project.« less

  3. High-quality permanent draft genome sequence of the Parapiptadenia rigida-nodulating Burkholderia sp. strain UYPR1.413

    DOE PAGES

    De Meyer, Sofie E.; Fabiano, Elena; Tian, Rui; ...

    2015-06-04

    We report that Burkholderia sp. strain UYPR1.413 is an aerobic, motile, Gram-negative, non-spore-forming rod that was isolated from a root nodule of Parapiptadenia rigida collected at the Angico plantation, Mandiyu, Uruguay, in December 2006. A survey of symbionts of P. rigida in Uruguay demonstrated that this species is nodulated predominantly by Burkholderia microsymbionts. Moreover, Burkholderia sp. strain UYPR1.413 is a highly efficient nitrogen fixing symbiont with this host. Currently, the only other sequenced isolate to fix with this host is Cupriavidus sp. UYPR2.512. Therefore, Burkholderia sp. strain UYPR1.413 was selected for sequencing on the basis of its environmental and agriculturalmore » relevance to issues in global carbon cycling, alternative energy production, and biogeochemical importance, and is part of the GEBA-RNB project. Here we describe the features of Burkholderia sp. strain UYPR1.413, together with sequence and annotation. The 10,373,764 bp high-quality permanent draft genome is arranged in 336 scaffolds of 342 contigs, contains 9759 protein-coding genes and 77 RNA-only encoding genes.« less

  4. High-quality permanent draft genome sequence of the Parapiptadenia rigida-nodulating Burkholderia sp. strain UYPR1.413

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    De Meyer, Sofie E.; Fabiano, Elena; Tian, Rui

    We report that Burkholderia sp. strain UYPR1.413 is an aerobic, motile, Gram-negative, non-spore-forming rod that was isolated from a root nodule of Parapiptadenia rigida collected at the Angico plantation, Mandiyu, Uruguay, in December 2006. A survey of symbionts of P. rigida in Uruguay demonstrated that this species is nodulated predominantly by Burkholderia microsymbionts. Moreover, Burkholderia sp. strain UYPR1.413 is a highly efficient nitrogen fixing symbiont with this host. Currently, the only other sequenced isolate to fix with this host is Cupriavidus sp. UYPR2.512. Therefore, Burkholderia sp. strain UYPR1.413 was selected for sequencing on the basis of its environmental and agriculturalmore » relevance to issues in global carbon cycling, alternative energy production, and biogeochemical importance, and is part of the GEBA-RNB project. Here we describe the features of Burkholderia sp. strain UYPR1.413, together with sequence and annotation. The 10,373,764 bp high-quality permanent draft genome is arranged in 336 scaffolds of 342 contigs, contains 9759 protein-coding genes and 77 RNA-only encoding genes.« less

  5. High-quality permanent draft genome sequence of Ensifer meliloti strain 4H41, an effective salt- and drought-tolerant microsymbiont of Phaseolus vulgaris

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Mhamdi, Ridha; Ardley, Julie; Tian, Rui

    We report that Ensifer meliloti 4H41 is an aerobic, motile, Gram-negative, non-spore-forming rod that can exist as a soil saprophyte or as a legume microsymbiont of common bean (Phaseolus vulgaris). Strain 4H41 was isolated in 2002 from root nodules of P. vulgaris grown in South Tunisia from the oasis of Rjim-Maatoug. Strain 4H41 is salt- and drought-tolerant and highly effective at fixing nitrogen with P. vulgaris. Here we describe the features of E. meliloti 4H41, together with genome sequence information and its annotation. The 6,795,637 bp high-quality permanent draft genome is arranged into 47 scaffolds of 47 contigs containing 6,350more » protein-coding genes and 72 RNA-only encoding genes, and is one of the rhizobial genomes sequenced as part of the DOE Joint Genome Institute 2010 Genomic Encyclopedia for Bacteria and Archaea-Root Nodule Bacteria (GEBA-RNB) project proposal.« less

  6. High-quality permanent draft genome sequence of Rhizobium sullae strain WSM1592; a Hedysarum coronarium microsymbiont from Sassari, Italy

    DOE PAGES

    Yates, Ron; Howieson, John; De Meyer, Sofie E.; ...

    2015-07-24

    Rhizobium sullae strain WSM1592 is an aerobic, Gram-negative, non-spore-forming rod that was isolated from an effective nitrogen (N2) fixing root nodule formed on the short-lived perennial legume Hedysarum coronarium (also known as Sulla coronaria or Sulla). WSM1592 was isolated from a nodule recovered from H. coronarium roots located in Ottava, bordering Sassari, Sardinia in 1995. WSM1592 is highly effective at fixing nitrogen with H. coronarium, and is currently the commercial Sulla inoculant strain in Australia. Here we describe the features of R. sullae strain WSM1592, together with genome sequence information and its annotation. The 7,530,820 bp high-quality permanent draft genomemore » is arranged into 118 scaffolds of 118 contigs containing 7.453 protein-coding genes and 73 RNA-only encoding genes. In conclusion, this rhizobial genome is sequenced as part of the DOE Joint Genome Institute 2010 Genomic Encyclopedia for Bacteria and Archaea-Root Nodule Bacteria (GEBA-RNB) project.« less

  7. High-quality permanent draft genome sequence of Ensifer meliloti strain 4H41, an effective salt- and drought-tolerant microsymbiont of Phaseolus vulgaris

    DOE PAGES

    Mhamdi, Ridha; Ardley, Julie; Tian, Rui; ...

    2015-07-02

    We report that Ensifer meliloti 4H41 is an aerobic, motile, Gram-negative, non-spore-forming rod that can exist as a soil saprophyte or as a legume microsymbiont of common bean (Phaseolus vulgaris). Strain 4H41 was isolated in 2002 from root nodules of P. vulgaris grown in South Tunisia from the oasis of Rjim-Maatoug. Strain 4H41 is salt- and drought-tolerant and highly effective at fixing nitrogen with P. vulgaris. Here we describe the features of E. meliloti 4H41, together with genome sequence information and its annotation. The 6,795,637 bp high-quality permanent draft genome is arranged into 47 scaffolds of 47 contigs containing 6,350more » protein-coding genes and 72 RNA-only encoding genes, and is one of the rhizobial genomes sequenced as part of the DOE Joint Genome Institute 2010 Genomic Encyclopedia for Bacteria and Archaea-Root Nodule Bacteria (GEBA-RNB) project proposal.« less

  8. A post-assembly genome-improvement toolkit (PAGIT) to obtain annotated genomes from contigs.

    PubMed

    Swain, Martin T; Tsai, Isheng J; Assefa, Samual A; Newbold, Chris; Berriman, Matthew; Otto, Thomas D

    2012-06-07

    Genome projects now produce draft assemblies within weeks owing to advanced high-throughput sequencing technologies. For milestone projects such as Escherichia coli or Homo sapiens, teams of scientists were employed to manually curate and finish these genomes to a high standard. Nowadays, this is not feasible for most projects, and the quality of genomes is generally of a much lower standard. This protocol describes software (PAGIT) that is used to improve the quality of draft genomes. It offers flexible functionality to close gaps in scaffolds, correct base errors in the consensus sequence and exploit reference genomes (if available) in order to improve scaffolding and generating annotations. The protocol is most accessible for bacterial and small eukaryotic genomes (up to 300 Mb), such as pathogenic bacteria, malaria and parasitic worms. Applying PAGIT to an E. coli assembly takes ∼24 h: it doubles the average contig size and annotates over 4,300 gene models.

  9. High-quality permanent draft genome sequence of Rhizobium leguminosarum bv. viciae strain GB30; an effective microsymbiont of Pisum sativum growing in Poland

    DOE PAGES

    Mazur, Andrzej; De Meyer, Sofie E.; Tian, Rui; ...

    2015-07-16

    We report that Rhizobium leguminosarum bv. viciae GB30 is an aerobic, motile, Gram-negative, non-spore-forming rod that can exist as a soil saprophyte or as a legume microsymbiont of Pisum sativum. GB30 was isolated in Poland from a nodule recovered from the roots of Pisum sativum growing at Janow. GB30 is also an effective microsymbiont of the annual forage legumes vetch and pea. Here we describe the features of R. leguminosarum bv. viciae strain GB30, together with sequence and annotation. The 7,468,464 bp high-quality permanent draft genome is arranged in 78 scaffolds of 78 contigs containing 7,227 protein-coding genes and 75more » RNA-only encoding genes, and is part of the GEBA-RNB project proposal.« less

  10. High-quality permanent draft genome sequence of Rhizobium leguminosarum bv. viciae strain GB30; an effective microsymbiont of Pisum sativum growing in Poland

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Mazur, Andrzej; De Meyer, Sofie E.; Tian, Rui

    We report that Rhizobium leguminosarum bv. viciae GB30 is an aerobic, motile, Gram-negative, non-spore-forming rod that can exist as a soil saprophyte or as a legume microsymbiont of Pisum sativum. GB30 was isolated in Poland from a nodule recovered from the roots of Pisum sativum growing at Janow. GB30 is also an effective microsymbiont of the annual forage legumes vetch and pea. Here we describe the features of R. leguminosarum bv. viciae strain GB30, together with sequence and annotation. The 7,468,464 bp high-quality permanent draft genome is arranged in 78 scaffolds of 78 contigs containing 7,227 protein-coding genes and 75more » RNA-only encoding genes, and is part of the GEBA-RNB project proposal.« less

  11. ABACAS: algorithm-based automatic contiguation of assembled sequences

    PubMed Central

    Assefa, Samuel; Keane, Thomas M.; Otto, Thomas D.; Newbold, Chris; Berriman, Matthew

    2009-01-01

    Summary: Due to the availability of new sequencing technologies, we are now increasingly interested in sequencing closely related strains of existing finished genomes. Recently a number of de novo and mapping-based assemblers have been developed to produce high quality draft genomes from new sequencing technology reads. New tools are necessary to take contigs from a draft assembly through to a fully contiguated genome sequence. ABACAS is intended as a tool to rapidly contiguate (align, order, orientate), visualize and design primers to close gaps on shotgun assembled contigs based on a reference sequence. The input to ABACAS is a set of contigs which will be aligned to the reference genome, ordered and orientated, visualized in the ACT comparative browser, and optimal primer sequences are automatically generated. Availability and Implementation: ABACAS is implemented in Perl and is freely available for download from http://abacas.sourceforge.net Contact: sa4@sanger.ac.uk PMID:19497936

  12. Draft genome of the medaka fish: a comprehensive resource for medaka developmental genetics and vertebrate evolutionary biology.

    PubMed

    Takeda, Hiroyuki

    2008-06-01

    The medaka Oryzias latipes is a small egg-laying freshwater teleost, and has become an excellent model system for developmental genetics and evolutionary biology. The medaka genome is relatively small in size, approximately 800 Mb, and the genome sequencing project was recently completed by Japanese research groups, providing a high-quality draft genome sequence of the inbred Hd-rR strain of medaka. In this review, I present an overview of the medaka genome project including genome resources, followed by specific findings obtained with the medaka draft genome. In particular, I focus on the analysis that was done by taking advantage of the medaka system, such as the sex chromosome differentiation and the regional history of medaka species using single nucleotide polymorphisms as genomic markers.

  13. Positional bias in variant calls against draft reference assemblies.

    PubMed

    Briskine, Roman V; Shimizu, Kentaro K

    2017-03-28

    Whole genome resequencing projects may implement variant calling using draft reference genomes assembled de novo from short-read libraries. Despite lower quality of such assemblies, they allowed researchers to extend a wide range of population genetic and genome-wide association analyses to non-model species. As the variant calling pipelines are complex and involve many software packages, it is important to understand inherent biases and limitations at each step of the analysis. In this article, we report a positional bias present in variant calling performed against draft reference assemblies constructed from de Bruijn or string overlap graphs. We assessed how frequently variants appeared at each position counted from ends of a contig or scaffold sequence, and discovered unexpectedly high number of variants at the positions related to the length of either k-mers or reads used for the assembly. We detected the bias in both publicly available draft assemblies from Assemblathon 2 competition as well as in the assemblies we generated from our simulated short-read data. Simulations confirmed that the bias causing variants are predominantly false positives induced by reads from spatially distant repeated sequences. The bias is particularly strong in contig assemblies. Scaffolding does not eliminate the bias but tends to mitigate it because of the changes in variants' relative positions and alterations in read alignments. The bias can be effectively reduced by filtering out the variants that reside in repetitive elements. Draft genome sequences generated by several popular assemblers appear to be susceptible to the positional bias potentially affecting many resequencing projects in non-model species. The bias is inherent to the assembly algorithms and arises from their particular handling of repeated sequences. It is recommended to reduce the bias by filtering especially if higher-quality genome assembly cannot be achieved. Our findings can help other researchers to improve the quality of their variant data sets and reduce artefactual findings in downstream analyses.

  14. High quality draft genome sequence of the moderately halophilic bacterium Pontibacillus yanchengensis Y32(T) and comparison among Pontibacillus genomes.

    PubMed

    Huang, Jing; Qiao, Zi Xu; Tang, Jing Wei; Wang, Gejiao

    2015-01-01

    Pontibacillus yanchengensis Y32(T) is an aerobic, motile, Gram-positive, endospore-forming, and moderately halophilic bacterium isolated from a salt field. In this study, we describe the features of P. yanchengensis strain Y32(T) together with a comparison with other four Pontibacillus genomes. The 4,281,464 bp high-quality-draft genome of strain Y32(T) is arranged into 153 contigs containing 3,965 protein-coding genes and 77 RNA encoding genes. The genome of strain Y32(T) possesses many genes related to its halophilic character, flagellar assembly and chemotaxis to support its survival in a salt-rich environment.

  15. High-quality draft genome sequence of Effusibacillus lacus strain skLN1T, facultative anaerobic spore-former isolated from freshwater lake sediment.

    PubMed

    Watanabe, Miho; Tokizawa, Riho; Kojima, Hisaya; Fukui, Manabu

    2017-01-01

    10.1601/nm.25721 strain skLN1 T is the type strain of the type species in the genus 10.1601/nm.25720 which is the one of the genera in the family 10.1601/nm.5070 within the phylum 10.1601/nm.3874. 10.1601/nm.25721 strain skLN1 T is a Gram-positive, spore-forming thermophilic neutrophile isolated from freshwater lake sediment. Here, we present the draft genome sequence of strain skLN1 T , which consists of 3,902,380 bp with a G + C content of 50.38%.

  16. De novo genome assembly of the red silk cotton tree (Bombax ceiba).

    PubMed

    Gao, Yong; Wang, Haibo; Liu, Chao; Chu, Honglong; Dai, Dongqin; Song, Shengnan; Yu, Long; Han, Lihong; Fu, Yi; Tian, Bin; Tang, Lizhou

    2018-05-01

    Bombax ceiba L. (the red silk cotton tree) is a large deciduous tree that is distributed in tropical and sub-tropical Asia as well as northern Australia. It has great economic and ecological importance, with several applications in industry and traditional medicine in many Asian countries. To facilitate further utilization of this plant resource, we present here the draft genome sequence for B. ceiba. We assembled a relatively intact genome of B. ceiba by using PacBio single-molecule sequencing and BioNano optical mapping technologies. The final draft genome is approximately 895 Mb long, with contig and scaffold N50 sizes of 1.0 Mb and 2.06 Mb, respectively. The high-quality draft genome assembly of B. ceiba will be a valuable resource enabling further genetic improvement and more effective use of this tree species.

  17. Genome Sequence of Enterohemorrhagic Escherichia coli NCCP15658

    PubMed Central

    Song, Ju Yeon; Yoo, Ran Hee; Jang, Song Yee; Seong, Won-Keun; Kim, Seon-Young; Jeong, Haeyoung; Kang, Sung Gyun; Kim, Byung Kwon; Kwon, Soon-Kyeong; Lee, Choong Hoon; Yu, Dong Su; Park, Mi-Sun

    2012-01-01

    Enterohemorrhagic Escherichia coli causes severe food-borne disease in the guts of humans and animals. Here, we report the high-quality draft genome sequence of E. coli NCCP15658 isolated from a patient in the Republic of Korea. Its genome size was determined to be 5.46 Mb, and its genomic features, including genes encoding virulence factors, were analyzed. PMID:22740673

  18. Single-molecule sequencing of the desiccation-tolerant grass Oropetium thomaeum.

    PubMed

    VanBuren, Robert; Bryant, Doug; Edger, Patrick P; Tang, Haibao; Burgess, Diane; Challabathula, Dinakar; Spittle, Kristi; Hall, Richard; Gu, Jenny; Lyons, Eric; Freeling, Michael; Bartels, Dorothea; Ten Hallers, Boudewijn; Hastie, Alex; Michael, Todd P; Mockler, Todd C

    2015-11-26

    Plant genomes, and eukaryotic genomes in general, are typically repetitive, polyploid and heterozygous, which complicates genome assembly. The short read lengths of early Sanger and current next-generation sequencing platforms hinder assembly through complex repeat regions, and many draft and reference genomes are fragmented, lacking skewed GC and repetitive intergenic sequences, which are gaining importance due to projects like the Encyclopedia of DNA Elements (ENCODE). Here we report the whole-genome sequencing and assembly of the desiccation-tolerant grass Oropetium thomaeum. Using only single-molecule real-time sequencing, which generates long (>16 kilobases) reads with random errors, we assembled 99% (244 megabases) of the Oropetium genome into 625 contigs with an N50 length of 2.4 megabases. Oropetium is an example of a 'near-complete' draft genome which includes gapless coverage over gene space as well as intergenic sequences such as centromeres, telomeres, transposable elements and rRNA clusters that are typically unassembled in draft genomes. Oropetium has 28,466 protein-coding genes and 43% repeat sequences, yet with 30% more compact euchromatic regions it is the smallest known grass genome. The Oropetium genome demonstrates the utility of single-molecule real-time sequencing for assembling high-quality plant and other eukaryotic genomes, and serves as a valuable resource for the plant comparative genomics community.

  19. High-quality draft genome sequence of Ensifer meliloti Mlalz-1, a microsymbiont of Medicago laciniata (L.) miller collected in Lanzarote, Canary Islands, Spain.

    PubMed

    Osman, Wan Adnawani Meor; van Berkum, Peter; León-Barrios, Milagros; Velázquez, Encarna; Elia, Patrick; Tian, Rui; Ardley, Julie; Gollagher, Margaret; Seshadri, Rekha; Reddy, T B K; Ivanova, Natalia; Woyke, Tanja; Pati, Amrita; Markowitz, Victor; Baeshen, Mohamed N; Baeshen, Naseebh Nabeeh; Kyrpides, Nikos; Reeve, Wayne

    2017-01-01

    10.1601/nm.1335 Mlalz-1 (INSDC = ATZD00000000) is an aerobic, motile, Gram-negative, non-spore-forming rod that was isolated from an effective nitrogen-fixing nodule of Medicago laciniata (L.) Miller from a soil sample collected near the town of Guatiza on the island of Lanzarote, the Canary Islands, Spain. This strain nodulates and forms an effective symbiosis with the highly specific host M. laciniata . This rhizobial genome was sequenced as part of the DOE Joint Genome Institute 2010 Genomic Encyclopedia for Bacteria and Archaea-Root Nodule Bacteria (GEBA-RNB) sequencing project. Here the features of 10.1601/nm.1335 Mlalz-1 are described, together with high-quality permanent draft genome sequence information and annotation. The 6,664,116 bp high-quality draft genome is arranged in 99 scaffolds of 100 contigs, containing 6314 protein-coding genes and 74 RNA-only encoding genes. Strain Mlalz-1 is closely related to 10.1601/nm.1335 10.1601/strainfinder?urlappend=%3Fid%3DIAM+12611 T , 10.1601/nm.1334 A 321 T and 10.1601/nm.17831 10.1601/strainfinder?urlappend=%3Fid%3DORS+1407 T , based on 16S rRNA gene sequences. gANI values of ≥98.1% support the classification of strain Mlalz-1 as 10.1601/nm.1335. Nodulation of M. laciniata requires a specific nodC allele, and the nodC gene of strain Mlalz-1 shares ≥98% sequence identity with nodC of M. laciniata -nodulating 10.1601/nm.1328 strains, but ≤93% with nodC of 10.1601/nm.1328 strains that nodulate other Medicago species. Strain Mlalz-1 is unique among sequenced 10.1601/nm.1335 strains in possessing genes encoding components of a T2SS and in having two versions of the adaptive acid tolerance response lpiA-acvB operon. In 10.1601/nm.1334 strain 10.1601/strainfinder?urlappend=%3Fid%3DWSM+419, lpiA is essential for enhancing survival in lethal acid conditions. The second copy of the lpiA-acvB operon of strain Mlalz-1 has highest sequence identity (> 96%) with that of 10.1601/nm.1334 strains, which suggests genetic recombination between strain Mlalz-1 and 10.1601/nm.1334 and the horizontal gene transfer of lpiA-acvB .

  20. Evaluation and validation of de novo and hybrid assembly techniques to derive high quality genome sequences

    DOE PAGES

    Utturkar, Sagar M.; Klingeman, Dawn Marie; Land, Miriam L.; ...

    2014-06-14

    Our motivation with this work was to assess the potential of different types of sequence data combined with de novo and hybrid assembly approaches to improve existing draft genome sequences. Our results show Illumina, 454 and PacBio sequencing technologies were used to generate de novo and hybrid genome assemblies for four different bacteria, which were assessed for quality using summary statistics (e.g. number of contigs, N50) and in silico evaluation tools. Differences in predictions of multiple copies of rDNA operons for each respective bacterium were evaluated by PCR and Sanger sequencing, and then the validated results were applied as anmore » additional criterion to rank assemblies. In general, assemblies using longer PacBio reads were better able to resolve repetitive regions. In this study, the combination of Illumina and PacBio sequence data assembled through the ALLPATHS-LG algorithm gave the best summary statistics and most accurate rDNA operon number predictions. This study will aid others looking to improve existing draft genome assemblies. As to availability and implementation–all assembly tools except CLC Genomics Workbench are freely available under GNU General Public License.« less

  1. Genome sequence of the phylogenetically isolated spirochete Leptonema illini type strain (3055T)

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Huntemann, Marcel; Stackebrandt, Erko; Held, Brittany

    2013-01-01

    Leptonema illini Hovind-Hougen 1979 is the type species of the genus Leptonema, family Leptospiraceae, phylum Spirochaetes. Organisms of this family have a Gram-negative-like cell enve- lope consisting of a cytoplasmic membrane and an outer membrane. The peptidoglycan layer is as- sociated with the cytoplasmic rather than the outer membrane. The two flagella of members of Leptospiraceae extend from the cytoplasmic membrane at the ends of the bacteria into the periplasmic space and are necessary for their motility. Here we describe the features of the L. illini type strain, together with the complete genome sequence, and annotation. This is the firstmore » genome sequence (finished at the level of Improved High Quality Draft) to be reported from of a member of the genus Leptonema and a representative of the third genus of the family Leptospiraceae for which complete or draft genome sequences are now available. The three scaffolds of the 4,522,760 bp draft genome sequence reported here, and its 4,230 protein-coding and 47 RNA genes are part of the Ge- nomic Encyclopedia of Bacteria and Archaea project.« less

  2. Phanerochaete chrysosporium genomics

    Treesearch

    Luis F. Larrondo; Rafael Vicuna; Dan Cullen

    2005-01-01

    A high quality draft genome sequence has been generated for the lignocellulose-degrading basidiomycete Phanerochaete chrysosporium (Martinez et al. 2004). Analysis of the genome in the context of previously established genetics and physiology is presented. Transposable elements and their potential relationship to genes involved in lignin degradation are systematically...

  3. DOE Office of Scientific and Technical Information (OSTI.GOV)

    Isanapong, Jantiya; Goodwin, Lynne A.; Bruce, David

    Microbial communities in the termite hindgut are essential for degrading plant material. We present the high-quality draft genome sequence of the Opitutaceae bacterium strain TAV1, the first member of the phylum Verrucomicrobia to be isolated from wood-feeding termites. The genomic analysis reveals genes coding for lignocellulosic degradation and nitrogen fixation.

  4. Genome Sequence of an Oligohaline Hyperthermophilic Archaeon, Thermococcus zilligii AN1, Isolated from a Terrestrial Geothermal Freshwater Spring

    PubMed Central

    Kim, Byung Kwon; Lee, Seong Hyuk; Kim, Seon-Young; Jeong, Haeyoung; Kwon, Soon-Kyeong; Lee, Choong Hoon; Song, Ju Yeon; Yu, Dong Su

    2012-01-01

    Thermococcus zilligii, a thermophilic anaerobe in freshwater, is useful for physiological research and biotechnological applications. Here we report the high-quality draft genome sequence of T. zilligii AN1T. The genome contains a number of genes for an immune system and adaptation to a microbial biomass-rich environment as well as hydrogenase genes. PMID:22740682

  5. High-quality permanent draft genome sequence of the Mimosa asperata - nodulating Cupriavidus sp. strain AMP6

    DOE PAGES

    De Meyer, Sofie E.; Parker, Matthew; Van Berkum, Peter; ...

    2015-10-16

    Cupriavidus sp. strain AMP6 is an aerobic, motile, Gram-negative, non-spore-forming rod that was isolated from a root nodule of Mimosa asperata collected in Santa Ana National Wildlife Refuge, Texas, in 2005. Mimosa asperata is the only legume described so far to exclusively associates with Cupriavidus symbionts. Furthermore, strain AMP6 represents an early-diverging lineage within the symbiotic Cupriavidus group and has the capacity to develop an effective nitrogen-fixing symbiosis with three other species of Mimosa. Here, we describe the genome of Cupriavidus sp. strain AMP6 which enables comparative analyses of symbiotic trait evolution in this genus; the general features, together withmore » sequence and annotation are further discussed. Finally, the 7,579,563 bp high-quality permanent draft genome is arranged in 260 scaffolds of 262 contigs, contains 7,033 protein-coding genes and 97 RNA-only encoding genes, and is part of the GEBA-RNB project proposal.« less

  6. High-quality permanent draft genome sequence of the Mimosa asperata - nodulating Cupriavidus sp. strain AMP6

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    De Meyer, Sofie E.; Parker, Matthew; Van Berkum, Peter

    Cupriavidus sp. strain AMP6 is an aerobic, motile, Gram-negative, non-spore-forming rod that was isolated from a root nodule of Mimosa asperata collected in Santa Ana National Wildlife Refuge, Texas, in 2005. Mimosa asperata is the only legume described so far to exclusively associates with Cupriavidus symbionts. Furthermore, strain AMP6 represents an early-diverging lineage within the symbiotic Cupriavidus group and has the capacity to develop an effective nitrogen-fixing symbiosis with three other species of Mimosa. Here, we describe the genome of Cupriavidus sp. strain AMP6 which enables comparative analyses of symbiotic trait evolution in this genus; the general features, together withmore » sequence and annotation are further discussed. Finally, the 7,579,563 bp high-quality permanent draft genome is arranged in 260 scaffolds of 262 contigs, contains 7,033 protein-coding genes and 97 RNA-only encoding genes, and is part of the GEBA-RNB project proposal.« less

  7. High-quality draft genome sequence of Ensifer meliloti Mlalz-1, a microsymbiont of Medicago laciniata (L.) miller collected in Lanzarote, Canary Islands, Spain

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Osman, Wan Adnawani Meor; van Berkum, Peter; León-Barrios, Milagros

    Ensifer meliloti Mlalz-1 (INSDC = ATZD00000000) is an aerobic, motile, Gram-negative, non-spore-forming rod that was isolated from an effective nitrogen-fixing nodule of Medicago laciniata (L.) Miller from a soil sample collected near the town of Guatiza on the island of Lanzarote, the Canary Islands, Spain. This strain nodulates and forms an effective symbiosis with the highly specific host M. laciniata. This rhizobial genome was sequenced as part of the DOE Joint Genome Institute 2010 Genomic Encyclopedia for Bacteria and Archaea-Root Nodule Bacteria (GEBA-RNB) sequencing project. Here in this paper, the features of E. meliloti Mlalz-1 are described, together with high-qualitymore » permanent draft genome sequence information and annotation. The 6,664,116 bp high-quality draft genome is arranged in 99 scaffolds of 100 contigs, containing 6314 protein-coding genes and 74 RNA-only encoding genes. Strain Mlalz-1 is closely related to Ensifer meliloti IAM 12611 T, Ensifer medicae A 321T and Ensifer numidicus ORS 1407 T, based on 16S rRNA gene sequences. gANI values of ≥98.1% support the classification of strain Mlalz-1 as E. meliloti . Nodulation of M. laciniata requires a specific nodC allele, and the nodC gene of strain Mlalz-1 shares ≥98% sequence identity with nodC of M. laciniata-nodulating Ensifer strains, but ≤93% with nodC of Ensifer strains that nodulate other Medicago species. Strain Mlalz-1 is unique among sequenced E. meliloti strains in possessing genes encoding components of a T2SS and in having two versions of the adaptive acid tolerance response lpiA-acvB operon. In E. medicae strain WSM419, lpiA is essential for enhancing survival in lethal acid conditions. The second copy of the lpiA-acvB operon of strain Mlalz-1 has highest sequence identity (> 96%) with that of E. medicae strains, which suggests genetic recombination between strain Mlalz-1 and E. medicae and the horizontal gene transfer of lpiA-acvB.« less

  8. High-quality draft genome sequence of Ensifer meliloti Mlalz-1, a microsymbiont of Medicago laciniata (L.) miller collected in Lanzarote, Canary Islands, Spain

    DOE PAGES

    Osman, Wan Adnawani Meor; van Berkum, Peter; León-Barrios, Milagros; ...

    2017-09-25

    Ensifer meliloti Mlalz-1 (INSDC = ATZD00000000) is an aerobic, motile, Gram-negative, non-spore-forming rod that was isolated from an effective nitrogen-fixing nodule of Medicago laciniata (L.) Miller from a soil sample collected near the town of Guatiza on the island of Lanzarote, the Canary Islands, Spain. This strain nodulates and forms an effective symbiosis with the highly specific host M. laciniata. This rhizobial genome was sequenced as part of the DOE Joint Genome Institute 2010 Genomic Encyclopedia for Bacteria and Archaea-Root Nodule Bacteria (GEBA-RNB) sequencing project. Here in this paper, the features of E. meliloti Mlalz-1 are described, together with high-qualitymore » permanent draft genome sequence information and annotation. The 6,664,116 bp high-quality draft genome is arranged in 99 scaffolds of 100 contigs, containing 6314 protein-coding genes and 74 RNA-only encoding genes. Strain Mlalz-1 is closely related to Ensifer meliloti IAM 12611 T, Ensifer medicae A 321T and Ensifer numidicus ORS 1407 T, based on 16S rRNA gene sequences. gANI values of ≥98.1% support the classification of strain Mlalz-1 as E. meliloti . Nodulation of M. laciniata requires a specific nodC allele, and the nodC gene of strain Mlalz-1 shares ≥98% sequence identity with nodC of M. laciniata-nodulating Ensifer strains, but ≤93% with nodC of Ensifer strains that nodulate other Medicago species. Strain Mlalz-1 is unique among sequenced E. meliloti strains in possessing genes encoding components of a T2SS and in having two versions of the adaptive acid tolerance response lpiA-acvB operon. In E. medicae strain WSM419, lpiA is essential for enhancing survival in lethal acid conditions. The second copy of the lpiA-acvB operon of strain Mlalz-1 has highest sequence identity (> 96%) with that of E. medicae strains, which suggests genetic recombination between strain Mlalz-1 and E. medicae and the horizontal gene transfer of lpiA-acvB.« less

  9. High-quality draft genome sequence of Rhizobium mesoamericanum strain STM6155, a Mimosa pudica microsymbiont from New Caledonia

    DOE PAGES

    Klonowska, Agnieszka; López-López, Aline; Moulin, Lionel; ...

    2017-01-17

    Rhizobium mesoamericanum STM6155 (INSCD=ATYY01000000) is an aerobic, motile, Gram-negative, non-spore-forming rod that can exist as a soil saprophyte or as an effective nitrogen fixing microsymbiont of the legume Mimosa pudica L.. STM6155 was isolated in 2009 from a nodule of the trap host M. pudica grown in nickel-rich soil collected near Mont Dore, New Caledonia. R. mesoamericanum STM6155 was selected as part of the DOE Joint Genome Institute 2010 Genomic Encyclopedia for Bacteria and Archaea-Root Nodule Bacteria (GEBA-RNB) genome sequencing project. Here we describe the symbiotic properties of R. mesoamericanum STM6155, together with its genome sequence information and annotation. Themore » 6,927,906bp high-quality draft genome is arranged into 147 scaffolds of 152 contigs containing 6855 protein-coding genes and 71 RNA-only encoding genes. Strain STM6155 forms an ANI clique (ID 2435) with the sequenced R. mesoamericanum strain STM3625, and the nodulation genes are highly conserved in these strains and the type strain of Rhizobium grahamii CCGE501 T . Within the STM6155 genome, we have identified a chr chromate efflux gene cluster of six genes arranged into two putative operons and we postulate that this cluster is important for the survival of STM6155 in ultramafic soils containing high concentrations of chromate.« less

  10. High-quality draft genome sequence of Rhizobium mesoamericanum strain STM6155, a Mimosa pudica microsymbiont from New Caledonia

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Klonowska, Agnieszka; López-López, Aline; Moulin, Lionel

    Rhizobium mesoamericanum STM6155 (INSCD=ATYY01000000) is an aerobic, motile, Gram-negative, non-spore-forming rod that can exist as a soil saprophyte or as an effective nitrogen fixing microsymbiont of the legume Mimosa pudica L.. STM6155 was isolated in 2009 from a nodule of the trap host M. pudica grown in nickel-rich soil collected near Mont Dore, New Caledonia. R. mesoamericanum STM6155 was selected as part of the DOE Joint Genome Institute 2010 Genomic Encyclopedia for Bacteria and Archaea-Root Nodule Bacteria (GEBA-RNB) genome sequencing project. Here we describe the symbiotic properties of R. mesoamericanum STM6155, together with its genome sequence information and annotation. Themore » 6,927,906bp high-quality draft genome is arranged into 147 scaffolds of 152 contigs containing 6855 protein-coding genes and 71 RNA-only encoding genes. Strain STM6155 forms an ANI clique (ID 2435) with the sequenced R. mesoamericanum strain STM3625, and the nodulation genes are highly conserved in these strains and the type strain of Rhizobium grahamii CCGE501 T . Within the STM6155 genome, we have identified a chr chromate efflux gene cluster of six genes arranged into two putative operons and we postulate that this cluster is important for the survival of STM6155 in ultramafic soils containing high concentrations of chromate.« less

  11. A high-coverage draft genome of the mycalesine butterfly Bicyclus anynana.

    PubMed

    Nowell, Reuben W; Elsworth, Ben; Oostra, Vicencio; Zwaan, Bas J; Wheat, Christopher W; Saastamoinen, Marjo; Saccheri, Ilik J; Van't Hof, Arjen E; Wasik, Bethany R; Connahs, Heidi; Aslam, Muhammad L; Kumar, Sujai; Challis, Richard J; Monteiro, Antónia; Brakefield, Paul M; Blaxter, Mark

    2017-07-01

    The mycalesine butterfly Bicyclus anynana, the "Squinting bush brown," is a model organism in the study of lepidopteran ecology, development, and evolution. Here, we present a draft genome sequence for B. anynana to serve as a genomics resource for current and future studies of this important model species. Seven libraries with insert sizes ranging from 350 bp to 20 kb were constructed using DNA from an inbred female and sequenced using both Illumina and PacBio technology; 128 Gb of raw Illumina data was filtered to 124 Gb and assembled to a final size of 475 Mb (∼×260 assembly coverage). Contigs were scaffolded using mate-pair, transcriptome, and PacBio data into 10 800 sequences with an N50 of 638 kb (longest scaffold 5 Mb). The genome is comprised of 26% repetitive elements and encodes a total of 22 642 predicted protein-coding genes. Recovery of a BUSCO set of core metazoan genes was almost complete (98%). Overall, these metrics compare well with other recently published lepidopteran genomes. We report a high-quality draft genome sequence for Bicyclus anynana. The genome assembly and annotated gene models are available at LepBase (http://ensembl.lepbase.org/index.html). © The Authors 2017. Published by Oxford University Press.

  12. A high-coverage draft genome of the mycalesine butterfly Bicyclus anynana

    PubMed Central

    Elsworth, Ben; Oostra, Vicencio; Zwaan, Bas J.; Wheat, Christopher W.; Saastamoinen, Marjo; Saccheri, Ilik J.; van’t Hof, Arjen E.; Wasik, Bethany R.; Connahs, Heidi; Aslam, Muhammad L.; Kumar, Sujai; Challis, Richard J.; Monteiro, Antónia; Brakefield, Paul M.

    2017-01-01

    Abstract The mycalesine butterfly Bicyclus anynana, the “Squinting bush brown,” is a model organism in the study of lepidopteran ecology, development, and evolution. Here, we present a draft genome sequence for B. anynana to serve as a genomics resource for current and future studies of this important model species. Seven libraries with insert sizes ranging from 350 bp to 20 kb were constructed using DNA from an inbred female and sequenced using both Illumina and PacBio technology; 128 Gb of raw Illumina data was filtered to 124 Gb and assembled to a final size of 475 Mb (∼×260 assembly coverage). Contigs were scaffolded using mate-pair, transcriptome, and PacBio data into 10 800 sequences with an N50 of 638 kb (longest scaffold 5 Mb). The genome is comprised of 26% repetitive elements and encodes a total of 22 642 predicted protein-coding genes. Recovery of a BUSCO set of core metazoan genes was almost complete (98%). Overall, these metrics compare well with other recently published lepidopteran genomes. We report a high-quality draft genome sequence for Bicyclus anynana. The genome assembly and annotated gene models are available at LepBase (http://ensembl.lepbase.org/index.html). PMID:28486658

  13. Single-molecule sequencing of the desiccation-tolerant grass Oropetium thomaeum

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    VanBuren, Robert; Bryant, Doug; Edger, Patrick P.

    Plant genomes, and eukaryotic genomes in general, are typically repetitive, polyploid and heterozygous, which complicates genome assembly1. The short read lengths of early Sanger and current next-generation sequencing platforms hinder assembly through complex repeat regions, and many draft and reference genomes are fragmented, lacking skewed GC and repetitive intergenic sequences, which are gaining importance due to projects like the Encyclopedia of DNA Elements (ENCODE). Here we report the whole-genome sequencing and assembly of the desiccation-tolerant grass Oropetium thomaeum. Using only single-molecule real-time sequencing, which generates long (>16 kilobases) reads with random errors, we assembled 99% (244 megabases) of the Oropetiummore » genome into 625 contigs with an N50 length of 2.4 megabases. Oropetium is an example of a ‘near-complete’ draft genome which includes gapless coverage over gene space as well as intergenic sequences such as centromeres, telomeres, transposable elements and rRNA clusters that are typically unassembled in draft genomes. Oropetium has 28,466 protein-coding genes and 43% repeat sequences, yet with 30% more compact euchromatic regions it is the smallest known grass genome. As a result, the Oropetium genome demonstrates the utility of single-molecule real-time sequencing for assembling high-quality plant and other eukaryotic genomes, and serves as a valuable resource for the plant comparative genomics community.« less

  14. Single-molecule sequencing of the desiccation-tolerant grass Oropetium thomaeum

    DOE PAGES

    VanBuren, Robert; Bryant, Doug; Edger, Patrick P.; ...

    2015-11-11

    Plant genomes, and eukaryotic genomes in general, are typically repetitive, polyploid and heterozygous, which complicates genome assembly1. The short read lengths of early Sanger and current next-generation sequencing platforms hinder assembly through complex repeat regions, and many draft and reference genomes are fragmented, lacking skewed GC and repetitive intergenic sequences, which are gaining importance due to projects like the Encyclopedia of DNA Elements (ENCODE). Here we report the whole-genome sequencing and assembly of the desiccation-tolerant grass Oropetium thomaeum. Using only single-molecule real-time sequencing, which generates long (>16 kilobases) reads with random errors, we assembled 99% (244 megabases) of the Oropetiummore » genome into 625 contigs with an N50 length of 2.4 megabases. Oropetium is an example of a ‘near-complete’ draft genome which includes gapless coverage over gene space as well as intergenic sequences such as centromeres, telomeres, transposable elements and rRNA clusters that are typically unassembled in draft genomes. Oropetium has 28,466 protein-coding genes and 43% repeat sequences, yet with 30% more compact euchromatic regions it is the smallest known grass genome. As a result, the Oropetium genome demonstrates the utility of single-molecule real-time sequencing for assembling high-quality plant and other eukaryotic genomes, and serves as a valuable resource for the plant comparative genomics community.« less

  15. High-quality permanent draft genome sequence of Ensifer sp. PC2, isolated from a nitrogen-fixing root nodule of the legume tree (Khejri) native to the Thar Desert of India

    DOE PAGES

    Gehlot, Hukam Singh; Ardley, Julie; Tak, Nisha; ...

    2016-06-23

    Ensifer sp. PC2 is an aerobic, motile, Gram-negative, non-spore-forming rod that was isolated from a nitrogen-fixing nodule of the tree legume P. cineraria (L.) Druce (Khejri), which is a keystone species that grows in arid and semi-arid regions of the Indian Thar desert. Strain PC2 exists as a dominant saprophyte in alkaline soils of Western Rajasthan. It is fast growing, well-adapted to arid conditions and is able to form an effective symbiosis with several annual crop legumes as well as species of mimosoid trees and shrubs. Here we describe the features of Ensifer sp. PC2, together with genome sequence informationmore » and its annotation. The 8,458,965 bp high-quality permanent draft genome is arranged into 171 scaffolds of 171 contigs containing 8,344 protein-coding genes and 139 RNA-only encoding genes, and is one of the rhizobial genomes sequenced as part of the DOE Joint Genome Institute 2010 Genomic Encyclopedia for Bacteria and Archaea-Root Nodule Bacteria (GEBA-RNB) project proposal.« less

  16. High-quality permanent draft genome sequence of Ensifer sp. PC2, isolated from a nitrogen-fixing root nodule of the legume tree (Khejri) native to the Thar Desert of India

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Gehlot, Hukam Singh; Ardley, Julie; Tak, Nisha

    Ensifer sp. PC2 is an aerobic, motile, Gram-negative, non-spore-forming rod that was isolated from a nitrogen-fixing nodule of the tree legume P. cineraria (L.) Druce (Khejri), which is a keystone species that grows in arid and semi-arid regions of the Indian Thar desert. Strain PC2 exists as a dominant saprophyte in alkaline soils of Western Rajasthan. It is fast growing, well-adapted to arid conditions and is able to form an effective symbiosis with several annual crop legumes as well as species of mimosoid trees and shrubs. Here we describe the features of Ensifer sp. PC2, together with genome sequence informationmore » and its annotation. The 8,458,965 bp high-quality permanent draft genome is arranged into 171 scaffolds of 171 contigs containing 8,344 protein-coding genes and 139 RNA-only encoding genes, and is one of the rhizobial genomes sequenced as part of the DOE Joint Genome Institute 2010 Genomic Encyclopedia for Bacteria and Archaea-Root Nodule Bacteria (GEBA-RNB) project proposal.« less

  17. Genomic Diversity of Biocontrol Strains of Pseudomonas spp. Isolated from Aerial or Root Surfaces of Plants

    USDA-ARS?s Scientific Manuscript database

    The striking ecological, metabolic, and biochemical diversity of Pseudomonas has intrigued microbiologists for many decades. To explore the genomic diversity of biocontrol strains of Pseudomonas spp., we derived high quality draft sequences of seven strains known to suppress plant disease. The str...

  18. A High Quality Draft Consensus Sequence of the Genome of a Heterozygous Grapevine Variety

    PubMed Central

    Cartwright, Dustin A.; Cestaro, Alessandro; Pruss, Dmitry; Pindo, Massimo; FitzGerald, Lisa M.; Vezzulli, Silvia; Reid, Julia; Malacarne, Giulia; Iliev, Diana; Coppola, Giuseppina; Wardell, Bryan; Micheletti, Diego; Macalma, Teresita; Facci, Marco; Mitchell, Jeff T.; Perazzolli, Michele; Eldredge, Glenn; Gatto, Pamela; Oyzerski, Rozan; Moretto, Marco; Gutin, Natalia; Stefanini, Marco; Chen, Yang; Segala, Cinzia; Davenport, Christine; Demattè, Lorenzo; Mraz, Amy; Battilana, Juri; Stormo, Keith; Costa, Fabrizio; Tao, Quanzhou; Si-Ammour, Azeddine; Harkins, Tim; Lackey, Angie; Perbost, Clotilde; Taillon, Bruce; Stella, Alessandra; Solovyev, Victor; Fawcett, Jeffrey A.; Sterck, Lieven; Vandepoele, Klaas; Grando, Stella M.; Toppo, Stefano; Moser, Claudio; Lanchbury, Jerry; Bogden, Robert; Skolnick, Mark; Sgaramella, Vittorio; Bhatnagar, Satish K.; Fontana, Paolo; Gutin, Alexander; Van de Peer, Yves; Salamini, Francesco; Viola, Roberto

    2007-01-01

    Background Worldwide, grapes and their derived products have a large market. The cultivated grape species Vitis vinifera has potential to become a model for fruit trees genetics. Like many plant species, it is highly heterozygous, which is an additional challenge to modern whole genome shotgun sequencing. In this paper a high quality draft genome sequence of a cultivated clone of V. vinifera Pinot Noir is presented. Principal Findings We estimate the genome size of V. vinifera to be 504.6 Mb. Genomic sequences corresponding to 477.1 Mb were assembled in 2,093 metacontigs and 435.1 Mb were anchored to the 19 linkage groups (LGs). The number of predicted genes is 29,585, of which 96.1% were assigned to LGs. This assembly of the grape genome provides candidate genes implicated in traits relevant to grapevine cultivation, such as those influencing wine quality, via secondary metabolites, and those connected with the extreme susceptibility of grape to pathogens. Single nucleotide polymorphism (SNP) distribution was consistent with a diffuse haplotype structure across the genome. Of around 2,000,000 SNPs, 1,751,176 were mapped to chromosomes and one or more of them were identified in 86.7% of anchored genes. The relative age of grape duplicated genes was estimated and this made possible to reveal a relatively recent Vitis-specific large scale duplication event concerning at least 10 chromosomes (duplication not reported before). Conclusions Sanger shotgun sequencing and highly efficient sequencing by synthesis (SBS), together with dedicated assembly programs, resolved a complex heterozygous genome. A consensus sequence of the genome and a set of mapped marker loci were generated. Homologous chromosomes of Pinot Noir differ by 11.2% of their DNA (hemizygous DNA plus chromosomal gaps). SNP markers are offered as a tool with the potential of introducing a new era in the molecular breeding of grape. PMID:18094749

  19. Nuclear, Chloroplast, and Mitochondrial Genome Sequences of the Prospective Microalgal Biofuel Strain Picochlorum soloecismus

    DOE PAGES

    Gonzalez-Esquer, C. Raul; Twary, Scott N.; Hovde, Blake T.; ...

    2018-01-25

    Picochlorum soloecismus is a halotolerant, fast-growing, and moderate-lipid-producing microalga that is being evaluated as a renewable feedstock for biofuel production. Herein, we report on an improved high-quality draft assembly and annotation for the nuclear, chloroplast, and mitochondrial genomes of P. soloecismus DOE 101.

  20. Genome sequence of the dark pink pigmented Listia bainesii microsymbiont Methylobacterium sp. WSM2598

    PubMed Central

    2014-01-01

    Strains of a pink-pigmented Methylobacterium sp. are effective nitrogen- (N2) fixing microsymbionts of species of the African crotalarioid genus Listia. Strain WSM2598 is an aerobic, motile, Gram-negative, non-spore-forming rod isolated in 2002 from a Listia bainesii root nodule collected at Estcourt Research Station in South Africa. Here we describe the features of Methylobacterium sp. WSM2598, together with information and annotation of a high-quality draft genome sequence. The 7,669,765 bp draft genome is arranged in 5 scaffolds of 83 contigs, contains 7,236 protein-coding genes and 18 RNA-only encoding genes. This rhizobial genome is one of 100 sequenced as part of the DOE Joint Genome Institute 2010 G enomic E ncyclopedia for B acteria and A rchaea- R oot N odule B acteria (GEBA-RNB) project. PMID:25780498

  1. Genome sequence of the dark pink pigmented Listia bainesii microsymbiont Methylobacterium sp. WSM2598.

    PubMed

    Ardley, Julie; Tian, Rui; Howieson, John; Yates, Ron; Bräu, Lambert; Han, James; Lobos, Elizabeth; Huntemann, Marcel; Chen, Amy; Mavromatis, Konstantinos; Markowitz, Victor; Ivanova, Natalia; Pati, Amrita; Goodwin, Lynne; Woyke, Tanja; Kyrpides, Nikos; Reeve, Wayne

    2014-01-01

    Strains of a pink-pigmented Methylobacterium sp. are effective nitrogen- (N2) fixing microsymbionts of species of the African crotalarioid genus Listia. Strain WSM2598 is an aerobic, motile, Gram-negative, non-spore-forming rod isolated in 2002 from a Listia bainesii root nodule collected at Estcourt Research Station in South Africa. Here we describe the features of Methylobacterium sp. WSM2598, together with information and annotation of a high-quality draft genome sequence. The 7,669,765 bp draft genome is arranged in 5 scaffolds of 83 contigs, contains 7,236 protein-coding genes and 18 RNA-only encoding genes. This rhizobial genome is one of 100 sequenced as part of the DOE Joint Genome Institute 2010 G enomic E ncyclopedia for B acteria and A rchaea- R oot N odule B acteria (GEBA-RNB) project.

  2. Draft genome and reference transcriptomic resources for the urticating pine defoliator Thaumetopoea pityocampa (Lepidoptera: Notodontidae).

    PubMed

    Gschloessl, B; Dorkeld, F; Berges, H; Beydon, G; Bouchez, O; Branco, M; Bretaudeau, A; Burban, C; Dubois, E; Gauthier, P; Lhuillier, E; Nichols, J; Nidelet, S; Rocha, S; Sauné, L; Streiff, R; Gautier, M; Kerdelhué, C

    2018-05-01

    The pine processionary moth Thaumetopoea pityocampa (Lepidoptera: Notodontidae) is the main pine defoliator in the Mediterranean region. Its urticating larvae cause severe human and animal health concerns in the invaded areas. This species shows a high phenotypic variability for various traits, such as phenology, fecundity and tolerance to extreme temperatures. This study presents the construction and analysis of extensive genomic and transcriptomic resources, which are an obligate prerequisite to understand their underlying genetic architecture. Using a well-studied population from Portugal with peculiar phenological characteristics, the karyotype was first determined and a first draft genome of 537 Mb total length was assembled into 68,292 scaffolds (N50 = 164 kb). From this genome assembly, 29,415 coding genes were predicted. To circumvent some limitations for fine-scale physical mapping of genomic regions of interest, a 3X coverage BAC library was also developed. In particular, 11 BACs from this library were individually sequenced to assess the assembly quality. Additionally, de novo transcriptomic resources were generated from various developmental stages sequenced with HiSeq and MiSeq Illumina technologies. The reads were de novo assembled into 62,376 and 63,175 transcripts, respectively. Then, a robust subset of the genome-predicted coding genes, the de novo transcriptome assemblies and previously published 454/Sanger data were clustered to obtain a high-quality and comprehensive reference transcriptome consisting of 29,701 bona fide unigenes. These sequences covered 99% of the cegma and 88% of the busco highly conserved eukaryotic genes and 84% of the busco arthropod gene set. Moreover, 90% of these transcripts could be localized on the draft genome. The described information is available via a genome annotation portal (http://bipaa.genouest.org/sp/thaumetopoea_pityocampa/). © 2018 John Wiley & Sons Ltd.

  3. Genome sequencing of bacteria: sequencing, de novo assembly and rapid analysis using open source tools.

    PubMed

    Kisand, Veljo; Lettieri, Teresa

    2013-04-01

    De novo genome sequencing of previously uncharacterized microorganisms has the potential to open up new frontiers in microbial genomics by providing insight into both functional capabilities and biodiversity. Until recently, Roche 454 pyrosequencing was the NGS method of choice for de novo assembly because it generates hundreds of thousands of long reads (<450 bps), which are presumed to aid in the analysis of uncharacterized genomes. The array of tools for processing NGS data are increasingly free and open source and are often adopted for both their high quality and role in promoting academic freedom. The error rate of pyrosequencing the Alcanivorax borkumensis genome was such that thousands of insertions and deletions were artificially introduced into the finished genome. Despite a high coverage (~30 fold), it did not allow the reference genome to be fully mapped. Reads from regions with errors had low quality, low coverage, or were missing. The main defect of the reference mapping was the introduction of artificial indels into contigs through lower than 100% consensus and distracting gene calling due to artificial stop codons. No assembler was able to perform de novo assembly comparable to reference mapping. Automated annotation tools performed similarly on reference mapped and de novo draft genomes, and annotated most CDSs in the de novo assembled draft genomes. Free and open source software (FOSS) tools for assembly and annotation of NGS data are being developed rapidly to provide accurate results with less computational effort. Usability is not high priority and these tools currently do not allow the data to be processed without manual intervention. Despite this, genome assemblers now readily assemble medium short reads into long contigs (>97-98% genome coverage). A notable gap in pyrosequencing technology is the quality of base pair calling and conflicting base pairs between single reads at the same nucleotide position. Regardless, using draft whole genomes that are not finished and remain fragmented into tens of contigs allows one to characterize unknown bacteria with modest effort.

  4. Draft genome sequence of the silver pomfret fish, Pampus argenteus.

    PubMed

    AlMomin, Sabah; Kumar, Vinod; Al-Amad, Sami; Al-Hussaini, Mohsen; Dashti, Talal; Al-Enezi, Khaznah; Akbar, Abrar

    2016-01-01

    Silver pomfret, Pampus argenteus, is a fish species from coastal waters. Despite its high commercial value, this edible fish has not been sequenced. Hence, its genetic and genomic studies have been limited. We report the first draft genome sequence of the silver pomfret obtained using a Next Generation Sequencing (NGS) technology. We assembled 38.7 Gb of nucleotides into scaffolds of 350 Mb with N50 of about 1.5 kb, using high quality paired end reads. These scaffolds represent 63.7% of the estimated silver pomfret genome length. The newly sequenced and assembled genome has 11.06% repetitive DNA regions, and this percentage is comparable to that of the tilapia genome. The genome analysis predicted 16 322 genes. About 91% of these genes showed homology with known proteins. Many gene clusters were annotated to protein and fatty-acid metabolism pathways that may be important in the context of the meat texture and immune system developmental processes. The reference genome can pave the way for the identification of many other genomic features that could improve breeding and population-management strategies, and it can also help characterize the genetic diversity of P. argenteus.

  5. Permanent draft genome sequence of the gliding predator Saprospira grandis strain Sa g1 (= HR1)

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Mavromatis, K; Chertkov, Olga; Lapidus, Alla L.

    2012-01-01

    Saprospira grandis Gross et al. 1911 is a member of the Saprospiraceae, a family in the class 'Sphingobacteria' that remains poorly characterized at the genomic level. The species is known for preying on other marine bacteria via 'ixotrophy'. S. grandis strain Sa g1 was isolated from decaying crab carapace in France and was selected for genome sequencing because of its isolated location in the tree of life. Only one type strain genome has been published so far from the Saprospiraceae, while the sequence of strain Sa g1 represents the second genome to be published from a non-type strain of S.more » grandis. Here we describe the features of this organism, together with the complete genome sequence and annotation. The 4,495,250 bp long Improved-High-Quality draft of the genome with its 3,536 protein-coding and 62 RNA genes is a part of the Genomic Encyclopedia of Bacteria and Archaea project.« less

  6. High-quality permanent draft genome sequence of Ensifer medicae strain WSM244, a microsymbiont isolated from Medicago polymorpha growing in alkaline soil

    DOE PAGES

    Ardley, Julie; Tian, Rui; O’Hara, Graham; ...

    2015-12-01

    We report that Ensifer medicae WSM244 is an aerobic, motile, Gram-negative, non-spore-forming rod that can exist as a soil saprophyte or as a legume microsymbiont of Medicago species. WSM244 was isolated in 1979 from a nodule recovered from the roots of the annual Medicago polymorpha L. growing in alkaline soil (pH 8.0) in Tel Afer, Iraq. WSM244 is the only acid-sensitive E. medicae strain that has been sequenced to date. It is effective at fixing nitrogen with M. polymorpha L., as well as with more alkaline-adapted Medicago spp. such as M. littoralis Loisel., M. scutellata (L.) Mill., M. tornata (L.)more » Mill. and M. truncatula Gaertn. This strain is also effective with the perennial M. sativa L. Here we describe the features of E. medicae WSM244, together with genome sequence information and its annotation. The 6,650,282 bp high-quality permanent draft genome is arranged into 91 scaffolds of 91 contigs containing 6,427 protein-coding genes and 68 RNA-only encoding genes, and is one of the rhizobial genomes sequenced as part of the DOE Joint Genome Institute 2010 Genomic Encyclopedia for Bacteria and Archaea-Root Nodule Bacteria (GEBA-RNB) project proposal.« less

  7. High-quality permanent draft genome sequence of Ensifer medicae strain WSM244, a microsymbiont isolated from Medicago polymorpha growing in alkaline soil

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Ardley, Julie; Tian, Rui; O’Hara, Graham

    We report that Ensifer medicae WSM244 is an aerobic, motile, Gram-negative, non-spore-forming rod that can exist as a soil saprophyte or as a legume microsymbiont of Medicago species. WSM244 was isolated in 1979 from a nodule recovered from the roots of the annual Medicago polymorpha L. growing in alkaline soil (pH 8.0) in Tel Afer, Iraq. WSM244 is the only acid-sensitive E. medicae strain that has been sequenced to date. It is effective at fixing nitrogen with M. polymorpha L., as well as with more alkaline-adapted Medicago spp. such as M. littoralis Loisel., M. scutellata (L.) Mill., M. tornata (L.)more » Mill. and M. truncatula Gaertn. This strain is also effective with the perennial M. sativa L. Here we describe the features of E. medicae WSM244, together with genome sequence information and its annotation. The 6,650,282 bp high-quality permanent draft genome is arranged into 91 scaffolds of 91 contigs containing 6,427 protein-coding genes and 68 RNA-only encoding genes, and is one of the rhizobial genomes sequenced as part of the DOE Joint Genome Institute 2010 Genomic Encyclopedia for Bacteria and Archaea-Root Nodule Bacteria (GEBA-RNB) project proposal.« less

  8. Quality scores for 32,000 genomes

    DOE PAGES

    Land, Miriam L.; Hyatt, Doug; Jun, Se-Ran; ...

    2014-12-08

    More than 80% of the microbial genomes in GenBank are of ‘draft’ quality (12,553 draft vs. 2,679 finished, as of October, 2013). In this study, we have examined all the microbial DNA sequences available for complete, draft, and Sequence Read Archive genomes in GenBank as well as three other major public databases, and assigned quality scores for more than 30,000 prokaryotic genome sequences. Scores were assigned using four categories: the completeness of the assembly, the presence of full-length rRNA genes, tRNA composition and the presence of a set of 102 conserved genes in prokaryotes. Most (~88%) of the genomes hadmore » quality scores of 0.8 or better and can be safely used for standard comparative genomics analysis. We compared genomes across factors that may influence the score. We found that although sequencing depth coverage of over 100x did not ensure a better score, sequencing read length was a better indicator of sequencing quality. With few exceptions, most of the 30,000 genomes have nearly all the 102 essential genes. The score can be used to set thresholds for screening data when analyzing “all published genomes” and reference data is either not available or not applicable. The scores highlighted organisms for which commonly used tools do not perform well. This information can be used to improve tools and to serve a broad group of users as more diverse organisms are sequenced. Finally and unexpectedly, the comparison of predicted tRNAs across 15,000 high quality genomes showed that anticodons beginning with an ‘A’ (codons ending with a ‘U’) are almost non-existent, with the exception of one arginine codon (CGU); this has been noted previously in the literature for a few genomes, but not with the depth found here.« less

  9. Quality scores for 32,000 genomes

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Land, Miriam L.; Hyatt, Doug; Jun, Se-Ran

    More than 80% of the microbial genomes in GenBank are of ‘draft’ quality (12,553 draft vs. 2,679 finished, as of October, 2013). In this study, we have examined all the microbial DNA sequences available for complete, draft, and Sequence Read Archive genomes in GenBank as well as three other major public databases, and assigned quality scores for more than 30,000 prokaryotic genome sequences. Scores were assigned using four categories: the completeness of the assembly, the presence of full-length rRNA genes, tRNA composition and the presence of a set of 102 conserved genes in prokaryotes. Most (~88%) of the genomes hadmore » quality scores of 0.8 or better and can be safely used for standard comparative genomics analysis. We compared genomes across factors that may influence the score. We found that although sequencing depth coverage of over 100x did not ensure a better score, sequencing read length was a better indicator of sequencing quality. With few exceptions, most of the 30,000 genomes have nearly all the 102 essential genes. The score can be used to set thresholds for screening data when analyzing “all published genomes” and reference data is either not available or not applicable. The scores highlighted organisms for which commonly used tools do not perform well. This information can be used to improve tools and to serve a broad group of users as more diverse organisms are sequenced. Finally and unexpectedly, the comparison of predicted tRNAs across 15,000 high quality genomes showed that anticodons beginning with an ‘A’ (codons ending with a ‘U’) are almost non-existent, with the exception of one arginine codon (CGU); this has been noted previously in the literature for a few genomes, but not with the depth found here.« less

  10. SSPACE-LongRead: scaffolding bacterial draft genomes using long read sequence information

    PubMed Central

    2014-01-01

    Background The recent introduction of the Pacific Biosciences RS single molecule sequencing technology has opened new doors to scaffolding genome assemblies in a cost-effective manner. The long read sequence information is promised to enhance the quality of incomplete and inaccurate draft assemblies constructed from Next Generation Sequencing (NGS) data. Results Here we propose a novel hybrid assembly methodology that aims to scaffold pre-assembled contigs in an iterative manner using PacBio RS long read information as a backbone. On a test set comprising six bacterial draft genomes, assembled using either a single Illumina MiSeq or Roche 454 library, we show that even a 50× coverage of uncorrected PacBio RS long reads is sufficient to drastically reduce the number of contigs. Comparisons to the AHA scaffolder indicate our strategy is better capable of producing (nearly) complete bacterial genomes. Conclusions The current work describes our SSPACE-LongRead software which is designed to upgrade incomplete draft genomes using single molecule sequences. We conclude that the recent advances of the PacBio sequencing technology and chemistry, in combination with the limited computational resources required to run our program, allow to scaffold genomes in a fast and reliable manner. PMID:24950923

  11. Draft Genome Sequence of Komagataeibacter rhaeticus Strain AF1, a High Producer of Cellulose, Isolated from Kombucha Tea.

    PubMed

    Dos Santos, Renato Augusto Corrêa; Berretta, Andresa A; Barud, Hernane da Silva; Ribeiro, Sidney José Lima; González-García, Laura Natalia; Zucchi, Tiago Domingues; Goldman, Gustavo H; Riaño-Pachón, Diego M

    2014-07-24

    Here, we present the draft genome sequence of Komagatabaeicter rhaeticus strain AF1, which was isolated from Kombucha tea and is capable of producing high levels of cellulose. Copyright © 2014 dos Santos et al.

  12. The draft genome of MD-2 pineapple using hybrid error correction of long reads

    PubMed Central

    Redwan, Raimi M.; Saidin, Akzam; Kumar, S. Vijay

    2016-01-01

    The introduction of the elite pineapple variety, MD-2, has caused a significant market shift in the pineapple industry. Better productivity, overall increased in fruit quality and taste, resilience to chilled storage and resistance to internal browning are among the key advantages of the MD-2 as compared with its previous predecessor, the Smooth Cayenne. Here, we present the genome sequence of the MD-2 pineapple (Ananas comosus (L.) Merr.) by using the hybrid sequencing technology from two highly reputable platforms, i.e. the PacBio long sequencing reads and the accurate Illumina short reads. Our draft genome achieved 99.6% genome coverage with 27,017 predicted protein-coding genes while 45.21% of the genome was identified as repetitive elements. Furthermore, differential expression of ripening RNASeq library of pineapple fruits revealed ethylene-related transcripts, believed to be involved in regulating the process of non-climacteric pineapple fruit ripening. The MD-2 pineapple draft genome serves as an example of how a complex heterozygous genome is amenable to whole genome sequencing by using a hybrid technology that is both economical and accurate. The genome will make genomic applications more feasible as a medium to understand complex biological processes specific to pineapple. PMID:27374615

  13. The draft genome sequence and annotation of the desert woodrat Neotoma lepida.

    PubMed

    Campbell, Michael; Oakeson, Kelly F; Yandell, Mark; Halpert, James R; Dearing, Denise

    2016-09-01

    We present the de novo draft genome sequence for a vertebrate mammalian herbivore, the desert woodrat (Neotoma lepida). This species is of ecological and evolutionary interest with respect to ingestion, microbial detoxification and hepatic metabolism of toxic plant secondary compounds from the highly toxic creosote bush (Larrea tridentata) and the juniper shrub (Juniperus monosperma). The draft genome sequence and annotation have been deposited at GenBank under the accession LZPO01000000.

  14. Genome Sequence of the Thermotolerant Yeast Kluyveromyces marxianus var. marxianus KCTC 17555

    PubMed Central

    Jeong, Haeyoung; Lee, Dae-Hee; Kim, Sun Hong; Kim, Hyun-Jin; Lee, Kyusang; Song, Ju Yeon; Kim, Byung Kwon; Sung, Bong Hyun; Sohn, Jung Hoon; Koo, Hyun Min

    2012-01-01

    Kluyveromyces marxianus is a thermotolerant yeast that has been explored for potential use in biotechnological applications, such as production of biofuels, single-cell proteins, enzymes, and other heterologous proteins. Here, we present the high-quality draft of the 10.9-Mb genome of K. marxianus var. marxianus KCTC 17555 (= CBS 6556 = ATCC 26548). PMID:23193140

  15. Draft genome of the Peruvian scallop Argopecten purpuratus.

    PubMed

    Li, Chao; Liu, Xiao; Liu, Bo; Ma, Bin; Liu, Fengqiao; Liu, Guilong; Shi, Qiong; Wang, Chunde

    2018-04-01

    The Peruvian scallop, Argopecten purpuratus, is mainly cultured in southern Chile and Peru was introduced into China in the last century. Unlike other Argopecten scallops, the Peruvian scallop normally has a long life span of up to 7 to 10 years. Therefore, researchers have been using it to develop hybrid vigor. Here, we performed whole genome sequencing, assembly, and gene annotation of the Peruvian scallop, with an important aim to develop genomic resources for genetic breeding in scallops. A total of 463.19-Gb raw DNA reads were sequenced. A draft genome assembly of 724.78 Mb was generated (accounting for 81.87% of the estimated genome size of 885.29 Mb), with a contig N50 size of 80.11 kb and a scaffold N50 size of 1.02 Mb. Repeat sequences were calculated to reach 33.74% of the whole genome, and 26,256 protein-coding genes and 3,057 noncoding RNAs were predicted from the assembly. We generated a high-quality draft genome assembly of the Peruvian scallop, which will provide a solid resource for further genetic breeding and for the analysis of the evolutionary history of this economically important scallop.

  16. Draft genome of the Northern snakehead, Channa argus.

    PubMed

    Xu, Jian; Bian, Chao; Chen, Kunci; Liu, Guiming; Jiang, Yanliang; Luo, Qing; You, Xinxin; Peng, Wenzhu; Li, Jia; Huang, Yu; Yi, Yunhai; Dong, Chuanju; Deng, Hua; Zhang, Songhao; Zhang, Hanyuan; Shi, Qiong; Xu, Peng

    2017-04-01

    The Northern snakehead (Channa argus), a member of the Channidae family of the Perciformes, is an economically important freshwater fish native to East Asia. In North America, it has become notorious as an intentionally released invasive species. Its ability to breathe air with gills and migrate short distances over land makes it a good model for bimodal breath research. Therefore, recent research has focused on the identification of relevant candidate genes. Here, we performed whole genome sequencing of C. argus to construct its draft genome, aiming to offer useful information for further functional studies and identification of target genes related to its unusual facultative air breathing. Findings: We assembled the C. argus genome with a total of 140.3 Gb of raw reads, which were sequenced using the Illumina HiSeq2000 platform. The final draft genome assembly was approximately 615.3 Mb, with a contig N50 of 81.4 kb and scaffold N50 of 4.5 Mb. The identified repeat sequences account for 18.9% of the whole genome. The 19 877 protein-coding genes were predicted from the genome assembly, with an average of 10.5 exons per gene. Conclusion: We generated a high-quality draft genome of C. argus, which will provide a valuable genetic resource for further biomedical investigations of this economically important teleost fish. © The Author 2017. Published by Oxford University Press.

  17. Genome Sequence of an Ammonia-Oxidizing Soil Archaeon, “Candidatus Nitrosoarchaeum koreensis” MY1

    PubMed Central

    Kim, Byung Kwon; Jung, Man-Young; Yu, Dong Su; Park, Soo-Je; Oh, Tae Kwang; Rhee, Sung-Keun; Kim, Jihyun F.

    2011-01-01

    Ammonia-oxidizing archaea are ubiquitous microorganisms which play important roles in global nitrogen and carbon cycle on earth. Here we present the high-quality draft genome sequence of an ammonia-oxidizing archaeon, “Candidatus Nitrosopumilus koreensis” MY1, that dominated an enrichment culture of a soil sample from the rhizosphere. Its genome contains genes for survival in the rhizosphere environment as well as those for carbon fixation and ammonium oxidation to nitrite. PMID:21914867

  18. High-Quality draft genome sequence of the Lotus spp. microsymbiont Mesorhizobium loti strain CJ3Sym

    DOE PAGES

    Reeve, Wayne; Sullivan, John; Ronson, Clive; ...

    2015-08-14

    Mesorhizobium loti strain CJ3Sym was isolated in 1998 following transfer of the integrative and conjugative element ICE Ml Sym R7A , also known as the R7A symbiosis island, in a laboratory mating from the donor M. loti strain R7A to a nonsymbiotic recipient Mesorhizobium strain CJ3. Strain CJ3 was originally isolated from a field site in the Rocklands range in New Zealand in 1994. CJ3Sym is an aerobic, Gram-negative, non-spore-forming rod. This report reveals the genome of M. loti strain CJ3Sym currently comprises 70 scaffolds totaling 7,563,725 bp. In conclusion, the high-quality draft genome is arranged in 70 scaffolds ofmore » 71 contigs, contains 7,331 protein-coding genes and 70 RNA-only encoding genes, and is part of the GEBA-RNB project proposal.« less

  19. High-Quality draft genome sequence of the Lotus spp. microsymbiont Mesorhizobium loti strain CJ3Sym

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Reeve, Wayne; Sullivan, John; Ronson, Clive

    Mesorhizobium loti strain CJ3Sym was isolated in 1998 following transfer of the integrative and conjugative element ICE Ml Sym R7A , also known as the R7A symbiosis island, in a laboratory mating from the donor M. loti strain R7A to a nonsymbiotic recipient Mesorhizobium strain CJ3. Strain CJ3 was originally isolated from a field site in the Rocklands range in New Zealand in 1994. CJ3Sym is an aerobic, Gram-negative, non-spore-forming rod. This report reveals the genome of M. loti strain CJ3Sym currently comprises 70 scaffolds totaling 7,563,725 bp. In conclusion, the high-quality draft genome is arranged in 70 scaffolds ofmore » 71 contigs, contains 7,331 protein-coding genes and 70 RNA-only encoding genes, and is part of the GEBA-RNB project proposal.« less

  20. Draft Genome Sequence of Solibacillus kalamii, Isolated from an Air Filter Aboard the International Space Station.

    PubMed

    Seuylemezian, Arman; Singh, Nitin K; Vaishampayan, Parag; Venkateswaran, Kasthuri

    2017-08-31

    We report here the draft genome of Solibacillus kalamii ISSFR-015, isolated from a high-energy particulate arrestance filter aboard the International Space Station. The draft genome sequence of this strain contains 3,809,180 bp with an estimated G+C content of 38.61%. Copyright © 2017 Seuylemezian et al.

  1. High quality permanent draft genome sequence of Chryseobacterium bovis DSM 19482 T, isolated from raw cow milk

    DOE PAGES

    Laviad-Shitrit, Sivan; Göker, Markus; Huntemann, Marcel; ...

    2017-05-08

    Chryseobacterium bovis DSM 19482 T (Hantsis-Zacharov et al., Int J Syst Evol Microbiol 58:1024-1028, 2008) is a Gram-negative, rod shaped, non-motile, facultative anaerobe, chemoorganotroph bacterium. C. bovis is a member of the Flavobacteriaceae, a family within the phylum Bacteroidetes. It was isolated when psychrotolerant bacterial communities in raw milk and their proteolytic and lipolytic traits were studied. Here we describe the features of this organism, together with the draft genome sequence and annotation. The DNA G + C content is 38.19%. The chromosome length is 3,346,045 bp. It encodes 3236 proteins and 105 RNA genes. The C. bovis genome ismore » part of the Genomic Encyclopedia of Type Strains, Phase I: the one thousand microbial genomes study.« less

  2. High quality permanent draft genome sequence of Chryseobacterium bovis DSM 19482 T, isolated from raw cow milk

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Laviad-Shitrit, Sivan; Göker, Markus; Huntemann, Marcel

    Chryseobacterium bovis DSM 19482 T (Hantsis-Zacharov et al., Int J Syst Evol Microbiol 58:1024-1028, 2008) is a Gram-negative, rod shaped, non-motile, facultative anaerobe, chemoorganotroph bacterium. C. bovis is a member of the Flavobacteriaceae, a family within the phylum Bacteroidetes. It was isolated when psychrotolerant bacterial communities in raw milk and their proteolytic and lipolytic traits were studied. Here we describe the features of this organism, together with the draft genome sequence and annotation. The DNA G + C content is 38.19%. The chromosome length is 3,346,045 bp. It encodes 3236 proteins and 105 RNA genes. The C. bovis genome ismore » part of the Genomic Encyclopedia of Type Strains, Phase I: the one thousand microbial genomes study.« less

  3. Draft Genome Sequence of Clostridium pasteurianum NRRL B-598, a Potential Butanol or Hydrogen Producer.

    PubMed

    Kolek, Jan; Sedlár, Karel; Provazník, Ivo; Patáková, Petra

    2014-03-20

    We present a draft genome sequence of Clostridium pasteurianum NRRL B-598. This strain ferments saccharides by two-stage acetone-butanol (AB) fermentation, is oxygen tolerant, and has high hydrogen yields.

  4. High-quality permanent draft genome sequence of the Bradyrhizobium elkanii type strain USDA 76T, isolated from Glycine max (L.) Merr

    USDA-ARS?s Scientific Manuscript database

    Bradyrhizobium elkanii USDA 76T (INSCD = ARAG00000000), the type strain for Bradyrhizobium elkanii, is an aerobic, motile, Gram-negative, non-spore-forming rod that was isolated from an effective nitrogen-fixing root nodule of Glycine max (L. Merr) grown in the USA. Because of its significance as a ...

  5. High quality draft genome sequences of Pseudomonas fulva DSM 17717 T, Pseudomonas parafulva DSM 17004 T and Pseudomonas cremoricolorata DSM 17059 T type strains

    DOE PAGES

    Peña, Arantxa; Busquets, Antonio; Gomila, Margarita; ...

    2016-09-01

    Pseudomonas has the highest number of species out of any genus of Gram-negative bacteria and is phylogenetically divided into several groups. The Pseudomonas putida phylogenetic branch includes at least 13 species of environmental and industrial interest, plant-associated bacteria, insect pathogens, and even some members that have been found in clinical specimens. In the context of the Genomic Encyclopedia of Bacteria and Archaea project, we present the permanent, high-quality draft genomes of the type strains of 3 taxonomically and ecologically closely related species in the Pseudomonas putida phylogenetic branch: Pseudomonas fulva DSM 17717 T, Pseudomonas parafulva DSM 17004 T and Pseudomonasmore » cremoricolorata DSM 17059T. All three genomes are comparable in size (4.6-4.9Mb), with 4,119-4,459 protein-coding genes. Average nucleotide identity based on BLAST comparisons and digital genome-to-genome distance calculations are in good agreement with experimental DNA-DNA hybridization results. The genome sequences presented here will be very helpful in elucidating the taxonomy, phylogeny and evolution of the Pseudomonas putida species complex.« less

  6. High quality draft genome sequences of Pseudomonas fulva DSM 17717 T, Pseudomonas parafulva DSM 17004 T and Pseudomonas cremoricolorata DSM 17059 T type strains

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Peña, Arantxa; Busquets, Antonio; Gomila, Margarita

    Pseudomonas has the highest number of species out of any genus of Gram-negative bacteria and is phylogenetically divided into several groups. The Pseudomonas putida phylogenetic branch includes at least 13 species of environmental and industrial interest, plant-associated bacteria, insect pathogens, and even some members that have been found in clinical specimens. In the context of the Genomic Encyclopedia of Bacteria and Archaea project, we present the permanent, high-quality draft genomes of the type strains of 3 taxonomically and ecologically closely related species in the Pseudomonas putida phylogenetic branch: Pseudomonas fulva DSM 17717 T, Pseudomonas parafulva DSM 17004 T and Pseudomonasmore » cremoricolorata DSM 17059T. All three genomes are comparable in size (4.6-4.9Mb), with 4,119-4,459 protein-coding genes. Average nucleotide identity based on BLAST comparisons and digital genome-to-genome distance calculations are in good agreement with experimental DNA-DNA hybridization results. The genome sequences presented here will be very helpful in elucidating the taxonomy, phylogeny and evolution of the Pseudomonas putida species complex.« less

  7. Draft Genome Sequences of 20 Salmonella enterica subsp. enterica Serovar Typhimurium Strains Isolated from Swine in Santa Catarina, Brazil.

    PubMed

    Seribelli, Amanda Aparecida; Frazão, Miliane Rodrigues; Gonzales, Júlia Cunha; Cao, Guojie; Leon, Maria Sanchez; Kich, Jalusa Deon; Allard, Marc William; Falcão, Juliana Pfrimer

    2018-04-19

    Salmonellosis is a disease with a high incidence worldwide, and Salmonella enterica subsp. enterica serovar Typhimurium is one of the most clinically important serovars. We report here the draft genome sequences of 20 S. Typhimurium strains isolated from swine in Santa Catarina, Brazil. These draft genomes will improve our understanding of S. Typhimurium in Brazil.

  8. Two low coverage bird genomes and a comparison of reference-guided versus de novo genome assemblies.

    PubMed

    Card, Daren C; Schield, Drew R; Reyes-Velasco, Jacobo; Fujita, Matthew K; Andrew, Audra L; Oyler-McCance, Sara J; Fike, Jennifer A; Tomback, Diana F; Ruggiero, Robert P; Castoe, Todd A

    2014-01-01

    As a greater number and diversity of high-quality vertebrate reference genomes become available, it is increasingly feasible to use these references to guide new draft assemblies for related species. Reference-guided assembly approaches may substantially increase the contiguity and completeness of a new genome using only low levels of genome coverage that might otherwise be insufficient for de novo genome assembly. We used low-coverage (∼3.5-5.5x) Illumina paired-end sequencing to assemble draft genomes of two bird species (the Gunnison Sage-Grouse, Centrocercus minimus, and the Clark's Nutcracker, Nucifraga columbiana). We used these data to estimate de novo genome assemblies and reference-guided assemblies, and compared the information content and completeness of these assemblies by comparing CEGMA gene set representation, repeat element content, simple sequence repeat content, and GC isochore structure among assemblies. Our results demonstrate that even lower-coverage genome sequencing projects are capable of producing informative and useful genomic resources, particularly through the use of reference-guided assemblies.

  9. Two low coverage bird genomes and a comparison of reference-guided versus de novo genome assemblies

    USGS Publications Warehouse

    Card, Daren C.; Schield, Drew R.; Reyes-Velasco, Jacobo; Fujita, Matthre K.; Andrew, Audra L.; Oyler-McCance, Sara J.; Fike, Jennifer A.; Tomback, Diana F.; Ruggiero, Robert P.; Castoe, Todd A.

    2014-01-01

    As a greater number and diversity of high-quality vertebrate reference genomes become available, it is increasingly feasible to use these references to guide new draft assemblies for related species. Reference-guided assembly approaches may substantially increase the contiguity and completeness of a new genome using only low levels of genome coverage that might otherwise be insufficient for de novo genome assembly. We used low-coverage (~3.5–5.5x) Illumina paired-end sequencing to assemble draft genomes of two bird species (the Gunnison Sage-Grouse, Centrocercus minimus, and the Clark's Nutcracker, Nucifraga columbiana). We used these data to estimate de novo genome assemblies and reference-guided assemblies, and compared the information content and completeness of these assemblies by comparing CEGMA gene set representation, repeat element content, simple sequence repeat content, and GC isochore structure among assemblies. Our results demonstrate that even lower-coverage genome sequencing projects are capable of producing informative and useful genomic resources, particularly through the use of reference-guided assemblies.

  10. Draft De Novo Transcriptome of the Rat Kangaroo Potorous tridactylus as a Tool for Cell Biology

    PubMed Central

    Udy, Dylan B.; Voorhies, Mark; Chan, Patricia P.; Lowe, Todd M.; Dumont, Sophie

    2015-01-01

    The rat kangaroo (long-nosed potoroo, Potorous tridactylus) is a marsupial native to Australia. Cultured rat kangaroo kidney epithelial cells (PtK) are commonly used to study cell biological processes. These mammalian cells are large, adherent, and flat, and contain large and few chromosomes—and are thus ideal for imaging intra-cellular dynamics such as those of mitosis. Despite this, neither the rat kangaroo genome nor transcriptome have been sequenced, creating a challenge for probing the molecular basis of these cellular dynamics. Here, we present the sequencing, assembly and annotation of the draft rat kangaroo de novo transcriptome. We sequenced 679 million reads that mapped to 347,323 Trinity transcripts and 20,079 Unigenes. We present statistics emerging from transcriptome-wide analyses, and analyses suggesting that the transcriptome covers full-length sequences of most genes, many with multiple isoforms. We also validate our findings with a proof-of-concept gene knockdown experiment. We expect that this high quality transcriptome will make rat kangaroo cells a more tractable system for linking molecular-scale function and cellular-scale dynamics. PMID:26252667

  11. Draft De Novo Transcriptome of the Rat Kangaroo Potorous tridactylus as a Tool for Cell Biology.

    PubMed

    Udy, Dylan B; Voorhies, Mark; Chan, Patricia P; Lowe, Todd M; Dumont, Sophie

    2015-01-01

    The rat kangaroo (long-nosed potoroo, Potorous tridactylus) is a marsupial native to Australia. Cultured rat kangaroo kidney epithelial cells (PtK) are commonly used to study cell biological processes. These mammalian cells are large, adherent, and flat, and contain large and few chromosomes-and are thus ideal for imaging intra-cellular dynamics such as those of mitosis. Despite this, neither the rat kangaroo genome nor transcriptome have been sequenced, creating a challenge for probing the molecular basis of these cellular dynamics. Here, we present the sequencing, assembly and annotation of the draft rat kangaroo de novo transcriptome. We sequenced 679 million reads that mapped to 347,323 Trinity transcripts and 20,079 Unigenes. We present statistics emerging from transcriptome-wide analyses, and analyses suggesting that the transcriptome covers full-length sequences of most genes, many with multiple isoforms. We also validate our findings with a proof-of-concept gene knockdown experiment. We expect that this high quality transcriptome will make rat kangaroo cells a more tractable system for linking molecular-scale function and cellular-scale dynamics.

  12. Draft genome sequence of Coniochaeta ligniaria NRRL 30616, a lignocellulolytic fungus for bioabatement of inhibitors in plant biomass hydrolysates

    USDA-ARS?s Scientific Manuscript database

    Here, we report the first draft genome sequence (42.38 Mb that contains 13,657 genes) of Coniochaeta ligniaria NRRL30616, an ascomycete with high biotechnological relevance in the bioenergy field given its high potential for bioabatement of toxic furanic compounds in plant biomass hydrolysates and i...

  13. High-quality-draft genome sequence of the fermenting bacterium Anaerobium acetethylicum type strain GluBS11T (DSM 29698)

    DOE PAGES

    Patil, Yogita; Müller, Nicolai; Schink, Bernhard; ...

    2017-02-20

    Anaerobium acetethylicum strain GluBS11 T belongs to the family Lachnospiraceae within the order Clostridiales. It is a Gram-positive, non-motile and strictly anaerobic bacterium isolated from biogas slurry that was originally enriched with gluconate as carbon source (Patil, et al., Int J Syst Evol Microbiol 65:3289-3296, 2015). Here we describe the draft genome sequence of strain GluBS11 T and provide a detailed insight into its physiological and metabolic features. The draft genome sequence generated 4,609,043 bp, distributed among 105 scaffolds assembled using the SPAdes genome assembler method. It comprises in total 4,132 genes, of which 4,008 were predicted to be proteinmore » coding genes, 124 RNA genes and 867 pseudogenes. The content was 43.51 mol %. The annotated genome of strain GluBS11 T contains putative genes coding for the pentose phosphate pathway, the Embden-Meyerhoff-Parnas pathway, the Entner-Doudoroff pathway and the tricarboxylic acid cycle. The genome revealed the presence of most of the necessary genes required for the fermentation of glucose and gluconate to acetate, ethanol, and hydrogen gas. However, a candidate gene for production of formate was not identified.« less

  14. High-quality-draft genome sequence of the fermenting bacterium Anaerobium acetethylicum type strain GluBS11T (DSM 29698)

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Patil, Yogita; Müller, Nicolai; Schink, Bernhard

    Anaerobium acetethylicum strain GluBS11 T belongs to the family Lachnospiraceae within the order Clostridiales. It is a Gram-positive, non-motile and strictly anaerobic bacterium isolated from biogas slurry that was originally enriched with gluconate as carbon source (Patil, et al., Int J Syst Evol Microbiol 65:3289-3296, 2015). Here we describe the draft genome sequence of strain GluBS11 T and provide a detailed insight into its physiological and metabolic features. The draft genome sequence generated 4,609,043 bp, distributed among 105 scaffolds assembled using the SPAdes genome assembler method. It comprises in total 4,132 genes, of which 4,008 were predicted to be proteinmore » coding genes, 124 RNA genes and 867 pseudogenes. The content was 43.51 mol %. The annotated genome of strain GluBS11 T contains putative genes coding for the pentose phosphate pathway, the Embden-Meyerhoff-Parnas pathway, the Entner-Doudoroff pathway and the tricarboxylic acid cycle. The genome revealed the presence of most of the necessary genes required for the fermentation of glucose and gluconate to acetate, ethanol, and hydrogen gas. However, a candidate gene for production of formate was not identified.« less

  15. The draft genome sequence of cork oak

    PubMed Central

    Ramos, António Marcos; Usié, Ana; Barbosa, Pedro; Barros, Pedro M.; Capote, Tiago; Chaves, Inês; Simões, Fernanda; Abreu, Isabl; Carrasquinho, Isabel; Faro, Carlos; Guimarães, Joana B.; Mendonça, Diogo; Nóbrega, Filomena; Rodrigues, Leandra; Saibo, Nelson J. M.; Varela, Maria Carolina; Egas, Conceição; Matos, José; Miguel, Célia M.; Oliveira, M. Margarida; Ricardo, Cândido P.; Gonçalves, Sónia

    2018-01-01

    Cork oak (Quercus suber) is native to southwest Europe and northwest Africa where it plays a crucial environmental and economical role. To tackle the cork oak production and industrial challenges, advanced research is imperative but dependent on the availability of a sequenced genome. To address this, we produced the first draft version of the cork oak genome. We followed a de novo assembly strategy based on high-throughput sequence data, which generated a draft genome comprising 23,347 scaffolds and 953.3 Mb in size. A total of 79,752 genes and 83,814 transcripts were predicted, including 33,658 high-confidence genes. An InterPro signature assignment was detected for 69,218 transcripts, which represented 82.6% of the total. Validation studies demonstrated the genome assembly and annotation completeness and highlighted the usefulness of the draft genome for read mapping of high-throughput sequence data generated using different protocols. All data generated is available through the public databases where it was deposited, being therefore ready to use by the academic and industry communities working on cork oak and/or related species. PMID:29786699

  16. The draft genome sequence of cork oak.

    PubMed

    Ramos, António Marcos; Usié, Ana; Barbosa, Pedro; Barros, Pedro M; Capote, Tiago; Chaves, Inês; Simões, Fernanda; Abreu, Isabl; Carrasquinho, Isabel; Faro, Carlos; Guimarães, Joana B; Mendonça, Diogo; Nóbrega, Filomena; Rodrigues, Leandra; Saibo, Nelson J M; Varela, Maria Carolina; Egas, Conceição; Matos, José; Miguel, Célia M; Oliveira, M Margarida; Ricardo, Cândido P; Gonçalves, Sónia

    2018-05-22

    Cork oak (Quercus suber) is native to southwest Europe and northwest Africa where it plays a crucial environmental and economical role. To tackle the cork oak production and industrial challenges, advanced research is imperative but dependent on the availability of a sequenced genome. To address this, we produced the first draft version of the cork oak genome. We followed a de novo assembly strategy based on high-throughput sequence data, which generated a draft genome comprising 23,347 scaffolds and 953.3 Mb in size. A total of 79,752 genes and 83,814 transcripts were predicted, including 33,658 high-confidence genes. An InterPro signature assignment was detected for 69,218 transcripts, which represented 82.6% of the total. Validation studies demonstrated the genome assembly and annotation completeness and highlighted the usefulness of the draft genome for read mapping of high-throughput sequence data generated using different protocols. All data generated is available through the public databases where it was deposited, being therefore ready to use by the academic and industry communities working on cork oak and/or related species.

  17. Community-led comparative genomic and phenotypic analysis of the aquaculture pathogen Pseudomonas baetica a390T sequenced by Ion semiconductor and Nanopore technologies

    PubMed Central

    Beaton, Ainsley; Lood, Cédric; Cunningham-Oakes, Edward; MacFadyen, Alison; Mullins, Alex J; Bestawy, Walid El; Botelho, João; Chevalier, Sylvie; Dalzell, Chloe; Dolan, Stephen K; Faccenda, Alberto; Ghequire, Maarten G K; Higgins, Steven; Kutschera, Alexander; Murray, Jordan; Redway, Martha; Salih, Talal; Smith, Brian A; Smits, Nathan; Thomson, Ryan; Woodcock, Stuart; Cornelis, Pierre; Lavigne, Rob; van Noort, Vera

    2018-01-01

    Abstract Pseudomonas baetica strain a390T is the type strain of this recently described species and here we present its high-contiguity draft genome. To celebrate the 16th International Conference on Pseudomonas, the genome of P. baetica strain a390T was sequenced using a unique combination of Ion Torrent semiconductor and Oxford Nanopore methods as part of a collaborative community-led project. The use of high-quality Ion Torrent sequences with long Nanopore reads gave rapid, high-contiguity and -quality, 16-contig genome sequence. Whole genome phylogenetic analysis places P. baetica within the P. koreensis clade of the P. fluorescens group. Comparison of the main genomic features of P. baetica with a variety of other Pseudomonas spp. suggests that it is a highly adaptable organism, typical of the genus. This strain was originally isolated from the liver of a diseased wedge sole fish, and genotypic and phenotypic analyses show that it is tolerant to osmotic stress and to oxytetracycline. PMID:29579234

  18. A physical map of the bovine genome

    PubMed Central

    Snelling, Warren M; Chiu, Readman; Schein, Jacqueline E; Hobbs, Matthew; Abbey, Colette A; Adelson, David L; Aerts, Jan; Bennett, Gary L; Bosdet, Ian E; Boussaha, Mekki; Brauning, Rudiger; Caetano, Alexandre R; Costa, Marcos M; Crawford, Allan M; Dalrymple, Brian P; Eggen, André; Everts-van der Wind, Annelie; Floriot, Sandrine; Gautier, Mathieu; Gill, Clare A; Green, Ronnie D; Holt, Robert; Jann, Oliver; Jones, Steven JM; Kappes, Steven M; Keele, John W; de Jong, Pieter J; Larkin, Denis M; Lewin, Harris A; McEwan, John C; McKay, Stephanie; Marra, Marco A; Mathewson, Carrie A; Matukumalli, Lakshmi K; Moore, Stephen S; Murdoch, Brenda; Nicholas, Frank W; Osoegawa, Kazutoyo; Roy, Alice; Salih, Hanni; Schibler, Laurent; Schnabel, Robert D; Silveri, Licia; Skow, Loren C; Smith, Timothy PL; Sonstegard, Tad S; Taylor, Jeremy F; Tellam, Ross; Van Tassell, Curtis P; Williams, John L; Womack, James E; Wye, Natasja H; Yang, George; Zhao, Shaying

    2007-01-01

    Background Cattle are important agriculturally and relevant as a model organism. Previously described genetic and radiation hybrid (RH) maps of the bovine genome have been used to identify genomic regions and genes affecting specific traits. Application of these maps to identify influential genetic polymorphisms will be enhanced by integration with each other and with bacterial artificial chromosome (BAC) libraries. The BAC libraries and clone maps are essential for the hybrid clone-by-clone/whole-genome shotgun sequencing approach taken by the bovine genome sequencing project. Results A bovine BAC map was constructed with HindIII restriction digest fragments of 290,797 BAC clones from animals of three different breeds. Comparative mapping of 422,522 BAC end sequences assisted with BAC map ordering and assembly. Genotypes and pedigree from two genetic maps and marker scores from three whole-genome RH panels were consolidated on a 17,254-marker composite map. Sequence similarity allowed integrating the BAC and composite maps with the bovine draft assembly (Btau3.1), establishing a comprehensive resource describing the bovine genome. Agreement between the marker and BAC maps and the draft assembly is high, although discrepancies exist. The composite and BAC maps are more similar than either is to the draft assembly. Conclusion Further refinement of the maps and greater integration into the genome assembly process may contribute to a high quality assembly. The maps provide resources to associate phenotypic variation with underlying genomic variation, and are crucial resources for understanding the biology underpinning this important ruminant species so closely associated with humans. PMID:17697342

  19. High quality permanent draft genome sequence of Phaseolibacter flectens ATCC 12775 T, a plant pathogen of French bean pods

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Aizenberg-Gershtein, Yana; Izhaki, Ido; Lapidus, Alla

    We report that the Phaseolibacter flectens strain ATCC 12775 T (Halpern et al., Int J Syst Evol Microbiol 63:268–273, 2013) is a Gram-negative, rod shaped, motile, aerobic, chemoorganotroph bacterium. Ph. flectens is as a plant-pathogenic bacterium on pods of French bean and was first identified by Johnson (1956) as Pseudomonas flectens. After its phylogenetic position was reexamined, Pseudomonas flectens was transferred to the family Enterobacteriaceae as Phaseolibacter flectens gen. nov., comb. nov. Here we describe the features of this organism, together with the draft genome sequence and annotation. The DNA GC content is 44.34 mol%. The chromosome length is 2,748,442more » bp. It encodes 2,437 proteins and 89 RNA genes. Ph. flectens genome is part of the Genomic Encyclopedia of Type Strains, Phase I: the one thousand microbial genomes study.« less

  20. High quality draft genome sequence and analysis of Pontibacter roseus type strain SRC-1T (DSM 17521T) isolated from muddy waters of a drainage system in Chandigarh, India

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Mukherjee, Supratim; Lapidus, Alla; Shapiro, Nicole

    2015-01-01

    Pontibacter roseus Suresh et al 2006 is a member of genus Pontibacter family Cytophagaceae, class Cytophagia. While the type species of the genus Pontibacter actiniarum was isolated in 2005 from a marine environment, subsequent species of the same genus have been found in different types of habitats ranging from seawater, sediment, desert soil, rhizosphere, contaminated sites, solar saltern and muddy water. Here we describe the features of Pontibacter roseus strain SRC-1T along with its complete genome sequence and annotation from a culture of DSM 17521T. The 4,581,480 bp long draft genome consists of 12 scaffolds with 4,003 protein-coding and 50more » RNA genes and is a part of Genomic encyclopedia of Type Strains, Phase I: the one thousand microbial genomes (KMG-I) project.« less

  1. High quality permanent draft genome sequence of Phaseolibacter flectens ATCC 12775 T, a plant pathogen of French bean pods

    DOE PAGES

    Aizenberg-Gershtein, Yana; Izhaki, Ido; Lapidus, Alla; ...

    2016-01-13

    We report that the Phaseolibacter flectens strain ATCC 12775 T (Halpern et al., Int J Syst Evol Microbiol 63:268–273, 2013) is a Gram-negative, rod shaped, motile, aerobic, chemoorganotroph bacterium. Ph. flectens is as a plant-pathogenic bacterium on pods of French bean and was first identified by Johnson (1956) as Pseudomonas flectens. After its phylogenetic position was reexamined, Pseudomonas flectens was transferred to the family Enterobacteriaceae as Phaseolibacter flectens gen. nov., comb. nov. Here we describe the features of this organism, together with the draft genome sequence and annotation. The DNA GC content is 44.34 mol%. The chromosome length is 2,748,442more » bp. It encodes 2,437 proteins and 89 RNA genes. Ph. flectens genome is part of the Genomic Encyclopedia of Type Strains, Phase I: the one thousand microbial genomes study.« less

  2. High quality draft genome sequence and analysis of Pontibacter roseus type strain SRC-1T (DSM 17521T) isolated from muddy waters of a drainage system in Chandigarh, India

    DOE PAGES

    Mukherjee, Supratim; Lapidus, Alla; Shapiro, Nicole; ...

    2015-02-09

    Pontibacter roseus is a member of genus Pontibacter family Cytophagaceae, class Cytophagia. While the type species of the genus Pontibacter actiniarum was isolated in 2005 from a marine environment, subsequent species of the same genus have been found in different types of habitats ranging from seawater, sediment, desert soil, rhizosphere, contaminated sites, solar saltern and muddy water. Here we describe the features of Pontibacter roseus strain SRC-1 T along with its complete genome sequence and annotation from a culture of DSM 17521 T. In conclusion, the 4,581,480 bp long draft genome consists of 12 scaffolds with 4,003 protein-coding and 50more » RNA genes and is a part of Genomic Encyclopedia of Type Strains: KMG-I project.« less

  3. Whole Genome Complete Resequencing of Bacillus subtilis Natto by Combining Long Reads with High-Quality Short Reads

    PubMed Central

    Kamada, Mayumi; Hase, Sumitaka; Sato, Kengo; Toyoda, Atsushi; Fujiyama, Asao; Sakakibara, Yasubumi

    2014-01-01

    De novo microbial genome sequencing reached a turning point with third-generation sequencing (TGS) platforms, and several microbial genomes have been improved by TGS long reads. Bacillus subtilis natto is closely related to the laboratory standard strain B. subtilis Marburg 168, and it has a function in the production of the traditional Japanese fermented food “natto.” The B. subtilis natto BEST195 genome was previously sequenced with short reads, but it included some incomplete regions. We resequenced the BEST195 genome using a PacBio RS sequencer, and we successfully obtained a complete genome sequence from one scaffold without any gaps, and we also applied Illumina MiSeq short reads to enhance quality. Compared with the previous BEST195 draft genome and Marburg 168 genome, we found that incomplete regions in the previous genome sequence were attributed to GC-bias and repetitive sequences, and we also identified some novel genes that are found only in the new genome. PMID:25329997

  4. The Draft Genome Sequence of a Novel High-Efficient Butanol-Producing Bacterium Clostridium Diolis Strain WST.

    PubMed

    Chen, Chaoyang; Sun, Chongran; Wu, Yi-Rui

    2018-03-21

    A wild-type solventogenic strain Clostridium diolis WST, isolated from mangrove sediments, was characterized to produce high amount of butanol and acetone with negligible level of ethanol and acids from glucose via a unique acetone-butanol (AB) fermentation pathway. Through the genomic sequencing, the assembled draft genome of strain WST is calculated to be 5.85 Mb with a GC content of 29.69% and contains 5263 genes that contribute to the annotation of 5049 protein-coding sequences. Within these annotated genes, the butanol dehydrogenase gene (bdh) was determined to be in a higher amount from strain WST compared to other Clostridial strains, which is positively related to its high-efficient production of butanol. Therefore, we present a draft genome sequence analysis of strain WST in this article that should facilitate to further understand the solventogenic mechanism of this special microorganism.

  5. The Genomics Education Partnership: Successful Integration of Research into Laboratory Classes at a Diverse Group of Undergraduate Institutions

    PubMed Central

    Shaffer, Christopher D.; Alvarez, Consuelo; Bailey, Cheryl; Barnard, Daron; Bhalla, Satish; Chandrasekaran, Chitra; Chandrasekaran, Vidya; Chung, Hui-Min; Dorer, Douglas R.; Du, Chunguang; Eckdahl, Todd T.; Poet, Jeff L.; Frohlich, Donald; Goodman, Anya L.; Gosser, Yuying; Hauser, Charles; Hoopes, Laura L.M.; Johnson, Diana; Jones, Christopher J.; Kaehler, Marian; Kokan, Nighat; Kopp, Olga R.; Kuleck, Gary A.; McNeil, Gerard; Moss, Robert; Myka, Jennifer L.; Nagengast, Alexis; Morris, Robert; Overvoorde, Paul J.; Shoop, Elizabeth; Parrish, Susan; Reed, Kelynne; Regisford, E. Gloria; Revie, Dennis; Rosenwald, Anne G.; Saville, Ken; Schroeder, Stephanie; Shaw, Mary; Skuse, Gary; Smith, Christopher; Smith, Mary; Spana, Eric P.; Spratt, Mary; Stamm, Joyce; Thompson, Jeff S.; Wawersik, Matthew; Wilson, Barbara A.; Youngblom, Jim; Leung, Wilson; Buhler, Jeremy; Mardis, Elaine R.; Lopatto, David

    2010-01-01

    Genomics is not only essential for students to understand biology but also provides unprecedented opportunities for undergraduate research. The goal of the Genomics Education Partnership (GEP), a collaboration between a growing number of colleges and universities around the country and the Department of Biology and Genome Center of Washington University in St. Louis, is to provide such research opportunities. Using a versatile curriculum that has been adapted to many different class settings, GEP undergraduates undertake projects to bring draft-quality genomic sequence up to high quality and/or participate in the annotation of these sequences. GEP undergraduates have improved more than 2 million bases of draft genomic sequence from several species of Drosophila and have produced hundreds of gene models using evidence-based manual annotation. Students appreciate their ability to make a contribution to ongoing research, and report increased independence and a more active learning approach after participation in GEP projects. They show knowledge gains on pre- and postcourse quizzes about genes and genomes and in bioinformatic analysis. Participating faculty also report professional gains, increased access to genomics-related technology, and an overall positive experience. We have found that using a genomics research project as the core of a laboratory course is rewarding for both faculty and students. PMID:20194808

  6. Draft Genome Sequence of the Algicidal Bacterium Mangrovimonas yunxiaonensis Strain LY01

    PubMed Central

    Li, Yi; Zhu, Hong; Li, Chongping; Zhang, Huajun; Chen, Zhangran; Zheng, Wei

    2014-01-01

    Mangrovimonas yunxiaonensis LY01, a novel bacterium isolated from mangrove sediment, showed high algicidal effects on harmful algal blooms of Alexandrium tamarense. Here, we present the first draft genome sequence of this strain to further understanding of the functional genes related to algicidal activity. PMID:25428978

  7. Draft Genome Sequence of Thermotoga maritima A7A Reconstructed from Metagenomic Sequencing Analysis of a Hydrocarbon Reservoir in the Bass Strait, Australia

    PubMed Central

    Sutcliffe, Brodie; Rosewarne, Carly P.; Greenfield, Paul; Li, Dongmei

    2013-01-01

    The draft genome sequence of Thermotoga maritima A7A was obtained from a metagenomic assembly obtained from a high-temperature hydrocarbon reservoir in the Gippsland Basin, Australia. The organism is predicted to be a motile anaerobe with an array of catabolic enzymes for the degradation of numerous carbohydrates. PMID:24009120

  8. Extensive Error in the Number of Genes Inferred from Draft Genome Assemblies

    PubMed Central

    Denton, James F.; Lugo-Martinez, Jose; Tucker, Abraham E.; Schrider, Daniel R.; Warren, Wesley C.; Hahn, Matthew W.

    2014-01-01

    Current sequencing methods produce large amounts of data, but genome assemblies based on these data are often woefully incomplete. These incomplete and error-filled assemblies result in many annotation errors, especially in the number of genes present in a genome. In this paper we investigate the magnitude of the problem, both in terms of total gene number and the number of copies of genes in specific families. To do this, we compare multiple draft assemblies against higher-quality versions of the same genomes, using several new assemblies of the chicken genome based on both traditional and next-generation sequencing technologies, as well as published draft assemblies of chimpanzee. We find that upwards of 40% of all gene families are inferred to have the wrong number of genes in draft assemblies, and that these incorrect assemblies both add and subtract genes. Using simulated genome assemblies of Drosophila melanogaster, we find that the major cause of increased gene numbers in draft genomes is the fragmentation of genes onto multiple individual contigs. Finally, we demonstrate the usefulness of RNA-Seq in improving the gene annotation of draft assemblies, largely by connecting genes that have been fragmented in the assembly process. PMID:25474019

  9. Extensive error in the number of genes inferred from draft genome assemblies.

    PubMed

    Denton, James F; Lugo-Martinez, Jose; Tucker, Abraham E; Schrider, Daniel R; Warren, Wesley C; Hahn, Matthew W

    2014-12-01

    Current sequencing methods produce large amounts of data, but genome assemblies based on these data are often woefully incomplete. These incomplete and error-filled assemblies result in many annotation errors, especially in the number of genes present in a genome. In this paper we investigate the magnitude of the problem, both in terms of total gene number and the number of copies of genes in specific families. To do this, we compare multiple draft assemblies against higher-quality versions of the same genomes, using several new assemblies of the chicken genome based on both traditional and next-generation sequencing technologies, as well as published draft assemblies of chimpanzee. We find that upwards of 40% of all gene families are inferred to have the wrong number of genes in draft assemblies, and that these incorrect assemblies both add and subtract genes. Using simulated genome assemblies of Drosophila melanogaster, we find that the major cause of increased gene numbers in draft genomes is the fragmentation of genes onto multiple individual contigs. Finally, we demonstrate the usefulness of RNA-Seq in improving the gene annotation of draft assemblies, largely by connecting genes that have been fragmented in the assembly process.

  10. Draft Genome Sequence of Mycobacterium chimaera Type Strain Fl-0169

    EPA Science Inventory

    We report the draft genome sequence of the type strain Mycobacterium chimaera Fl-0169T, a member of the Mycobacterium avium complex (MAC). M. chimaera Fl-0169T was isolated from a patient in Italy and is highly similar to strains of M. chimaera isolated in Ireland, though Fl-016...

  11. Draft Genome Sequence of Streptomyces specialis Type Strain GW41-1564 (DSM 41924).

    PubMed

    Loucif, Lotfi; Michelle, Caroline; Terras, Jérôme; Rolain, Jean-Marc; Raoult, Didier; Fournier, Pierre-Edouard

    2017-03-30

    Here, we report the draft genome sequence of Streptomyces specialis type strain GW41-1564, which was isolated from soil. This 5.87-Mb genome exhibits a high G+C content of 72.72% and contains 5,486 protein-coding genes. Copyright © 2017 Loucif et al.

  12. Draft Genome Sequence of the Algicidal Bacterium Mangrovimonas yunxiaonensis Strain LY01.

    PubMed

    Li, Yi; Zhu, Hong; Li, Chongping; Zhang, Huajun; Chen, Zhangran; Zheng, Wei; Xu, Hong; Zheng, Tianling

    2014-11-26

    Mangrovimonas yunxiaonensis LY01, a novel bacterium isolated from mangrove sediment, showed high algicidal effects on harmful algal blooms of Alexandrium tamarense. Here, we present the first draft genome sequence of this strain to further understanding of the functional genes related to algicidal activity. Copyright © 2014 Li et al.

  13. Draft genome of the protandrous Chinese black porgy, Acanthopagrus schlegelii.

    PubMed

    Zhang, Zhiyong; Zhang, Kai; Chen, Shuyin; Zhang, Zhiwei; Zhang, Jinyong; You, Xinxin; Bian, Chao; Xu, Jin; Jia, Chaofeng; Qiang, Jun; Zhu, Fei; Li, Hongxia; Liu, Hailin; Shen, Dehua; Ren, Zhonghong; Chen, Jieming; Li, Jia; Gao, Tianheng; Gu, Ruobo; Xu, Junmin; Shi, Qiong; Xu, Pao

    2018-04-01

    As one of the most popular and valuable commercial marine fishes in China and East Asian countries, the Chinese black porgy (Acanthopagrus schlegelii), also known as the blackhead seabream, has some attractive characteristics such as fast growth rate, good meat quality, resistance to diseases, and excellent adaptability to various environments. Furthermore, the black porgy is a good model for investigating sex changes in fish due to its protandrous hermaphroditism. Here, we obtained a high-quality genome assembly of this interesting teleost species and performed a genomic survey on potential genes associated with the sex-change phenomenon. We generated 175.4 gigabases (Gb) of clean sequence reads using a whole-genome shotgun sequencing strategy. The final genome assembly is approximately 688.1 megabases (Mb), accounting for 93% of the estimated genome size (739.6 Mb). The achieved scaffold N50 is 7.6 Mb, reaching a relatively high level among sequenced fish species. We identified 19 465 protein-coding genes, which had an average transcript length of 17.3 kb. By performing a comparative genomic analysis, we found 3 types of genes potentially associated with sex change, which are useful for studying the genetic basis of the protandrous hermaphroditism. We provide a draft genome assembly of the Chinese black porgy and discuss the potential genetic mechanisms of sex change. These data are also an important resource for studying the biology and for facilitating breeding of this economically important fish.

  14. Draft Genome Sequence of Sporolactobacillus inulinus Strain CASD, an Efficient d-Lactic Acid-Producing Bacterium with High-Concentration Lactate Tolerance Capability

    PubMed Central

    Yu, Bo; Su, Fei; Wang, Limin; Xu, Ke; Zhao, Bo; Xu, Ping

    2011-01-01

    Sporolactobacillus inulinus CASD is an efficient d-lactic acid producer with high optical purity. Here we report for the first time the draft genome sequence of S. inulinus (2,930,096 bp). The large number of annotated two-component system genes makes it possible to explore the mechanism of extraordinary lactate tolerance of S. inulinus CASD. PMID:21952540

  15. Draft genome sequence of Sporolactobacillus inulinus strain CASD, an efficient D-lactic acid-producing bacterium with high-concentration lactate tolerance capability.

    PubMed

    Yu, Bo; Su, Fei; Wang, Limin; Xu, Ke; Zhao, Bo; Xu, Ping

    2011-10-01

    Sporolactobacillus inulinus CASD is an efficient D-lactic acid producer with high optical purity. Here we report for the first time the draft genome sequence of S. inulinus (2,930,096 bp). The large number of annotated two-component system genes makes it possible to explore the mechanism of extraordinary lactate tolerance of S. inulinus CASD.

  16. Two Low Coverage Bird Genomes and a Comparison of Reference-Guided versus De Novo Genome Assemblies

    PubMed Central

    Card, Daren C.; Schield, Drew R.; Reyes-Velasco, Jacobo; Fujita, Matthew K.; Andrew, Audra L.; Oyler-McCance, Sara J.; Fike, Jennifer A.; Tomback, Diana F.; Ruggiero, Robert P.; Castoe, Todd A.

    2014-01-01

    As a greater number and diversity of high-quality vertebrate reference genomes become available, it is increasingly feasible to use these references to guide new draft assemblies for related species. Reference-guided assembly approaches may substantially increase the contiguity and completeness of a new genome using only low levels of genome coverage that might otherwise be insufficient for de novo genome assembly. We used low-coverage (∼3.5–5.5x) Illumina paired-end sequencing to assemble draft genomes of two bird species (the Gunnison Sage-Grouse, Centrocercus minimus, and the Clark's Nutcracker, Nucifraga columbiana). We used these data to estimate de novo genome assemblies and reference-guided assemblies, and compared the information content and completeness of these assemblies by comparing CEGMA gene set representation, repeat element content, simple sequence repeat content, and GC isochore structure among assemblies. Our results demonstrate that even lower-coverage genome sequencing projects are capable of producing informative and useful genomic resources, particularly through the use of reference-guided assemblies. PMID:25192061

  17. The Transcriptomics of Secondary Growth and Wood Formation in Conifers

    PubMed Central

    Carvalho, Ana; Paiva, Jorge; Louzada, José; Lima-Brito, José

    2013-01-01

    In the last years, forestry scientists have adapted genomics and next-generation sequencing (NGS) technologies to the search for candidate genes related to the transcriptomics of secondary growth and wood formation in several tree species. Gymnosperms, in particular, the conifers, are ecologically and economically important, namely, for the production of wood and other forestry end products. Until very recently, no whole genome sequencing of a conifer genome was available. Due to the gradual improvement of the NGS technologies and inherent bioinformatics tools, two draft assemblies of the whole genomes sequence of Picea abies and Picea glauca arose in the current year. These draft genome assemblies will bring new insights about the structure, content, and evolution of the conifer genomes. Furthermore, new directions in the forestry, breeding and research of conifers will be discussed in the following. The identification of genes associated with the xylem transcriptome and the knowledge of their regulatory mechanisms will provide less time-consuming breeding cycles and a high accuracy for the selection of traits related to wood production and quality. PMID:24288610

  18. Draft genome sequence of a multidrug-resistant Aeromonas hydrophila ST508 strain carrying rmtD and blaCTX-M-131 isolated from a bloodstream infection.

    PubMed

    Moura, Quézia; Fernandes, Miriam R; Cerdeira, Louise; Santos, Ana Carolina M; de Souza, Tiago A; Ienne, Susan; Pignatari, Antonio Carlos C; Gales, Ana C; Silva, Rosa M; Lincopan, Nilton

    2017-09-01

    Here we report the draft genome sequence of a multidrug-resistant (MDR) Aeromonas hydrophila strain belonging to sequence type 508 (ST508) isolated from a human bloodstream infection. Assembly and annotation of this draft genome resulted in 5028498bp and revealed the presence of 16S rRNA methylase rmtD and bla CTX-M-131 genes encoding high-level resistance to aminoglycosides and cephalosporins, respectively, as well as multiple virulence genes. This draft genome can provide significant information for understanding mechanisms on the establishment and treatment of infections caused by this pathogen. Copyright © 2017 International Society for Chemotherapy of Infection and Cancer. Published by Elsevier Ltd. All rights reserved.

  19. Draft genome sequence of Halomonas lutea strain YIM 91125 T (DSM 23508 T) isolated from the alkaline Lake Ebinur in Northwest China

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Gao, Xiao-Yang; Zhi, Xiao-Yang; Li, Hong-Wei

    Species of the genus Halomonas are halophilic and their flexible adaption to changes of salinity and temperature brings considerable potential biotechnology applications, such as degradation of organic pollutants and enzyme production. The type strain Halomonas lutea YIM 91125 T was isolated from a hypersaline lake in China. The genome of strain YIM 91125 T becomes the twelfth species sequenced in Halomonas, and the thirteenth species sequenced in Halomonadaceae. We described the features of H. lutea YIM 91125 T, together with the high quality draft genome sequence and annotation of its type strain. The 4,533,090 bp long genome of strain YIMmore » 91125 T with its 4,284 protein-coding and 84 RNA genes is a part of Genomic Encyclopedia of Type Strains, Phase I: the one thousand microbial genomes (KMG-I) project. From the viewpoint of comparative genomics, H. lutea has a larger genome size and more specific genes, which indicated acquisition of function bringing better adaption to its environment. Finally, DDH analysis demonstrated that H. lutea is a distinctive species, and halophilic features and nitrogen metabolism related genes were discovered in its genome.« less

  20. Draft genome sequence of Halomonas lutea strain YIM 91125 T (DSM 23508 T) isolated from the alkaline Lake Ebinur in Northwest China

    DOE PAGES

    Gao, Xiao-Yang; Zhi, Xiao-Yang; Li, Hong-Wei; ...

    2015-01-20

    Species of the genus Halomonas are halophilic and their flexible adaption to changes of salinity and temperature brings considerable potential biotechnology applications, such as degradation of organic pollutants and enzyme production. The type strain Halomonas lutea YIM 91125 T was isolated from a hypersaline lake in China. The genome of strain YIM 91125 T becomes the twelfth species sequenced in Halomonas, and the thirteenth species sequenced in Halomonadaceae. We described the features of H. lutea YIM 91125 T, together with the high quality draft genome sequence and annotation of its type strain. The 4,533,090 bp long genome of strain YIMmore » 91125 T with its 4,284 protein-coding and 84 RNA genes is a part of Genomic Encyclopedia of Type Strains, Phase I: the one thousand microbial genomes (KMG-I) project. From the viewpoint of comparative genomics, H. lutea has a larger genome size and more specific genes, which indicated acquisition of function bringing better adaption to its environment. Finally, DDH analysis demonstrated that H. lutea is a distinctive species, and halophilic features and nitrogen metabolism related genes were discovered in its genome.« less

  1. Application of long sequence reads to improve genomes for Clostridium thermocellum AD2, Clostridium thermocellum LQRI, and Pelosinus fermentans R7

    DOE PAGES

    Utturkar, Sagar M.; Bayer, Edward A.; Borovok, Ilya; ...

    2016-09-29

    Here, we and others have shown the utility of long sequence reads to improve genome assembly quality. In this study, we generated PacBio DNA sequence data to improve the assemblies of draft genomes for Clostridium thermocellum AD2, Clostridium thermocellum LQRI, and Pelosinus fermentans R7.

  2. Draft Genome Sequences of Seven Thermophilic Spore-Forming Bacteria Isolated from Foods That Produce Highly Heat-Resistant Spores, Comprising Geobacillus spp., Caldibacillus debilis, and Anoxybacillus flavithermus

    PubMed Central

    Berendsen, Erwin M.; Wells-Bennik, Marjon H. J.; Krawczyk, Antonina O.; de Jong, Anne; van Heel, Auke; Holsappel, Siger; Eijlander, Robyn T.

    2016-01-01

    Here, we report the draft genomes of five strains of Geobacillus spp., one Caldibacillus debilis strain, and one draft genome of Anoxybacillus flavithermus, all thermophilic spore-forming Gram-positive bacteria. PMID:27151781

  3. Draft Genome Sequences of Three Escherichia coli Strains with Different In Vivo Pathogenicities in an Avian (Ascending) Infection Model of the Oviduct

    PubMed Central

    Thøfner, Ida Cecilie Naundrup; Pors, Susanne Elisabeth; Christensen, Henrik; Bisgaard, Magne; Christensen, Jens Peter

    2015-01-01

    Here, we present three draft genome sequences of Escherichia coli strains that experimentally were proven to possess low (strain D2-2), intermediate (Chronic_salp), or high virulence (Cp6salp3) in an avian (ascending) infection model of the oviduct. PMID:25953185

  4. Draft genome of the red harvester ant Pogonomyrmex barbatus.

    PubMed

    Smith, Chris R; Smith, Christopher D; Robertson, Hugh M; Helmkampf, Martin; Zimin, Aleksey; Yandell, Mark; Holt, Carson; Hu, Hao; Abouheif, Ehab; Benton, Richard; Cash, Elizabeth; Croset, Vincent; Currie, Cameron R; Elhaik, Eran; Elsik, Christine G; Favé, Marie-Julie; Fernandes, Vilaiwan; Gibson, Joshua D; Graur, Dan; Gronenberg, Wulfila; Grubbs, Kirk J; Hagen, Darren E; Viniegra, Ana Sofia Ibarraran; Johnson, Brian R; Johnson, Reed M; Khila, Abderrahman; Kim, Jay W; Mathis, Kaitlyn A; Munoz-Torres, Monica C; Murphy, Marguerite C; Mustard, Julie A; Nakamura, Rin; Niehuis, Oliver; Nigam, Surabhi; Overson, Rick P; Placek, Jennifer E; Rajakumar, Rajendhran; Reese, Justin T; Suen, Garret; Tao, Shu; Torres, Candice W; Tsutsui, Neil D; Viljakainen, Lumi; Wolschin, Florian; Gadau, Jürgen

    2011-04-05

    We report the draft genome sequence of the red harvester ant, Pogonomyrmex barbatus. The genome was sequenced using 454 pyrosequencing, and the current assembly and annotation were completed in less than 1 y. Analyses of conserved gene groups (more than 1,200 manually annotated genes to date) suggest a high-quality assembly and annotation comparable to recently sequenced insect genomes using Sanger sequencing. The red harvester ant is a model for studying reproductive division of labor, phenotypic plasticity, and sociogenomics. Although the genome of P. barbatus is similar to other sequenced hymenopterans (Apis mellifera and Nasonia vitripennis) in GC content and compositional organization, and possesses a complete CpG methylation toolkit, its predicted genomic CpG content differs markedly from the other hymenopterans. Gene networks involved in generating key differences between the queen and worker castes (e.g., wings and ovaries) show signatures of increased methylation and suggest that ants and bees may have independently co-opted the same gene regulatory mechanisms for reproductive division of labor. Gene family expansions (e.g., 344 functional odorant receptors) and pseudogene accumulation in chemoreception and P450 genes compared with A. mellifera and N. vitripennis are consistent with major life-history changes during the adaptive radiation of Pogonomyrmex spp., perhaps in parallel with the development of the North American deserts.

  5. Draft Genome Sequences of Klebsiella oxytoca Isolates Originating from a Highly Contaminated Liquid Hand Soap Product.

    PubMed

    Hammerl, J A; Lasch, P; Nitsche, A; Dabrowski, P W; Hahmann, H; Wicke, A; Kleta, S; Al Dahouk, S; Dieckmann, R

    2015-07-23

    In 2013, contaminated liquid soap was detected by routine microbiological monitoring of consumer products through state health authorities. Because of its high load of Klebsiella oxytoca, the liquid soap was notified via the European Union Rapid Alert System for Dangerous Non-Food Products (EU-RAPEX) and recalled. Here, we present two draft genome sequences and a summary of their general features. Copyright © 2015 Hammerl et al.

  6. Draft Genome Sequence of the Entomopathogenic Bacterium Bacillus pumilus 15.1, a Strain Highly Toxic to the Mediterranean Fruit Fly Ceratitis capitata

    PubMed Central

    García-Ramón, Diana C.; Palma, Leopoldo; Berry, Colin; Osuna, Antonio

    2015-01-01

    We present the draft whole-genome sequence of the entomopathogenic Bacillus pumilus 15.1 strain that consists of 3,795,691 bp and 3,776 predicted protein-coding genes. This genome sequence provides the basis for understanding the potential mechanism behind the toxicity and virulence of B. pumilus 15.1 against the Mediterranean fruit fly. PMID:26404596

  7. High quality draft genome sequence of Janthinobacterium psychrotolerans sp. nov., isolated from a frozen freshwater pond.

    PubMed

    Gong, Xianzhe; Skrivergaard, Stig; Korsgaard, Benjamin Smed; Schreiber, Lars; Marshall, Ian P G; Finster, Kai; Schramm, Andreas

    2017-01-01

    Strain S3-2 T , isolated from sediment of a frozen freshwater pond, shares 99% 16S rRNA gene sequence identity with strains of the genus Janthinobacterium . Strain S3-2 T is a facultative anaerobe that lacks the ability to produce violacein but shows antibiotic resistance, psychrotolerance, incomplete denitrification, and fermentation. The draft genome of strain S3-2 T has a size of ~5.8 Mbp and contains 5,297 genes, including 115 RNA genes. Based on the phenotypic properties of the strain, the low in silico DNA-DNA hybridization (DDH) values with related genomes (<35%), and the low whole genome-based average nucleotide identity (ANI) (<86%) with other strains within the genus Janthinobacterium, we propose that strain S3-2 T is the type strain (= DSM 102223 = LMG 29653) of a new species within this genus. We propose the name Janthinobacterium psychrotolerans sp. nov. to emphasize the capability of the strain to grow at low temperatures.

  8. High-quality permanent draft genome sequence of the extremely osmotolerant diphenol degrading bacterium Halotalea alkalilenta AW-7T, and emended description of the genus Halotalea

    DOE PAGES

    Ntougias, Spyridon; Lapidus, Alla; Copeland, Alex; ...

    2015-08-13

    Members of the genus Halotalea (family Halomonadaceae) are of high significance since they can tolerate the greatest glucose and maltose concentrations ever reported for known bacteria and are involved in the degradation of industrial effluents. Here, the characteristics and the permanent-draft genome sequence and annotation of Halotalea alkalilenta AW-7T are described. The microorganism was sequenced as a part of the Genomic Encyclopedia of Type Strains, Phase I: the one thousand microbial genomes (KMG) project at the DOE Joint Genome Institute, and it is the only strain within the genus Halotalea having its genome sequenced. The genome is 4,467,826 bp longmore » and consists of 40 scaffolds with 64.62 % average GC content. A total of 4,104 genes were predicted, comprising of 4,028 protein-coding and 76 RNA genes. Most protein-coding genes (87.79 %) were assigned to a putative function. Halotalea alkalilenta AW-7T encodes the catechol and protocatechuate degradation to β-ketoadipate via the β-ketoadipate and protocatechuate ortho-cleavage degradation pathway, and it possesses the genetic ability to detoxify fluoroacetate, cyanate and acrylonitrile. Lastly, an emended description of the genus Halotalea Ntougias et al. 2007 is also provided in order to describe the delayed fermentation ability of the type strain.« less

  9. Massively parallel whole genome amplification for single-cell sequencing using droplet microfluidics.

    PubMed

    Hosokawa, Masahito; Nishikawa, Yohei; Kogawa, Masato; Takeyama, Haruko

    2017-07-12

    Massively parallel single-cell genome sequencing is required to further understand genetic diversities in complex biological systems. Whole genome amplification (WGA) is the first step for single-cell sequencing, but its throughput and accuracy are insufficient in conventional reaction platforms. Here, we introduce single droplet multiple displacement amplification (sd-MDA), a method that enables massively parallel amplification of single cell genomes while maintaining sequence accuracy and specificity. Tens of thousands of single cells are compartmentalized in millions of picoliter droplets and then subjected to lysis and WGA by passive droplet fusion in microfluidic channels. Because single cells are isolated in compartments, their genomes are amplified to saturation without contamination. This enables the high-throughput acquisition of contamination-free and cell specific sequence reads from single cells (21,000 single-cells/h), resulting in enhancement of the sequence data quality compared to conventional methods. This method allowed WGA of both single bacterial cells and human cancer cells. The obtained sequencing coverage rivals those of conventional techniques with superior sequence quality. In addition, we also demonstrate de novo assembly of uncultured soil bacteria and obtain draft genomes from single cell sequencing. This sd-MDA is promising for flexible and scalable use in single-cell sequencing.

  10. Draft Genome Sequences of Two Protease-Producing Strains of Arsukibacterium, Isolated from Two Cold and Alkaline Environments

    PubMed Central

    Lylloff, Jeanette E.; Hansen, Lea B. S.; Jepsen, Morten; Hallin, Peter F.; Sørensen, Søren J.; Glaring, Mikkel A.

    2015-01-01

    Arsukibacterium ikkense GCM72T and a close relative, Arsukibacterium sp. MJ3, were isolated from two cold and alkaline environments as producers of extracellular proteolytic enzymes active at high pH and low temperature. This report describes the two draft genome sequences, which may serve as sources of future industrial enzymes. PMID:26044431

  11. Draft Genome Sequences of Three Escherichia coli Strains with Different In Vivo Pathogenicities in an Avian (Ascending) Infection Model of the Oviduct.

    PubMed

    Olsen, Rikke Heidemann; Thøfner, Ida Cecilie Naundrup; Pors, Susanne Elisabeth; Christensen, Henrik; Bisgaard, Magne; Christensen, Jens Peter

    2015-05-07

    Here, we present three draft genome sequences of Escherichia coli strains that experimentally were proven to possess low (strain D2-2), intermediate (Chronic_salp), or high virulence (Cp6salp3) in an avian (ascending) infection model of the oviduct. Copyright © 2015 Olsen et al.

  12. Draft Genome Sequence of Corynebacterium kefirresidentii SB, Isolated from Kefir.

    PubMed

    Blasche, Sonja; Kim, Yongkyu; Patil, Kiran R

    2017-09-14

    The genus Corynebacterium includes Gram-positive species with a high G+C content. We report here a novel species, Corynebacterium kefirresidentii SB, isolated from kefir grains collected in Germany. Its draft genome sequence was remarkably dissimilar (average nucleotide identity, 76.54%) to those of other Corynebacterium spp., confirming that this is a unique novel species. Copyright © 2017 Blasche et al.

  13. Opera: reconstructing optimal genomic scaffolds with high-throughput paired-end sequences.

    PubMed

    Gao, Song; Sung, Wing-Kin; Nagarajan, Niranjan

    2011-11-01

    Scaffolding, the problem of ordering and orienting contigs, typically using paired-end reads, is a crucial step in the assembly of high-quality draft genomes. Even as sequencing technologies and mate-pair protocols have improved significantly, scaffolding programs still rely on heuristics, with no guarantees on the quality of the solution. In this work, we explored the feasibility of an exact solution for scaffolding and present a first tractable solution for this problem (Opera). We also describe a graph contraction procedure that allows the solution to scale to large scaffolding problems and demonstrate this by scaffolding several large real and synthetic datasets. In comparisons with existing scaffolders, Opera simultaneously produced longer and more accurate scaffolds demonstrating the utility of an exact approach. Opera also incorporates an exact quadratic programming formulation to precisely compute gap sizes (Availability: http://sourceforge.net/projects/operasf/ ).

  14. Opera: Reconstructing Optimal Genomic Scaffolds with High-Throughput Paired-End Sequences

    PubMed Central

    Gao, Song; Sung, Wing-Kin

    2011-01-01

    Abstract Scaffolding, the problem of ordering and orienting contigs, typically using paired-end reads, is a crucial step in the assembly of high-quality draft genomes. Even as sequencing technologies and mate-pair protocols have improved significantly, scaffolding programs still rely on heuristics, with no guarantees on the quality of the solution. In this work, we explored the feasibility of an exact solution for scaffolding and present a first tractable solution for this problem (Opera). We also describe a graph contraction procedure that allows the solution to scale to large scaffolding problems and demonstrate this by scaffolding several large real and synthetic datasets. In comparisons with existing scaffolders, Opera simultaneously produced longer and more accurate scaffolds demonstrating the utility of an exact approach. Opera also incorporates an exact quadratic programming formulation to precisely compute gap sizes (Availability: http://sourceforge.net/projects/operasf/). PMID:21929371

  15. Genome sequence, comparative analysis and haplotype structure of the domestic dog.

    PubMed

    Lindblad-Toh, Kerstin; Wade, Claire M; Mikkelsen, Tarjei S; Karlsson, Elinor K; Jaffe, David B; Kamal, Michael; Clamp, Michele; Chang, Jean L; Kulbokas, Edward J; Zody, Michael C; Mauceli, Evan; Xie, Xiaohui; Breen, Matthew; Wayne, Robert K; Ostrander, Elaine A; Ponting, Chris P; Galibert, Francis; Smith, Douglas R; DeJong, Pieter J; Kirkness, Ewen; Alvarez, Pablo; Biagi, Tara; Brockman, William; Butler, Jonathan; Chin, Chee-Wye; Cook, April; Cuff, James; Daly, Mark J; DeCaprio, David; Gnerre, Sante; Grabherr, Manfred; Kellis, Manolis; Kleber, Michael; Bardeleben, Carolyne; Goodstadt, Leo; Heger, Andreas; Hitte, Christophe; Kim, Lisa; Koepfli, Klaus-Peter; Parker, Heidi G; Pollinger, John P; Searle, Stephen M J; Sutter, Nathan B; Thomas, Rachael; Webber, Caleb; Baldwin, Jennifer; Abebe, Adal; Abouelleil, Amr; Aftuck, Lynne; Ait-Zahra, Mostafa; Aldredge, Tyler; Allen, Nicole; An, Peter; Anderson, Scott; Antoine, Claudel; Arachchi, Harindra; Aslam, Ali; Ayotte, Laura; Bachantsang, Pasang; Barry, Andrew; Bayul, Tashi; Benamara, Mostafa; Berlin, Aaron; Bessette, Daniel; Blitshteyn, Berta; Bloom, Toby; Blye, Jason; Boguslavskiy, Leonid; Bonnet, Claude; Boukhgalter, Boris; Brown, Adam; Cahill, Patrick; Calixte, Nadia; Camarata, Jody; Cheshatsang, Yama; Chu, Jeffrey; Citroen, Mieke; Collymore, Alville; Cooke, Patrick; Dawoe, Tenzin; Daza, Riza; Decktor, Karin; DeGray, Stuart; Dhargay, Norbu; Dooley, Kimberly; Dooley, Kathleen; Dorje, Passang; Dorjee, Kunsang; Dorris, Lester; Duffey, Noah; Dupes, Alan; Egbiremolen, Osebhajajeme; Elong, Richard; Falk, Jill; Farina, Abderrahim; Faro, Susan; Ferguson, Diallo; Ferreira, Patricia; Fisher, Sheila; FitzGerald, Mike; Foley, Karen; Foley, Chelsea; Franke, Alicia; Friedrich, Dennis; Gage, Diane; Garber, Manuel; Gearin, Gary; Giannoukos, Georgia; Goode, Tina; Goyette, Audra; Graham, Joseph; Grandbois, Edward; Gyaltsen, Kunsang; Hafez, Nabil; Hagopian, Daniel; Hagos, Birhane; Hall, Jennifer; Healy, Claire; Hegarty, Ryan; Honan, Tracey; Horn, Andrea; Houde, Nathan; Hughes, Leanne; Hunnicutt, Leigh; Husby, M; Jester, Benjamin; Jones, Charlien; Kamat, Asha; Kanga, Ben; Kells, Cristyn; Khazanovich, Dmitry; Kieu, Alix Chinh; Kisner, Peter; Kumar, Mayank; Lance, Krista; Landers, Thomas; Lara, Marcia; Lee, William; Leger, Jean-Pierre; Lennon, Niall; Leuper, Lisa; LeVine, Sarah; Liu, Jinlei; Liu, Xiaohong; Lokyitsang, Yeshi; Lokyitsang, Tashi; Lui, Annie; Macdonald, Jan; Major, John; Marabella, Richard; Maru, Kebede; Matthews, Charles; McDonough, Susan; Mehta, Teena; Meldrim, James; Melnikov, Alexandre; Meneus, Louis; Mihalev, Atanas; Mihova, Tanya; Miller, Karen; Mittelman, Rachel; Mlenga, Valentine; Mulrain, Leonidas; Munson, Glen; Navidi, Adam; Naylor, Jerome; Nguyen, Tuyen; Nguyen, Nga; Nguyen, Cindy; Nguyen, Thu; Nicol, Robert; Norbu, Nyima; Norbu, Choe; Novod, Nathaniel; Nyima, Tenchoe; Olandt, Peter; O'Neill, Barry; O'Neill, Keith; Osman, Sahal; Oyono, Lucien; Patti, Christopher; Perrin, Danielle; Phunkhang, Pema; Pierre, Fritz; Priest, Margaret; Rachupka, Anthony; Raghuraman, Sujaa; Rameau, Rayale; Ray, Verneda; Raymond, Christina; Rege, Filip; Rise, Cecil; Rogers, Julie; Rogov, Peter; Sahalie, Julie; Settipalli, Sampath; Sharpe, Theodore; Shea, Terrance; Sheehan, Mechele; Sherpa, Ngawang; Shi, Jianying; Shih, Diana; Sloan, Jessie; Smith, Cherylyn; Sparrow, Todd; Stalker, John; Stange-Thomann, Nicole; Stavropoulos, Sharon; Stone, Catherine; Stone, Sabrina; Sykes, Sean; Tchuinga, Pierre; Tenzing, Pema; Tesfaye, Senait; Thoulutsang, Dawa; Thoulutsang, Yama; Topham, Kerri; Topping, Ira; Tsamla, Tsamla; Vassiliev, Helen; Venkataraman, Vijay; Vo, Andy; Wangchuk, Tsering; Wangdi, Tsering; Weiand, Michael; Wilkinson, Jane; Wilson, Adam; Yadav, Shailendra; Yang, Shuli; Yang, Xiaoping; Young, Geneva; Yu, Qing; Zainoun, Joanne; Zembek, Lisa; Zimmer, Andrew; Lander, Eric S

    2005-12-08

    Here we report a high-quality draft genome sequence of the domestic dog (Canis familiaris), together with a dense map of single nucleotide polymorphisms (SNPs) across breeds. The dog is of particular interest because it provides important evolutionary information and because existing breeds show great phenotypic diversity for morphological, physiological and behavioural traits. We use sequence comparison with the primate and rodent lineages to shed light on the structure and evolution of genomes and genes. Notably, the majority of the most highly conserved non-coding sequences in mammalian genomes are clustered near a small subset of genes with important roles in development. Analysis of SNPs reveals long-range haplotypes across the entire dog genome, and defines the nature of genetic diversity within and across breeds. The current SNP map now makes it possible for genome-wide association studies to identify genes responsible for diseases and traits, with important consequences for human and companion animal health.

  16. Sequencing and comparative analyses of the genomes of zoysiagrasses

    PubMed Central

    Tanaka, Hidenori; Hirakawa, Hideki; Kosugi, Shunichi; Nakayama, Shinobu; Ono, Akiko; Watanabe, Akiko; Hashiguchi, Masatsugu; Gondo, Takahiro; Ishigaki, Genki; Muguerza, Melody; Shimizu, Katsuya; Sawamura, Noriko; Inoue, Takayasu; Shigeki, Yuichi; Ohno, Naoki; Tabata, Satoshi; Akashi, Ryo; Sato, Shusei

    2016-01-01

    Zoysia is a warm-season turfgrass, which comprises 11 allotetraploid species (2n = 4x = 40), each possessing different morphological and physiological traits. To characterize the genetic systems of Zoysia plants and to analyse their structural and functional differences in individual species and accessions, we sequenced the genomes of Zoysia species using HiSeq and MiSeq platforms. As a reference sequence of Zoysia species, we generated a high-quality draft sequence of the genome of Z. japonica accession ‘Nagirizaki’ (334 Mb) in which 59,271 protein-coding genes were predicted. In parallel, draft genome sequences of Z. matrella ‘Wakaba’ and Z. pacifica ‘Zanpa’ were also generated for comparative analyses. To investigate the genetic diversity among the Zoysia species, genome sequence reads of three additional accessions, Z. japonica ‘Kyoto’, Z. japonica ‘Miyagi’ and Z. matrella ‘Chiba Fair Green’, were accumulated, and aligned against the reference genome of ‘Nagirizaki’ along with those from ‘Wakaba’ and ‘Zanpa’. As a result, we detected 7,424,163 single-nucleotide polymorphisms and 852,488 short indels among these species. The information obtained in this study will be valuable for basic studies on zoysiagrass evolution and genetics as well as for the breeding of zoysiagrasses, and is made available in the ‘Zoysia Genome Database’ at http://zoysia.kazusa.or.jp. PMID:26975196

  17. Sequencing and comparative analyses of the genomes of zoysiagrasses.

    PubMed

    Tanaka, Hidenori; Hirakawa, Hideki; Kosugi, Shunichi; Nakayama, Shinobu; Ono, Akiko; Watanabe, Akiko; Hashiguchi, Masatsugu; Gondo, Takahiro; Ishigaki, Genki; Muguerza, Melody; Shimizu, Katsuya; Sawamura, Noriko; Inoue, Takayasu; Shigeki, Yuichi; Ohno, Naoki; Tabata, Satoshi; Akashi, Ryo; Sato, Shusei

    2016-04-01

    Zoysiais a warm-season turfgrass, which comprises 11 allotetraploid species (2n= 4x= 40), each possessing different morphological and physiological traits. To characterize the genetic systems of Zoysia plants and to analyse their structural and functional differences in individual species and accessions, we sequenced the genomes of Zoysia species using HiSeq and MiSeq platforms. As a reference sequence of Zoysia species, we generated a high-quality draft sequence of the genome of Z. japonica accession 'Nagirizaki' (334 Mb) in which 59,271 protein-coding genes were predicted. In parallel, draft genome sequences of Z. matrella 'Wakaba' and Z. pacifica 'Zanpa' were also generated for comparative analyses. To investigate the genetic diversity among the Zoysia species, genome sequence reads of three additional accessions, Z. japonica'Kyoto', Z. japonica'Miyagi' and Z. matrella'Chiba Fair Green', were accumulated, and aligned against the reference genome of 'Nagirizaki' along with those from 'Wakaba' and 'Zanpa'. As a result, we detected 7,424,163 single-nucleotide polymorphisms and 852,488 short indels among these species. The information obtained in this study will be valuable for basic studies on zoysiagrass evolution and genetics as well as for the breeding of zoysiagrasses, and is made available in the 'Zoysia Genome Database' at http://zoysia.kazusa.or.jp. © The Author 2016. Published by Oxford University Press on behalf of Kazusa DNA Research Institute.

  18. High-Quality Draft Genome Sequence of Desulfovibrio carbinoliphilus FW-101-2B, an Organic Acid-Oxidizing Sulfate-Reducing Bacterium Isolated from Uranium(VI)-Contaminated Groundwater

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Ramsay, Bradley D.; Hwang, Chiachi; Woo, Hannah L.

    2015-03-12

    Desulfovibrio carbinoliphilus subsp. oakridgensis FW-101-2B is an anaerobic, organic acid/alcohol-oxidizing, sulfate-reducing δ-proteobacterium. FW-101-2B was isolated from contaminated groundwater at The Field Research Center at Oak Ridge National Lab after in situ stimulation for heavy metal-reducing conditions. The genome will help elucidate the metabolic potential of sulfate-reducing bacteria during uranium reduction.

  19. Genome sequence of Bradyrhizobium sp. WSM1253; a microsymbiont of Ornithopus compressus from the Greek Island of Sifnos

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Tiwari, Ravi; Howieson, John; Yates, Ron

    Bradyrhizobium sp. WSM1253 is a novel N 2-fixing bacterium isolated from a root nodule of the herbaceous annual legume Ornithopus compressus that was growing on the Greek Island of Sifnos. WSM1253 emerged as a strain of interest in an Australian program that was selecting inoculant quality bradyrhizobial strains for inoculation of Mediterranean species of lupins ( Lupinus angustifolius, L. princei, L. atlanticus, L. pilosus ). In this report we describe, for the first time, the genome sequence information and annotation of this legume microsymbiont. The 8,719,808 bp genome has a G + C content of 63.09 % with 71 contigsmore » arranged into two scaffolds. The assembled genome contains 8,432 protein-coding genes, 66 RNA genes and a single rRNA operon. In conclusion, this improved-high-quality draft rhizobial genome is one of 20 sequenced through a DOE Joint Genome Institute 2010 Community Sequencing Project.« less

  20. Genome sequence of Bradyrhizobium sp. WSM1253; a microsymbiont of Ornithopus compressus from the Greek Island of Sifnos

    DOE PAGES

    Tiwari, Ravi; Howieson, John; Yates, Ron; ...

    2015-11-30

    Bradyrhizobium sp. WSM1253 is a novel N 2-fixing bacterium isolated from a root nodule of the herbaceous annual legume Ornithopus compressus that was growing on the Greek Island of Sifnos. WSM1253 emerged as a strain of interest in an Australian program that was selecting inoculant quality bradyrhizobial strains for inoculation of Mediterranean species of lupins ( Lupinus angustifolius, L. princei, L. atlanticus, L. pilosus ). In this report we describe, for the first time, the genome sequence information and annotation of this legume microsymbiont. The 8,719,808 bp genome has a G + C content of 63.09 % with 71 contigsmore » arranged into two scaffolds. The assembled genome contains 8,432 protein-coding genes, 66 RNA genes and a single rRNA operon. In conclusion, this improved-high-quality draft rhizobial genome is one of 20 sequenced through a DOE Joint Genome Institute 2010 Community Sequencing Project.« less

  1. Draft Genome Sequence of Mycobacterium chimaera Type Strain Fl-0169.

    PubMed

    Pfaller, Stacy; Tokarev, Vasily; Kessler, Collin; McLimans, Christopher; Gomez-Alvarez, Vicente; Wright, Justin; King, Dawn; Lamendella, Regina

    2017-02-23

    We report here the draft genome sequence of the type strain Mycobacterium chimaera Fl-0169, a member of the Mycobacterium avium complex (MAC). M. chimaera Fl-0169 T was isolated from a patient in Italy and is highly similar to strains of M. chimaera isolated in Ireland, although Fl-0169 T possesses unique virulence genes. Copyright © 2017 Pfaller et al.

  2. Draft Genome Sequence of Sphingobium chinhatense Strain IP26T, Isolated from a Hexachlorocyclohexane Dumpsite

    PubMed Central

    Niharika, Neha; Sangwan, Naseer; Ahmad, Salar; Singh, Priya; Khurana, J. P.

    2013-01-01

    Sphingobium chinhatense strain IP26T is a conducive hexachlorocyclohexane (HCH) degrader isolated from a heavily contaminated (450 mg HCH/g soil) HCH dumpsite. IP26T degrades α-, β-, γ-, and δ-HCH, which are highly persistent in the environment. Here we report the draft genome sequence (~5.8 Mbp) of this strain. PMID:23990581

  3. Draft Genome Sequence of the Polyextremophilic Halorubrum sp. Strain AJ67, Isolated from Hyperarsenic Lakes in the Argentinian Puna.

    PubMed

    Burguener, Germán F; Maldonado, Marcos J; Revale, Santiago; Fernández Do Porto, Darío; Rascován, Nicolás; Vázquez, Martín; Farías, María Eugenia; Marti, Marcelo A; Turjanski, Adrián Gustavo

    2014-02-06

    Halorubrum sp. strain AJ67, an extreme halophilic UV-resistant archaeon, was isolated from Laguna Antofalla in the Argentinian Puna. The draft genome sequence suggests the presence of potent enzyme candidates that are essential for survival under multiple environmental extreme conditions, such as high UV radiation, elevated salinity, and the presence of critical arsenic concentrations.

  4. Initial sequencing and comparative analysis of the mouse genome

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Waterston, Robert H.; Lindblad-Toh, Kerstin; Birney, Ewan

    2002-12-15

    The sequence of the mouse genome is a key informational tool for understanding the contents of the human genome and a key experimental tool for biomedical research. Here, we report the results of an international collaboration to produce a high-quality draft sequence of the mouse genome. We also present an initial comparative analysis of the mouse and human genomes, describing some of the insights that can be gleaned from the two sequences. We discuss topics including the analysis of the evolutionary forces shaping the size, structure and sequence of the genomes; the conservation of large-scale synteny across most of themore » genomes; the much lower extent of sequence orthology covering less than half of the genomes; the proportions of the genomes under selection; the number of protein-coding genes; the expansion of gene families related to reproduction and immunity; the evolution of proteins; and the identification of intraspecies polymorphism.« less

  5. Reducing assembly complexity of microbial genomes with single-molecule sequencing.

    PubMed

    Koren, Sergey; Harhay, Gregory P; Smith, Timothy P L; Bono, James L; Harhay, Dayna M; Mcvey, Scott D; Radune, Diana; Bergman, Nicholas H; Phillippy, Adam M

    2013-01-01

    The short reads output by first- and second-generation DNA sequencing instruments cannot completely reconstruct microbial chromosomes. Therefore, most genomes have been left unfinished due to the significant resources required to manually close gaps in draft assemblies. Third-generation, single-molecule sequencing addresses this problem by greatly increasing sequencing read length, which simplifies the assembly problem. To measure the benefit of single-molecule sequencing on microbial genome assembly, we sequenced and assembled the genomes of six bacteria and analyzed the repeat complexity of 2,267 complete bacteria and archaea. Our results indicate that the majority of known bacterial and archaeal genomes can be assembled without gaps, at finished-grade quality, using a single PacBio RS sequencing library. These single-library assemblies are also more accurate than typical short-read assemblies and hybrid assemblies of short and long reads. Automated assembly of long, single-molecule sequencing data reduces the cost of microbial finishing to $1,000 for most genomes, and future advances in this technology are expected to drive the cost lower. This is expected to increase the number of completed genomes, improve the quality of microbial genome databases, and enable high-fidelity, population-scale studies of pan-genomes and chromosomal organization.

  6. Genome sequence of the mud-dwelling archaeon Methanoplanus limicola type strain (DSM 2279 T), reclassification of Methanoplanus petrolearius as Methanolacinia petrolearia and emended descriptions of the genera Methanoplanus and Methanolacinia

    DOE PAGES

    Goker, Markus; Lu, Megan; Fiebig, Anne; ...

    2014-06-15

    Methanoplanus limicola Wildgruber et al. 1984 is a mesophilic methanogen that was isolated from a swamp composed of drilling waste near Naples, Italy, shortly after the Archaea were recognized as a separate domain of life. Methanoplanus is the type genus in the family Methanoplanaceae, a taxon that felt into disuse since modern 16S rRNA gene sequences-based taxonomy was established. Methanoplanus is now placed within the Methanomicrobiaceae, a family that is so far poorly characterized at the genome level. The only other type strain of the genus with a sequenced genome, Methanoplanus petrolearius SEBR 4847 T, turned out to be misclassifiedmore » and required reclassification to Methanolacinia. Both, Methanoplanus and Methanolacinia, needed taxonomic emendations due to a significant deviation of the G+C content of their genomes from previously published (pregenome-sequence era) values. Until now genome sequences were published for only four of the 33 species with validly published names in the Methanomicrobiaceae. Here we describe the features of M. limicola, together with the improved-high-quality draft genome sequence and an notation of the type strain, M3 T. The 3,200,946 bp long chromosome (permanent draft sequence) with its 3,064 protein-coding and 65 RNA genes is a part of the Genomic Encyclopedia of Bacteria and Archaea project.« less

  7. dBBQs: dataBase of Bacterial Quality scores.

    PubMed

    Wanchai, Visanu; Patumcharoenpol, Preecha; Nookaew, Intawat; Ussery, David

    2017-12-28

    It is well-known that genome sequencing technologies are becoming significantly cheaper and faster. As a result of this, the exponential growth in sequencing data in public databases allows us to explore ever growing large collections of genome sequences. However, it is less known that the majority of available sequenced genome sequences in public databases are not complete, drafts of varying qualities. We have calculated quality scores for around 100,000 bacterial genomes from all major genome repositories and put them in a fast and easy-to-use database. Prokaryotic genomic data from all sources were collected and combined to make a non-redundant set of bacterial genomes. The genome quality score for each was calculated by four different measurements: assembly quality, number of rRNA and tRNA genes, and the occurrence of conserved functional domains. The dataBase of Bacterial Quality scores (dBBQs) was designed to store and retrieve quality scores. It offers fast searching and download features which the result can be used for further analysis. In addition, the search results are shown in interactive JavaScript chart framework using DC.js. The analysis of quality scores across major public genome databases find that around 68% of the genomes are of acceptable quality for many uses. dBBQs (available at http://arc-gem.uams.edu/dbbqs ) provides genome quality scores for all available prokaryotic genome sequences with a user-friendly Web-interface. These scores can be used as cut-offs to get a high-quality set of genomes for testing bioinformatics tools or improving the analysis. Moreover, all data of the four measurements that were combined to make the quality score for each genome, which can potentially be used for further analysis. dBBQs will be updated regularly and is freely use for non-commercial purpose.

  8. Draft genome sequence of Bradyrhizobium manausense strain BR 3351T, an effective symbiont isolated from Amazon rainforest.

    PubMed

    Simões-Araújo, Jean Luiz; Rumjanek, Norma Gouvêa; Xavier, Gustavo Ribeiro; Zilli, Jerri Édson

    The strain BR 3351 T (Bradyrhizobium manausense) was obtained from nodules of cowpea (Vigna unguiculata L. Walp) growing in soil collected from Amazon rainforest. Furthermore, it was observed that the strain has high capacity to fix nitrogen symbiotically in symbioses with cowpea. We report here the draft genome sequence of strain BR 3351 T . The information presented will be important for comparative analysis of nodulation and nitrogen fixation for diazotrophic bacteria. A draft genome with 9,145,311bp and 62.9% of GC content was assembled in 127 scaffolds using 100bp pair-end Illumina MiSeq system. The RAST annotation identified 8603 coding sequences, 51 RNAs genes, classified in 504 subsystems. Published by Elsevier Editora Ltda.

  9. Draft genome sequence of the docosahexaenoic acid producing thraustochytrid Aurantiochytrium sp. T66.

    PubMed

    Liu, Bin; Ertesvåg, Helga; Aasen, Inga Marie; Vadstein, Olav; Brautaset, Trygve; Heggeset, Tonje Marita Bjerkan

    2016-06-01

    Thraustochytrids are unicellular, marine protists, and there is a growing industrial interest in these organisms, particularly because some species, including strains belonging to the genus Aurantiochytrium, accumulate high levels of docosahexaenoic acid (DHA). Here, we report the draft genome sequence of Aurantiochytrium sp. T66 (ATCC PRA-276), with a size of 43 Mbp, and 11,683 predicted protein-coding sequences. The data has been deposited at DDBJ/EMBL/Genbank under the accession LNGJ00000000. The genome sequence will contribute new insight into DHA biosynthesis and regulation, providing a basis for metabolic engineering of thraustochytrids.

  10. Draft Genome Sequence of Thermoanaerobacter sp. Strain A7A, Reconstructed from a Metagenome Obtained from a High-Temperature Hydrocarbon Reservoir in the Bass Strait, Australia

    PubMed Central

    Li, Dongmei; Greenfield, Paul; Rosewarne, Carly P.

    2013-01-01

    The draft genome sequence of Thermoanaerobacter sp. strain A7A was reconstructed from a metagenome of a microbial consortium obtained from the Tuna oil field in the Gippsland Basin, Australia. The organism is a strict anaerobe that is predicted to ferment a range of simple sugars and undertake sulfur reduction. PMID:24029756

  11. Draft Genome Sequence of the Polyextremophilic Exiguobacterium sp. Strain S17, Isolated from Hyperarsenic Lakes in the Argentinian Puna.

    PubMed

    Ordoñez, Omar F; Lanzarotti, Esteban; Kurth, Daniel; Gorriti, Marta F; Revale, Santiago; Cortez, Néstor; Vazquez, Martin P; Farías, María E; Turjanski, Adrian G

    2013-07-25

    Exiguobacterium sp. strain S17 is a moderately halotolerant, arsenic-resistant bacterium that was isolated from Laguna Socompa stromatolites in the Argentinian Puna. The draft genome sequence suggests potent enzyme candidates that are essential for survival under multiple environmental extreme conditions, such as high levels of UV radiation, elevated salinity, and the presence of critical arsenic concentrations.

  12. Draft Genome Sequence of the Polyextremophilic Halorubrum sp. Strain AJ67, Isolated from Hyperarsenic Lakes in the Argentinian Puna

    PubMed Central

    Burguener, Germán F.; Maldonado, Marcos J.; Revale, Santiago; Fernández Do Porto, Darío; Rascován, Nicolás; Vázquez, Martín; Farías, María Eugenia; Marti, Marcelo A.

    2014-01-01

    Halorubrum sp. strain AJ67, an extreme halophilic UV-resistant archaeon, was isolated from Laguna Antofalla in the Argentinian Puna. The draft genome sequence suggests the presence of potent enzyme candidates that are essential for survival under multiple environmental extreme conditions, such as high UV radiation, elevated salinity, and the presence of critical arsenic concentrations. PMID:24503991

  13. The Nuclear and Mitochondrial Genomes of the Facultatively Eusocial Orchid Bee Euglossa dilemma

    PubMed Central

    Brand, Philipp; Saleh, Nicholas; Pan, Hailin; Li, Cai; Kapheim, Karen M.; Ramírez, Santiago R.

    2017-01-01

    Bees provide indispensable pollination services to both agricultural crops and wild plant populations, and several species of bees have become important models for the study of learning and memory, plant–insect interactions, and social behavior. Orchid bees (Apidae: Euglossini) are especially important to the fields of pollination ecology, evolution, and species conservation. Here we report the nuclear and mitochondrial genome sequences of the orchid bee Euglossa dilemma Bembé & Eltz. E. dilemma was selected because it is widely distributed, highly abundant, and it was recently naturalized in the southeastern United States. We provide a high-quality assembly of the 3.3 Gb genome, and an official gene set of 15,904 gene annotations. We find high conservation of gene synteny with the honey bee throughout 80 MY of divergence time. This genomic resource represents the first draft genome of the orchid bee genus Euglossa, and the first draft orchid bee mitochondrial genome, thus representing a valuable resource to the research community. PMID:28701376

  14. The Nuclear and Mitochondrial Genomes of the Facultatively Eusocial Orchid Bee Euglossa dilemma.

    PubMed

    Brand, Philipp; Saleh, Nicholas; Pan, Hailin; Li, Cai; Kapheim, Karen M; Ramírez, Santiago R

    2017-09-07

    Bees provide indispensable pollination services to both agricultural crops and wild plant populations, and several species of bees have become important models for the study of learning and memory, plant-insect interactions, and social behavior. Orchid bees (Apidae: Euglossini) are especially important to the fields of pollination ecology, evolution, and species conservation. Here we report the nuclear and mitochondrial genome sequences of the orchid bee Euglossa dilemma Bembé & Eltz. E. dilemma was selected because it is widely distributed, highly abundant, and it was recently naturalized in the southeastern United States. We provide a high-quality assembly of the 3.3 Gb genome, and an official gene set of 15,904 gene annotations. We find high conservation of gene synteny with the honey bee throughout 80 MY of divergence time. This genomic resource represents the first draft genome of the orchid bee genus Euglossa , and the first draft orchid bee mitochondrial genome, thus representing a valuable resource to the research community. Copyright © 2017 Brand et al.

  15. Draft genome sequence of ramie, Boehmeria nivea (L.) Gaudich.

    PubMed

    Luan, Ming-Bao; Jian, Jian-Bo; Chen, Ping; Chen, Jun-Hui; Chen, Jian-Hua; Gao, Qiang; Gao, Gang; Zhou, Ju-Hong; Chen, Kun-Mei; Guang, Xuan-Min; Chen, Ji-Kang; Zhang, Qian-Qian; Wang, Xiao-Fei; Fang, Long; Sun, Zhi-Min; Bai, Ming-Zhou; Fang, Xiao-Dong; Zhao, Shan-Cen; Xiong, He-Ping; Yu, Chun-Ming; Zhu, Ai-Guo

    2018-05-01

    Ramie, Boehmeria nivea (L.) Gaudich, family Urticaceae, is a plant native to eastern Asia, and one of the world's oldest fibre crops. It is also used as animal feed and for the phytoremediation of heavy metal-contaminated farmlands. Thus, the genome sequence of ramie was determined to explore the molecular basis of its fibre quality, protein content and phytoremediation. For further understanding ramie genome, different paired-end and mate-pair libraries were combined to generate 134.31 Gb of raw DNA sequences using the Illumina whole-genome shotgun sequencing approach. The highly heterozygous B. nivea genome was assembled using the Platanus Genome Assembler, which is an effective tool for the assembly of highly heterozygous genome sequences. The final length of the draft genome of this species was approximately 341.9 Mb (contig N50 = 22.62 kb, scaffold N50 = 1,126.36 kb). Based on ramie genome annotations, 30,237 protein-coding genes were predicted, and the repetitive element content was 46.3%. The completeness of the final assembly was evaluated by benchmarking universal single-copy orthologous genes (BUSCO); 90.5% of the 1,440 expected embryophytic genes were identified as complete, and 4.9% were identified as fragmented. Phylogenetic analysis based on single-copy gene families and one-to-one orthologous genes placed ramie with mulberry and cannabis, within the clade of urticalean rosids. Genome information of ramie will be a valuable resource for the conservation of endangered Boehmeria species and for future studies on the biogeography and characteristic evolution of members of Urticaceae. © 2018 John Wiley & Sons Ltd.

  16. High-quality draft genome sequence of Gracilimonas tropica CL-CB462 T (DSM 19535 T), isolated from a Synechococcus culture

    DOE PAGES

    Choi, Dong Han; Ahn, Chisang; Jang, Gwang Il; ...

    2015-11-11

    Gracilimonas tropica Choi et al. 2009 is a member of order Sphingobacteriales, class Sphingobacteriia. Three species of the genus Gracilimonas have been isolated from marine seawater or a salt mine and showed extremely halotolerant and mesophilic features, although close relatives are extremely halophilic or thermophilic. The type strain of the type species of Gracilimonas, G. tropica DSM19535 T, was isolated from a Synechococcus culture which was established from the tropical sea-surface water of the Pacific Ocean. The genome of the strain DSM19535 T was sequenced through the Genomic Encyclopedia of Type Strains, Phase I: the one thousand microbial genomes project.more » Here, we describe the genomic features of the strain. The 3,831,242 bp long draft genome consists of 48 contigs with 3373 protein-coding and 53 RNA genes. Finally, the strain seems to adapt to phosphate limitation and requires amino acids from external environment. In addition, genomic analyses and pasteurization experiment suggested that G. tropica DSM19535 T did not form spore.« less

  17. Assembly of the Lactuca sativa, L. cv. Tizian draft genome sequence reveals differences within major resistance complex 1 as compared to the cv. Salinas reference genome.

    PubMed

    Verwaaijen, Bart; Wibberg, Daniel; Nelkner, Johanna; Gordin, Miriam; Rupp, Oliver; Winkler, Anika; Bremges, Andreas; Blom, Jochen; Grosch, Rita; Pühler, Alfred; Schlüter, Andreas

    2018-02-10

    Lettuce (Lactuca sativa, L.) is an important annual plant of the family Asteraceae (Compositae). The commercial lettuce cultivar Tizian has been used in various scientific studies investigating the interaction of the plant with phytopathogens or biological control agents. Here, we present the de novo draft genome sequencing and gene prediction for this specific cultivar derived from transcriptome sequence data. The assembled scaffolds amount to a size of 2.22 Gb. Based on RNAseq data, 31,112 transcript isoforms were identified. Functional predictions for these transcripts were determined within the GenDBE annotation platform. Comparison with the cv. Salinas reference genome revealed a high degree of sequence similarity on genome and transcriptome levels, with an average amino acid identity of 99%. Furthermore, it was observed that two large regions are either missing or are highly divergent within the cv. Tizian genome compared to cv. Salinas. One of these regions covers the major resistance complex 1 region of cv. Salinas. The cv. Tizian draft genome sequence provides a valuable resource for future functional and transcriptome analyses focused on this lettuce cultivar. Copyright © 2017 Elsevier B.V. All rights reserved.

  18. Draft sequencing and comparative genomics of Xylella fastidiosa strains reveal novel biological insights.

    PubMed

    Bhattacharyya, Anamitra; Stilwagen, Stephanie; Reznik, Gary; Feil, Helene; Feil, William S; Anderson, Iain; Bernal, Axel; D'Souza, Mark; Ivanova, Natalia; Kapatral, Vinayak; Larsen, Niels; Los, Tamara; Lykidis, Athanasios; Selkov, Eugene; Walunas, Theresa L; Purcell, Alexander; Edwards, Rob A; Hawkins, Trevor; Haselkorn, Robert; Overbeek, Ross; Kyrpides, Nikos C; Predki, Paul F

    2002-10-01

    Draft sequencing is a rapid and efficient method for determining the near-complete sequence of microbial genomes. Here we report a comparative analysis of one complete and two draft genome sequences of the phytopathogenic bacterium, Xylella fastidiosa, which causes serious disease in plants, including citrus, almond, and oleander. We present highlights of an in silico analysis based on a comparison of reconstructions of core biological subsystems. Cellular pathway reconstructions have been used to identify a small number of genes, which are likely to reside within the draft genomes but are not captured in the draft assembly. These represented only a small fraction of all genes and were predominantly large and small ribosomal subunit protein components. By using this approach, some of the inherent limitations of draft sequence can be significantly reduced. Despite the incomplete nature of the draft genomes, it is possible to identify several phage-related genes, which appear to be absent from the draft genomes and not the result of insufficient sequence sampling. This region may therefore identify potential host-specific functions. Based on this first functional reconstruction of a phytopathogenic microbe, we spotlight an unusual respiration machinery as a potential target for biological control. We also predicted and developed a new defined growth medium for Xylella.

  19. The value of new genome references.

    PubMed

    Worley, Kim C; Richards, Stephen; Rogers, Jeffrey

    2017-09-15

    Genomic information has become a ubiquitous and almost essential aspect of biological research. Over the last 10-15 years, the cost of generating sequence data from DNA or RNA samples has dramatically declined and our ability to interpret those data increased just as remarkably. Although it is still possible for biologists to conduct interesting and valuable research on species for which genomic data are not available, the impact of having access to a high quality whole genome reference assembly for a given species is nothing short of transformational. Research on a species for which we have no DNA or RNA sequence data is restricted in fundamental ways. In contrast, even access to an initial draft quality genome (see below for definitions) opens a wide range of opportunities that are simply not available without that reference genome assembly. Although a complete discussion of the impact of genome sequencing and assembly is beyond the scope of this short paper, the goal of this review is to summarize the most common and highest impact contributions that whole genome sequencing and assembly has had on comparative and evolutionary biology. Copyright © 2016. Published by Elsevier Inc.

  20. High quality draft genome sequence of Olivibacter sitiensis type strain (AW-6T), a diphenol degrader with genes involved in the catechol pathway

    PubMed Central

    Ntougias, Spyridon; Lapidus, Alla; Han, James; Mavromatis, Konstantinos; Pati, Amrita; Chen, Amy; Klenk, Hans-Peter; Woyke, Tanja; Fasseas, Constantinos; Kyrpides, Nikos C.; Zervakis, Georgios I.

    2014-01-01

    Olivibacter sitiensis Ntougias et al. 2007 is a member of the family Sphingobacteriaceae, phylum Bacteroidetes. Members of the genus Olivibacter are phylogenetically diverse and of significant interest. They occur in diverse habitats, such as rhizosphere and contaminated soils, viscous wastes, composts, biofilter clean-up facilities on contaminated sites and cave environments, and they are involved in the degradation of complex and toxic compounds. Here we describe the features of O. sitiensis AW-6T, together with the permanent-draft genome sequence and annotation. The organism was sequenced under the Genomic Encyclopedia for Bacteria and Archaea (GEBA) project at the DOE Joint Genome Institute and is the first genome sequence of a species within the genus Olivibacter. The genome is 5,053,571 bp long and is comprised of 110 scaffolds with an average GC content of 44.61%. Of the 4,565 genes predicted, 4,501 were protein-coding genes and 64 were RNA genes. Most protein-coding genes (68.52%) were assigned to a putative function. The identification of 2-keto-4-pentenoate hydratase/2-oxohepta-3-ene-1,7-dioic acid hydratase-coding genes indicates involvement of this organism in the catechol catabolic pathway. In addition, genes encoding for β-1,4-xylanases and β-1,4-xylosidases reveal the xylanolytic action of O. sitiensis. PMID:25197463

  1. EU-US ABWG AgENCODE Workshop

    USDA-ARS?s Scientific Manuscript database

    As considerable progress has been made on producing draft quality genomic sequence for many food animal species, the next goal for genomics research is a greater understanding of gene regulation and expression. The EU-US Animal Biotechnology Working Group (ABWG), established by the EU-US Biotechnolo...

  2. Draft Genome Sequence of Ideonella sp. Strain A 288, Isolated from an Iron-Precipitating Biofilm

    PubMed Central

    Künzel, Sven; Szewzyk, Ulrich

    2017-01-01

    ABSTRACT Here, we report the draft genome sequence of the betaproteobacterium Ideonella sp. strain A_228. This isolate, obtained from a bog iron ore-containing floodplain area in Germany, provides valuable information about the genetic diversity of neutrophilic iron-depositing bacteria. The Illumina NextSeq technique was used to sequence the draft genome sequence of the strain. PMID:28818902

  3. ProDeGe: A computational protocol for fully automated decontamination of genomes

    DOE PAGES

    Tennessen, Kristin; Andersen, Evan; Clingenpeel, Scott; ...

    2015-06-09

    Single amplified genomes and genomes assembled from metagenomes have enabled the exploration of uncultured microorganisms at an unprecedented scale. However, both these types of products are plagued by contamination. Since these genomes are now being generated in a high-throughput manner and sequences from them are propagating into public databases to drive novel scientific discoveries, rigorous quality controls and decontamination protocols are urgently needed. Here, we present ProDeGe (Protocol for fully automated Decontamination of Genomes), the first computational protocol for fully automated decontamination of draft genomes. ProDeGe classifies sequences into two classes—clean and contaminant—using a combination of homology and feature-based methodologies.more » On average, 84% of sequence from the non-target organism is removed from the data set (specificity) and 84% of the sequence from the target organism is retained (sensitivity). Lastly, the procedure operates successfully at a rate of ~0.30 CPU core hours per megabase of sequence and can be applied to any type of genome sequence.« less

  4. ProDeGe: A computational protocol for fully automated decontamination of genomes

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Tennessen, Kristin; Andersen, Evan; Clingenpeel, Scott

    Single amplified genomes and genomes assembled from metagenomes have enabled the exploration of uncultured microorganisms at an unprecedented scale. However, both these types of products are plagued by contamination. Since these genomes are now being generated in a high-throughput manner and sequences from them are propagating into public databases to drive novel scientific discoveries, rigorous quality controls and decontamination protocols are urgently needed. Here, we present ProDeGe (Protocol for fully automated Decontamination of Genomes), the first computational protocol for fully automated decontamination of draft genomes. ProDeGe classifies sequences into two classes—clean and contaminant—using a combination of homology and feature-based methodologies.more » On average, 84% of sequence from the non-target organism is removed from the data set (specificity) and 84% of the sequence from the target organism is retained (sensitivity). Lastly, the procedure operates successfully at a rate of ~0.30 CPU core hours per megabase of sequence and can be applied to any type of genome sequence.« less

  5. The Asian arowana (Scleropages formosus) genome provides new insights into the evolution of an early lineage of teleosts

    PubMed Central

    Bian, Chao; Hu, Yinchang; Ravi, Vydianathan; Kuznetsova, Inna S.; Shen, Xueyan; Mu, Xidong; Sun, Ying; You, Xinxin; Li, Jia; Li, Xiaofeng; Qiu, Ying; Tay, Boon-Hui; Thevasagayam, Natascha May; Komissarov, Aleksey S.; Trifonov, Vladimir; Kabilov, Marsel; Tupikin, Alexey; Luo, Jianren; Liu, Yi; Song, Hongmei; Liu, Chao; Wang, Xuejie; Gu, Dangen; Yang, Yexin; Li, Wujiao; Polgar, Gianluca; Fan, Guangyi; Zeng, Peng; Zhang, He; Xiong, Zijun; Tang, Zhujing; Peng, Chao; Ruan, Zhiqiang; Yu, Hui; Chen, Jieming; Fan, Mingjun; Huang, Yu; Wang, Min; Zhao, Xiaomeng; Hu, Guojun; Yang, Huanming; Wang, Jian; Wang, Jun; Xu, Xun; Song, Linsheng; Xu, Gangchun; Xu, Pao; Xu, Junmin; O’Brien, Stephen J.; Orbán, László; Venkatesh, Byrappa; Shi, Qiong

    2016-01-01

    The Asian arowana (Scleropages formosus), one of the world’s most expensive cultivated ornamental fishes, is an endangered species. It represents an ancient lineage of teleosts: the Osteoglossomorpha. Here, we provide a high-quality chromosome-level reference genome of a female golden-variety arowana using a combination of deep shotgun sequencing and high-resolution linkage mapping. In addition, we have also generated two draft genome assemblies for the red and green varieties. Phylogenomic analysis supports a sister group relationship between Osteoglossomorpha (bonytongues) and Elopomorpha (eels and relatives), with the two clades together forming a sister group of Clupeocephala which includes all the remaining teleosts. The arowana genome retains the full complement of eight Hox clusters unlike the African butterfly fish (Pantodon buchholzi), another bonytongue fish, which possess only five Hox clusters. Differential gene expression among three varieties provides insights into the genetic basis of colour variation. A potential heterogametic sex chromosome is identified in the female arowana karyotype, suggesting that the sex is determined by a ZW/ZZ sex chromosomal system. The high-quality reference genome of the golden arowana and the draft assemblies of the red and green varieties are valuable resources for understanding the biology, adaptation and behaviour of Asian arowanas. PMID:27089831

  6. High-quality draft genome sequence of Sedimenticola selenatireducens strain AK4OH1T, a gammaproteobacterium isolated from estuarine sediment

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Louie, Tiffany S.; Giovannelli, Donato; Yee, Nathan

    Sedimenticola selenatireducens strain AK4OH1 T (= DSM 17993 T = ATCC BAA-1233 T) is a microaerophilic bacterium isolated from sediment from the Arthur Kill intertidal strait between New Jersey and Staten Island, NY. S. selenatireducens is Gram-negative and belongs to the Gammaproteobacteria. Strain AK4OH1 T was the first representative of its genus to be isolated for its unique coupling of the oxidation of aromatic acids to the respiration of selenate. It is a versatile heterotroph and can use a variety of carbon compounds, but can also grow lithoautotrophically under hypoxic and anaerobic conditions. Furthermore, the draft genome comprises 4,588,530 bpmore » and 4276 predicted protein-coding genes including genes for the anaerobic degradation of 4-hydroxybenzoate and benzoate. We report the main features of the genome of S. selenatireducens strain AK4OH1 T.« less

  7. Draft genome sequence of the novel strain Pseudomonas sp. 10B238 with potential ability to produce antibiotics from deep-sea sediment.

    PubMed

    Pan, Hua-Qi; Hu, Jiang-Chun

    2015-10-01

    Pseudomonas sp. 10B238 was a putatively novel species of Pseudomonas, isolated from a deep-sea sediment of the South China Sea, which had the genetic potential to produce secondary metabolites related to nonribosomal peptides (NRPs), as well as showed moderate antimicrobial activities. Here we report a high quality draft genome of Pseudomonas sp. 10B238, which comprises 4,933,052bp with the G+C content of 60.23%. A total of 11 potential secondary metabolite biosynthetic gene clusters were predicted, including a NRP for new peptide siderophore. And many anaerobic respiratory terminal enzymes were found for life in deep-sea environments. Our results may provide insights into biosynthetic pathway for antimicrobial bioactive compounds and be helpful to understand the physiological characteristic of this species. Copyright © 2015 Elsevier B.V. All rights reserved.

  8. High-quality draft genome sequence of Sedimenticola selenatireducens strain AK4OH1T, a gammaproteobacterium isolated from estuarine sediment

    DOE PAGES

    Louie, Tiffany S.; Giovannelli, Donato; Yee, Nathan; ...

    2016-09-08

    Sedimenticola selenatireducens strain AK4OH1 T (= DSM 17993 T = ATCC BAA-1233 T) is a microaerophilic bacterium isolated from sediment from the Arthur Kill intertidal strait between New Jersey and Staten Island, NY. S. selenatireducens is Gram-negative and belongs to the Gammaproteobacteria. Strain AK4OH1 T was the first representative of its genus to be isolated for its unique coupling of the oxidation of aromatic acids to the respiration of selenate. It is a versatile heterotroph and can use a variety of carbon compounds, but can also grow lithoautotrophically under hypoxic and anaerobic conditions. Furthermore, the draft genome comprises 4,588,530 bpmore » and 4276 predicted protein-coding genes including genes for the anaerobic degradation of 4-hydroxybenzoate and benzoate. We report the main features of the genome of S. selenatireducens strain AK4OH1 T.« less

  9. De novo assembly of the Aedes aegypti genome using Hi-C yields chromosome-length scaffolds.

    PubMed

    Dudchenko, Olga; Batra, Sanjit S; Omer, Arina D; Nyquist, Sarah K; Hoeger, Marie; Durand, Neva C; Shamim, Muhammad S; Machol, Ido; Lander, Eric S; Aiden, Aviva Presser; Aiden, Erez Lieberman

    2017-04-07

    The Zika outbreak, spread by the Aedes aegypti mosquito, highlights the need to create high-quality assemblies of large genomes in a rapid and cost-effective way. Here we combine Hi-C data with existing draft assemblies to generate chromosome-length scaffolds. We validate this method by assembling a human genome, de novo, from short reads alone (67× coverage). We then combine our method with draft sequences to create genome assemblies of the mosquito disease vectors Ae aegypti and Culex quinquefasciatus , each consisting of three scaffolds corresponding to the three chromosomes in each species. These assemblies indicate that almost all genomic rearrangements among these species occur within, rather than between, chromosome arms. The genome assembly procedure we describe is fast, inexpensive, and accurate, and can be applied to many species. Copyright © 2017, American Association for the Advancement of Science.

  10. High-quality permanent draft genome sequence of Bradyrhizobium sp. Ai1a-2; a microsymbiont of Andira inermis discovered in Costa Rica

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Tian, Rui; Parker, Matthew; Seshadri, Rekha

    Bradyrhizobium sp. Ai1a-2 is is an aerobic, motile, Gram-negative, non-spore-forming rod that was isolated from an effective nitrogen fixing root nodule of Andira inermis collected from Tres Piedras in Costa Rica. In this report we describe, for the first time, the genome sequence information and annotation of this legume microsymbiont. The 9,029,266 bp genome has a GC content of 62.56% with 247 contigs arranged into 246 scaffolds. The assembled genome contains 8,482 protein-coding genes and 102 RNA-only encoding genes. Lastly, this rhizobial genome was sequenced as part of the DOE Joint Genome Institute 2010 Genomic Encyclopedia for Bacteria and Archaea-Rootmore » Nodule Bacteria (GEBA-RNB) project proposal.« less

  11. Genome sequence and description of Corynebacterium ihumii sp. nov.

    PubMed Central

    Padmanabhan, Roshan; Dubourg, Grégory; Lagier, Jean-Christophe; Couderc, Carine; Michelle, Caroline; Raoult, Didier; Fournier, Pierre-Edouard

    2014-01-01

    Corynebacterium ihumii strain GD7T sp. nov. is proposed as the type strain of a new species, which belongs to the family Corynebacteriaceae of the class Actinobacteria. This strain was isolated from the fecal flora of a 62 year-old male patient, as a part of the culturomics study. Corynebacterium ihumii is a Gram positive, facultativly anaerobic, nonsporulating bacillus. Here, we describe the features of this organism, together with the high quality draft genome sequence, annotation and the comparison with other member of the genus Corynebacteria. C. ihumii genome is 2,232,265 bp long (one chromosome but no plasmid) containing 2,125 protein-coding and 53 RNA genes, including 4 rRNA genes. The whole-genome shotgun sequence of Corynebacterium ihumii strain GD7T sp. nov has been deposited in EMBL under accession number GCA_000403725. PMID:25197488

  12. High-quality permanent draft genome sequence of Bradyrhizobium sp. Ai1a-2; a microsymbiont of Andira inermis discovered in Costa Rica

    DOE PAGES

    Tian, Rui; Parker, Matthew; Seshadri, Rekha; ...

    2015-06-14

    Bradyrhizobium sp. Ai1a-2 is is an aerobic, motile, Gram-negative, non-spore-forming rod that was isolated from an effective nitrogen fixing root nodule of Andira inermis collected from Tres Piedras in Costa Rica. In this report we describe, for the first time, the genome sequence information and annotation of this legume microsymbiont. The 9,029,266 bp genome has a GC content of 62.56% with 247 contigs arranged into 246 scaffolds. The assembled genome contains 8,482 protein-coding genes and 102 RNA-only encoding genes. Lastly, this rhizobial genome was sequenced as part of the DOE Joint Genome Institute 2010 Genomic Encyclopedia for Bacteria and Archaea-Rootmore » Nodule Bacteria (GEBA-RNB) project proposal.« less

  13. Draft Genome Sequence of the 2-Chloro-4-Nitrophenol-Degrading Bacterium Arthrobacter sp. Strain SJCon

    PubMed Central

    Vikram, Surendra; Kumar, Shailesh; Vaidya, Bhumika; Pinnaka, Anil Kumar

    2013-01-01

    We report the 4.39-Mb draft genome sequence of the 2-chloro-4-nitrophenol-degrading bacterium Arthrobacter sp. strain SJCon, isolated from a pesticide-contaminated site. The draft genome sequence of strain SJCon will be helpful in studying the genetic pathways involved in the degradation of several aromatic compounds. PMID:23516196

  14. De novo Assembly of a 40 Mb Eukaryotic Genome from Short Sequence Reads: Sordaria macrospora, a Model Organism for Fungal Morphogenesis

    PubMed Central

    Nowrousian, Minou; Stajich, Jason E.; Chu, Meiling; Engh, Ines; Espagne, Eric; Halliday, Karen; Kamerewerd, Jens; Kempken, Frank; Knab, Birgit; Kuo, Hsiao-Che; Osiewacz, Heinz D.; Pöggeler, Stefanie; Read, Nick D.; Seiler, Stephan; Smith, Kristina M.; Zickler, Denise; Kück, Ulrich; Freitag, Michael

    2010-01-01

    Filamentous fungi are of great importance in ecology, agriculture, medicine, and biotechnology. Thus, it is not surprising that genomes for more than 100 filamentous fungi have been sequenced, most of them by Sanger sequencing. While next-generation sequencing techniques have revolutionized genome resequencing, e.g. for strain comparisons, genetic mapping, or transcriptome and ChIP analyses, de novo assembly of eukaryotic genomes still presents significant hurdles, because of their large size and stretches of repetitive sequences. Filamentous fungi contain few repetitive regions in their 30–90 Mb genomes and thus are suitable candidates to test de novo genome assembly from short sequence reads. Here, we present a high-quality draft sequence of the Sordaria macrospora genome that was obtained by a combination of Illumina/Solexa and Roche/454 sequencing. Paired-end Solexa sequencing of genomic DNA to 85-fold coverage and an additional 10-fold coverage by single-end 454 sequencing resulted in ∼4 Gb of DNA sequence. Reads were assembled to a 40 Mb draft version (N50 of 117 kb) with the Velvet assembler. Comparative analysis with Neurospora genomes increased the N50 to 498 kb. The S. macrospora genome contains even fewer repeat regions than its closest sequenced relative, Neurospora crassa. Comparison with genomes of other fungi showed that S. macrospora, a model organism for morphogenesis and meiosis, harbors duplications of several genes involved in self/nonself-recognition. Furthermore, S. macrospora contains more polyketide biosynthesis genes than N. crassa. Phylogenetic analyses suggest that some of these genes may have been acquired by horizontal gene transfer from a distantly related ascomycete group. Our study shows that, for typical filamentous fungi, de novo assembly of genomes from short sequence reads alone is feasible, that a mixture of Solexa and 454 sequencing substantially improves the assembly, and that the resulting data can be used for comparative studies to address basic questions of fungal biology. PMID:20386741

  15. De novo assembly of a 40 Mb eukaryotic genome from short sequence reads: Sordaria macrospora, a model organism for fungal morphogenesis.

    PubMed

    Nowrousian, Minou; Stajich, Jason E; Chu, Meiling; Engh, Ines; Espagne, Eric; Halliday, Karen; Kamerewerd, Jens; Kempken, Frank; Knab, Birgit; Kuo, Hsiao-Che; Osiewacz, Heinz D; Pöggeler, Stefanie; Read, Nick D; Seiler, Stephan; Smith, Kristina M; Zickler, Denise; Kück, Ulrich; Freitag, Michael

    2010-04-08

    Filamentous fungi are of great importance in ecology, agriculture, medicine, and biotechnology. Thus, it is not surprising that genomes for more than 100 filamentous fungi have been sequenced, most of them by Sanger sequencing. While next-generation sequencing techniques have revolutionized genome resequencing, e.g. for strain comparisons, genetic mapping, or transcriptome and ChIP analyses, de novo assembly of eukaryotic genomes still presents significant hurdles, because of their large size and stretches of repetitive sequences. Filamentous fungi contain few repetitive regions in their 30-90 Mb genomes and thus are suitable candidates to test de novo genome assembly from short sequence reads. Here, we present a high-quality draft sequence of the Sordaria macrospora genome that was obtained by a combination of Illumina/Solexa and Roche/454 sequencing. Paired-end Solexa sequencing of genomic DNA to 85-fold coverage and an additional 10-fold coverage by single-end 454 sequencing resulted in approximately 4 Gb of DNA sequence. Reads were assembled to a 40 Mb draft version (N50 of 117 kb) with the Velvet assembler. Comparative analysis with Neurospora genomes increased the N50 to 498 kb. The S. macrospora genome contains even fewer repeat regions than its closest sequenced relative, Neurospora crassa. Comparison with genomes of other fungi showed that S. macrospora, a model organism for morphogenesis and meiosis, harbors duplications of several genes involved in self/nonself-recognition. Furthermore, S. macrospora contains more polyketide biosynthesis genes than N. crassa. Phylogenetic analyses suggest that some of these genes may have been acquired by horizontal gene transfer from a distantly related ascomycete group. Our study shows that, for typical filamentous fungi, de novo assembly of genomes from short sequence reads alone is feasible, that a mixture of Solexa and 454 sequencing substantially improves the assembly, and that the resulting data can be used for comparative studies to address basic questions of fungal biology.

  16. Draft Genome Sequence of Sphingobium ummariense Strain RL-3, a Hexachlorocyclohexane-Degrading Bacterium

    PubMed Central

    Kohli, Puneet; Dua, Ankita; Sangwan, Naseer; Oldach, Phoebe; Khurana, J. P.

    2013-01-01

    Here, we report the draft genome sequence of the hexachlorocyclohexane (HCH)-degrading bacterium Sphingobium ummariense strain RL-3, which was isolated from the HCH dumpsite located in Lucknow, India (27°00′N and 81°09′E). The annotated draft genome sequence (4.75 Mb) of strain RL-3 consisted of 139 contigs, 4,645 coding sequences, and 65% G+C content. PMID:24233594

  17. Draft Genome Sequence of Tolypothrix boutellei Strain VB521301

    PubMed Central

    Chandrababunaidu, Mathu Malar; Singh, Deeksha; Sen, Diya; Bhan, Sushma; Das, Subhadeep; Gupta, Akash

    2015-01-01

    We report here the draft genome sequence of the filamentous nitrogen-fixing cyanobacterium Tolypothrix boutellei strain VB521301. The organism is lipid rich and hydrophobic and produces polyunsaturated fatty acids which can be harnessed for industrial purpose. The draft genome sequence assembled into 11,572,263 bp with 70 scaffolds and 7,777 protein coding genes. PMID:25700407

  18. Alignment-free design of highly discriminatory diagnostic primer sets for Escherichia coli O104:H4 outbreak strains.

    PubMed

    Pritchard, Leighton; Holden, Nicola J; Bielaszewska, Martina; Karch, Helge; Toth, Ian K

    2012-01-01

    An Escherichia coli O104:H4 outbreak in Germany in summer 2011 caused 53 deaths, over 4000 individual infections across Europe, and considerable economic, social and political impact. This outbreak was the first in a position to exploit rapid, benchtop high-throughput sequencing (HTS) technologies and crowdsourced data analysis early in its investigation, establishing a new paradigm for rapid response to disease threats. We describe a novel strategy for design of diagnostic PCR primers that exploited this rapid draft bacterial genome sequencing to distinguish between E. coli O104:H4 outbreak isolates and other pathogenic E. coli isolates, including the historical hæmolytic uræmic syndrome (HUSEC) E. coli HUSEC041 O104:H4 strain, which possesses the same serotype as the outbreak isolates. Primers were designed using a novel alignment-free strategy against eleven draft whole genome assemblies of E. coli O104:H4 German outbreak isolates from the E. coli O104:H4 Genome Analysis Crowd-Sourcing Consortium website, and a negative sequence set containing 69 E. coli chromosome and plasmid sequences from public databases. Validation in vitro against 21 'positive' E. coli O104:H4 outbreak and 32 'negative' non-outbreak EHEC isolates indicated that individual primer sets exhibited 100% sensitivity for outbreak isolates, with false positive rates of between 9% and 22%. A minimal combination of two primers discriminated between outbreak and non-outbreak E. coli isolates with 100% sensitivity and 100% specificity. Draft genomes of isolates of disease outbreak bacteria enable high throughput primer design and enhanced diagnostic performance in comparison to traditional molecular assays. Future outbreak investigations will be able to harness HTS rapidly to generate draft genome sequences and diagnostic primer sets, greatly facilitating epidemiology and clinical diagnostics. We expect that high throughput primer design strategies will enable faster, more precise responses to future disease outbreaks of bacterial origin, and help to mitigate their societal impact.

  19. Draft genome sequence of Staphylococcus aureus KT/312045, an ST1-MSSA PVL positive isolated from pus sample in East Coast Malaysia.

    PubMed

    Suhaili, Zarizal; Lean, Soo-Sum; Mohamad, Noor Muzamil; Rachman, Abdul R Abdul; Desa, Mohd Nasir Mohd; Yeo, Chew Chieng

    2016-09-01

    Most of the efforts in elucidating the molecular relatedness and epidemiology of Staphylococcus aureus in Malaysia have been largely focused on methicillin-resistant S. aureus (MRSA). Therefore, here we report the draft genome sequence of the methicillin-susceptible Staphylococcus aureus (MSSA) with sequence type 1 (ST1), spa type t127 with Panton-Valentine Leukocidin (pvl) pathogenic determinant isolated from pus sample designated as KT/314250 strain. The size of the draft genome is 2.86 Mbp with 32.7% of G + C content consisting 2673 coding sequences. The draft genome sequence has been deposited in DDBJ/EMBL/GenBank under the accession number AOCP00000000.

  20. Lineage-Specific Biology Revealed by a Finished Genome Assembly of the Mouse

    PubMed Central

    Hillier, LaDeana W.; Zody, Michael C.; Goldstein, Steve; She, Xinwe; Bult, Carol J.; Agarwala, Richa; Cherry, Joshua L.; DiCuccio, Michael; Hlavina, Wratko; Kapustin, Yuri; Meric, Peter; Maglott, Donna; Birtle, Zoë; Marques, Ana C.; Graves, Tina; Zhou, Shiguo; Teague, Brian; Potamousis, Konstantinos; Churas, Christopher; Place, Michael; Herschleb, Jill; Runnheim, Ron; Forrest, Daniel; Amos-Landgraf, James; Schwartz, David C.; Cheng, Ze; Lindblad-Toh, Kerstin; Eichler, Evan E.; Ponting, Chris P.

    2009-01-01

    The mouse (Mus musculus) is the premier animal model for understanding human disease and development. Here we show that a comprehensive understanding of mouse biology is only possible with the availability of a finished, high-quality genome assembly. The finished clone-based assembly of the mouse strain C57BL/6J reported here has over 175,000 fewer gaps and over 139 Mb more of novel sequence, compared with the earlier MGSCv3 draft genome assembly. In a comprehensive analysis of this revised genome sequence, we are now able to define 20,210 protein-coding genes, over a thousand more than predicted in the human genome (19,042 genes). In addition, we identified 439 long, non–protein-coding RNAs with evidence for transcribed orthologs in human. We analyzed the complex and repetitive landscape of 267 Mb of sequence that was missing or misassembled in the previously published assembly, and we provide insights into the reasons for its resistance to sequencing and assembly by whole-genome shotgun approaches. Duplicated regions within newly assembled sequence tend to be of more recent ancestry than duplicates in the published draft, correcting our initial understanding of recent evolution on the mouse lineage. These duplicates appear to be largely composed of sequence regions containing transposable elements and duplicated protein-coding genes; of these, some may be fixed in the mouse population, but at least 40% of segmentally duplicated sequences are copy number variable even among laboratory mouse strains. Mouse lineage-specific regions contain 3,767 genes drawn mainly from rapidly-changing gene families associated with reproductive functions. The finished mouse genome assembly, therefore, greatly improves our understanding of rodent-specific biology and allows the delineation of ancestral biological functions that are shared with human from derived functions that are not. PMID:19468303

  1. Draft Genome Sequence of Thermus sp. Strain RL, Isolated from a Hot Water Spring Located atop the Himalayan Ranges at Manikaran, India

    PubMed Central

    Dwivedi, Vatsala; Sangwan, Naseer; Nigam, Aeshna; Garg, Nidhi; Niharika, Neha; Khurana, Paramjit; Khurana, Jitendra P.

    2012-01-01

    Thermus sp. strain RL was isolated from a hot water spring (90°C to 98°C) at Manikaran, Himachal Pradesh, India. Here we report the draft genome sequence (20,36,600 bp) of this strain. The draft genome sequence consists of 17 contigs and 1,986 protein-coding sequences and has an average G+C content of 68.77%. PMID:22689228

  2. Draft Genome Sequence, and a Sequence-Defined Genetic Linkage Map of the Legume Crop Species Lupinus angustifolius L

    PubMed Central

    Zheng, Zequn; Zhang, Qisen; Zhou, Gaofeng; Sweetingham, Mark W.; Howieson, John G.; Li, Chengdao

    2013-01-01

    Lupin (Lupinus angustifolius L.) is the most recently domesticated crop in major agricultural cultivation. Its seeds are high in protein and dietary fibre, but low in oil and starch. Medical and dietetic studies have shown that consuming lupin-enriched food has significant health benefits. We report the draft assembly from a whole genome shotgun sequencing dataset for this legume species with 26.9x coverage of the genome, which is predicted to contain 57,807 genes. Analysis of the annotated genes with metabolic pathways provided a partial understanding of some key features of lupin, such as the amino acid profile of storage proteins in seeds. Furthermore, we applied the NGS-based RAD-sequencing technology to obtain 8,244 sequence-defined markers for anchoring the genomic sequences. A total of 4,214 scaffolds from the genome sequence assembly were aligned into the genetic map. The combination of the draft assembly and a sequence-defined genetic map made it possible to locate and study functional genes of agronomic interest. The identification of co-segregating SNP markers, scaffold sequences and gene annotation facilitated the identification of a candidate R gene associated with resistance to the major lupin disease anthracnose. We demonstrated that the combination of medium-depth genome sequencing and a high-density genetic linkage map by application of NGS technology is a cost-effective approach to generating genome sequence data and a large number of molecular markers to study the genomics, genetics and functional genes of lupin, and to apply them to molecular plant breeding. This strategy does not require prior genome knowledge, which potentiates its application to a wide range of non-model species. PMID:23734219

  3. Draft genome sequence, and a sequence-defined genetic linkage map of the legume crop species Lupinus angustifolius L.

    PubMed

    Yang, Huaan; Tao, Ye; Zheng, Zequn; Zhang, Qisen; Zhou, Gaofeng; Sweetingham, Mark W; Howieson, John G; Li, Chengdao

    2013-01-01

    Lupin (Lupinus angustifolius L.) is the most recently domesticated crop in major agricultural cultivation. Its seeds are high in protein and dietary fibre, but low in oil and starch. Medical and dietetic studies have shown that consuming lupin-enriched food has significant health benefits. We report the draft assembly from a whole genome shotgun sequencing dataset for this legume species with 26.9x coverage of the genome, which is predicted to contain 57,807 genes. Analysis of the annotated genes with metabolic pathways provided a partial understanding of some key features of lupin, such as the amino acid profile of storage proteins in seeds. Furthermore, we applied the NGS-based RAD-sequencing technology to obtain 8,244 sequence-defined markers for anchoring the genomic sequences. A total of 4,214 scaffolds from the genome sequence assembly were aligned into the genetic map. The combination of the draft assembly and a sequence-defined genetic map made it possible to locate and study functional genes of agronomic interest. The identification of co-segregating SNP markers, scaffold sequences and gene annotation facilitated the identification of a candidate R gene associated with resistance to the major lupin disease anthracnose. We demonstrated that the combination of medium-depth genome sequencing and a high-density genetic linkage map by application of NGS technology is a cost-effective approach to generating genome sequence data and a large number of molecular markers to study the genomics, genetics and functional genes of lupin, and to apply them to molecular plant breeding. This strategy does not require prior genome knowledge, which potentiates its application to a wide range of non-model species.

  4. Draft Genome Sequence of Tolypothrix boutellei Strain VB521301.

    PubMed

    Chandrababunaidu, Mathu Malar; Singh, Deeksha; Sen, Diya; Bhan, Sushma; Das, Subhadeep; Gupta, Akash; Adhikary, Siba Prasad; Tripathy, Sucheta

    2015-02-19

    We report here the draft genome sequence of the filamentous nitrogen-fixing cyanobacterium Tolypothrix boutellei strain VB521301. The organism is lipid rich and hydrophobic and produces polyunsaturated fatty acids which can be harnessed for industrial purpose. The draft genome sequence assembled into 11,572,263 bp with 70 scaffolds and 7,777 protein coding genes. Copyright © 2015 Chandrababunaidu et al.

  5. Draft Genome Sequence of Lactobacillus reuteri Strain CRL 1098, an Interesting Candidate for Functional Food Development.

    PubMed

    Torres, Andrea C; Suárez, Nadia E; Font, Graciela; Saavedra, Lucila; Taranto, María Pía

    2016-08-25

    We report here the draft genome sequence of Lactobacillus reuteri strain CRL 1098. This strain represents an interesting candidate for functional food development because of its proven probiotic properties. The draft genome sequence is composed of 1,969,471 bp assembled into 45 contigs and an average G+C content of 38.8%. Copyright © 2016 Torres et al.

  6. Genome sequence of Phytophthora ramorum: implications for management

    Treesearch

    Brett Tyler; Sucheta Tripathy; Nik Grunwald; Kurt Lamour; Kelly Ivors; Matteo Garbelotto; Daniel Rokhsar; Nik Putnam; Igor Grigoriev; Jeffrey Boore

    2006-01-01

    A draft genome sequence has been determined for Phytophthora ramorum, together with a draft sequence of the soybean pathogen Phytophthora sojae. The P. ramorum genome was sequenced to a depth of 7-fold coverage, while the P. sojae genome was sequenced to a depth of 9-fold coverage. The genome...

  7. A high-resolution cattle CNV map by population-scale genome sequencing

    USDA-ARS?s Scientific Manuscript database

    Copy Number Variations (CNVs) are common genomic structural variations that have been linked to human diseases and phenotypic traits. Prior studies in cattle have produced low-resolution CNV maps. We constructed a draft, high-resolution map of cattle CNVs based on whole genome sequencing data from 7...

  8. Raineya orbicola gen. nov., sp. nov. a slightly thermophilic bacterium of the phylum Bacteroidetes and the description of Raineyaceae fam. nov.

    PubMed

    Albuquerque, Luciana; Polónia, Ana Rita M; Barroso, Cristina; Froufe, Hugo J C; Lage, Olga; Lobo-da-Cunha, Alexandre; Egas, Conceição; da Costa, Milton S

    2018-04-01

    An isolate, designated SPSPC-11 T , with an optimum growth temperature of about 50 °C and an optimum pH for growth between 7.5 and 8.0, was recovered from a hot spring in central Portugal. Based on phylogenetic analysis of its 16S rRNA sequence, the new organism is most closely related to the species of the genus Thermonema but with a pairwise sequence similarity of <85 %. The isolate was orange-pigmented, formed non-motile long filaments and rod-shaped cells that stain Gram-negative. The organism was strictly aerobic, oxidase-positive and catalase-positive. The major fatty acids were iso-C15:0, iso-C15 : 0 2-OH and iso-C17 : 0 3-OH. The major polar lipids were one aminophospholipid, two aminolipids and three unidentified lipids. Menaquinone 7 was the major respiratory quinone. The DNA G+C content of strain SPSPC-11 T was 37.6 mol% (draft genome sequence). The high quality draft genome sequence corroborated many of the phenotypic characteristics of strain SPSPC-11 T . Based on genotypic, phylogenetic, physiological and biochemical characterization we describe a new species of a novel genus represented by strain SPSPC-11 T (=CECT 9012 T =LMG 29233 T ) for which we propose the name Raineya orbicola gen. nov., sp. nov. We also describe the family Raineyaceae to accommodate this new genus and species.

  9. Draft genome sequence of Camellia sinensis var. sinensis provides insights into the evolution of the tea genome and tea quality.

    PubMed

    Wei, Chaoling; Yang, Hua; Wang, Songbo; Zhao, Jian; Liu, Chun; Gao, Liping; Xia, Enhua; Lu, Ying; Tai, Yuling; She, Guangbiao; Sun, Jun; Cao, Haisheng; Tong, Wei; Gao, Qiang; Li, Yeyun; Deng, Weiwei; Jiang, Xiaolan; Wang, Wenzhao; Chen, Qi; Zhang, Shihua; Li, Haijing; Wu, Junlan; Wang, Ping; Li, Penghui; Shi, Chengying; Zheng, Fengya; Jian, Jianbo; Huang, Bei; Shan, Dai; Shi, Mingming; Fang, Congbing; Yue, Yi; Li, Fangdong; Li, Daxiang; Wei, Shu; Han, Bin; Jiang, Changjun; Yin, Ye; Xia, Tao; Zhang, Zhengzhu; Bennetzen, Jeffrey L; Zhao, Shancen; Wan, Xiaochun

    2018-05-01

    Tea, one of the world's most important beverage crops, provides numerous secondary metabolites that account for its rich taste and health benefits. Here we present a high-quality sequence of the genome of tea, Camellia sinensis var. sinensis (CSS), using both Illumina and PacBio sequencing technologies. At least 64% of the 3.1-Gb genome assembly consists of repetitive sequences, and the rest yields 33,932 high-confidence predictions of encoded proteins. Divergence between two major lineages, CSS and Camellia sinensis var. assamica (CSA), is calculated to ∼0.38 to 1.54 million years ago (Mya). Analysis of genic collinearity reveals that the tea genome is the product of two rounds of whole-genome duplications (WGDs) that occurred ∼30 to 40 and ∼90 to 100 Mya. We provide evidence that these WGD events, and subsequent paralogous duplications, had major impacts on the copy numbers of secondary metabolite genes, particularly genes critical to producing three key quality compounds: catechins, theanine, and caffeine. Analyses of transcriptome and phytochemistry data show that amplification and transcriptional divergence of genes encoding a large acyltransferase family and leucoanthocyanidin reductases are associated with the characteristic young leaf accumulation of monomeric galloylated catechins in tea, while functional divergence of a single member of the glutamine synthetase gene family yielded theanine synthetase. This genome sequence will facilitate understanding of tea genome evolution and tea metabolite pathways, and will promote germplasm utilization for breeding improved tea varieties. Copyright © 2018 the Author(s). Published by PNAS.

  10. Draft genome sequence of Camellia sinensis var. sinensis provides insights into the evolution of the tea genome and tea quality

    PubMed Central

    Wei, Chaoling; Yang, Hua; Wang, Songbo; Zhao, Jian; Liu, Chun; Gao, Liping; Xia, Enhua; Lu, Ying; Tai, Yuling; She, Guangbiao; Sun, Jun; Cao, Haisheng; Tong, Wei; Gao, Qiang; Li, Yeyun; Deng, Weiwei; Jiang, Xiaolan; Wang, Wenzhao; Chen, Qi; Zhang, Shihua; Li, Haijing; Wu, Junlan; Wang, Ping; Li, Penghui; Shi, Chengying; Zheng, Fengya; Jian, Jianbo; Huang, Bei; Shan, Dai; Shi, Mingming; Fang, Congbing; Yue, Yi; Li, Fangdong; Li, Daxiang; Wei, Shu; Han, Bin; Jiang, Changjun; Yin, Ye; Xia, Tao; Zhang, Zhengzhu; Bennetzen, Jeffrey L.; Zhao, Shancen; Wan, Xiaochun

    2018-01-01

    Tea, one of the world’s most important beverage crops, provides numerous secondary metabolites that account for its rich taste and health benefits. Here we present a high-quality sequence of the genome of tea, Camellia sinensis var. sinensis (CSS), using both Illumina and PacBio sequencing technologies. At least 64% of the 3.1-Gb genome assembly consists of repetitive sequences, and the rest yields 33,932 high-confidence predictions of encoded proteins. Divergence between two major lineages, CSS and Camellia sinensis var. assamica (CSA), is calculated to ∼0.38 to 1.54 million years ago (Mya). Analysis of genic collinearity reveals that the tea genome is the product of two rounds of whole-genome duplications (WGDs) that occurred ∼30 to 40 and ∼90 to 100 Mya. We provide evidence that these WGD events, and subsequent paralogous duplications, had major impacts on the copy numbers of secondary metabolite genes, particularly genes critical to producing three key quality compounds: catechins, theanine, and caffeine. Analyses of transcriptome and phytochemistry data show that amplification and transcriptional divergence of genes encoding a large acyltransferase family and leucoanthocyanidin reductases are associated with the characteristic young leaf accumulation of monomeric galloylated catechins in tea, while functional divergence of a single member of the glutamine synthetase gene family yielded theanine synthetase. This genome sequence will facilitate understanding of tea genome evolution and tea metabolite pathways, and will promote germplasm utilization for breeding improved tea varieties. PMID:29678829

  11. High quality draft genome sequence of Brachymonas chironomi AIMA4T (DSM 19884T) isolated from a Chironomus sp. egg mass

    DOE PAGES

    Laviad, Sivan; Lapidus, Alla; Han, James; ...

    2015-05-27

    Brachymonas chironomi strain AIMA4T (Halpern et al., 2009) is a Gram-negative, non-motile, aerobic, chemoorganotroph bacterium. B. chironomi is a member of the Comamonadaceae, a family within the class Betaproteobacteria. This species was isolated from a chironomid (Diptera; Chironomidae) egg mass, sampled from a waste stabilization pond in northern Israel. Phylogenetic analysis based on the 16S rRNA gene sequences placed strain AIMA4T in the genus Brachymonas. Here we describe the features of this organism, together with the complete genome sequence and annotation. We find the DNA GC content is 63.5%. The chromosome length is 2,509,395 bp. It encodes 2,382 proteins andmore » 68 RNA genes. Brachymonas chironomi genome is part of the Genomic Encyclopedia of Type Strains, Phase I: the one thousand microbial genomes (KMG) project.« less

  12. Draft Genome Sequences of Clostridium tyrobutyricum Strains FAM22552 and FAM22553, Isolated from Swiss Semihard Red-Smear Cheese

    PubMed Central

    Wüthrich, Daniel; Bruggmann, Rémy; Berthoud, Hélène; Arias-Roth, Emmanuelle

    2015-01-01

    Clostridium tyrobutyricum is the main microorganism responsible for late blowing defect in cheeses. Here, we present the draft genome sequences of two C. tyrobutyricum strains isolated from a Swiss semihard red-smear cheese. The two draft genomes comprise 3.05 and 3.08 Mbp and contain 3,030 and 3,089 putative coding sequences, respectively. PMID:25767226

  13. Draft Genome Sequences of blaKPC-Containing Enterobacter aerogenes, Citrobacter freundii, and Citrobacter koseri Strains

    PubMed Central

    Hazen, Tracy H.; Mettus, Roberta T.; McElheny, Christi L.; Bowler, Sarah L.

    2018-01-01

    ABSTRACT We report here the draft genome sequences of four blaKPC-containing bacteria identified as Klebsiella aerogenes, Citrobacter freundii, and Citrobacter koseri. Additionally, we report the draft genome sequence of a K. aerogenes strain that did not contain a blaKPC gene but was isolated from the patient who had the blaKPC-2-containing K. aerogenes strain. PMID:29472325

  14. Draft Genome Sequences of blaKPC-Containing Enterobacter aerogenes, Citrobacter freundii, and Citrobacter koseri Strains.

    PubMed

    Hazen, Tracy H; Mettus, Roberta T; McElheny, Christi L; Bowler, Sarah L; Doi, Yohei; Rasko, David A

    2018-02-22

    We report here the draft genome sequences of four bla KPC -containing bacteria identified as Klebsiella aerogenes , Citrobacter freundii , and Citrobacter koseri Additionally, we report the draft genome sequence of a K. aerogenes strain that did not contain a bla KPC gene but was isolated from the patient who had the bla KPC-2 -containing K. aerogenes strain. Copyright © 2018 Hazen et al.

  15. Draft Genome Sequence of Agrobacterium sp. Strain UHFBA-218, Isolated from Rhizosphere Soil of Crown Gall-Infected Cherry Rootstock Colt

    PubMed Central

    Dua, Ankita; Sangwan, Naseer; Kaur, Jasvinder; Saxena, Anjali; Kohli, Puneet; Gupta, A. K.

    2013-01-01

    We report here the draft genome sequence of the alphaproteobacterium Agrobacterium sp. strain UHFBA-218, which was isolated from rhizosphere soil of crown gall-infected cherry rootstock Colt. The draft genome of strain UHFBA-218 consists of 112 contigs (5,425,303 bp) and 5,063 coding sequences with a G+C content of 59.8%. PMID:23723402

  16. Human genetics and genomics a decade after the release of the draft sequence of the human genome.

    PubMed

    Naidoo, Nasheen; Pawitan, Yudi; Soong, Richie; Cooper, David N; Ku, Chee-Seng

    2011-10-01

    Substantial progress has been made in human genetics and genomics research over the past ten years since the publication of the draft sequence of the human genome in 2001. Findings emanating directly from the Human Genome Project, together with those from follow-on studies, have had an enormous impact on our understanding of the architecture and function of the human genome. Major developments have been made in cataloguing genetic variation, the International HapMap Project, and with respect to advances in genotyping technologies. These developments are vital for the emergence of genome-wide association studies in the investigation of complex diseases and traits. In parallel, the advent of high-throughput sequencing technologies has ushered in the 'personal genome sequencing' era for both normal and cancer genomes, and made possible large-scale genome sequencing studies such as the 1000 Genomes Project and the International Cancer Genome Consortium. The high-throughput sequencing and sequence-capture technologies are also providing new opportunities to study Mendelian disorders through exome sequencing and whole-genome sequencing. This paper reviews these major developments in human genetics and genomics over the past decade.

  17. Human genetics and genomics a decade after the release of the draft sequence of the human genome

    PubMed Central

    2011-01-01

    Substantial progress has been made in human genetics and genomics research over the past ten years since the publication of the draft sequence of the human genome in 2001. Findings emanating directly from the Human Genome Project, together with those from follow-on studies, have had an enormous impact on our understanding of the architecture and function of the human genome. Major developments have been made in cataloguing genetic variation, the International HapMap Project, and with respect to advances in genotyping technologies. These developments are vital for the emergence of genome-wide association studies in the investigation of complex diseases and traits. In parallel, the advent of high-throughput sequencing technologies has ushered in the 'personal genome sequencing' era for both normal and cancer genomes, and made possible large-scale genome sequencing studies such as the 1000 Genomes Project and the International Cancer Genome Consortium. The high-throughput sequencing and sequence-capture technologies are also providing new opportunities to study Mendelian disorders through exome sequencing and whole-genome sequencing. This paper reviews these major developments in human genetics and genomics over the past decade. PMID:22155605

  18. Identification of Putative Nuclear Receptors and Steroidogenic Enzymes in Murray-Darling Rainbowfish (Melanotaenia fluviatilis) Using RNA-Seq and De Novo Transcriptome Assembly.

    PubMed

    Bain, Peter A; Papanicolaou, Alexie; Kumar, Anupama

    2015-01-01

    Murray-Darling rainbowfish (Melanotaenia fluviatilis [Castelnau, 1878]; Atheriniformes: Melanotaeniidae) is a small-bodied teleost currently under development in Australasia as a test species for aquatic toxicological studies. To date, efforts towards the development of molecular biomarkers of contaminant exposure have been hindered by the lack of available sequence data. To address this, we sequenced messenger RNA from brain, liver and gonads of mature male and female fish and generated a high-quality draft transcriptome using a de novo assembly approach. 149,742 clusters of putative transcripts were obtained, encompassing 43,841 non-redundant protein-coding regions. Deduced amino acid sequences were annotated by functional inference based on similarity with sequences from manually curated protein sequence databases. The draft assembly contained protein-coding regions homologous to 95.7% of the complete cohort of predicted proteins from the taxonomically related species, Oryzias latipes (Japanese medaka). The mean length of rainbowfish protein-coding sequences relative to their medaka homologues was 92.1%, indicating that despite the limited number of tissues sampled a large proportion of the total expected number of protein-coding genes was captured in the study. Because of our interest in the effects of environmental contaminants on endocrine pathways, we manually curated subsets of coding regions for putative nuclear receptors and steroidogenic enzymes in the rainbowfish transcriptome, revealing 61 candidate nuclear receptors encompassing all known subfamilies, and 41 putative steroidogenic enzymes representing all major steroidogenic enzymes occurring in teleosts. The transcriptome presented here will be a valuable resource for researchers interested in biomarker development, protein structure and function, and contaminant-response genomics in Murray-Darling rainbowfish.

  19. 77 FR 39959 - Draft Guidance To Implement Requirements for the Treatment of Air Quality Monitoring Data...

    Federal Register 2010, 2011, 2012, 2013, 2014

    2012-07-06

    ... of the revised draft High Winds Guidance document, the EPA identifies example technical analyses that... identified analyses and any additional technical analyses that air agencies could use to demonstrate that the... Web site at http://www.epa.gov/ttn/analysis/exevents.htm for additional details on the draft non...

  20. An improved high-quality draft genome sequence of Carnobacterium inhibens subsp. inhibens strain K1 T

    DOE PAGES

    Nicholson, Wayne L.; Davis, Christina L.; Shapiro, Nicole; ...

    2016-09-08

    Despite their ubiquity and their involvement in food spoilage, the genus Carnobacterium remains rather sparsely characterized at the genome level. Carnobacterium inhibens K1 T is a member of the Carnobacteriaceae family within the class Bacilli. This strain is a Gram-positive, rod-shaped bacterium isolated from the intestine of an Atlantic salmon. The present study determined the genome sequence and annotation of Carnobacterium inhibens K1 T. The genome comprised 2,748,608 bp with a G+C content of 34.85 %, which included 2621 protein-coding genes and 116 RNA genes. The strain contained five contigs corresponding to presumptive plasmids of sizes: 19,036; 24,250; 26,581; 65,272;more » and 65,904 bp.« less

  1. An improved high-quality draft genome sequence of Carnobacterium inhibens subsp. inhibens strain K1 T

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Nicholson, Wayne L.; Davis, Christina L.; Shapiro, Nicole

    Despite their ubiquity and their involvement in food spoilage, the genus Carnobacterium remains rather sparsely characterized at the genome level. Carnobacterium inhibens K1 T is a member of the Carnobacteriaceae family within the class Bacilli. This strain is a Gram-positive, rod-shaped bacterium isolated from the intestine of an Atlantic salmon. The present study determined the genome sequence and annotation of Carnobacterium inhibens K1 T. The genome comprised 2,748,608 bp with a G+C content of 34.85 %, which included 2621 protein-coding genes and 116 RNA genes. The strain contained five contigs corresponding to presumptive plasmids of sizes: 19,036; 24,250; 26,581; 65,272;more » and 65,904 bp.« less

  2. Draft genome sequence of the rubber tree Hevea brasiliensis.

    PubMed

    Rahman, Ahmad Yamin Abdul; Usharraj, Abhilash O; Misra, Biswapriya B; Thottathil, Gincy P; Jayasekaran, Kandakumar; Feng, Yun; Hou, Shaobin; Ong, Su Yean; Ng, Fui Ling; Lee, Ling Sze; Tan, Hock Siew; Sakaff, Muhd Khairul Luqman Muhd; Teh, Beng Soon; Khoo, Bee Feong; Badai, Siti Suriawati; Aziz, Nurohaida Ab; Yuryev, Anton; Knudsen, Bjarne; Dionne-Laporte, Alexandre; Mchunu, Nokuthula P; Yu, Qingyi; Langston, Brennick J; Freitas, Tracey Allen K; Young, Aaron G; Chen, Rui; Wang, Lei; Najimudin, Nazalan; Saito, Jennifer A; Alam, Maqsudul

    2013-02-02

    Hevea brasiliensis, a member of the Euphorbiaceae family, is the major commercial source of natural rubber (NR). NR is a latex polymer with high elasticity, flexibility, and resilience that has played a critical role in the world economy since 1876. Here, we report the draft genome sequence of H. brasiliensis. The assembly spans ~1.1 Gb of the estimated 2.15 Gb haploid genome. Overall, ~78% of the genome was identified as repetitive DNA. Gene prediction shows 68,955 gene models, of which 12.7% are unique to Hevea. Most of the key genes associated with rubber biosynthesis, rubberwood formation, disease resistance, and allergenicity have been identified. The knowledge gained from this genome sequence will aid in the future development of high-yielding clones to keep up with the ever increasing need for natural rubber.

  3. Draft genome sequence of the rubber tree Hevea brasiliensis

    PubMed Central

    2013-01-01

    Background Hevea brasiliensis, a member of the Euphorbiaceae family, is the major commercial source of natural rubber (NR). NR is a latex polymer with high elasticity, flexibility, and resilience that has played a critical role in the world economy since 1876. Results Here, we report the draft genome sequence of H. brasiliensis. The assembly spans ~1.1 Gb of the estimated 2.15 Gb haploid genome. Overall, ~78% of the genome was identified as repetitive DNA. Gene prediction shows 68,955 gene models, of which 12.7% are unique to Hevea. Most of the key genes associated with rubber biosynthesis, rubberwood formation, disease resistance, and allergenicity have been identified. Conclusions The knowledge gained from this genome sequence will aid in the future development of high-yielding clones to keep up with the ever increasing need for natural rubber. PMID:23375136

  4. Draft Genome Sequences of Clostridium tyrobutyricum Strains FAM22552 and FAM22553, Isolated from Swiss Semihard Red-Smear Cheese.

    PubMed

    Storari, Michelangelo; Wüthrich, Daniel; Bruggmann, Rémy; Berthoud, Hélène; Arias-Roth, Emmanuelle

    2015-03-12

    Clostridium tyrobutyricum is the main microorganism responsible for late blowing defect in cheeses. Here, we present the draft genome sequences of two C. tyrobutyricum strains isolated from a Swiss semihard red-smear cheese. The two draft genomes comprise 3.05 and 3.08 Mbp and contain 3,030 and 3,089 putative coding sequences, respectively. Copyright © 2015 Storari et al.

  5. Draft Genome Sequence of the First New Delhi Metallo-β-Lactamase (NDM-1)-Producing Escherichia coli Strain Isolated in Peru.

    PubMed

    Tamariz, Jesus; Llanos, Carlos; Seas, Carlos; Montenegro, Paola; Lagos, Jose; Fernandes, Miriam R; Cerdeira, Louise; Lincopan, Nilton

    2018-03-29

    We present here the draft genome sequence of the first New Delhi metallo-β-lactamase (NDM-1)-producing Escherichia coli strain, belonging to sequence type 155 (ST155), isolated in Peru. Assembly of this draft genome resulted in 5,061,184 bp, revealing a clinically significant resistome for β-lactams, aminoglycosides, tetracyclines, phenicols, sulfonamides, trimethoprim, and fluoroquinolones. Copyright © 2018 Tamariz et al.

  6. Complete Genome Sequence of Magnetospirillum gryphiswaldense MSR-1

    PubMed Central

    Wang, Xu; Wang, Qing; Zhang, Weijia; Wang, Yinjia; Li, Li; Wen, Tong; Zhang, Tongwei; Zhang, Yang; Xu, Jun; Hu, Junying; Li, Shuqi; Liu, Lingzi; Liu, Jinxin; Jiang, Wei; Tian, Jiesheng; Wang, Lei; Li, Jilun

    2014-01-01

    We report the complete genomic sequence of Magnetospirillum gryphiswaldense MSR-1 (DSM 6361), a type strain of the genus Magnetospirillum belonging to the Alphaproteobacteria. Compared to the reported draft sequence, extensive rearrangements and differences were found, indicating high genomic flexibility and “domestication” by accelerated evolution of the strain upon repeated passaging. PMID:24625872

  7. Draft genome sequence of a CTX-M-8, CTX-M-55 and FosA3 co-producing Escherichia coli ST117/B2 isolated from an asymptomatic carrier.

    PubMed

    Fernandes, Miriam R; Sellera, Fábio P; Moura, Quézia; Souza, Tiago A; Lincopan, Nilton

    2018-03-01

    Asymptomatic carriers can act as reservoirs of multidrug-resistant (MDR) bacteria. The aim of this study was to describe the draft genome sequence of a MDR Escherichia coli lineage recovered from a faecal sample of a healthy carrier. Genomic DNA was sequenced on an Illumina NextSeq platform. Sequence reads were de novo assembled using CLC Genomics Workbench and the whole genome sequence was evaluated through bioinformatics tools available from the Center of Genomic Epidemiology as well as additional in silico analysis. The genome size was calculated as 5178340 bp, with 5442 protein-coding sequences and 5492 total genes. Presence of the bla CTX-M-8 , bla CTX-M-55 and fosA3 genes was detected in addition to other antimicrobial resistance genes. Interestingly, the strain was assigned to serotype O8:H4-fimH97 and was classified within the highly virulent phylogroup B2. This draft genome can provide helpful information to elucidate genetic features that contribute to colonisation and adaptation of MDR and virulent pathogens in asymptomatic carriers. Copyright © 2018 International Society for Chemotherapy of Infection and Cancer. Published by Elsevier Ltd. All rights reserved.

  8. Draft Genome Sequence of Corynebacterium ulcerans FRC58, Isolated from the Bronchitic Aspiration of a Patient in France

    PubMed Central

    Silva, Andréia do Socorro de Sousa; Baraúna, Rafael Azevedo; de Sá, Pablo Caracciolo Gomes; das Graças, Diego Assis; Carneiro, Adriana Ribeiro; Thouvenin, Maxime; Azevedo, Vasco; Badell, Edgar; Guiso, Nicole; da Silva, Artur Luiz da Costa

    2014-01-01

    Corynebacterium ulcerans is a bacterial species with high importance because it causes infections in animals and, rarely, in humans. Its virulence mechanisms remain unclear. The current study describes the draft genome of C. ulcerans FRC58, which was isolated from the bronchitic aspiration of a patient in France. PMID:24407640

  9. Draft sequencing and analysis of the genome of pufferfish Takifugu flavidus.

    PubMed

    Gao, Yang; Gao, Qiang; Zhang, Huan; Wang, Lingling; Zhang, Fuchong; Yang, Chuanyan; Song, Linsheng

    2014-12-01

    The pufferfish Takifugu flavidus is an important economic species due to its outstanding flavour and high market value. It has been regarded as an excellent model of genetic study for decades as well. In the present study, three mate-pair libraries of T. flavidus genome were sequenced by the SOLiD 4 next-generation sequencing platform, and the draft genome was constructed with the short reads using an assisted assembly strategy. The draft consists of 50,947 scaffolds with an N50 value of 305.7 kb, and the average GC content was 45.2%. The combined length of repetitive sequences was 26.5 Mb, which accounted for 6.87% of the genome, indicating that the compactness of T. flavidus genome was approximative with that of T. rubripes genome. A total of 1,253 non-coding RNA genes and 30,285 protein-encoding genes were assigned to the genome. There were 132,775 and 394 presumptive genes playing roles in the colour pattern variation, the relatively slow growth and the lipid metabolism, respectively. Among them, genes involved in the microtubule-dependent transport system, angiogenesis, decapentaplegic pathway and lipid mobilization were significantly expanded in the T. flavidus genome. This draft genome provides a valuable resource for understanding and improving both fundamental and applied research with pufferfish in the future. © The Author 2014. Published by Oxford University Press on behalf of Kazusa DNA Research Institute.

  10. The Draft Genome Sequence of Clostridium sp. Strain NJ4, a Bacterium Capable of Producing Butanol from Inulin Through Consolidated Bioprocessing.

    PubMed

    Jiang, Yujia; Lu, Jiasheng; Chen, Tianpeng; Yan, Wei; Dong, Weiliang; Zhou, Jie; Zhang, Wenming; Ma, Jiangfeng; Jiang, Min; Xin, Fengxue

    2018-05-23

    A novel butanogenic Clostridium sp. NJ4 was successfully isolated and characterized, which could directly produce relatively high titer of butanol from inulin through consolidated bioprocessing (CBP). The assembled draft genome of strain NJ4 is 4.09 Mp, containing 3891 encoded protein sequences with G+C content of 30.73%. Among these annotated genes, a levanase, a hypothetical inulinase, and two bifunctional alcohol/aldehyde dehydrogenases (AdhE) were found to play key roles in the achievement of ABE production from inulin through CBP.

  11. Draft Genome Sequence of Bacillus sp. GZT, a 2,4,6-Tribromophenol-Degrading Strain Isolated from the River Sludge of an Electronic Waste-Dismantling Region

    PubMed Central

    Liang, Zhishu; Li, Guiying; Das, Ranjit

    2016-01-01

    Here, we report the draft genome sequence of Bacillus sp. strain GZT, a 2,4,6-tribromophenol (TBP)-degrading bacterium previously isolated from an electronic waste-dismantling region. The draft genome sequence is 5.18 Mb and has a G+C content of 35.1%. This is the first genome report of a brominated flame retardant-degrading strain. PMID:27257197

  12. The draft genome sequence of Mangrovibacter sp. strain MP23, an endophyte isolated from the roots of Phragmites karka.

    PubMed

    Behera, Pratiksha; Vaishampayan, Parag; Singh, Nitin K; Mishra, Samir R; Raina, Vishakha; Suar, Mrutyunjay; Pattnaik, Ajit K; Rastogi, Gurdeep

    2016-09-01

    Till date, only one draft genome has been reported within the genus Mangrovibacter. Here, we report the second draft genome shotgun sequence of a Mangrovibacter sp. strain MP23 that was isolated from the roots of Phargmites karka (P. karka), an invasive weed growing in the Chilika Lagoon, Odisha, India. Strain MP23 is a facultative anaerobic, nitrogen-fixing endophytic bacteria that grows optimally at 37 °C, 7.0 pH, and 1% NaCl concentration. The draft genome sequence of strain MP23 contains 4,947,475 bp with an estimated G + C content of 49.9% and total 4392 protein coding genes. The genome sequence has provided information on putative genes that code for proteins involved in oxidative stress, uptake of nutrients, and nitrogen fixation that might offer niche specific ecological fitness and explain the invasive success of P. karka in Chilika Lagoon. The draft genome sequence and annotation have been deposited at DDBJ/EMBL/GenBank under the accession number LYRP00000000.

  13. Draft Genome Sequence of a Rare Smut Relative, Tilletiaria anomala UBC 951

    DOE PAGES

    Toome, Merje; Kuo, Alan; Henrissat, Bernard; ...

    2014-06-12

    We present the draft genome sequence of the smut fungus Tilletiaria anomala UBC 951 (Basidiomycota, Ustilaginomycotina). The sequenced genome size is 18.7 Mb, consisting of 289 scaffolds and a total of 6,810 predicted genes. This is the first genome sequence published for a fungus in the order Georgefisheriales (Exobasidiomycetes).

  14. Draft genome sequence of Thermoanaerobacterium sp. strain PSU-2 isolated from thermophilic hydrogen producing reactor.

    PubMed

    O-Thong, Sompong; Khongkliang, Peerawat; Mamimin, Chonticha; Singkhala, Apinya; Prasertsan, Poonsuk; Birkeland, Nils-Kåre

    2017-06-01

    Thermoanaerobacterium sp. strain PSU-2 was isolated from thermophilic hydrogen producing reactor and subjected to draft genome sequencing on 454 pyrosequencing and annotated on RAST. The draft genome sequence of strain PSU-2 contains 2,552,497 bases with an estimated G + C content of 35.2%, 2555 CDS, 8 rRNAs and 57 tRNAs. The strain had a number of genes responsible for carbohydrates metabolic, amino acids and derivatives, and protein metabolism of 17.7%, 14.39% and 9.81%, respectively. Strain PSU-2 also had gene responsible for hydrogen biosynthesis as well as the genes related to Ni-Fe hydrogenase. Comparative genomic analysis indicates strain PSU-2 shares about 94% genome sequence similarity with Thermoanaerobacterium xylanolyticum LX-11. The nucleotide sequence of this draft genome was deposited into DDBJ/ENA/GenBank under the accession MSQD00000000.

  15. Draft Genome Sequence of a Pseudomonas aeruginosa NA04 Bacterium Isolated from an Entomopathogenic Nematode.

    PubMed

    Salgado-Morales, Rosalba; Rivera-Gómez, Nancy; Lozano-Aguirre Beltrán, Luis Fernando; Hernández-Mendoza, Armando; Dantán-González, Edgar

    2017-09-07

    We report the draft genome sequence of Gram-negative bacterium Pseudomonas aeruginosa NA04, isolated from the entomopathogenic nematode Heterorhabditis indica MOR03. The draft genome consists of 54 contigs, a length of 6.37 Mb, and a G+C content 66.49%. Copyright © 2017 Salgado-Morales et al.

  16. Draft genome analysis provides insights into the fiber yield, crude protein biosynthesis, and vegetative growth of domesticated ramie (Boehmeria nivea L. Gaud).

    PubMed

    Liu, Chan; Zeng, Liangbin; Zhu, Siyuan; Wu, Lingqing; Wang, Yanzhou; Tang, Shouwei; Wang, Hongwu; Zheng, Xia; Zhao, Jian; Chen, Xiaorong; Dai, Qiuzhong; Liu, Touming

    2017-11-15

    Plentiful bast fiber, a high crude protein content, and vigorous vegetative growth make ramie a popular fiber and forage crop. Here, we report the draft genome of ramie, along with a genomic comparison and evolutionary analysis. The draft genome contained a sequence of approximately 335.6 Mb with 42,463 predicted genes. A high-density genetic map with 4,338 single nucleotide polymorphisms (SNPs) was developed and used to anchor the genome sequence, thus, creating an integrated genetic and physical map containing a 58.2-Mb genome sequence and 4,304 molecular markers. A genomic comparison identified 1,075 unique gene families in ramie, containing 4,082 genes. Among these unique genes, five were cellulose synthase genes that were specifically expressed in stem bark, and 3 encoded a WAT1-related protein, suggesting that they are probably related to high bast fiber yield. An evolutionary analysis detected 106 positively selected genes, 22 of which were related to nitrogen metabolism, indicating that they are probably responsible for the crude protein content and vegetative growth of domesticated varieties. This study is the first to characterize the genome and develop a high-density genetic map of ramie and provides a basis for the genetic and molecular study of this crop. © The Author 2017. Published by Oxford University Press on behalf of Kazusa DNA Research Institute.

  17. Draft genome sequence of the phenazine-producing Pseudomonas fluorescens strain 2-79

    USDA-ARS?s Scientific Manuscript database

    Pseudomonas fluorescens strain 2-79, a natural isolate of the rhizosphere of wheat (Triticum aestivum L.), possesses antagonistic potential toward several fungal pathogens. We report the draft genome sequence of strain 2-79, which comprises 5,674 protein-coding sequences....

  18. Alignment-Free Design of Highly Discriminatory Diagnostic Primer Sets for Escherichia coli O104:H4 Outbreak Strains

    PubMed Central

    Bielaszewska, Martina; Karch, Helge; Toth, Ian K.

    2012-01-01

    Background An Escherichia coli O104:H4 outbreak in Germany in summer 2011 caused 53 deaths, over 4000 individual infections across Europe, and considerable economic, social and political impact. This outbreak was the first in a position to exploit rapid, benchtop high-throughput sequencing (HTS) technologies and crowdsourced data analysis early in its investigation, establishing a new paradigm for rapid response to disease threats. We describe a novel strategy for design of diagnostic PCR primers that exploited this rapid draft bacterial genome sequencing to distinguish between E. coli O104:H4 outbreak isolates and other pathogenic E. coli isolates, including the historical hæmolytic uræmic syndrome (HUSEC) E. coli HUSEC041 O104:H4 strain, which possesses the same serotype as the outbreak isolates. Methodology/Principal Findings Primers were designed using a novel alignment-free strategy against eleven draft whole genome assemblies of E. coli O104:H4 German outbreak isolates from the E. coli O104:H4 Genome Analysis Crowd-Sourcing Consortium website, and a negative sequence set containing 69 E. coli chromosome and plasmid sequences from public databases. Validation in vitro against 21 ‘positive’ E. coli O104:H4 outbreak and 32 ‘negative’ non-outbreak EHEC isolates indicated that individual primer sets exhibited 100% sensitivity for outbreak isolates, with false positive rates of between 9% and 22%. A minimal combination of two primers discriminated between outbreak and non-outbreak E. coli isolates with 100% sensitivity and 100% specificity. Conclusions/Significance Draft genomes of isolates of disease outbreak bacteria enable high throughput primer design and enhanced diagnostic performance in comparison to traditional molecular assays. Future outbreak investigations will be able to harness HTS rapidly to generate draft genome sequences and diagnostic primer sets, greatly facilitating epidemiology and clinical diagnostics. We expect that high throughput primer design strategies will enable faster, more precise responses to future disease outbreaks of bacterial origin, and help to mitigate their societal impact. PMID:22496820

  19. Draft genome sequence of Mycobacterium tuberculosis strain B9741 of Beijing B0/W lineage from HIV positive patient from Siberia.

    PubMed

    Shur, K V; Zaychikova, M V; Mikheecheva, N E; Klimina, K M; Bekker, O B; Zhdanova, S N; Ogarkov, O B; Danilenko, V N

    2016-12-01

    We report a draft genome sequence of Mycobacterium tuberculosis strain B9741 belonging to Beijing B0/W lineage isolated from a HIV patient from Siberia, Russia. This clinical isolate showed MDR phenotype and resistance to isoniazid, rifampin, streptomycin and pyrazinamide. We analyzed SNPs associated with virulence and resistance. The draft genome sequence and annotation have been deposited at GenBank under the accession NZ_LVJJ00000000.

  20. Sequence-based Network Completion Reveals the Integrality of Missing Reactions in Metabolic Networks*

    PubMed Central

    Krumholz, Elias W.; Libourel, Igor G. L.

    2015-01-01

    Genome-scale metabolic models are central in connecting genotypes to metabolic phenotypes. However, even for well studied organisms, such as Escherichia coli, draft networks do not contain a complete biochemical network. Missing reactions are referred to as gaps. These gaps need to be filled to enable functional analysis, and gap-filling choices influence model predictions. To investigate whether functional networks existed where all gap-filling reactions were supported by sequence similarity to annotated enzymes, four draft networks were supplemented with all reactions from the Model SEED database for which minimal sequence similarity was found in their genomes. Quadratic programming revealed that the number of reactions that could partake in a gap-filling solution was vast: 3,270 in the case of E. coli, where 72% of the metabolites in the draft network could connect a gap-filling solution. Nonetheless, no network could be completed without the inclusion of orphaned enzymes, suggesting that parts of the biochemistry integral to biomass precursor formation are uncharacterized. However, many gap-filling reactions were well determined, and the resulting networks showed improved prediction of gene essentiality compared with networks generated through canonical gap filling. In addition, gene essentiality predictions that were sensitive to poorly determined gap-filling reactions were of poor quality, suggesting that damage to the network structure resulting from the inclusion of erroneous gap-filling reactions may be predictable. PMID:26041773

  1. Genome sequence and plasmid transformation of the model high-yield bacterial cellulose producer Gluconacetobacter hansenii ATCC 53582

    NASA Astrophysics Data System (ADS)

    Florea, Michael; Reeve, Benjamin; Abbott, James; Freemont, Paul S.; Ellis, Tom

    2016-03-01

    Bacterial cellulose is a strong, highly pure form of cellulose that is used in a range of applications in industry, consumer goods and medicine. Gluconacetobacter hansenii ATCC 53582 is one of the highest reported bacterial cellulose producing strains and has been used as a model organism in numerous studies of bacterial cellulose production and studies aiming to increased cellulose productivity. Here we present a high-quality draft genome sequence for G. hansenii ATCC 53582 and find that in addition to the previously described cellulose synthase operon, ATCC 53582 contains two additional cellulose synthase operons and several previously undescribed genes associated with cellulose production. In parallel, we also develop optimized protocols and identify plasmid backbones suitable for transformation of ATCC 53582, albeit with low efficiencies. Together, these results provide important information for further studies into cellulose synthesis and for future studies aiming to genetically engineer G. hansenii ATCC 53582 for increased cellulose productivity.

  2. Draft Genome Sequence of Aspergillus oryzae ATCC 12892

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Deng, Shuang; Pomraning, Kyle R.; Bohutskyi, Pavlo

    The draft genome sequence ofAspergillus oryzaeATCC 12892 is presented here.A. oryzaeproduces 3-nitropropionic acid, which has been investigated with regard to understanding the biosynthesis of nitroorganic compounds.

  3. Strategies for high-altitude adaptation revealed from high-quality draft genome of non-violacein producing Janthinobacterium lividum ERGS5:01.

    PubMed

    Kumar, Rakshak; Acharya, Vishal; Singh, Dharam; Kumar, Sanjay

    2018-01-01

    A light pink coloured bacterial strain ERGS5:01 isolated from glacial stream water of Sikkim Himalaya was affiliated to Janthinobacterium lividum based on 16S rRNA gene sequence identity and phylogenetic clustering. Whole genome sequencing was performed for the strain to confirm its taxonomy as it lacked the typical violet pigmentation of the genus and also to decipher its survival strategy at the aquatic ecosystem of high elevation. The PacBio RSII sequencing generated genome of 5,168,928 bp with 4575 protein-coding genes and 118 RNA genes. Whole genome-based multilocus sequence analysis clustering, in silico DDH similarity value of 95.1% and, the ANI value of 99.25% established the identity of the strain ERGS5:01 (MCC 2953) as a non-violacein producing J. lividum . The genome comparisons across genus Janthinobacterium revealed an open pan-genome with the scope of the addition of new orthologous cluster to complete the genomic inventory. The genomic insight provided the genetic basis of freezing and frequent freeze-thaw cycle tolerance and, for industrially important enzymes. Extended insight into the genome provided clues of crucial genes associated with adaptation in the harsh aquatic ecosystem of high altitude.

  4. Draft genome sequences of Actinomyces timonensis strain 7400942T and its prophage.

    PubMed

    Gorlas, Aurore; Gimenez, Grégory; Raoult, Didier; Roux, Véronique

    2012-12-01

    A draft genome sequence of Actinomyces timonensis, an anaerobic bacterium isolated from a human clinical osteoarticular sample, is described here. CRISPR-associated proteins, insertion sequence, and toxin-antitoxin loci were found on the genome. A new virus or provirus, AT-1, was characterized.

  5. Draft Genome Sequence of a Picorna-Like Virus Associated with Gill Tissue in Clinically Normal Brook Trout, Salvelinus fontinalis.

    PubMed

    Iwanowicz, Luke R; Iwanowicz, Deborah D; Adams, Cynthia R; Galbraith, Heather; Aunins, Aaron; Cornman, Robert S

    2017-10-12

    Here, we report a draft genome sequence of a picorna-like virus associated with brook trout, Salvelinus fontinalis , gill tissue. The draft genome comprises 8,681 nucleotides, excluding the poly(A) tract, and contains two open reading frames. It is most similar to picorna-like viruses that infect invertebrates.

  6. Draft genome sequence of a picorna-like virus associated with gill tissue in clinically normal brook trout, Salvelinus fontinalis

    USGS Publications Warehouse

    Iwanowicz, Luke R.; Iwanowicz, Deborah; Adams, Cynthia; Galbraith, Heather S.; Aunins, Aaron W.; Cornman, Robert S.

    2017-01-01

    Here, we report a draft genome sequence of a picorna-like virus associated with brook trout, Salvelinus fontinalis, gill tissue. The draft genome comprises 8,681 nucleotides, excluding the poly(A) tract, and contains two open reading frames. It is most similar to picorna-like viruses that infect invertebrates.

  7. Genome sequence of Ensifer meliloti strain WSM1022; a highly effective microsymbiont of the model legume Medicago truncatula A17.

    PubMed

    Terpolilli, Jason; Hill, Yvette; Tian, Rui; Howieson, John; Bräu, Lambert; Goodwin, Lynne; Han, James; Liolios, Konstantinos; Huntemann, Marcel; Pati, Amrita; Woyke, Tanja; Mavromatis, Konstantinos; Markowitz, Victor; Ivanova, Natalia; Kyrpides, Nikos; Reeve, Wayne

    2013-12-20

    Ensifer meliloti WSM1022 is an aerobic, motile, Gram-negative, non-spore-forming rod that can exist as a soil saprophyte or as a legume microsymbiont of Medicago. WSM1022 was isolated in 1987 from a nodule recovered from the roots of the annual Medicago orbicularis growing on the Cyclades Island of Naxos in Greece. WSM1022 is highly effective at fixing nitrogen with M. truncatula and other annual species such as M. tornata and M. littoralis and is also highly effective with the perennial M. sativa (alfalfa or lucerne). In common with other characterized E. meliloti strains, WSM1022 will nodulate but fixes poorly with M. polymorpha and M. sphaerocarpos and does not nodulate M. murex. Here we describe the features of E. meliloti WSM1022, together with genome sequence information and its annotation. The 6,649,661 bp high-quality-draft genome is arranged into 121 scaffolds of 125 contigs containing 6,323 protein-coding genes and 75 RNA-only encoding genes, and is one of 100 rhizobial genomes sequenced as part of the DOE Joint Genome Institute 2010 Genomic Encyclopedia for Bacteria and Archaea-Root Nodule Bacteria (GEBA-RNB) project.

  8. High quality draft genome sequence of Bacteroides barnesiae type strain BL2T (DSM 18169T) from chicken caecum

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Sakamoto, Mitsuo; Lapidus, Alla L.; Han, James

    Bacteroides barnesiae Lan et al. 2006 is a species of the genus Bacteroides, which belongs to the family Bacteroidaceae. Strain BL2T is of interest because it was isolated from the gut of a chicken and the growing awareness that the anaerobic microbiota of the caecum is of benefit for the host and may impact poultry farming. We report that the 3,621,509 bp long genome with its 3,059 protein-coding and 97 RNA genes is a part of the Genomic Encyclopedia of Type Strains, Phase I: the one thousand microbial genomes (KMG) project.

  9. High quality draft genome sequence of Bacteroides barnesiae type strain BL2T (DSM 18169T) from chicken caecum

    DOE PAGES

    Sakamoto, Mitsuo; Lapidus, Alla L.; Han, James; ...

    2015-08-02

    Bacteroides barnesiae Lan et al. 2006 is a species of the genus Bacteroides, which belongs to the family Bacteroidaceae. Strain BL2T is of interest because it was isolated from the gut of a chicken and the growing awareness that the anaerobic microbiota of the caecum is of benefit for the host and may impact poultry farming. We report that the 3,621,509 bp long genome with its 3,059 protein-coding and 97 RNA genes is a part of the Genomic Encyclopedia of Type Strains, Phase I: the one thousand microbial genomes (KMG) project.

  10. Finishing and Special Motifs: Lessons Learned from CRISPR Analysis Using Next-Generation Draft Sequences (7th Annual SFAF Meeting, 2012)

    ScienceCinema

    Campbell, Catherine

    2018-01-22

    Catherine Campbell on "Finishing and Special Motifs: Lessons learned from CRISPR analysis using next-generation draft sequences" at the 2012 Sequencing, Finishing, Analysis in the Future Meeting held June 5-7, 2012 in Santa Fe, New Mexico.

  11. Finishing and Special Motifs: Lessons Learned from CRISPR Analysis Using Next-Generation Draft Sequences (7th Annual SFAF Meeting, 2012)

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Campbell, Catherine

    Catherine Campbell on "Finishing and Special Motifs: Lessons learned from CRISPR analysis using next-generation draft sequences" at the 2012 Sequencing, Finishing, Analysis in the Future Meeting held June 5-7, 2012 in Santa Fe, New Mexico.

  12. Draft Genome Sequence of Streptococcus orisasini SH06, Isolated from a Healthy Thoroughbred Gastrointestinal Tract.

    PubMed

    Takagi, Misako; Nakano, Akiyo; Toh, Hidehiro; Oshima, Kenshiro; Arakawa, Kensuke; Nakajima, Fumihiko; Tashiro, Kosuke; Kikusui, Tekefumi; Yanagida, Fujitoshi; Morita, Hidetoshi

    2016-01-14

    Streptococcus orisasini SH06 was isolated from a healthy thoroughbred gastrointestinal tract. Here, we report the draft genome sequence of this organism. This paper is the first published report of the genomic sequence of S. orisasini. Copyright © 2016 Takagi et al.

  13. Draft Genome Sequence of Sphingobium sp. Strain HDIPO4, an Avid Degrader of Hexachlorocyclohexane

    PubMed Central

    Mukherjee, Udita; Kumar, Roshan; Mahato, Nitish Kumar; Khurana, J. P.

    2013-01-01

    Sphingobium sp. strain HDIPO4 was isolated from a hexachlorocyclohexane (HCH) dumpsite and degraded HCH isomers rapidly. The draft genome sequence of HDIPO4 (~4.7 Mbp) contains 143 contigs and 4,646 coding sequences with a G+C content of 65%. PMID:24051321

  14. Genetic analysis of the Hungarian draft horse population using partial mitochondrial DNA D-loop sequencing.

    PubMed

    Csizmár, Nikolett; Mihók, Sándor; Jávor, András; Kusza, Szilvia

    2018-01-01

    The Hungarian draft is a horse breed with a recent mixed ancestry created in the 1920s by crossing local mares with draught horses imported from France and Belgium. The interest in its conservation and characterization has increased over the last few years. The aim of this work is to contribute to the characterization of the endangered Hungarian heavy draft horse populations in order to obtain useful information to implement conservation strategies for these genetic stocks. To genetically characterize the breed and to set up the basis for a conservation program, in the present study a hypervariable region of the mitochrondial DNA (D-loop) was used to assess genetic diversity in Hungarian draft horses. Two hundred and eighty five sequences obtained in our laboratory and 419 downloaded sequences available from Genbank were analyzed. One hundred and sixty-four haplotypes and thirty-six polymorphic sites were observed. High haplotype and nucleotide diversity values ( H d  = 0.954 ± 0.004; π  = 0.028 ± 0.0004) were identified in Hungarian population, although they were higher within than among the different populations ( H d  = 0.972 ± 0.002; π  = 0.03097 ± 0.002). Fourteen of the previously observed seventeen haplogroups were detected. Our samples showed a large intra- and interbreed variation. There was no clear clustering on the median joining network figure. The overall information collected in this work led us to consider that the genetic scenario observed for Hungarian draft breed is more likely the result of contributions from 'ancestrally' different genetic backgrounds. This study could contribute to the development of a breeding plan for Hungarian draft horses and help to formulate a genetic conservation plan, avoiding inbreeding while.

  15. Sequence and comparative analysis of the chicken genome provide unique perspectives on vertebrate evolution.

    PubMed

    2004-12-09

    We present here a draft genome sequence of the red jungle fowl, Gallus gallus. Because the chicken is a modern descendant of the dinosaurs and the first non-mammalian amniote to have its genome sequenced, the draft sequence of its genome--composed of approximately one billion base pairs of sequence and an estimated 20,000-23,000 genes--provides a new perspective on vertebrate genome evolution, while also improving the annotation of mammalian genomes. For example, the evolutionary distance between chicken and human provides high specificity in detecting functional elements, both non-coding and coding. Notably, many conserved non-coding sequences are far from genes and cannot be assigned to defined functional classes. In coding regions the evolutionary dynamics of protein domains and orthologous groups illustrate processes that distinguish the lineages leading to birds and mammals. The distinctive properties of avian microchromosomes, together with the inferred patterns of conserved synteny, provide additional insights into vertebrate chromosome architecture.

  16. Draft Genome Sequence of a Picorna-Like Virus Associated with Gill Tissue in Clinically Normal Brook Trout, Salvelinus fontinalis

    PubMed Central

    2017-01-01

    ABSTRACT Here, we report a draft genome sequence of a picorna-like virus associated with brook trout, Salvelinus fontinalis, gill tissue. The draft genome comprises 8,681 nucleotides, excluding the poly(A) tract, and contains two open reading frames. It is most similar to picorna-like viruses that infect invertebrates. PMID:29025930

  17. Draft Genome Sequence of Bioactive-Compound-Producing Cyanobacterium Tolypothrix campylonemoides Strain VB511288

    PubMed Central

    Das, Subhadeep; Singh, Deeksha; Madduluri, Madhavi; Chandrababunaidu, Mathu Malar; Gupta, Akash

    2015-01-01

    We report here the draft genome sequence of Tolypothrix campylonemoides VB511288, isolated from building facades in Santiniketan, India. The members of this genus produce several compounds of commercial importance. The draft assembly is 10,627,177 bases in 135 scaffolds, and it contains 7,886 protein-coding genes, 994 pseudogenes, 18 rRNA genes, and 76 tRNA genes. PMID:25838485

  18. Mississippi Curriculum Framework for Drafting and Design Technology (Program CIP: 48.0102--Architectural Drafting Technology) (Program CIP: 48.0101--General Drafting). Postsecondary Programs.

    ERIC Educational Resources Information Center

    Mississippi Research and Curriculum Unit for Vocational and Technical Education, State College.

    This document, which is intended for use by community and junior colleges throughout Mississippi, contains curriculum frameworks for the two course sequences of the state's postsecondary-level drafting and design technology program: architectural drafting technology and drafting and design technology. Presented first are a program description and…

  19. Draft genome sequence of field isolate Brucella melitensis strain 2007BM/1 from India.

    PubMed

    Singh, D K; Kumar, Bablu; Shrinet, Garima; Singh, R P; Das, Aparajita; Mantur, B G; Abhishek; Pandey, Aruna; Mondal, Piyali; Sajjanar, B K; Doimari, Soni; Singh, Vijayata; Kumari, Reena; Tiwari, A K; Gandham, Ravi Kumar

    2018-04-21

    Brucellosis is among one of the most widespread important global zoonotic diseases that is endemic in many parts of India. Brucella melitensis is supposed to be the most pathogenic species for humans. Here we report the draft genome sequence of B. melitensis strain 2007BM/1 isolated from a human in India. Genomic DNA was extracted from Brucella culture and was sequenced using an Illumina MiSeq platform. The generated reads were assembled using three de novo assemblers and the draft genome was annotated. This monoisolate, with a genome length of 3268756bp, was found to be resistant to azithromycin and trimethoprim/sulfamethoxazole but susceptible to tetracycline, ofloxacin, rifampicin, ciprofloxacin and doxycycline. The presence of virulence genes in the strain was identified. The results obtained will help in understanding drug resistance mechanisms and virulence factors in highly zoonotic B. melitensis and suggest the need for judicious use of antibiotics in livestock health and management practices. Copyright © 2018 International Society for Chemotherapy of Infection and Cancer. Published by Elsevier Ltd. All rights reserved.

  20. Dramatic improvement in genome assembly achieved using doubled-haploid genomes.

    PubMed

    Zhang, Hong; Tan, Engkong; Suzuki, Yutaka; Hirose, Yusuke; Kinoshita, Shigeharu; Okano, Hideyuki; Kudoh, Jun; Shimizu, Atsushi; Saito, Kazuyoshi; Watabe, Shugo; Asakawa, Shuichi

    2014-10-27

    Improvement in de novo assembly of large genomes is still to be desired. Here, we improved draft genome sequence quality by employing doubled-haploid individuals. We sequenced wildtype and doubled-haploid Takifugu rubripes genomes, under the same conditions, using the Illumina platform and assembled contigs with SOAPdenovo2. We observed 5.4-fold and 2.6-fold improvement in the sizes of the N50 contig and scaffold of doubled-haploid individuals, respectively, compared to the wildtype, indicating that the use of a doubled-haploid genome aids in accurate genome analysis.

  1. Draft Genome Sequence of Sphingobium fuliginis OMI, a Bacterium That Degrades Alkylphenols and Bisphenols

    PubMed Central

    Ogata, Yuka; Yahara, Tatsuya; Yokoyama, Takashi; Ishizawa, Hidehiro; Takada, Kazuki; Inoue, Daisuke; Sei, Kazunari

    2017-01-01

    ABSTRACT Sphingobium fuliginis OMI is a bacterium that can degrade a variety of recalcitrant alkylphenols and bisphenols. This study reports the draft genome sequence of S. fuliginis OMI. PMID:29167253

  2. Draft genome sequences of 64 swine associated LA-MRSA ST5 isolates from the USA

    USDA-ARS?s Scientific Manuscript database

    Methicillin resistant Staphylococcus aureus colonizes humans and other animals such as swine. LA-MRSA sequence type (ST) 5 isolates are a public concern due to their pathogenicity and ability to acquire mobile genetic elements. This report presents draft genome sequences for 64 LA-MRSA ST5 isolates ...

  3. Draft Genome Sequence of Saccharomyces cerevisiae Barra Grande (BG-1), a Brazilian Industrial Bioethanol-Producing Strain

    PubMed Central

    Coutouné, Natalia; Mulato, Aline Tieppo Nogueira

    2017-01-01

    ABSTRACT Here, we present the draft genome sequence of Saccharomyces cerevisiae BG-1, a Brazilian industrial strain widely used for bioethanol production from sugarcane. The 11.7-Mb genome sequence consists of 216 scaffolds and harbors 5,607 predicted protein-coding genes. PMID:28360170

  4. Improved genomic resources and new bioinformatic workflow for the carcinogenic parasite Clonorchis sinensis: Biotechnological implications.

    PubMed

    Wang, Daxi; Korhonen, Pasi K; Gasser, Robin B; Young, Neil D

    Clonorchis sinensis (family Opisthorchiidae) is an important foodborne parasite that has a major socioeconomic impact on ~35 million people predominantly in China, Vietnam, Korea and the Russian Far East. In humans, infection with C. sinensis causes clonorchiasis, a complex hepatobiliary disease that can induce cholangiocarcinoma (CCA), a malignant cancer of the bile ducts. Central to understanding the epidemiology of this disease is knowledge of genetic variation within and among populations of this parasite. Although most published molecular studies seem to suggest that C. sinensis represents a single species, evidence of karyotypic variation within C. sinensis and cryptic species within a related opisthorchiid fluke (Opisthorchis viverrini) emphasise the importance of studying and comparing the genes and genomes of geographically distinct isolates of C. sinensis. Recently, we sequenced, assembled and characterised a draft nuclear genome of a C. sinensis isolate from Korea and compared it with a published draft genome of a Chinese isolate of this species using a bioinformatic workflow established for comparing draft genome assemblies and their gene annotations. We identified that 50.6% and 51.3% of the Korean and Chinese C. sinensis genomic scaffolds were syntenic, respectively. Within aligned syntenic blocks, the genomes had a high level of nucleotide identity (99.1%) and encoded 15 variable proteins likely to be involved in diverse biological processes. Here, we review current technical challenges of using draft genome assemblies to undertake comparative genomic analyses to quantify genetic variation between isolates of the same species. Using a workflow that overcomes these challenges, we report on a high-quality draft genome for C. sinensis from Korea and comparative genomic analyses, as a basis for future investigations of the genetic structures of C. sinensis populations, and discuss the biotechnological implications of these explorations. Copyright © 2018 Elsevier Inc. All rights reserved.

  5. Extreme Sensory Complexity Encoded in the 10-Megabase Draft Genome Sequence of the Chromatically Acclimating Cyanobacterium Tolypothrix sp. PCC 7601

    PubMed Central

    Yerrapragada, Shaila; Shukla, Animesh; Hallsworth-Pepin, Kymberlie; Choi, Kwangmin; Wollam, Aye; Clifton, Sandra; Qin, Xiang; Muzny, Donna; Raghuraman, Sriram; Ashki, Haleh; Uzman, Akif; Highlander, Sarah K.; Fryszczyn, Bartlomiej G.; Fox, George E.; Tirumalai, Madhan R.; Liu, Yamei; Kim, Sun

    2015-01-01

    Tolypothrix sp. PCC 7601 is a freshwater filamentous cyanobacterium with complex responses to environmental conditions. Here, we present its 9.96-Mbp draft genome sequence, containing 10,065 putative protein-coding sequences, including 305 predicted two-component system proteins and 27 putative phytochrome-class photoreceptors, the most such proteins in any sequenced genome. PMID:25953173

  6. A reference genome of the European beech (Fagus sylvatica L.).

    PubMed

    Mishra, Bagdevi; Gupta, Deepak K; Pfenninger, Markus; Hickler, Thomas; Langer, Ewald; Nam, Bora; Paule, Juraj; Sharma, Rahul; Ulaszewski, Bartosz; Warmbier, Joanna; Burczyk, Jaroslaw; Thines, Marco

    2018-06-01

    The European beech is arguably the most important climax broad-leaved tree species in Central Europe, widely planted for its valuable wood. Here, we report the 542 Mb draft genome sequence of an up to 300-year-old individual (Bhaga) from an undisturbed stand in the Kellerwald-Edersee National Park in central Germany. Using a hybrid assembly approach, Illumina reads with short- and long-insert libraries, coupled with long Pacific Biosciences reads, we obtained an assembled genome size of 542 Mb, in line with flow cytometric genome size estimation. The largest scaffold was of 1.15 Mb, the N50 length was 145 kb, and the L50 count was 983. The assembly contained 0.12% of Ns. A Benchmarking with Universal Single-Copy Orthologs (BUSCO) analysis retrieved 94% complete BUSCO genes, well in the range of other high-quality draft genomes of trees. A total of 62,012 protein-coding genes were predicted, assisted by transcriptome sequencing. In addition, we are reporting an efficient method for extracting high-molecular-weight DNA from dormant buds, by which contamination by environmental bacteria and fungi was kept at a minimum. The assembled genome will be a valuable resource and reference for future population genomics studies on the evolution and past climate change adaptation of beech and will be helpful for identifying genes, e.g., involved in drought tolerance, in order to select and breed individuals to adapt forestry to climate change in Europe. A continuously updated genome browser and download page can be accessed from beechgenome.net, which will include future genome versions of the reference individual Bhaga, as new sequencing approaches develop.

  7. Random Amplification and Pyrosequencing for Identification of Novel Viral Genome Sequences

    PubMed Central

    Hang, Jun; Forshey, Brett M.; Kochel, Tadeusz J.; Li, Tao; Solórzano, Víctor Fiestas; Halsey, Eric S.; Kuschner, Robert A.

    2012-01-01

    ssRNA viruses have high levels of genomic divergence, which can lead to difficulty in genomic characterization of new viruses using traditional PCR amplification and sequencing methods. In this study, random reverse transcription, anchored random PCR amplification, and high-throughput pyrosequencing were used to identify orthobunyavirus sequences from total RNA extracted from viral cultures of acute febrile illness specimens. Draft genome sequence for the orthobunyavirus L segment was assembled and sequentially extended using de novo assembly contigs from pyrosequencing reads and orthobunyavirus sequences in GenBank as guidance. Accuracy and continuous coverage were achieved by mapping all reads to the L segment draft sequence. Subsequently, RT-PCR and Sanger sequencing were used to complete the genome sequence. The complete L segment was found to be 6936 bases in length, encoding a 2248-aa putative RNA polymerase. The identified L segment was distinct from previously published South American orthobunyaviruses, sharing 63% and 54% identity at the nucleotide and amino acid level, respectively, with the complete Oropouche virus L segment and 73% and 81% identity at the nucleotide and amino acid level, respectively, with a partial Caraparu virus L segment. The result demonstrated the effectiveness of a sequence-independent amplification and next-generation sequencing approach for obtaining complete viral genomes from total nucleic acid extracts and its use in pathogen discovery. PMID:22468136

  8. Draft Genome Sequence of Bioactive-Compound-Producing Cyanobacterium Tolypothrix campylonemoides Strain VB511288.

    PubMed

    Das, Subhadeep; Singh, Deeksha; Madduluri, Madhavi; Chandrababunaidu, Mathu Malar; Gupta, Akash; Adhikary, Siba Prasad; Tripathy, Sucheta

    2015-04-02

    We report here the draft genome sequence of Tolypothrix campylonemoides VB511288, isolated from building facades in Santiniketan, India. The members of this genus produce several compounds of commercial importance. The draft assembly is 10,627,177 bases in 135 scaffolds, and it contains 7,886 protein-coding genes, 994 pseudogenes, 18 rRNA genes, and 76 tRNA genes. Copyright © 2015 Das et al.

  9. Draft Genome Sequence of the Terrestrial Cyanobacterium Scytonema millei VB511283, Isolated from Eastern India

    PubMed Central

    Sen, Diya; Chandrababunaidu, Mathu Malar; Singh, Deeksha; Sanghi, Neha; Ghorai, Arpita; Mishra, Gyan Prakash; Madduluri, Madhavi

    2015-01-01

    We report here the draft genome sequence of Scytonema millei VB511283, a cyanobacterium isolated from biofilms on the exterior of stone monuments in Santiniketan, eastern India. The draft genome is 11,627,246 bp long (11.63 Mb), with 118 scaffolds. About 9,011 protein-coding genes, 117 tRNAs, and 12 rRNAs are predicted from this assembly. PMID:25744984

  10. Draft Genome Sequence of Aeromonas caviae Strain 429865 INP, Isolated from a Mexican Patient

    PubMed Central

    Padilla, Juan Carlos A.; Bustos, Patricia; Sánchez-Varela, Alejandro; Palma-Martinez, Ingrid; Arzate-Barbosa, Patricia; García-Pérez, Carlos A.; López-López, María de Jesús; González, Víctor

    2015-01-01

    Aeromonas caviae is an emerging human pathogen. Here, we report the draft genome sequence of Aeromonas caviae strain 429865 INP which shows the presence of various putative virulence-related genes. PMID:26494682

  11. Draft Genome Sequence of Lactobacillus plantarum Strain IPLA 88

    PubMed Central

    Ladero, Victor; Alvarez-Sieiro, Patricia; Redruello, Begoña; del Rio, Beatriz; Linares, Daniel M.; Martin, M. Cruz; Fernández, María

    2013-01-01

    Here, we report a 3.2-Mbp draft assembly for the genome of Lactobacillus plantarum IPLA 88. The sequence of this sourdough isolate provides insight into the adaptation of this versatile species to different environments. PMID:23887921

  12. Development of genomic tools in a widespread tropical tree, Symphonia globulifera L.f.: a new low-coverage draft genome, SNP and SSR markers.

    PubMed

    Olsson, Sanna; Seoane-Zonjic, Pedro; Bautista, Rocío; Claros, M Gonzalo; González-Martínez, Santiago C; Scotti, Ivan; Scotti-Saintagne, Caroline; Hardy, Olivier J; Heuertz, Myriam

    2017-07-01

    Population genetic studies in tropical plants are often challenging because of limited information on taxonomy, phylogenetic relationships and distribution ranges, scarce genomic information and logistic challenges in sampling. We describe a strategy to develop robust and widely applicable genetic markers based on a modest development of genomic resources in the ancient tropical tree species Symphonia globulifera L.f. (Clusiaceae), a keystone species in African and Neotropical rainforests. We provide the first low-coverage (11X) fragmented draft genome sequenced on an individual from Cameroon, covering 1.027 Gbp or 67.5% of the estimated genome size. Annotation of 565 scaffolds (7.57 Mbp) resulted in the prediction of 1046 putative genes (231 of them containing a complete open reading frame) and 1523 exact simple sequence repeats (SSRs, microsatellites). Aligning a published transcriptome of a French Guiana population against this draft genome produced 923 high-quality single nucleotide polymorphisms. We also preselected genic SSRs in silico that were conserved and polymorphic across a wide geographical range, thus reducing marker development tests on rare DNA samples. Of 23 SSRs tested, 19 amplified and 18 were successfully genotyped in four S. globulifera populations from South America (Brazil and French Guiana) and Africa (Cameroon and São Tomé island, F ST  = 0.34). Most loci showed only population-specific deviations from Hardy-Weinberg proportions, pointing to local population effects (e.g. null alleles). The described genomic resources are valuable for evolutionary studies in Symphonia and for comparative studies in plants. The methods are especially interesting for widespread tropical or endangered taxa with limited DNA availability. © 2016 John Wiley & Sons Ltd.

  13. Draft genome sequences of Streptococcus bovis strains ATCC 33317 and JB1

    USDA-ARS?s Scientific Manuscript database

    We report the draft genome sequences of Streptococcus bovis type strain ATTC 33317 (CVM42251) isolated from cow dung and strain JB1 (CVM42252) isolated from a cow rumen in 1977. Strains were subjected to Next Generation sequencing and the genome sizes are approximately 2 MB and 2.2 MB, respectively....

  14. Draft Genome Sequence of a Bacillus Bacterium from the Atacama Desert Wetlands Metagenome

    PubMed Central

    Vilo, Claudia; Galetovic, Alexandra; Araya, Jorge E.; Dong, Qunfeng

    2015-01-01

    We report here the draft genome sequence of a Bacillus bacterium isolated from the microflora of Nostoc colonies grown at the Andean wetlands in northern Chile. We consider this genome sequence to be a molecular tool for exploring microbial relationships and adaptation strategies to the prevailing extreme conditions at the Atacama Desert. PMID:26294639

  15. Draft genome sequences of 9 LA-MRSA ST5 isolates obtained from humans after short term swine contact

    USDA-ARS?s Scientific Manuscript database

    Livestock associated methicillin resistant Staphylococcus aureus (LA-MRSA) sequence type 5 have raised concerns surrounding the potential for these isolates to colonize or cause disease in humans with swine contact. Here, we report draft genome sequences for 9 LA-MRSA ST5 isolates obtained from huma...

  16. Draft genome sequences of 14 swine associated LA-MRSA ST398 isolates from the U.S.

    USDA-ARS?s Scientific Manuscript database

    Livestock associated methicillin resistant Staphylococcus aureus (LA-MRSA) is part of the normal microbiota of swine. The initial and predominant swine associated LA-MRSA sequence type (ST) identified is ST398. Here, we present 14 draft genome sequence from LA-MRSA ST398 isolates found in the US....

  17. Draft Genome Sequence of Thiostrepton-Producing Streptomyces azureus ATCC 14921

    PubMed Central

    Sakihara, Kengo; Maeda, Jumpei; Tashiro, Kosuke; Fujino, Yasuhiro; Kuhara, Satoru; Ohshima, Toshihisa; Ogata, Seiya

    2015-01-01

    Streptomyces azureus ATCC 14921 belongs to the Streptomyces cyaneus cluster and is known to be a thiostrepton producer. Here, we report a draft genome sequence for this strain, consisting of 350 contigs containing a total of 8,790,525 bp, 8,164 predicted coding sequences, and a G+C content of 70.9%. PMID:26494661

  18. Draft Genome Sequence of “Cohnella kolymensis” B-2846

    PubMed Central

    Kudryashova, Ekaterina B.; Ariskina, Elena V.

    2016-01-01

    A draft genome sequence of “Cohnella kolymensis” strain B-2846 was derived using IonTorrent sequencing technology. The size of the assembly and G+C content were in agreement with those of other species of this genus. Characterization of the genome of a novel species of Cohnella will assist in bacterial systematics. PMID:26769947

  19. Draft Genome Sequence of Fish Pathogen Aeromonas bestiarum GA97-22.

    PubMed

    Kumru, Salih; Tekedar, Hasan C; Griffin, Matt J; Waldbieser, Geoffrey C; Liles, Mark R; Sonstegard, Tad; Schroeder, Steven G; Lawrence, Mark L; Karsi, Attila

    2018-06-14

    Aeromonas bestiarum is a Gram-negative mesophilic motile bacterium causing acute hemorrhagic septicemia or chronic skin ulcers in fish. Here, we report the draft genome sequence of A. bestiarum strain GA97-22, which was isolated from rainbow trout in 1997. This genome sequence will improve our understanding of the complex taxonomy of motile aeromonads.

  20. Sequence-based Network Completion Reveals the Integrality of Missing Reactions in Metabolic Networks.

    PubMed

    Krumholz, Elias W; Libourel, Igor G L

    2015-07-31

    Genome-scale metabolic models are central in connecting genotypes to metabolic phenotypes. However, even for well studied organisms, such as Escherichia coli, draft networks do not contain a complete biochemical network. Missing reactions are referred to as gaps. These gaps need to be filled to enable functional analysis, and gap-filling choices influence model predictions. To investigate whether functional networks existed where all gap-filling reactions were supported by sequence similarity to annotated enzymes, four draft networks were supplemented with all reactions from the Model SEED database for which minimal sequence similarity was found in their genomes. Quadratic programming revealed that the number of reactions that could partake in a gap-filling solution was vast: 3,270 in the case of E. coli, where 72% of the metabolites in the draft network could connect a gap-filling solution. Nonetheless, no network could be completed without the inclusion of orphaned enzymes, suggesting that parts of the biochemistry integral to biomass precursor formation are uncharacterized. However, many gap-filling reactions were well determined, and the resulting networks showed improved prediction of gene essentiality compared with networks generated through canonical gap filling. In addition, gene essentiality predictions that were sensitive to poorly determined gap-filling reactions were of poor quality, suggesting that damage to the network structure resulting from the inclusion of erroneous gap-filling reactions may be predictable. © 2015 by The American Society for Biochemistry and Molecular Biology, Inc.

  1. Single haplotype assembly of the human genome from a hydatidiform mole.

    PubMed

    Steinberg, Karyn Meltz; Schneider, Valerie A; Graves-Lindsay, Tina A; Fulton, Robert S; Agarwala, Richa; Huddleston, John; Shiryev, Sergey A; Morgulis, Aleksandr; Surti, Urvashi; Warren, Wesley C; Church, Deanna M; Eichler, Evan E; Wilson, Richard K

    2014-12-01

    A complete reference assembly is essential for accurately interpreting individual genomes and associating variation with phenotypes. While the current human reference genome sequence is of very high quality, gaps and misassemblies remain due to biological and technical complexities. Large repetitive sequences and complex allelic diversity are the two main drivers of assembly error. Although increasing the length of sequence reads and library fragments can improve assembly, even the longest available reads do not resolve all regions. In order to overcome the issue of allelic diversity, we used genomic DNA from an essentially haploid hydatidiform mole, CHM1. We utilized several resources from this DNA including a set of end-sequenced and indexed BAC clones and 100× Illumina whole-genome shotgun (WGS) sequence coverage. We used the WGS sequence and the GRCh37 reference assembly to create an assembly of the CHM1 genome. We subsequently incorporated 382 finished BAC clone sequences to generate a draft assembly, CHM1_1.1 (NCBI AssemblyDB GCA_000306695.2). Analysis of gene, repetitive element, and segmental duplication content show this assembly to be of excellent quality and contiguity. However, comparison to assembly-independent resources, such as BAC clone end sequences and PacBio long reads, indicate misassembled regions. Most of these regions are enriched for structural variation and segmental duplication, and can be resolved in the future. This publicly available assembly will be integrated into the Genome Reference Consortium curation framework for further improvement, with the ultimate goal being a completely finished gap-free assembly. © 2014 Steinberg et al.; Published by Cold Spring Harbor Laboratory Press.

  2. Single haplotype assembly of the human genome from a hydatidiform mole

    PubMed Central

    Steinberg, Karyn Meltz; Schneider, Valerie A.; Graves-Lindsay, Tina A.; Fulton, Robert S.; Agarwala, Richa; Huddleston, John; Shiryev, Sergey A.; Morgulis, Aleksandr; Surti, Urvashi; Warren, Wesley C.; Church, Deanna M.; Eichler, Evan E.; Wilson, Richard K.

    2014-01-01

    A complete reference assembly is essential for accurately interpreting individual genomes and associating variation with phenotypes. While the current human reference genome sequence is of very high quality, gaps and misassemblies remain due to biological and technical complexities. Large repetitive sequences and complex allelic diversity are the two main drivers of assembly error. Although increasing the length of sequence reads and library fragments can improve assembly, even the longest available reads do not resolve all regions. In order to overcome the issue of allelic diversity, we used genomic DNA from an essentially haploid hydatidiform mole, CHM1. We utilized several resources from this DNA including a set of end-sequenced and indexed BAC clones and 100× Illumina whole-genome shotgun (WGS) sequence coverage. We used the WGS sequence and the GRCh37 reference assembly to create an assembly of the CHM1 genome. We subsequently incorporated 382 finished BAC clone sequences to generate a draft assembly, CHM1_1.1 (NCBI AssemblyDB GCA_000306695.2). Analysis of gene, repetitive element, and segmental duplication content show this assembly to be of excellent quality and contiguity. However, comparison to assembly-independent resources, such as BAC clone end sequences and PacBio long reads, indicate misassembled regions. Most of these regions are enriched for structural variation and segmental duplication, and can be resolved in the future. This publicly available assembly will be integrated into the Genome Reference Consortium curation framework for further improvement, with the ultimate goal being a completely finished gap-free assembly. PMID:25373144

  3. 75 FR 16459 - Draft Document Related to the Review of the National Ambient Air Quality Standards for...

    Federal Register 2010, 2011, 2012, 2013, 2014

    2010-04-01

    ... Review of the National Ambient Air Quality Standards for Particulate Matter AGENCY: Environmental... Review of the Particulate Matter National Ambient Air Quality Standards--First External Review Draft (75... Particulate Matter National Ambient Air Quality Standards--First External Review Draft (March 2010), please...

  4. Draft Genome Sequence of Bacillus altitudinis YNP4-TSU, Isolated from Yellowstone National Park

    PubMed Central

    OHair, Joshua A.; Li, Hui; Thapa, Santosh; Scholz, Matthew

    2017-01-01

    ABSTRACT Undisturbed hot springs inside Yellowstone National Park remain a dynamic biome for novel cellulolytic thermophiles. We report here the draft genome sequence of one of these isolates, Bacillus altitudinis YNP4-TSU. PMID:28705979

  5. Draft genome sequence of Xylella fastidiosa subsp. fastidiosa strain Stag’s Leap

    USDA-ARS?s Scientific Manuscript database

    Xylella fastidiosa subsp. fastidiosa causes Pierce’s disease of grapevine. Presented here is the draft genome sequence of the Stag’s Leap strain, previously used in pathogenicity/virulence assays to evaluate grapevine germplasm bearing Pierce’s disease....

  6. Draft Genome Sequence of Sphingobium fuliginis OMI, a Bacterium That Degrades Alkylphenols and Bisphenols.

    PubMed

    Kuroda, Masashi; Ogata, Yuka; Yahara, Tatsuya; Yokoyama, Takashi; Ishizawa, Hidehiro; Takada, Kazuki; Inoue, Daisuke; Sei, Kazunari; Ike, Michihiko

    2017-11-22

    Sphingobium fuliginis OMI is a bacterium that can degrade a variety of recalcitrant alkylphenols and bisphenols. This study reports the draft genome sequence of S. fuliginis OMI. Copyright © 2017 Kuroda et al.

  7. Extreme Sensory Complexity Encoded in the 10-Megabase Draft Genome Sequence of the Chromatically Acclimating Cyanobacterium Tolypothrix sp. PCC 7601.

    PubMed

    Yerrapragada, Shaila; Shukla, Animesh; Hallsworth-Pepin, Kymberlie; Choi, Kwangmin; Wollam, Aye; Clifton, Sandra; Qin, Xiang; Muzny, Donna; Raghuraman, Sriram; Ashki, Haleh; Uzman, Akif; Highlander, Sarah K; Fryszczyn, Bartlomiej G; Fox, George E; Tirumalai, Madhan R; Liu, Yamei; Kim, Sun; Kehoe, David M; Weinstock, George M

    2015-05-07

    Tolypothrix sp. PCC 7601 is a freshwater filamentous cyanobacterium with complex responses to environmental conditions. Here, we present its 9.96-Mbp draft genome sequence, containing 10,065 putative protein-coding sequences, including 305 predicted two-component system proteins and 27 putative phytochrome-class photoreceptors, the most such proteins in any sequenced genome. Copyright © 2015 Yerrapragada et al.

  8. Single-molecule sequencing and optical mapping yields an improved genome of woodland strawberry (Fragaria vesca) with chromosome-scale contiguity.

    PubMed

    Edger, Patrick P; VanBuren, Robert; Colle, Marivi; Poorten, Thomas J; Wai, Ching Man; Niederhuth, Chad E; Alger, Elizabeth I; Ou, Shujun; Acharya, Charlotte B; Wang, Jie; Callow, Pete; McKain, Michael R; Shi, Jinghua; Collier, Chad; Xiong, Zhiyong; Mower, Jeffrey P; Slovin, Janet P; Hytönen, Timo; Jiang, Ning; Childs, Kevin L; Knapp, Steven J

    2018-02-01

    Although draft genomes are available for most agronomically important plant species, the majority are incomplete, highly fragmented, and often riddled with assembly and scaffolding errors. These assembly issues hinder advances in tool development for functional genomics and systems biology. Here we utilized a robust, cost-effective approach to produce high-quality reference genomes. We report a near-complete genome of diploid woodland strawberry (Fragaria vesca) using single-molecule real-time sequencing from Pacific Biosciences (PacBio). This assembly has a contig N50 length of ∼7.9 million base pairs (Mb), representing a ∼300-fold improvement of the previous version. The vast majority (>99.8%) of the assembly was anchored to 7 pseudomolecules using 2 sets of optical maps from Bionano Genomics. We obtained ∼24.96 Mb of sequence not present in the previous version of the F. vesca genome and produced an improved annotation that includes 1496 new genes. Comparative syntenic analyses uncovered numerous, large-scale scaffolding errors present in each chromosome in the previously published version of the F. vesca genome. Our results highlight the need to improve existing short-read based reference genomes. Furthermore, we demonstrate how genome quality impacts commonly used analyses for addressing both fundamental and applied biological questions. © The Authors 2017. Published by Oxford University Press.

  9. Draft Genome Sequence of the Terrestrial Cyanobacterium Scytonema millei VB511283, Isolated from Eastern India.

    PubMed

    Sen, Diya; Chandrababunaidu, Mathu Malar; Singh, Deeksha; Sanghi, Neha; Ghorai, Arpita; Mishra, Gyan Prakash; Madduluri, Madhavi; Adhikary, Siba Prasad; Tripathy, Sucheta

    2015-03-05

    We report here the draft genome sequence of Scytonema millei VB511283, a cyanobacterium isolated from biofilms on the exterior of stone monuments in Santiniketan, eastern India. The draft genome is 11,627,246 bp long (11.63 Mb), with 118 scaffolds. About 9,011 protein-coding genes, 117 tRNAs, and 12 rRNAs are predicted from this assembly. Copyright © 2015 Sen et al.

  10. GI-POP: a combinational annotation and genomic island prediction pipeline for ongoing microbial genome projects.

    PubMed

    Lee, Chi-Ching; Chen, Yi-Ping Phoebe; Yao, Tzu-Jung; Ma, Cheng-Yu; Lo, Wei-Cheng; Lyu, Ping-Chiang; Tang, Chuan Yi

    2013-04-10

    Sequencing of microbial genomes is important because of microbial-carrying antibiotic and pathogenetic activities. However, even with the help of new assembling software, finishing a whole genome is a time-consuming task. In most bacteria, pathogenetic or antibiotic genes are carried in genomic islands. Therefore, a quick genomic island (GI) prediction method is useful for ongoing sequencing genomes. In this work, we built a Web server called GI-POP (http://gipop.life.nthu.edu.tw) which integrates a sequence assembling tool, a functional annotation pipeline, and a high-performance GI predicting module, in a support vector machine (SVM)-based method called genomic island genomic profile scanning (GI-GPS). The draft genomes of the ongoing genome projects in contigs or scaffolds can be submitted to our Web server, and it provides the functional annotation and highly probable GI-predicting results. GI-POP is a comprehensive annotation Web server designed for ongoing genome project analysis. Researchers can perform annotation and obtain pre-analytic information include possible GIs, coding/non-coding sequences and functional analysis from their draft genomes. This pre-analytic system can provide useful information for finishing a genome sequencing project. Copyright © 2012 Elsevier B.V. All rights reserved.

  11. High quality draft genome sequence of Corynebacterium ulceribovis type strain IMMIB-L1395T (DSM 45146T)

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Yassin, Atteyet F.; Lapidus, Alla; Han, James

    We report that the Corynebacterium ulceribovis strain IMMIB L-1395T (= DSM 45146T) is an aerobic to facultative anaerobic, Gram-positive, non-spore-forming, non-motile rod-shaped bacterium that was isolated from the skin of the udder of a cow, in Schleswig Holstein, Germany. The cell wall of C. ulceribovis contains corynemycolic acids. The cellular fatty acids are those described for the genus Corynebacterium, but tuberculostearic acid is not present. Here we describe the features of C. ulceribovis strain IMMIB L-1395T, together with genome sequence information and its annotation. The 2,300,451 bp long genome containing 2,104 protein-coding genes and 54 RNA-encoding genes and is partmore » of the Genomic Encyclopedia of Type Strains, Phase I: the one thousand microbial genomes (KMG) project.« less

  12. High quality draft genome sequence of Corynebacterium ulceribovis type strain IMMIB-L1395T (DSM 45146T)

    DOE PAGES

    Yassin, Atteyet F.; Lapidus, Alla; Han, James; ...

    2015-08-05

    We report that the Corynebacterium ulceribovis strain IMMIB L-1395T (= DSM 45146T) is an aerobic to facultative anaerobic, Gram-positive, non-spore-forming, non-motile rod-shaped bacterium that was isolated from the skin of the udder of a cow, in Schleswig Holstein, Germany. The cell wall of C. ulceribovis contains corynemycolic acids. The cellular fatty acids are those described for the genus Corynebacterium, but tuberculostearic acid is not present. Here we describe the features of C. ulceribovis strain IMMIB L-1395T, together with genome sequence information and its annotation. The 2,300,451 bp long genome containing 2,104 protein-coding genes and 54 RNA-encoding genes and is partmore » of the Genomic Encyclopedia of Type Strains, Phase I: the one thousand microbial genomes (KMG) project.« less

  13. DOE Office of Scientific and Technical Information (OSTI.GOV)

    Lapidus, Alla L.

    From the date its role in heredity was discovered, DNA has been generating interest among scientists from different fields of knowledge: physicists have studied the three dimensional structure of the DNA molecule, biologists tried to decode the secrets of life hidden within these long molecules, and technologists invent and improve methods of DNA analysis. The analysis of the nucleotide sequence of DNA occupies a special place among the methods developed. Thanks to the variety of sequencing technologies available, the process of decoding the sequence of genomic DNA (or whole genome sequencing) has become robust and inexpensive. Meanwhile the assembly ofmore » whole genome sequences remains a challenging task. In addition to the need to assemble millions of DNA fragments of different length (from 35 bp (Solexa) to 800 bp (Sanger)), great interest in analysis of microbial communities (metagenomes) of different complexities raises new problems and pushes some new requirements for sequence assembly tools to the forefront. The genome assembly process can be divided into two steps: draft assembly and assembly improvement (finishing). Despite the fact that automatically performed assembly (or draft assembly) is capable of covering up to 98% of the genome, in most cases, it still contains incorrectly assembled reads. The error rate of the consensus sequence produced at this stage is about 1/2000 bp. A finished genome represents the genome assembly of much higher accuracy (with no gaps or incorrectly assembled areas) and quality ({approx}1 error/10,000 bp), validated through a number of computer and laboratory experiments.« less

  14. Draft Genome Sequence of Mycobacterium chimaera Type ...

    EPA Pesticide Factsheets

    We report the draft genome sequence of the type strain Mycobacterium chimaera Fl-0169T, a member of the Mycobacterium avium complex (MAC). M. chimaera Fl-0169T was isolated from a patient in Italy and is highly similar to strains of M. chimaera isolated in Ireland, though Fl-0169T possesses unique virulence genes. Evidence suggests that M. avium, M. intracellulare, and M. chimaera are differently virulent and a comparative genomic analysis is critically needed to identify diagnostic targets that reliably differentiate species of MAC. With treatment costs for Mycobacterium infections estimated to be >$1.8 B annually in the U.S., correct species identification will result in improved treatment selection, lower costs, and improved patient outcomes.

  15. CheckM: assessing the quality of microbial genomes recovered from isolates, single cells, and metagenomes

    PubMed Central

    Parks, Donovan H.; Imelfort, Michael; Skennerton, Connor T.; Hugenholtz, Philip; Tyson, Gene W.

    2015-01-01

    Large-scale recovery of genomes from isolates, single cells, and metagenomic data has been made possible by advances in computational methods and substantial reductions in sequencing costs. Although this increasing breadth of draft genomes is providing key information regarding the evolutionary and functional diversity of microbial life, it has become impractical to finish all available reference genomes. Making robust biological inferences from draft genomes requires accurate estimates of their completeness and contamination. Current methods for assessing genome quality are ad hoc and generally make use of a limited number of “marker” genes conserved across all bacterial or archaeal genomes. Here we introduce CheckM, an automated method for assessing the quality of a genome using a broader set of marker genes specific to the position of a genome within a reference genome tree and information about the collocation of these genes. We demonstrate the effectiveness of CheckM using synthetic data and a wide range of isolate-, single-cell-, and metagenome-derived genomes. CheckM is shown to provide accurate estimates of genome completeness and contamination and to outperform existing approaches. Using CheckM, we identify a diverse range of errors currently impacting publicly available isolate genomes and demonstrate that genomes obtained from single cells and metagenomic data vary substantially in quality. In order to facilitate the use of draft genomes, we propose an objective measure of genome quality that can be used to select genomes suitable for specific gene- and genome-centric analyses of microbial communities. PMID:25977477

  16. CheckM: assessing the quality of microbial genomes recovered from isolates, single cells, and metagenomes.

    PubMed

    Parks, Donovan H; Imelfort, Michael; Skennerton, Connor T; Hugenholtz, Philip; Tyson, Gene W

    2015-07-01

    Large-scale recovery of genomes from isolates, single cells, and metagenomic data has been made possible by advances in computational methods and substantial reductions in sequencing costs. Although this increasing breadth of draft genomes is providing key information regarding the evolutionary and functional diversity of microbial life, it has become impractical to finish all available reference genomes. Making robust biological inferences from draft genomes requires accurate estimates of their completeness and contamination. Current methods for assessing genome quality are ad hoc and generally make use of a limited number of "marker" genes conserved across all bacterial or archaeal genomes. Here we introduce CheckM, an automated method for assessing the quality of a genome using a broader set of marker genes specific to the position of a genome within a reference genome tree and information about the collocation of these genes. We demonstrate the effectiveness of CheckM using synthetic data and a wide range of isolate-, single-cell-, and metagenome-derived genomes. CheckM is shown to provide accurate estimates of genome completeness and contamination and to outperform existing approaches. Using CheckM, we identify a diverse range of errors currently impacting publicly available isolate genomes and demonstrate that genomes obtained from single cells and metagenomic data vary substantially in quality. In order to facilitate the use of draft genomes, we propose an objective measure of genome quality that can be used to select genomes suitable for specific gene- and genome-centric analyses of microbial communities. © 2015 Parks et al.; Published by Cold Spring Harbor Laboratory Press.

  17. Draft Genome Sequence of Magnesium-Dissolving Lactococcus garvieae A1, Isolated from Soil

    PubMed Central

    Altın, Gonca; Şahin, Fikrettin

    2017-01-01

    ABSTRACT The probiotic bacterium Lactococcus garvieae A1, isolated from soil, is interesting for biomining applications. Here, we report the draft genome sequence and annotation of this strain, with a focus on metal transporter enzymes. PMID:28546485

  18. Genome Sequence of the Historical Clinical Isolate Burkholderia pseudomallei PHLS 6

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    D’haeseleer, Patrik; Johnson, Shannon L.; Davenport, Karen W.

    We present the draft genome sequence ofBurkholderia pseudomalleiPHLS 6, a virulent clinical strain isolated from a melioidosis patient in Bangladesh in 1960. This draft genome consists of 39 contigs and is 7,322,181 bp long.

  19. Genome Sequence of the Historical Clinical Isolate Burkholderia pseudomallei PHLS 6

    DOE PAGES

    D’haeseleer, Patrik; Johnson, Shannon L.; Davenport, Karen W.; ...

    2016-06-30

    We present the draft genome sequence ofBurkholderia pseudomalleiPHLS 6, a virulent clinical strain isolated from a melioidosis patient in Bangladesh in 1960. This draft genome consists of 39 contigs and is 7,322,181 bp long.

  20. Draft Genome Sequence of Herbaspirillum lusitanum P6-12, an Endophyte Isolated from Root Nodules of Phaseolus vulgaris

    PubMed Central

    Weiss, Vinícius Almir; Faoro, Helisson; Tadra-Sfeir, Michelle Zibbetti; Raittz, Roberto Tadeu; de Souza, Emanuel Maltempi; Monteiro, Rose Adele; Cardoso, Rodrigo Luis Alves; Wassem, Roseli; Chubatsu, Leda Satie; Huergo, Luciano Fernandes; Müller-Santos, Marcelo; Steffens, Maria Berenice Reynaud; Rigo, Liu Un; Pedrosa, Fábio de Oliveira

    2012-01-01

    Herbaspirillum lusitanum strain P6-12 (DSM 17154) is, so far, the only species of Herbaspirillum isolated from plant root nodules. Here we report a draft genome sequence of this organism. PMID:22815451

  1. 75 FR 8045 - National Environmental Policy Act (NEPA) Draft Guidance, Establishing, Applying, and Revising...

    Federal Register 2010, 2011, 2012, 2013, 2014

    2010-02-23

    ... COUNCIL ON ENVIRONMENTAL QUALITY National Environmental Policy Act (NEPA) Draft Guidance...: Council on Environmental Quality. ACTION: Notice of Availability, Draft Guidance, ``Establishing, Applying... February 18, 2010, the Council on Environmental Quality (CEQ) announced four steps to modernize...

  2. Draft Genome Sequence of Microbacterium sp. Strain UCD-TDU (Phylum Actinobacteria)

    PubMed Central

    Bendiks, Zachary A.; Lang, Jenna M.; Darling, Aaron E.; Coil, David A.

    2013-01-01

    Here, we present the draft genome sequence of Microbacterium sp. strain UCD-TDU, a member of the phylum Actinobacteria. The assembly contains 3,746,321 bp (in 8 scaffolds). This strain was isolated from a residential toilet as part of an undergraduate student research project to sequence reference genomes of microbes from the built environment. PMID:23516225

  3. Draft Genome Sequence of a Violacein-Producing Iodobacter sp. from the Hudson Valley Watershed

    PubMed Central

    Doing, Georgia

    2018-01-01

    ABSTRACT Iodobacter species are among a number of freshwater Gram-negative violacein-producing bacteria. Janthinobacterium lividum and Chromobacterium violaceum have had their whole genomes sequenced and annotated. This is the first report of a draft whole-genome sequence of a violacein-producing Iodobacter strain that was isolated from the Hudson Valley watershed. PMID:29301892

  4. Draft Genome Sequence and Description of Janthinobacterium sp. Strain CG3, a Psychrotolerant Antarctic Supraglacial Stream Bacterium

    PubMed Central

    Smith, Heidi; Akiyama, Tatsuya; Franklin, Michael; Woyke, Tanja; Teshima, Hazuki; Davenport, Karen; Daligault, Hajnalka; Erkkila, Tracy; Goodwin, Lynne; Gu, Wei; Xu, Yan; Chain, Patrick

    2013-01-01

    Here we present the draft genome sequence of Janthinobacterium sp. strain CG3, a psychrotolerant non-violacein-producing bacterium that was isolated from the Cotton Glacier supraglacial stream. The genome sequence of this organism will provide insight as to the mechanisms necessary for bacteria to survive in UV-stressed icy environments. PMID:24265494

  5. Draft Genome Sequence of a Violacein-Producing Iodobacter sp. from the Hudson Valley Watershed.

    PubMed

    Doing, Georgia; Perron, Gabriel G; Jude, Brooke A

    2018-01-04

    Iodobacter species are among a number of freshwater Gram-negative violacein-producing bacteria. Janthinobacterium lividum and Chromobacterium violaceum have had their whole genomes sequenced and annotated. This is the first report of a draft whole-genome sequence of a violacein-producing Iodobacter strain that was isolated from the Hudson Valley watershed. Copyright © 2018 Doing et al.

  6. Draft Genome Sequence of Pediococcus lolii NGRI 0510QT Isolated from Ryegrass Silage

    PubMed Central

    Mori, Kazuki; Tashiro, Kosuke; Fujino, Yasuhiro; Nagayoshi, Yuko; Hayashi, Yoshiharu; Kuhara, Satoru; Ohshima, Toshihisa

    2013-01-01

    Pediococcus lolii NGRI 0510QT was isolated from ryegrass silage produced on Ishigaki Island, Okinawa Prefecture, Japan. Here we present a draft genome sequence for this strain, consisting of 103 contigs for a total of 2,047,078 bp, 2,154 predicted coding sequences, and a G+C content of 42.1%. PMID:23405350

  7. Draft Genome Sequence of Lactobacillus crispatus EM-LC1, an Isolate with Antimicrobial Activity Cultured from an Elderly Subject

    PubMed Central

    Power, Susan E.; Harris, Hugh M. B.; Bottacini, Francesca; Ross, R. Paul; O’Toole, Paul W.

    2013-01-01

    Here we report the 1.86-Mb draft genome sequence of Lactobacillus crispatus EM-LC1, a fecal isolate with antimicrobial activity. This genome sequence is expected to provide insights into the antimicrobial activity of L. crispatus and improve our knowledge of its potential probiotic traits. PMID:24356836

  8. Assembly of highly repetitive genomes using short reads: the genome of discrete typing unit III Trypanosoma cruzi strain 231.

    PubMed

    Baptista, Rodrigo P; Reis-Cunha, Joao Luis; DeBarry, Jeremy D; Chiari, Egler; Kissinger, Jessica C; Bartholomeu, Daniella C; Macedo, Andrea M

    2018-02-14

    Next-generation sequencing (NGS) methods are low-cost high-throughput technologies that produce thousands to millions of sequence reads. Despite the high number of raw sequence reads, their short length, relative to Sanger, PacBio or Nanopore reads, complicates the assembly of genomic repeats. Many genome tools are available, but the assembly of highly repetitive genome sequences using only NGS short reads remains challenging. Genome assembly of organisms responsible for important neglected diseases such as Trypanosoma cruzi, the aetiological agent of Chagas disease, is known to be challenging because of their repetitive nature. Only three of six recognized discrete typing units (DTUs) of the parasite have their draft genomes published and therefore genome evolution analyses in the taxon are limited. In this study, we developed a computational workflow to assemble highly repetitive genomes via a combination of de novo and reference-based assembly strategies to better overcome the intrinsic limitations of each, based on Illumina reads. The highly repetitive genome of the human-infecting parasite T. cruzi 231 strain was used as a test subject. The combined-assembly approach shown in this study benefits from the reference-based assembly ability to resolve highly repetitive sequences and from the de novo capacity to assemble genome-specific regions, improving the quality of the assembly. The acceptable confidence obtained by analyzing our results showed that our combined approach is an attractive option to assemble highly repetitive genomes with NGS short reads. Phylogenomic analysis including the 231 strain, the first representative of DTU III whose genome was sequenced, was also performed and provides new insights into T. cruzi genome evolution.

  9. Draft Genome Sequence of Leuconostoc mesenteroides 406 Isolated from the Traditional Fermented Mare Milk Airag in Tuv Aimag, Mongolia

    PubMed Central

    Toh, Hidehiro; Oshima, Kenshiro; Nakano, Akiyo; Hano, Chihiro; Yoshida, Saki; Nguyen, Tien Thi Thuy; Wulijideligen; Tashiro, Kosuke; Arakawa, Kensuke; Miyamoto, Taku

    2016-01-01

    Leuconostoc mesenteroides 406 was isolated from the traditional fermented mare milk airag in Tuv Aimag, Mongolia. This strain produces an antilisterial bacteriocin. Here, we report the draft genome sequence of this organism. PMID:27013047

  10. The draft genome of a diploid cotton Gossypium raimondii

    USDA-ARS?s Scientific Manuscript database

    We have sequenced and assembled the draft genome of Gossypium raimondii, whose progenitor is considered the contributor of the D-subgenome to the economically important natural textile fiber producer, G. hirsutum. Next-generation Illumina pair-end (PE) sequencing strategies were employed to obtain ...

  11. Draft Genome Sequence of Catellicoccus marimammalium, a Novel Species Commonly Found in Gull Feces

    EPA Science Inventory

    Catellicoccus marimammalium is a relatively uncharacterized Gram-positive, facultative anaerobe with potential utility as an indicator of waterfowl fecal contamination. Here we report an annotated draft genome sequence that suggests this organism may be a symbiotic gut microbe.

  12. High-quality permanent draft genome sequence of the Bradyrhizobium elkanii type strain USDA 76T, isolated from Glycine max (L.) Merr

    DOE PAGES

    Reeve, Wayne; van Berkum, Peter; Ardley, Julie; ...

    2017-03-04

    Bradyrhizobium elkanii USDA 76 T (INSCD = ARAG00000000), the type strain for Bradyrhizobium elkanii, is an aerobic, motile, Gram-negative, non-spore-forming rod that was isolated from an effective nitrogen-fixing root nodule of Glycine max (L. Merr) grown in the USA. Because of its significance as a microsymbiont of this economically important legume, B. elkanii USDA 76 T was selected as part of the DOE Joint Genome Institute 2010 Genomic Encyclopedia for Bacteria and Archaea-Root Nodule Bacteria sequencing project. Here the symbiotic abilities of B. elkanii USDA 76 T are described, together with its genome sequence information and annotation. The 9,484,767 bpmore » high-quality draft genome is arranged in 2 scaffolds of 25 contigs, containing 9060 protein-coding genes and 91 RNA-only encoding genes. The B. elkanii USDA 76 T genome contains a low GC content region with symbiotic nod and fix genes, indicating the presence of a symbiotic island integration. A comparison of five B. elkanii genomes that formed a clique revealed that 356 of the 9060 protein coding genes of USDA 76 T were unique, including 22 genes of an intact resident prophage. A conserved set of 7556 genes were also identified for this species, including genes encoding a general secretion pathway as well as type II, III, IV and VI secretion system proteins. The type III secretion system has previously been characterized as a host determinant for Rj and/or rj soybean cultivars. Here we show that the USDA 76 T genome contains genes encoding all the type III secretion system components, including a translocon complex protein NopX required for the introduction of effector proteins into host cells. While many bradyrhizobial strains are unable to nodulate the soybean cultivar Clark (rj1), USDA 76 T was able to elicit nodules on Clark (rj1), although in reduced numbers, when plants were grown in Leonard jars containing sand or vermiculite. In these conditions, we postulate that the presence of NopX allows USDA 76 T to introduce various effector molecules into this host to enable nodulation.« less

  13. High-quality permanent draft genome sequence of the Bradyrhizobium elkanii type strain USDA 76T, isolated from Glycine max (L.) Merr

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Reeve, Wayne; van Berkum, Peter; Ardley, Julie

    Bradyrhizobium elkanii USDA 76 T (INSCD = ARAG00000000), the type strain for Bradyrhizobium elkanii, is an aerobic, motile, Gram-negative, non-spore-forming rod that was isolated from an effective nitrogen-fixing root nodule of Glycine max (L. Merr) grown in the USA. Because of its significance as a microsymbiont of this economically important legume, B. elkanii USDA 76 T was selected as part of the DOE Joint Genome Institute 2010 Genomic Encyclopedia for Bacteria and Archaea-Root Nodule Bacteria sequencing project. Here the symbiotic abilities of B. elkanii USDA 76 T are described, together with its genome sequence information and annotation. The 9,484,767 bpmore » high-quality draft genome is arranged in 2 scaffolds of 25 contigs, containing 9060 protein-coding genes and 91 RNA-only encoding genes. The B. elkanii USDA 76 T genome contains a low GC content region with symbiotic nod and fix genes, indicating the presence of a symbiotic island integration. A comparison of five B. elkanii genomes that formed a clique revealed that 356 of the 9060 protein coding genes of USDA 76 T were unique, including 22 genes of an intact resident prophage. A conserved set of 7556 genes were also identified for this species, including genes encoding a general secretion pathway as well as type II, III, IV and VI secretion system proteins. The type III secretion system has previously been characterized as a host determinant for Rj and/or rj soybean cultivars. Here we show that the USDA 76 T genome contains genes encoding all the type III secretion system components, including a translocon complex protein NopX required for the introduction of effector proteins into host cells. While many bradyrhizobial strains are unable to nodulate the soybean cultivar Clark (rj1), USDA 76 T was able to elicit nodules on Clark (rj1), although in reduced numbers, when plants were grown in Leonard jars containing sand or vermiculite. In these conditions, we postulate that the presence of NopX allows USDA 76 T to introduce various effector molecules into this host to enable nodulation.« less

  14. Draft Genome Sequence of Roseovarius sp. A-2, an Iodide-Oxidizing Bacterium Isolated from Natural Gas Brine Water, Chiba, Japan.

    PubMed

    Yuliana, Tri; Nakajima, Nobuyoshi; Yamamura, Shigeki; Tomita, Masaru; Suzuki, Haruo; Amachi, Seigo

    2017-01-01

    Roseovarius sp. A-2 is a heterotrophic iodide (I - )-oxidizing bacterium isolated from iodide-rich natural gas brine water in Chiba, Japan. This strain oxidizes iodide to molecular iodine (I 2 ) by means of an extracellular multicopper oxidase. Here we report the draft genome sequence of strain A-2. The draft genome contained 46 tRNA genes, 1 copy of a 16S-23S-5S rRNA operon, and 4,514 protein coding DNA sequences, of which 1,207 (27%) were hypothetical proteins. The genome contained a gene encoding IoxA, a multicopper oxidase previously found to catalyze the oxidation of iodide in Iodidimonas sp. Q-1. This draft genome provides detailed insights into the metabolism and potential application of Roseovarius sp. A-2.

  15. Genome assembly and transcriptome resource for river buffalo, Bubalus bubalis (2n = 50)

    PubMed Central

    Iamartino, Daniela; Pruitt, Kim D; Sonstegard, Tad; Smith, Timothy P L; Low, Wai Yee; Biagini, Tommaso; Bomba, Lorenzo; Capomaccio, Stefano; Castiglioni, Bianca; Coletta, Angelo; Corrado, Federica; Ferré, Fabrizio; Iannuzzi, Leopoldo; Lawley, Cynthia; Macciotta, Nicolò; McClure, Matthew; Mancini, Giordano; Matassino, Donato; Mazza, Raffaele; Milanesi, Marco; Moioli, Bianca; Morandi, Nicola; Ramunno, Luigi; Peretti, Vincenzo; Pilla, Fabio; Ramelli, Paola; Schroeder, Steven; Strozzi, Francesco; Thibaud-Nissen, Francoise; Zicarelli, Luigi; Ajmone-Marsan, Paolo; Valentini, Alessio; Chillemi, Giovanni; Zimin, Aleksey

    2017-01-01

    Abstract Water buffalo is a globally important species for agriculture and local economies. A de novo assembled, well-annotated reference sequence for the water buffalo is an important prerequisite for studying the biology of this species, and is necessary to manage genetic diversity and to use modern breeding and genomic selection techniques. However, no such genome assembly has been previously reported. There are 2 species of domestic water buffalo, the river (2n = 50) and the swamp (2n = 48) buffalo. Here we describe a draft quality reference sequence for the river buffalo created from Illumina GA and Roche 454 short read sequences using the MaSuRCA assembler. The assembled sequence is 2.83 Gb, consisting of 366 983 scaffolds with a scaffold N50 of 1.41 Mb and contig N50 of 21 398 bp. Annotation of the genome was supported by transcriptome data from 30 tissues and identified 21 711 predicted protein coding genes. Searches for complete mammalian BUSCO gene groups found 98.6% of curated single copy orthologs present among predicted genes, which suggests a high level of completeness of the genome. The annotated sequence is available from NCBI at accession GCA_000471725.1. PMID:29048578

  16. High-quality draft genome sequence of Flavobacterium suncheonense GH29-5 T (DSM 17707 T) isolated from greenhouse soil in South Korea, and emended description of Flavobacterium suncheonense GH29-5 T

    DOE PAGES

    Tashkandy, Nisreen; Sabban, Sari; Fakieh, Mohammad; ...

    2016-06-16

    Flavobacterium suncheonense is a member of the family Flavobacteriaceae in the phylum Bacteroidetes. Strain GH29-5 T (DSM 17707 T ) was isolated from greenhouse soil in Suncheon, South Korea. F. suncheonense GH29-5 T is part of the Genomic Encyclopedia of Bacteria and Archaea project. The 2,880,663 bp long draft genome consists of 54 scaffolds with 2739 protein-coding genes and 82 RNA genes. The genome of strain GH29-5 T has 117 genes encoding peptidases but a small number of genes encoding carbohydrate active enzymes (51 CAZymes). Metallo and serine peptidases were found most frequently. Among CAZymes, eight glycoside hydrolase families, ninemore » glycosyl transferase families, two carbohydrate binding module families and four carbohydrate esterase families were identified. Suprisingly, polysaccharides utilization loci (PULs) were not found in strain GH29-5 T . Based on the coherent physiological and genomic characteristics we suggest that F. suncheonense GH29-5 T feeds rather on proteins than saccharides and lipids.« less

  17. Draft genome sequence of Enterococcus faecium strain LMG 8148.

    PubMed

    Michiels, Joran E; Van den Bergh, Bram; Fauvart, Maarten; Michiels, Jan

    2016-01-01

    Enterococcus faecium, traditionally considered a harmless gut commensal, is emerging as an important nosocomial pathogen showing increasing rates of multidrug resistance. We report the draft genome sequence of E. faecium strain LMG 8148, isolated in 1968 from a human in Gothenburg, Sweden. The draft genome has a total length of 2,697,490 bp, a GC-content of 38.3 %, and 2,402 predicted protein-coding sequences. The isolation of this strain predates the emergence of E. faecium as a nosocomial pathogen. Consequently, its genome can be useful in comparative genomic studies investigating the evolution of E. faecium as a pathogen.

  18. Draft genome of the reindeer (Rangifer tarandus).

    PubMed

    Li, Zhipeng; Lin, Zeshan; Ba, Hengxing; Chen, Lei; Yang, Yongzhi; Wang, Kun; Qiu, Qiang; Wang, Wen; Li, Guangyu

    2017-12-01

    The reindeer (Rangifer tarandus) is the only fully domesticated species in the Cervidae family, and it is the only cervid with a circumpolar distribution. Unlike all other cervids, female reindeer, as well as males, regularly grow cranial appendages (antlers, the defining characteristics of cervids). Moreover, reindeer milk contains more protein and less lactose than bovids' milk. A high-quality reference genome of this species will assist efforts to elucidate these and other important features in the reindeer. We obtained 615 Gb (Gigabase) of usable sequences by filtering the low-quality reads of the raw data generated from the Illumina Hiseq 4000 platform, and a 2.64-Gb final assembly, representing 95.7% of the estimated genome (2.76 Gb according to k-mer analysis), including 92.6% of expected genes according to BUSCO analysis. The contig N50 and scaffold N50 sizes were 89.7 kilo base (kb) and 0.94 mega base (Mb), respectively. We annotated 21 555 protein-coding genes and 1.07 Gb of repetitive sequences by de novo and homology-based prediction. Homology-based searches detected 159 rRNA, 547 miRNA, 1339 snRNA, and 863 tRNA sequences in the genome of R. tarandus. The divergence time between R. tarandus and ancestors of Bos taurus and Capra hircus is estimated to be about 29.5 million years ago. Our results provide the first high-quality reference genome for the reindeer and a valuable resource for studying the evolution, domestication, and other unusual characteristics of the reindeer. © The Authors 2017. Published by Oxford University Press.

  19. Genome sequences of five Lactobacillus sp. isolates from traditional Turkish sourdough

    USDA-ARS?s Scientific Manuscript database

    A high level of variation in microflora can be observed in lactic acid bacteria (LAB) profiles of sourdoughs. Here, we present draft genome sequences of Lactobacillus reuteri E81, L. reuteri LR5A, L. rhamnosus LR2, L. plantarum PFC-311 and a novel Lactobacillus sp. PFC-70 isolated from traditional T...

  20. Draft Genome Sequence of Clostridium mangenotii TR, Isolated from the Fecal Material of a Timber Rattlesnake

    PubMed Central

    Cochran, Philip A.; Dowd, Scot E.; Andersen, Kylie; Anderson, Nichole; Brennan, Rachel; Brook, Nicole; Callaway, Tracie; Diamante, Kimberly; Duberstine, Annie; Fitch, Karla; Freiheit, Heidi; Godlewski, Chantel; Gorman, Kelly; Haubrich, Mark; Hernandez, Mercedes; Hirtreiter, Amber; Ivanoski, Beth; Jaminet, Xochitl; Kirkpatrick, Travis; Kratowicz, Jennifer; Latus, Casey; Leable, Tiegen; Lingafelt, Nicole; Lowe, DeAnna; Lowrance, Holly; Malsack, Latiffa; Mazurkiewicz, Julie; Merlos, Persida; Messley, Jamie; Montemurro, Dawn; Nakitare, Samora; Nelson, Christine; Nye, Amber; Pazera, Valerie; Pierangeli, Gina; Rellora, Ashley; Reyes, Angelica; Roberts, Jennifer; Robins, Shadara; Robinson, Jeshannah; Schultz, Alissa; Seifert, Sara; Sigler, Elona; Spangler, Julie; Swift, Ebony; TenCate, Rebecca; Thurber, Jessica; Vallee, Kristin; Wamboldt, Jennifer; Whitten, Shannon; Woods, De’andrea; Wright, Amanda; Yankunas, Darin

    2014-01-01

    Here, we report the draft genome sequence of Clostridium mangenotii strain TR, which was isolated from the fecal material of a timber rattlesnake. This bacterium is nonpathogenic but contains 68 genes involved in virulence, disease, and defense. PMID:24407632

  1. Draft Genome Sequence of Magnesium-Dissolving Lactococcus garvieae A1, Isolated from Soil.

    PubMed

    Altın, Gonca; Nikerel, Emrah; Şahin, Fikrettin

    2017-05-25

    The probiotic bacterium Lactococcus garvieae A1, isolated from soil, is interesting for biomining applications. Here, we report the draft genome sequence and annotation of this strain, with a focus on metal transporter enzymes. Copyright © 2017 Altın et al.

  2. Draft Genome Sequences of Six Mycobacterium immunogenum, Strains Obtained from a Chloraminated Drinking Water Distribution System Simulator

    EPA Science Inventory

    We report the draft genome sequences of six Mycobacterium immunogenum isolated from a chloraminated drinking water distribution system simulator subjected to changes in operational parameters. M. immunogenum, a rapidly growing mycobacteria previously reported as the cause of hyp...

  3. Draft Genome Sequence of the Dimorphic Yeast Yarrowia lipolytica Strain W29

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Pomraning, Kyle R.; Baker, Scott E.

    Here, we present the draft genome sequence of the dimorphic ascomycete yeastYarrowia lipolyticastrain W29 (ATCC 20460).Y. lipolyticais a commonly employed model for the industrial production of lipases, small molecules, and more recently for its ability to accumulate lipids.

  4. Draft Genome Sequence of Lactobacillus helveticus ATCC 12046

    PubMed Central

    2018-01-01

    ABSTRACT Lactobacillus helveticus is a lactic acid bacterium used traditionally in the dairy industry, especially in the manufacture of cheeses. We present here the 2,141,841-bp draft genome sequence of L. helveticus strain ATCC 12046, a potential starter strain for improving cheese production. PMID:29449405

  5. Draft genome sequence of rice orange leaf phytoplasma from Guangdong, China

    USDA-ARS?s Scientific Manuscript database

    The genome of rice orange leaf phytoplasma strain LD1 from Luoding City, Guangdong, P. R. China, was sequenced. The draft LD1genome is 599,264 bp with GC content of 28.2%, 647 predicted open reading frames and 33 RNA genes....

  6. An Annotated Draft Genome for Radix auricularia (Gastropoda, Mollusca)

    PubMed Central

    Feldmeyer, Barbara; Schmidt, Hanno; Greshake, Bastian; Tills, Oliver; Truebano, Manuela; Rundle, Simon D.; Paule, Juraj; Ebersberger, Ingo; Pfenninger, Markus

    2017-01-01

    Molluscs are the second most species-rich phylum in the animal kingdom, yet only 11 genomes of this group have been published so far. Here, we present the draft genome sequence of the pulmonate freshwater snail Radix auricularia. Six whole genome shotgun libraries with different layouts were sequenced. The resulting assembly comprises 4,823 scaffolds with a cumulative length of 910 Mb and an overall read coverage of 72×. The assembly contains 94.6% of a metazoan core gene collection, indicating an almost complete coverage of the coding fraction. The discrepancy of ∼690 Mb compared with the estimated genome size of R. auricularia (1.6 Gb) results from a high repeat content of 70% mainly comprising DNA transposons. The annotation of 17,338 protein coding genes was supported by the use of publicly available transcriptome data. This draft will serve as starting point for further genomic and population genetic research in this scientifically important phylum. PMID:28204581

  7. Draft Genome Sequence of Thermus scotoductus Strain K1, Isolated from a Geothermal Spring in Karvachar, Nagorno Karabakh

    PubMed Central

    Saghatelyan, Ani; Poghosyan, Lianna

    2015-01-01

    The 2,379,636-bp draft genome sequence of Thermus scotoductus strain K1, isolated from geothermal spring outlet located in the Karvachar region in Nagorno Karabakh is presented. Strain K1 shares about 80% genome sequence similarity with T. scotoductus strain SA-01, recovered from a deep gold mine in South Africa. PMID:26564055

  8. Draft Genome Sequence of Bacillus licheniformis Strain YNP1-TSU Isolated from Whiterock Springs in Yellowstone National Park

    PubMed Central

    O'Hair, Joshua A.; Li, Hui; Thapa, Santosh; Scholz, Matthew B.

    2017-01-01

    ABSTRACT Novel cellulolytic microorganisms can potentially influence second-generation biofuel production. This paper reports the draft genome sequence of Bacillus licheniformis strain YNP1-TSU, isolated from hydrothermal-vegetative microbiomes inside Yellowstone National Park. The assembled sequence contigs predicted 4,230 coding genes, 66 tRNAs, and 10 rRNAs through automated annotation. PMID:28254968

  9. Draft Genome Sequence of the d-Xylose-Fermenting Yeast Spathaspora arborariae UFMG-HM19.1AT

    PubMed Central

    Lobo, Francisco P.; Gonçalves, Davi L.; Alves, Sergio L.; Gerber, Alexandra L.; de Vasconcelos, Ana Tereza R.; Basso, Luiz C.; Franco, Glória R.; Soares, Marco A.; Cadete, Raquel M.; Rosa, Carlos A.

    2014-01-01

    The draft genome sequence of the yeast Spathaspora arborariae UFMG-HM19.1AT (CBS 11463 = NRRL Y-48658) is presented here. The sequenced genome size is 12.7 Mb, consisting of 41 scaffolds containing a total of 5,625 predicted open reading frames, including many genes encoding enzymes and transporters involved in d-xylose fermentation. PMID:24435867

  10. Draft Genome Sequence of the Phytopathogenic Fungus Ganoderma boninense, the Causal Agent of Basal Stem Rot Disease on Oil Palm

    PubMed Central

    Tanjung, Zulfikar Achmad; Aditama, Redi; Buana, Rika Fithri Nurani; Pratomo, Antonius Dony Madu; Tryono, Reno; Liwang, Tony

    2018-01-01

    ABSTRACT Ganoderma boninense is the dominant fungal pathogen of basal stem rot (BSR) disease on Elaeis guineensis. We sequenced the nuclear genome of mycelia using both Illumina and Pacific Biosciences platforms for assembly of scaffolds. The draft genome comprised 79.24 Mb, 495 scaffolds, and 26,226 predicted coding sequences. PMID:29700132

  11. Draft Genome Sequence of Cellulolytic and Xylanolytic Paenibacillus sp. A59, Isolated from Decaying Forest Soil from Patagonia, Argentina

    PubMed Central

    Ghio, Silvina; Martinez Cáceres, Alfredo I.; Talia, Paola; Grasso, Daniel H.

    2015-01-01

    Paenibacillus sp. A59 was isolated from decaying forest soil in Argentina and characterized as a xylanolytic strain. We report the draft genome sequence of this isolate, with an estimated genome size of 7 Mb which harbor 6,424 coding sequences. Genes coding for hydrolytic enzymes involved in lignocellulose deconstruction were predicted. PMID:26494679

  12. Draft Genome Sequence of Sphingobium lactosutens Strain DS20T, Isolated from a Hexachlorocyclohexane Dumpsite

    PubMed Central

    Kumar, Roshan; Dwivedi, Vatsala; Negi, Vivek; Khurana, J. P.

    2013-01-01

    Sphingobium lactosutens DS20T has been isolated from the hexachlorocyclohexane (HCH) dumpsite in Lucknow, India, but does not degrade any of the HCH isomers. Here, we present the ~5.36-Mb draft genome sequence of strain DS20T, which consists of 110 contigs and 5,288 coding sequences, with a G+C content of 63.1%. PMID:24051323

  13. Assembly of the draft genome of buckwheat and its applications in identifying agronomically useful genes

    PubMed Central

    Yasui, Yasuo; Hirakawa, Hideki; Ueno, Mariko; Matsui, Katsuhiro; Katsube-Tanaka, Tomoyuki; Yang, Soo Jung; Aii, Jotaro; Sato, Shingo; Mori, Masashi

    2016-01-01

    Buckwheat (Fagopyrum esculentum Moench; 2n = 2x = 16) is a nutritionally dense annual crop widely grown in temperate zones. To accelerate molecular breeding programmes of this important crop, we generated a draft assembly of the buckwheat genome using short reads obtained by next-generation sequencing (NGS), and constructed the Buckwheat Genome DataBase. After assembling short reads, we determined 387,594 scaffolds as the draft genome sequence (FES_r1.0). The total length of FES_r1.0 was 1,177,687,305 bp, and the N50 of the scaffolds was 25,109 bp. Gene prediction analysis revealed 286,768 coding sequences (CDSs; FES_r1.0_cds) including those related to transposable elements. The total length of FES_r1.0_cds was 212,917,911 bp, and the N50 was 1,101 bp. Of these, the functions of 35,816 CDSs excluding those for transposable elements were annotated by BLAST analysis. To demonstrate the utility of the database, we conducted several test analyses using BLAST and keyword searches. Furthermore, we used the draft genome as a reference sequence for NGS-based markers, and successfully identified novel candidate genes controlling heteromorphic self-incompatibility of buckwheat. The database and draft genome sequence provide a valuable resource that can be used in efforts to develop buckwheat cultivars with superior agronomic traits. PMID:27037832

  14. Draft Genome Sequences of Acinetobacter and Bacillus Strains Isolated from Spacecraft-Associated Surfaces

    PubMed Central

    Seuylemezian, Arman; Vaishampayan, Parag; Cooper, Kerry

    2018-01-01

    ABSTRACT We report here the draft genome sequences of four strains isolated from spacecraft-associated surfaces exhibiting increased resistance to stressors such as UV radiation and exposure to H2O2. The draft genomes of strains 1P01SCT, FO-92T, 50v1, and 2P01AA had sizes of 5,500,894 bp, 4,699,376 bp, 3,174,402 bp, and 4,328,804 bp, respectively. PMID:29439046

  15. Draft Genome Sequence of Candida pseudohaemulonii Isolated from the Blood of a Neutropenic Patient.

    PubMed

    Mohd Tap, Ratna; Kamarudin, Nur Amalina; Ginsapu, Stephanie Jane; Ahmed Bakri, Ahmed Rafezzan; Ahmad, Norazah; Amran, Fairuz; Sipiczki, Matthias

    2018-04-05

    Candida pseudohaemulonii is phylogenetically close to the C. haemulonii complex and exhibits resistance to amphotericin B and azole agents. We report here the draft genome sequence of C. pseudohaemulonii UZ153_17 isolated from the blood culture of a neutropenic patient. The draft genome is 3,532,003,666 bp in length, with 579,838 reads, 130 contigs, and a G+C content of 47.15%. Copyright © 2018 Mohd Tap et al.

  16. Draft Genome Sequence of Ezakiella peruensis Strain M6.X2, a Human Gut Gram-Positive Anaerobic Coccus.

    PubMed

    Diop, Awa; Diop, Khoudia; Tomei, Enora; Raoult, Didier; Fenollar, Florence; Fournier, Pierre-Edouard

    2018-03-01

    We report here the draft genome sequence of Ezakiella peruensis strain M6.X2 T The draft genome is 1,672,788 bp long and harbors 1,589 predicted protein-encoding genes, including 26 antibiotic resistance genes with 1 gene encoding vancomycin resistance. The genome also exhibits 1 clustered regularly interspaced short palindromic repeat region and 333 genes acquired by horizontal gene transfer. Copyright © 2018 Diop et al.

  17. 75 FR 8046 - National Environmental Policy Act (NEPA) Draft Guidance, “NEPA Mitigation and Monitoring.”

    Federal Register 2010, 2011, 2012, 2013, 2014

    2010-02-23

    ... COUNCIL ON ENVIRONMENTAL QUALITY National Environmental Policy Act (NEPA) Draft Guidance, ``NEPA Mitigation and Monitoring.'' AGENCY: Council On Environmental Quality. ACTION: Notice of Availability, Draft... Quality (CEQ) announced four steps to modernize, reinvigorate, and ease the use and increase the...

  18. Genetic analysis of the Hungarian draft horse population using partial mitochondrial DNA D-loop sequencing

    PubMed Central

    2018-01-01

    Background The Hungarian draft is a horse breed with a recent mixed ancestry created in the 1920s by crossing local mares with draught horses imported from France and Belgium. The interest in its conservation and characterization has increased over the last few years. The aim of this work is to contribute to the characterization of the endangered Hungarian heavy draft horse populations in order to obtain useful information to implement conservation strategies for these genetic stocks. Methods To genetically characterize the breed and to set up the basis for a conservation program, in the present study a hypervariable region of the mitochrondial DNA (D-loop) was used to assess genetic diversity in Hungarian draft horses. Two hundred and eighty five sequences obtained in our laboratory and 419 downloaded sequences available from Genbank were analyzed. Results One hundred and sixty-four haplotypes and thirty-six polymorphic sites were observed. High haplotype and nucleotide diversity values (Hd = 0.954 ± 0.004; π = 0.028 ± 0.0004) were identified in Hungarian population, although they were higher within than among the different populations (Hd = 0.972 ± 0.002; π = 0.03097 ± 0.002). Fourteen of the previously observed seventeen haplogroups were detected. Discussion Our samples showed a large intra- and interbreed variation. There was no clear clustering on the median joining network figure. The overall information collected in this work led us to consider that the genetic scenario observed for Hungarian draft breed is more likely the result of contributions from ‘ancestrally’ different genetic backgrounds. This study could contribute to the development of a breeding plan for Hungarian draft horses and help to formulate a genetic conservation plan, avoiding inbreeding while. PMID:29404201

  19. The high-quality draft genome of peach (Prunus persica) identifies unique patterns of genetic diversity, domestication and genome evolution.

    PubMed

    Verde, Ignazio; Abbott, Albert G; Scalabrin, Simone; Jung, Sook; Shu, Shengqiang; Marroni, Fabio; Zhebentyayeva, Tatyana; Dettori, Maria Teresa; Grimwood, Jane; Cattonaro, Federica; Zuccolo, Andrea; Rossini, Laura; Jenkins, Jerry; Vendramin, Elisa; Meisel, Lee A; Decroocq, Veronique; Sosinski, Bryon; Prochnik, Simon; Mitros, Therese; Policriti, Alberto; Cipriani, Guido; Dondini, Luca; Ficklin, Stephen; Goodstein, David M; Xuan, Pengfei; Del Fabbro, Cristian; Aramini, Valeria; Copetti, Dario; Gonzalez, Susana; Horner, David S; Falchi, Rachele; Lucas, Susan; Mica, Erica; Maldonado, Jonathan; Lazzari, Barbara; Bielenberg, Douglas; Pirona, Raul; Miculan, Mara; Barakat, Abdelali; Testolin, Raffaele; Stella, Alessandra; Tartarini, Stefano; Tonutti, Pietro; Arús, Pere; Orellana, Ariel; Wells, Christina; Main, Dorrie; Vizzotto, Giannina; Silva, Herman; Salamini, Francesco; Schmutz, Jeremy; Morgante, Michele; Rokhsar, Daniel S

    2013-05-01

    Rosaceae is the most important fruit-producing clade, and its key commercially relevant genera (Fragaria, Rosa, Rubus and Prunus) show broadly diverse growth habits, fruit types and compact diploid genomes. Peach, a diploid Prunus species, is one of the best genetically characterized deciduous trees. Here we describe the high-quality genome sequence of peach obtained from a completely homozygous genotype. We obtained a complete chromosome-scale assembly using Sanger whole-genome shotgun methods. We predicted 27,852 protein-coding genes, as well as noncoding RNAs. We investigated the path of peach domestication through whole-genome resequencing of 14 Prunus accessions. The analyses suggest major genetic bottlenecks that have substantially shaped peach genome diversity. Furthermore, comparative analyses showed that peach has not undergone recent whole-genome duplication, and even though the ancestral triplicated blocks in peach are fragmentary compared to those in grape, all seven paleosets of paralogs from the putative paleoancestor are detectable.

  20. Draft Genome Sequence of Bifidobacterium animalis subsp. lactis Strain CECT 8145, Able To Improve Metabolic Syndrome In Vivo.

    PubMed

    Chenoll, E; Codoñer, F M; Silva, A; Martinez-Blanch, J F; Martorell, P; Ramón, D; Genovés, S

    2014-03-27

    Bifidobacterium animalis subsp. lactis strain CECT 8145 is able to reduce body fat content and improve metabolic syndrome biomarkers. Here, we report the draft genome sequence of this strain, which may provide insights into its safety status and functional role.

  1. Draft Genome Sequence of a Multidrug-Resistant Klebsiella quasipneumoniae subsp. similipneumoniae Isolate from a Clinical Source

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Ozer, Egon A.; Morris, Andrew R.; Krapp, Fiorella

    We report here the draft genome sequence of a multidrug-resistant clinical isolate ofKlebsiella quasipneumoniaesubsp.similipneumoniae, KP_Z4175. This strain, isolated as part of a hospital infection-control screening program, is resistant to multiple β-lactam antibiotics, aminoglycosides, and trimethoprim-sulfamethoxazole.

  2. Draft Genome Sequence of Mycobacterium asiaticum Strain DSM 44297.

    PubMed

    Croce, Olivier; Robert, Catherine; Raoult, Didier; Drancourt, Michel

    2014-04-17

    We report the draft genome sequence of Mycobacterium asiaticum strain DSM 44297, a tropical mycobacterium seldom responsible for human infection. The genome of M. asiaticum has a size of 5,935,986 bp, with a 66.03% G+C content, encoding 5,591 proteins and 81 RNAs.

  3. Draft Genome Sequences of Historical Listeria monocytogenes from Human Listeriosis, 1933

    USDA-ARS?s Scientific Manuscript database

    We report here the draft genome sequences of two Listeria monocytogenes strains from some of the earliest reported cases of human listeriosis in North America. The strains were isolated in 1933 from patients in Massachusetts and Connecticut, USA, and belong to the widely disseminated hypervirulent c...

  4. Draft Genome Sequence of Leuconostoc mesenteroides 406 Isolated from the Traditional Fermented Mare Milk Airag in Tuv Aimag, Mongolia.

    PubMed

    Morita, Hidetoshi; Toh, Hidehiro; Oshima, Kenshiro; Nakano, Akiyo; Hano, Chihiro; Yoshida, Saki; Nguyen, Tien Thi Thuy; Wulijideligen; Tashiro, Kosuke; Arakawa, Kensuke; Miyamoto, Taku

    2016-03-24

    Leuconostoc mesenteroides406 was isolated from the traditional fermented mare milk airag in Tuv Aimag, Mongolia. This strain produces an antilisterial bacteriocin. Here, we report the draft genome sequence of this organism. Copyright © 2016 Morita et al.

  5. Draft Genome Sequence of Leuconostoc mesenteroides 213M0, Isolated from Traditional Fermented Mare Milk Airag in Bulgan Aimag, Mongolia

    PubMed Central

    Toh, Hidehiro; Oshima, Kenshiro; Nakano, Akiyo; Hano, Chihiro; Yoshida, Saki; Bolormaa, Tsognemekh; Burenjargal, Sedkhuu; Nguyen, Co Thi Kim; Tashiro, Kosuke; Arakawa, Kensuke; Miyamoto, Taku

    2016-01-01

    Leuconostoc mesenteroides 213M0 was isolated from traditional fermented mare milk airag in Bulgan Aimag, Mongolia. This strain produces a listericidal bacteriocin-like inhibitory substance. Here, we report the draft genome sequence of this organism. PMID:27034488

  6. Draft genome sequence of Cryptococcus terricola JCM 24523, an oleaginous yeast capable of expressing exogenous DNA

    DOE PAGES

    Close, Dan; Ojumu, John O.; Zhang, Gui X.

    2016-11-03

    Cryptococcus terricola JCM 24523 has recently been identified as an oleaginous yeast capable of converting starch into fatty acids. Here, this draft genome sequence provides a platform for elucidating its fatty acid production potential and supporting comparisons with other oleaginous species.

  7. Reconstruction of a Nearly Complete Pseudomonas Draft Genome Sequence from a Coalbed Methane-Produced Water Metagenome

    DOE PAGES

    Ross, Daniel E.; Gulliver, Djuna

    2016-10-06

    The draft genome sequence ofPseudomonas stutzeristrain K35 was separated from a metagenome derived from a produced water microbial community of a coalbed methane well. The genome encodes a complete nitrogen fixation pathway and the upper and lower naphthalene degradation pathways.

  8. Reconstruction of a Nearly Complete Pseudomonas Draft Genome Sequence from a Coalbed Methane-Produced Water Metagenome

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Ross, Daniel E.; Gulliver, Djuna

    The draft genome sequence ofPseudomonas stutzeristrain K35 was separated from a metagenome derived from a produced water microbial community of a coalbed methane well. The genome encodes a complete nitrogen fixation pathway and the upper and lower naphthalene degradation pathways.

  9. Draft genome sequence of Cryptococcus terricola JCM 24523, an oleaginous yeast capable of expressing exogenous DNA

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Close, Dan; Ojumu, John O.; Zhang, Gui X.

    Cryptococcus terricola JCM 24523 has recently been identified as an oleaginous yeast capable of converting starch into fatty acids. Here, this draft genome sequence provides a platform for elucidating its fatty acid production potential and supporting comparisons with other oleaginous species.

  10. Sequencing and De novo Draft Assemblies of the Fathead Minnow (Pimphales promelas)Reference Genome

    EPA Science Inventory

    This study was undertaken to develop genome-scale resources for the fathead minnow (Pimphales promelas) an important model organism widely used in both aquatic ecotoxicology research and in regulatory toxicity testing. We report on the first sequencing and two draft assemblies fo...

  11. Draft Genome Sequence of Aldehyde-Degrading Strain Halomonas axialensis ACH-L-8

    PubMed Central

    Ye, Jun; Ren, Chong; Shan, Xiexie

    2016-01-01

    Halomonas axialensis ACH-L-8, a deep-sea strain isolated from the South China Sea, has the ability to degrade aldehydes. Here, we present an annotated draft genome sequence of this species, which could provide fundamental molecular information on the aldehydes-degrading mechanism. PMID:27081145

  12. Draft genome sequences of 50 MRSA ST5 isolates obtained from a U.S. hospital

    USDA-ARS?s Scientific Manuscript database

    Methicillin resistant Staphylococcus aureus (MRSA) can be a commensal or pathogen in humans. Pathogenicity and disease are related to the acquisition of mobile genetic elements encoding virulence and antimicrobial resistance genes. Here, we report draft genome sequences for 50 clinical MRSA isolates...

  13. Draft genome sequence of Erwinia tracheiphila, an economically important bacterial pathogen of cucurbits

    USDA-ARS?s Scientific Manuscript database

    Erwinia tracheiphila is one of the most economically important pathogen of cucumbers, melons, squashes, pumpkins, and gourds, in the Northeastern and Midwestern United States, yet the molecular pathology remains uninvestigated. Here we report the first draft genome sequence of an E. tracheiphila str...

  14. Draft Genome Sequence of Lactobacillus panis DSM 6035T, First Isolated from Sourdough

    PubMed Central

    Zhu, Yixin; Fang, Daiqiong; Shi, Ding; Li, Ang; Lv, Longxian; Yan, Ren; Yao, Jian; Hua, Dasong; Hu, Xinjun; Guo, Feifei; Wu, Wenrui; Guo, Jing; Chen, Yanfei; Jiang, Xiawei; Chen, Xiaoxiao

    2015-01-01

    We report a draft genome sequence of Lactobacillus panis DSM 6035T, isolated from sourdough. The genome of this strain is 2,082,789 bp long, with 47.9% G+C content. A total of 2,047 protein-coding genes were predicted. PMID:26205855

  15. Draft Genome Sequence of Lactobacillus helveticus ATCC 12046.

    PubMed

    Palomino, María Mercedes; Burguener, Germán F; Campos, Josefina; Allievi, Mariana; Fina-Martin, Joaquina; Prado Acosta, Mariano; Fernández Do Porto, Darío A; Ruzal, Sandra M

    2018-02-15

    Lactobacillus helveticus is a lactic acid bacterium used traditionally in the dairy industry, especially in the manufacture of cheeses. We present here the 2,141,841-bp draft genome sequence of L. helveticus strain ATCC 12046, a potential starter strain for improving cheese production. Copyright © 2018 Palomino et al.

  16. Draft genome sequences of seven 4-Formylaminooxyvinylglycine producers belonging to the Pseudomonas fluorescens species complex

    USDA-ARS?s Scientific Manuscript database

    Vinylglycines are non-proteinogenic amino acids that inhibit amino acid metabolism and ethylene production. In this report, we describe the draft genome sequences of seven isolates of Pseudomonas that produce 4-formylaminooxyvinylglycine, a compound known to inhibit the germination of grasses and t...

  17. Draft Genome Sequence of Enterococcus faecium Strain J19, Isolated from Cabbage

    PubMed Central

    2018-01-01

    ABSTRACT Herein, we report the draft genome sequence of a newly discovered probiotic strain, Enterococcus faecium J19, which was isolated from cabbage. Strain J19 has shown antagonistic effects against the human foodborne pathogen Listeria monocytogenes in coculture and in different food matrices. PMID:29622613

  18. Draft Genome Sequence of the Putrescine-Producing Strain Lactococcus lactis subsp. lactis 1AA59

    PubMed Central

    del Rio, Beatriz; Linares, Daniel M.; Fernandez, María; Mayo, Baltasar; Martín, M. Cruz

    2015-01-01

    We report here the 2,576,542-bp genome annotated draft assembly sequence of Lactococcus lactis subsp. lactis 1AA59. This strain—isolated from a traditional cheese—produces putrescine, one of the most frequently biogenic amines found in dairy products. PMID:26089428

  19. Draft genome sequence analysis of multidrug-resistant Escherichia coli strains isolated in 2013 from humans and chickens in Nigeria

    USDA-ARS?s Scientific Manuscript database

    Here, we present the draft genome sequences of nine multidrug-resistant Escherichia coli isolated from humans (n=6) and chicken carcass (n=3) from Lagos, Nigeria in 2013. Multiple extended-spectrum beta-lactamase (ESBL) genes were identified in these isolates. ...

  20. Draft Sequences of the Radish (Raphanus sativus L.) Genome

    PubMed Central

    Kitashiba, Hiroyasu; Li, Feng; Hirakawa, Hideki; Kawanabe, Takahiro; Zou, Zhongwei; Hasegawa, Yoichi; Tonosaki, Kaoru; Shirasawa, Sachiko; Fukushima, Aki; Yokoi, Shuji; Takahata, Yoshihito; Kakizaki, Tomohiro; Ishida, Masahiko; Okamoto, Shunsuke; Sakamoto, Koji; Shirasawa, Kenta; Tabata, Satoshi; Nishio, Takeshi

    2014-01-01

    Radish (Raphanus sativus L., n = 9) is one of the major vegetables in Asia. Since the genomes of Brassica and related species including radish underwent genome rearrangement, it is quite difficult to perform functional analysis based on the reported genomic sequence of Brassica rapa. Therefore, we performed genome sequencing of radish. Short reads of genomic sequences of 191.1 Gb were obtained by next-generation sequencing (NGS) for a radish inbred line, and 76,592 scaffolds of ≥300 bp were constructed along with the bacterial artificial chromosome-end sequences. Finally, the whole draft genomic sequence of 402 Mb spanning 75.9% of the estimated genomic size and containing 61,572 predicted genes was obtained. Subsequently, 221 single nucleotide polymorphism markers and 768 PCR-RFLP markers were used together with the 746 markers produced in our previous study for the construction of a linkage map. The map was combined further with another radish linkage map constructed mainly with expressed sequence tag-simple sequence repeat markers into a high-density integrated map of 1,166 cM with 2,553 DNA markers. A total of 1,345 scaffolds were assigned to the linkage map, spanning 116.0 Mb. Bulked PCR products amplified by 2,880 primer pairs were sequenced by NGS, and SNPs in eight inbred lines were identified. PMID:24848699

  1. Draft genome sequence of Trametes villosa (Sw.) Kreisel CCMB561, a tropical white-rot Basidiomycota from the semiarid region of Brazil.

    PubMed

    Ferreira, Dalila Souza Santos; Kato, Rodrigo Bentes; Miranda, Fábio Malcher; da Costa Pinheiro, Kenny; Fonseca, Paula Luize Camargos; Tomé, Luiz Marcelo Ribeiro; Vaz, Aline Bruna Martins; Badotti, Fernanda; Ramos, Rommel Thiago Jucá; Brenig, Bertram; Azevedo, Vasco Ariston de Carvalho; Benevides, Raquel Guimarães; Góes-Neto, Aristóteles

    2018-06-01

    Herein, we present the draft genome of Trametes villosa isolate CCMB561, a wood-decaying Basidiomycota commonly found in tropical semiarid climate. The genome assembly was 57.98 Mb in size with an L50 of 691. A total of 16,711 putative protein-encoding genes was predicted, including 590 genes coding for carbohydrate-active enzymes (CAZy), directly involved in the decomposition of lignocellulosic materials. This is the first genome of this species of high interest in bioenergy research. The draft genome of Trametes villosa isolate CCMB561 will provide an important resource for future investigations in biofuel production, bioremediation and other green technologies.

  2. Draft genome sequence and genetic transformation of the oleaginous alga Nannochloropis gaditana

    PubMed Central

    Radakovits, Randor; Jinkerson, Robert E.; Fuerstenberg, Susan I.; Tae, Hongseok; Settlage, Robert E.; Boore, Jeffrey L.; Posewitz, Matthew C.

    2012-01-01

    The potential use of algae in biofuels applications is receiving significant attention. However, none of the current algal model species are competitive production strains. Here we present a draft genome sequence and a genetic transformation method for the marine microalga Nannochloropsis gaditana CCMP526. We show that N. gaditana has highly favourable lipid yields, and is a promising production organism. The genome assembly includes nuclear (~29 Mb) and organellar genomes, and contains 9,052 gene models. We define the genes required for glycerolipid biogenesis and detail the differential regulation of genes during nitrogen-limited lipid biosynthesis. Phylogenomic analysis identifies genetic attributes of this organism, including unique stramenopile photosynthesis genes and gene expansions that may explain the distinguishing photoautotrophic phenotypes observed. The availability of a genome sequence and transformation methods will facilitate investigations into N. gaditana lipid biosynthesis and permit genetic engineering strategies to further improve this naturally productive alga. PMID:22353717

  3. Draft genome sequence and genetic transformation of the oleaginous alga Nannochloropis gaditana.

    PubMed

    Radakovits, Randor; Jinkerson, Robert E; Fuerstenberg, Susan I; Tae, Hongseok; Settlage, Robert E; Boore, Jeffrey L; Posewitz, Matthew C

    2012-02-21

    The potential use of algae in biofuels applications is receiving significant attention. However, none of the current algal model species are competitive production strains. Here we present a draft genome sequence and a genetic transformation method for the marine microalga Nannochloropsis gaditana CCMP526. We show that N. gaditana has highly favourable lipid yields, and is a promising production organism. The genome assembly includes nuclear (~29 Mb) and organellar genomes, and contains 9,052 gene models. We define the genes required for glycerolipid biogenesis and detail the differential regulation of genes during nitrogen-limited lipid biosynthesis. Phylogenomic analysis identifies genetic attributes of this organism, including unique stramenopile photosynthesis genes and gene expansions that may explain the distinguishing photoautotrophic phenotypes observed. The availability of a genome sequence and transformation methods will facilitate investigations into N. gaditana lipid biosynthesis and permit genetic engineering strategies to further improve this naturally productive alga.

  4. First draft genome sequence of a strain from the genus Fusibacter isolated from Salar de Ascotán in Northern Chile.

    PubMed

    Serrano, Antonio E; Escudero, Lorena V; Tebes-Cayo, Cinthya; Acosta, Mauricio; Encalada, Olga; Fernández-Moroso, Sebastián; Demergasso, Cecilia

    2017-01-01

    Fusibacter sp . 3D3 (ATCC BAA-2418) is an arsenate-reducing halotolerant strain within the Firmicutes phylum, isolated from the Salar de Ascotán, a hypersaline salt flat in Northern Chile. This high-Andean closed basin is an athalassohaline environment located at the bottom of a tectonic basin surrounded by mountain range, including some active volcanoes. This landscape can be an advantageous system to explore the effect of salinity on microorganisms that mediate biogeochemical reactions. Since 2000, microbial reduction of arsenic has been evidenced in the system, and the phylogenetic analysis of the original community plus the culture enrichments has revealed the predominance of Firmicutes phylum. Here, we describe the first whole draft genome sequence of an arsenic-reducing strain belonging to the Fusibacter genus showing the highest 16S rRNA gene sequence similarity (98%) with Fusibacter sp. strain Vns02. The draft genome consists of 57 contigs with 5,111,250 bp and an average G + C content of 37.6%. Out of 4780 total genes predicted, 4700 genes code for proteins and 80 genes for RNAs. Insights from the genome sequence and some microbiological features of the strain 3D3 are available under Bioproject accession PRJDB4973 and Biosample SAMD00055724. The release of the genome sequence of this strain could contribute to the understanding of the arsenic biogeochemistry in extreme environments.

  5. Draft Genome Sequence of Pseudomonas putida CA-3, a Bacterium Capable of Styrene Degradation and Medium-Chain-Length Polyhydroxyalkanoate Synthesis

    PubMed Central

    Almeida, Eduardo L.; Margassery, Lekha M.; O’Leary, Niall

    2018-01-01

    ABSTRACT Pseudomonas putida strain CA-3 is an industrial bioreactor isolate capable of synthesizing biodegradable polyhydroxyalkanoate polymers via the metabolism of styrene and other unrelated carbon sources. The pathways involved are subject to regulation by global cellular processes. The draft genome sequence is 6,177,154 bp long and contains 5,608 predicted coding sequences. PMID:29371359

  6. Draft Genome Sequence of Leuconostoc mesenteroides P45 Isolated from Pulque, a Traditional Mexican Alcoholic Fermented Beverage

    PubMed Central

    Riveros-Mckay, Fernando; Campos, Itzia; Giles-Gómez, Martha; Bolívar, Francisco

    2014-01-01

    Leuconostoc mesenteroides P45 was isolated from the traditional Mexican pulque beverage. We report its draft genome sequence, assembled in 6 contigs consisting of 1,874,188 bp and no plasmids. Genome annotation predicted a total of 1,800 genes, 1,687 coding sequences, 52 pseudogenes, 9 rRNAs, 51 tRNAs, 1 noncoding RNA, and 44 frameshifted genes. PMID:25377708

  7. Draft Genome Sequence of Pedobacter sp. Strain V48, Isolated from a Coastal Sand Dune in the Netherlands

    PubMed Central

    Bitzer, Adam S.; Garbeva, Paolina

    2014-01-01

    Pedobacter sp. strain V48 participates in an interaction with Pseudomonas fluorescens which elicits interaction-induced phenotypes. We report the draft genome sequence of Pedobacter sp. V48, consisting of 6.46 Mbp. The sequence will contribute to improved understanding of the genus and facilitate genomic analysis of the model interspecies interaction with P. fluorescens. PMID:24578271

  8. Draft Genome Sequence of Pseudomonas chlororaphis ATCC 9446, a Nonpathogenic Bacterium with Bioremediation and Industrial Potential.

    PubMed

    Moreno-Avitia, Fabian; Lozano, Luis; Utrilla, Jose; Bolívar, Francisco; Escalante, Adelfo

    2017-06-08

    Pseudomonas chlororaphis strain ATCC 9446 is a biocontrol-related organism. We report here its draft genome sequence assembled into 35 contigs consisting of 6,783,030 bp. Genome annotation predicted a total of 6,200 genes, 6,128 coding sequences, 81 pseudogenes, 58 tRNAs, 4 noncoding RNAs (ncRNAs), and 41 frameshifted genes. Copyright © 2017 Moreno-Avitia et al.

  9. Draft Genome Sequence of Geobacillus sp. Isolate T6, a Thermophilic Bacterium Collected from a Thermal Spring in Argentina

    PubMed Central

    Ortiz, Elio M.; Berretta, Marcelo F.; Benintende, Graciela B.; Zandomeni, Rubén O.

    2015-01-01

    Geobacillus sp. isolate T6 was collected from a thermal spring in Salta, Argentina. The draft genome sequence (3,767,773 bp) of this isolate is represented by one major scaffold of 3,46 Mbp, a second one of 207 kbp, and 20 scaffolds of <13 kbp. The assembled sequences revealed 3,919 protein-coding genes. PMID:26184933

  10. Draft Genome Sequence of the Phytopathogenic Fungus Ganoderma boninense, the Causal Agent of Basal Stem Rot Disease on Oil Palm.

    PubMed

    Utomo, Condro; Tanjung, Zulfikar Achmad; Aditama, Redi; Buana, Rika Fithri Nurani; Pratomo, Antonius Dony Madu; Tryono, Reno; Liwang, Tony

    2018-04-26

    Ganoderma boninense is the dominant fungal pathogen of basal stem rot (BSR) disease on Elaeis guineensis We sequenced the nuclear genome of mycelia using both Illumina and Pacific Biosciences platforms for assembly of scaffolds. The draft genome comprised 79.24 Mb, 495 scaffolds, and 26,226 predicted coding sequences. Copyright © 2018 Utomo et al.

  11. Draft Genome Sequence of Pseudomonas oceani DSM 100277T, a Deep-Sea Bacterium

    PubMed Central

    2018-01-01

    ABSTRACT Pseudomonas oceani DSM 100277T was isolated from deep seawater in the Okinawa Trough at 1390 m. P. oceani belongs to the Pseudomonas pertucinogena group. Here, we report the draft genome sequence of P. oceani, which has an estimated size of 4.1 Mb and exhibits 3,790 coding sequences, with a G+C content of 59.94 mol%. PMID:29650573

  12. Draft Genome Sequence of Algoriphagus sp. Strain NH1, a Multidrug-Resistant Bacterium Isolated from Coastal Sediments of the Northern Yellow Sea in China

    PubMed Central

    Mu, Dashuai; Zhao, Jinxin; Wang, Zongjie; Chen, Guanjun

    2016-01-01

    Algoriphagus sp. NH1 is a multidrug-resistant bacterium isolated from coastal sediments of the northern Yellow Sea in China. Here, we report the draft genome sequence of NH1, with a size of 6,131,579 bp, average G+C content of 42.68%, and 5,746 predicted protein-coding sequences. PMID:26769940

  13. Draft Genome Sequences of 510 Listeria monocytogenes Strains from Food Isolates and Human Listeriosis Cases from Northern Italy.

    PubMed

    Lomonaco, Sara; Gallina, Silvia; Filipello, Virginia; Sanchez Leon, Maria; Kastanis, George John; Allard, Marc; Brown, Eric; Amato, Ettore; Pontello, Mirella; Decastelli, Lucia

    2018-01-18

    Listeriosis outbreaks are frequently multistate/multicountry outbreaks, underlining the importance of molecular typing data for several diverse and well-characterized isolates. Large-scale whole-genome sequencing studies on Listeria monocytogenes isolates from non-U.S. locations have been limited. Herein, we describe the draft genome sequences of 510 L. monocytogenes isolates from northern Italy from different sources.

  14. Draft Genome Sequence of Pseudomonas sp. Strain B1, Isolated from a Contaminated Sediment

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Pathak, Ashish; Jaswal, Rajneesh; Stothard, Paul

    ABSTRACT The draft genome sequence of Pseudomonas sp. strain B1, isolated from a contaminated soil, is reported. The genome comprises 6,706,934 bases, 6,059 coding sequences, and 70 RNAs and has a G+C content of 60.3%. A suite of biodegradative genes, many located on genomic islands, were identified from strain B1, further enhancing our understanding of the versatile pseudomonads.

  15. Draft Genome Sequence of Lactobacillus sp. Strain TCF032-E4, Isolated from Fermented Radish.

    PubMed

    Mao, Yuejian; Chen, Meng; Horvath, Philippe

    2015-07-30

    Here, we report the draft genome sequence of Lactobacillus sp. strain TCF032-E4 (= CCTCC AB2015090 = DSM 100358), isolated from a Chinese fermented radish. The total length of the 57 contigs is about 2.9 Mb, with a G+C content of 43.5 mol% and 2,797 predicted coding sequences (CDSs). Copyright © 2015 Mao et al.

  16. Draft Genome Sequence of Geobacillus kaustophilus GBlys, a Lysogenic Strain with Bacteriophage ϕOH2

    PubMed Central

    Mori, Kazuki; Martono, Hindra; Nagayoshi, Yuko; Fujino, Yasuhiro; Tashiro, Kosuke; Kuhara, Satoru; Ohshima, Toshihisa

    2013-01-01

    Geobacillus kaustophilus strain GBlys was isolated along with the bacteriophage ϕOH2, which infects G. kaustophilus NBRC 102445T. Here we present a draft sequence of this strain’s genome, which consists of 216 contigs for a total of 3,541,481 bp, 3,679 predicted coding sequences, and a G+C content of 52.1%. PMID:23950135

  17. Draft Genome Sequence of Cellulolytic and Xylanolytic Cellulomonas sp. Strain B6 Isolated from Subtropical Forest Soil

    PubMed Central

    Piccinni, Florencia; Murua, Yanina; Ghio, Silvina; Talia, Paola; Rivarola, Máximo

    2016-01-01

    Cellulomonas sp. strain B6 was isolated from a subtropical forest soil sample and presented (hemi)cellulose-degrading activity. We report here its draft genome sequence, with an estimated genome size of 4 Mb, a G+C content of 75.1%, and 3,443 predicted protein-coding sequences, 92 of which are glycosyl hydrolases involved in polysaccharide degradation. PMID:27563050

  18. Draft Genome Sequence of Cellulolytic and Xylanolytic Paenibacillus sp. A59, Isolated from Decaying Forest Soil from Patagonia, Argentina.

    PubMed

    Ghio, Silvina; Martinez Cáceres, Alfredo I; Talia, Paola; Grasso, Daniel H; Campos, Eleonora

    2015-10-22

    Paenibacillus sp. A59 was isolated from decaying forest soil in Argentina and characterized as a xylanolytic strain. We report the draft genome sequence of this isolate, with an estimated genome size of 7 Mb which harbor 6,424 coding sequences. Genes coding for hydrolytic enzymes involved in lignocellulose deconstruction were predicted. Copyright © 2015 Ghio et al.

  19. Draft Genome Sequence of Photorhabdus luminescens Strain DSPV002N Isolated from Santa Fe, Argentina

    PubMed Central

    Del Valle, Eleodoro E.; Frizzo, Laureano; Berry, Colin; Caballero, Primitivo

    2016-01-01

    Here, we report the draft genome sequence of Photorhabdus luminescens strain DSPV002N, which consists of 177 contig sequences accounting for 5,518,143 bp, with a G+C content of 42.3% and 4,701 predicted protein-coding genes (CDSs). From these, 27 CDSs exhibited significant similarity with insecticidal toxin proteins from Photorhabdus luminescens subsp. laumondii TT01. PMID:27469965

  20. Draft Genome Sequence of Paenibacillus sp. Strain DMB20, Isolated from Alang Ship-Breaking Yard, Which Harbors Genes for Xenobiotic Degradation

    PubMed Central

    Shah, Binal; Jain, Kunal; Patel, Namrata; Pandit, Ramesh; Patel, Anand; Joshi, Chaitanya G.

    2015-01-01

    Paenibacillus sp. strain DMB20, in cometabolism with other Proteobacteria and Firmicutes, exhibits azoreduction of textile dyes. Here, we report the draft genome sequence of this bacterium, consisting of 6,647,181 bp with 7,668 coding sequences (CDSs). The data presented highlight multiple sets of functional genes associated with xenobiotic compound degradation. PMID:26067950

  1. Draft Genome Sequence of Pseudomonas sp. Strain B1, Isolated from a Contaminated Sediment

    DOE PAGES

    Pathak, Ashish; Jaswal, Rajneesh; Stothard, Paul; ...

    2018-06-21

    ABSTRACT The draft genome sequence of Pseudomonas sp. strain B1, isolated from a contaminated soil, is reported. The genome comprises 6,706,934 bases, 6,059 coding sequences, and 70 RNAs and has a G+C content of 60.3%. A suite of biodegradative genes, many located on genomic islands, were identified from strain B1, further enhancing our understanding of the versatile pseudomonads.

  2. Draft Genome Sequences of Biosafety Level 2 Opportunistic Pathogens Isolated from the Environmental Surfaces of the International Space Station.

    PubMed

    Checinska Sielaff, Aleksandra; Singh, Nitin K; Allen, Jonathan E; Thissen, James; Jaing, Crystal; Venkateswaran, Kasthuri

    2016-12-29

    The draft genome sequences of 20 biosafety level 2 (BSL-2) opportunistic pathogens isolated from the environmental surfaces of the International Space Station (ISS) were presented. These genomic sequences will help in understanding the influence of microgravity on the pathogenicity and virulence of these strains when compared with Earth strains. Copyright © 2016 Checinska Sielaff et al.

  3. Draft Genome Sequence of Thermus scotoductus Strain K1, Isolated from a Geothermal Spring in Karvachar, Nagorno Karabakh.

    PubMed

    Saghatelyan, Ani; Poghosyan, Lianna; Panosyan, Hovik; Birkeland, Nils-Kåre

    2015-11-12

    The 2,379,636-bp draft genome sequence of Thermus scotoductus strain K1, isolated from geothermal spring outlet located in the Karvachar region in Nagorno Karabakh is presented. Strain K1 shares about 80% genome sequence similarity with T. scotoductus strain SA-01, recovered from a deep gold mine in South Africa. Copyright © 2015 Saghatelyan et al.

  4. Draft Genome Sequences of Three Novel Low-Abundance Species Strains Isolated from Kefir Grain.

    PubMed

    Kim, Yongkyu; Blasche, Sonja; Patil, Kiran R

    2017-09-28

    We report here the genome sequences of three novel bacterial species strains- Bacillus kefirresidentii Opo, Rothia kefirresidentii KRP, and Streptococcus kefirresidentii YK-isolated from kefir grains collected in Germany. The draft genomes of these isolates were remarkably dissimilar (average nucleotide identities, 77.80%, 89.01%, and 92.10%, respectively) to those of the previously sequenced strains. Copyright © 2017 Kim et al.

  5. High quality draft genome of Nakamurella lactea type strain, a rock actinobacterium, and emended description of Nakamurella lactea

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Nouioui, Imen; Göker, Markus; Carro, Lorena

    Nakamurella lactea DLS-10 T , isolated from rock in Korea, is one of the four type strains of the genus Nakamurella. In this study, we describe the high quality draft genome of N. lactea DLS-10 T and its annotation. A summary of phenotypic data collected from previously published studies was also included. The genome of strain DLS-10 T presents a size of 5.82 Mpb, 5100 protein coding genes, and a C + G content of 68.9%. Based on the genome analysis, emended description of N. lactea in terms of G + C content was also proposed.

  6. High quality draft genome of Nakamurella lactea type strain, a rock actinobacterium, and emended description of Nakamurella lactea

    DOE PAGES

    Nouioui, Imen; Göker, Markus; Carro, Lorena; ...

    2017-01-06

    Nakamurella lactea DLS-10 T , isolated from rock in Korea, is one of the four type strains of the genus Nakamurella. In this study, we describe the high quality draft genome of N. lactea DLS-10 T and its annotation. A summary of phenotypic data collected from previously published studies was also included. The genome of strain DLS-10 T presents a size of 5.82 Mpb, 5100 protein coding genes, and a C + G content of 68.9%. Based on the genome analysis, emended description of N. lactea in terms of G + C content was also proposed.

  7. Genomic Sequence of Saccharomyces cerevisiae BAW-6, a Yeast Strain Optimal for Brewing Barley Shochu.

    PubMed

    Kajiwara, Yasuhiro; Mori, Kazuki; Tashiro, Kosuke; Higuchi, Yujiro; Takegawa, Kaoru; Takashita, Hideharu

    2018-04-05

    Here, we report the draft genome sequence of Saccharomyces cerevisiae strain BAW-6, which is used for the production of barley shochu, a traditional Japanese spirit. This genomic information can be used to elucidate the genetic basis underlying the high alcohol production capacity and citric acid tolerance of shochu yeast. Copyright © 2018 Kajiwara et al.

  8. Genomic Sequence of Saccharomyces cerevisiae BAW-6, a Yeast Strain Optimal for Brewing Barley Shochu

    PubMed Central

    Mori, Kazuki; Tashiro, Kosuke; Higuchi, Yujiro; Takashita, Hideharu

    2018-01-01

    ABSTRACT Here, we report the draft genome sequence of Saccharomyces cerevisiae strain BAW-6, which is used for the production of barley shochu, a traditional Japanese spirit. This genomic information can be used to elucidate the genetic basis underlying the high alcohol production capacity and citric acid tolerance of shochu yeast. PMID:29622617

  9. [How to establish a good acupuncture-moxibustion standard?].

    PubMed

    Wu, Xiao-dong; Xiao, Hui

    2014-10-01

    At the beginning of a standard item, the standardized objects and involved contents should be demonstrated thoroughly, which is the precondition of establishing a good standard. After the proposal of this standard, a high-level drafting group should be built, led by top specialists who also draft the standard, which is essential to guarantee the quality of the standard. Before drafting the standard, literature regarding this standard should be searched completely, and Directives for Standardization should be learned to understand the basic requirements of establishing a standard; in the meanwhile, selections on standardized contents and quantitative boundaries of technical indices should be comprehensively and deeply studied. At the stage of consultation, focus should be paid on the scope of the consultation departments, level and personnel quality. As for standard review, it should be precise and truth-seeking. At the stage of submitting and authorization, it is necessary to have timely communication. Only by full cooperations of all parties, and by strictly following the procedure, method and rule of standard establishment, can a high-quality acupuncture-moxibustion standard be established.

  10. Draft genome sequences of four Streptomyces isolates from the Populus trichocarpa root endosphere and rhizosphere

    DOE PAGES

    Klingeman, Dawn M.; Utturkar, Sagar; Lu, Tse -Yuan S.; ...

    2015-11-12

    Draft genome sequences for four Actinobacteria from the genus Streptomyces are presented. Streptomyces is a metabolically diverse genus that is abundant in soils and has been reported in association with plants. The strains described in this study were isolated from the Populus trichocarpa endosphere and rhizosphere.

  11. Draft Genome Sequence of Leptospira interrogans Serovar Bataviae Strain LepIMR 22 Isolated from a Rodent in Johor, Malaysia

    PubMed Central

    Amran, Fairuz; Mohamad, Saharuddin; Mat Ripen, Adiratna; Ahmad, Norazah; Goris, Marga G. A.; Muhammad, Ayu Haslin; Noor Halim, Nurul Atiqah

    2016-01-01

    Leptospira interrogans serovar Bataviae was recently identified as one of the persistent Leptospira serovars in Malaysia. Here, we report the draft genome sequence of the L. interrogans serovar Bataviae strain LepIMR 22 isolated from kidney of a rodent in Johor, Malaysia. PMID:27609924

  12. Draft Genome Sequence of Staphylococcus aureus Strain HD1410, Isolated from a Persistent Nasal Carrier.

    PubMed

    Nurjadi, Dennis; Boutin, Sébastien; Dalpke, Alexander; Heeg, Klaus; Zanger, Philipp

    2018-05-10

    We report here the draft genome sequence of a Staphylococcus aureus strain isolated from the nares of an 18-year-old female healthy persistent-carrier individual, and it was used to investigate S. aureus -specific immune responses in colonized and noncolonized individuals. Copyright © 2018 Nurjadi et al.

  13. Draft Genome Sequence of Staphylococcus aureus Strain HD1410, Isolated from a Persistent Nasal Carrier

    PubMed Central

    Boutin, Sébastien; Dalpke, Alexander; Heeg, Klaus; Zanger, Philipp

    2018-01-01

    ABSTRACT We report here the draft genome sequence of a Staphylococcus aureus strain isolated from the nares of an 18-year-old female healthy persistent-carrier individual, and it was used to investigate S. aureus-specific immune responses in colonized and noncolonized individuals. PMID:29748411

  14. Draft Genome Sequence of Mycobacterium triplex DSM 44626.

    PubMed

    Sassi, Mohamed; Croce, Olivier; Robert, Catherine; Raoult, Didier; Drancourt, Michel

    2014-05-29

    We announce the draft genome sequence of Mycobacterium triplex strain DSM 44626, a nontuberculosis species responsible for opportunistic infections. The genome described here is composed of 6,382,840 bp, with a G+C content of 66.57%, and contains 5,988 protein-coding genes and 81 RNA genes. Copyright © 2014 Sassi et al.

  15. Draft Genome Sequence of Leuconostoc mesenteroides 213M0, Isolated from Traditional Fermented Mare Milk Airag in Bulgan Aimag, Mongolia.

    PubMed

    Morita, Hidetoshi; Toh, Hidehiro; Oshima, Kenshiro; Nakano, Akiyo; Hano, Chihiro; Yoshida, Saki; Bolormaa, Tsognemekh; Burenjargal, Sedkhuu; Nguyen, Co Thi Kim; Tashiro, Kosuke; Arakawa, Kensuke; Miyamoto, Taku

    2016-03-31

    Leuconostoc mesenteroides213M0 was isolated from traditional fermented mare milk airag in Bulgan Aimag, Mongolia. This strain produces a listericidal bacteriocin-like inhibitory substance. Here, we report the draft genome sequence of this organism. Copyright © 2016 Morita et al.

  16. Draft Genome Sequence of Komagataeibacter intermedius Strain AF2, a Producer of Cellulose, Isolated from Kombucha Tea

    PubMed Central

    dos Santos, Renato Augusto Corrêa; Berretta, Andresa Aparecida; Barud, Hernane da Silva; Ribeiro, Sidney José Lima; González-García, Laura Natalia; Zucchi, Tiago Domingues

    2015-01-01

    Here, we present the draft genome sequence of Komagataeibacter intermedius strain AF2, which was isolated from Kombucha tea and is capable of producing cellulose, although at lower levels compared to another bacterium from the same environment, K. rhaeticus strain AF1. PMID:26634755

  17. Draft genome sequence of the oleaginous yeast Cryptococcus curvatus ATCC 20509

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Close, Dan; Ojumu, John O.

    Cryptococcus curvatus ATCC 20509 is a commonly used nonmodel oleaginous yeast capable of converting a variety of carbon sources into fatty acids. In addition, we present the draft genome sequence of this popular organism to provide a means for more in-depth studies of its fatty acid production potential.

  18. Permanent Draft Genome Sequence of Nocardia sp. BMG111209, an Actinobacterium Isolated from Nodules of Casuarina glauca

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Ghodhbane-Gtari, Faten; Beauchemin, Nicholas; Gueddou, Abdellatif

    Nocardiasp. strain BMG111209 is a non-Frankiaactinobacterium isolated from root nodules ofCasuarina glaucain Tunisia. Here, we report the 9.1-Mbp draft genome sequence ofNocardiasp. strain BMG111209 with a G + C content of 69.19% and 8,122 candidate protein-encoding genes.

  19. Draft Genome Sequences of Phenotypically Distinct Janthinobacterium sp. Isolates Cultured from the Hudson Valley Watershed

    PubMed Central

    Bettina, Alexandra M.; Doing, Georgia; O’Brien, Kelsey

    2018-01-01

    ABSTRACT Investigation of the Hudson Valley watershed reveals many violacein-producing bacteria. These are of interest for their biotherapeutic potential in treating chytrid infections of amphibians. The draft whole-genome sequences for seven Janthinobacterium isolates with a variety of phenotypes are provided in this study. PMID:29348334

  20. Draft genome sequences of 1 MSSA and 7 MRSA ST5 isolates obtained from California

    USDA-ARS?s Scientific Manuscript database

    Staphylococcus aureus is a commensal of humans that can cause a spectrum of diseases. An isolate’s capacity to cause disease is partially attributed to the acquisition of novel mobile genetic elements. This report provides the draft genome sequence of one methicillin susceptible and seven methicilli...

  1. Draft Genome Sequence of Lactobacillus pobuzihii E100301T.

    PubMed

    Chiu, Chi-Ming; Chang, Chi-Huan; Pan, Shwu-Fen; Wu, Hui-Chung; Li, Shiao-Wen; Chang, Chuan-Hsiung; Lee, Yun-Shien; Chiang, Chih-Ming; Chen, Yi-Sheng

    2013-05-09

    Lactobacillus pobuzihii E100301(T) is a novel Lactobacillus species previously isolated from pobuzihi (fermented cummingcordia) in Taiwan. Phylogenetically, this strain is closest to Lactobacillus acidipiscis, but its phenotypic characteristics can be clearly distinguished from those of L. acidipiscis. We present the draft genome sequence of strain L. pobuzihii E100301(T).

  2. Draft Genome Sequence of Bacillus coagulans GBI-30, 6086, a Widely Used Spore-Forming Probiotic Strain

    PubMed Central

    Orrù, Luigi; Salvetti, Elisa; Cattivelli, Luigi; Lamontanara, Antonella; Michelotti, Vania; Capozzi, Vittorio; Spano, Giuseppe; Keller, David; Cash, Howard; Martina, Alessia; Felis, Giovanna E.

    2014-01-01

    Bacillus coagulans GBI-30, 6086 is a safe strain, already available on the market, and characterized by certified beneficial effects. The draft genome sequence presented here constitutes the first pillar toward the identification of the molecular mechanisms responsible for its positive features and safety. PMID:25377698

  3. Draft Genome Sequence of Xylella fastidiosa subsp. fastidiosa Strain Stag's Leap.

    PubMed

    Chen, J; Wu, F; Zheng, Z; Deng, X; Burbank, L P; Stenger, D C

    2016-04-21

    ITALIC! Xylella fastidiosasubsp. ITALIC! fastidiosacauses Pierce's disease of grapevine. Presented here is the draft genome sequence of the Stag's Leap strain, previously used in pathogenicity/virulence assays to evaluate grapevine germplasm bearing Pierce's disease resistance and a phenotypic assessment of knockout mutants to determine gene function. Copyright © 2016 Chen et al.

  4. Draft genome sequence of ‘Candidatus Phytoplasma pruni’ strain CX, a plant pathogenic bacterium

    USDA-ARS?s Scientific Manuscript database

    ‘Candidatus Phytoplasma pruni’ strain CX, belonging to subgroup 16SrIII-A, is a plant pathogenic bacterium causing economically important diseases in many fruit crops. Here we report the draft genome sequence that consists of 598,508 bases, with a G+C content of 27.21 mol%. ...

  5. Draft Genome Sequences of 116 Campylobacter jejuni Strains Isolated from Humans, Animals, Food, and the Environment in Brazil.

    PubMed

    Frazão, Miliane Rodrigues; Cao, Guojie; Medeiros, Marta Inês Cazentini; Duque, Sheila da Silva; Leon, Maria Sanchez; Allard, Marc William; Falcão, Juliana Pfrimer

    2018-04-19

    Campylobacter jejuni is a major zoonotic pathogen that causes foodborne gastroenteritis worldwide. However, clinical cases of campylobacteriosis have been underreported and underdiagnosed in Brazil. Herein, we describe the draft genome sequences of 116 C. jejuni strains isolated from diverse sources in Brazil.

  6. Draft Genome Sequence of Lactobacillus farciminis NBRC 111452, Isolated from Kôso, a Japanese Sugar-Vegetable Fermented Beverage

    PubMed Central

    Oshima, Kenshiro; Suda, Wataru; Hattori, Masahira; Takahashi, Tomoya

    2016-01-01

    Here, we report the draft genome sequence of the Lactobacillus farciminis strain NBRC 111452, isolated from kôso, a Japanese sugar-vegetable fermented beverage. This genome information is of potential use in studies of Lactobacillus farciminis as a probiotic. PMID:26769925

  7. Draft genome sequence analysis of eight streptogramin-resistant Enterococcus species isolated from animal and environmental sources in the US

    USDA-ARS?s Scientific Manuscript database

    Here, we present the draft genome sequences of eight streptogramin-resistant Enterococcus spp. (n=8) isolated from animals and an environmental source in the US from 2001-2004. Antimicrobial resistance genes were identified conferring resistance to the macrolide-lincosamide-streptogramins, aminoglyc...

  8. Draft genome sequence of the oleaginous yeast Cryptococcus curvatus ATCC 20509

    DOE PAGES

    Close, Dan; Ojumu, John O.

    2016-11-03

    Cryptococcus curvatus ATCC 20509 is a commonly used nonmodel oleaginous yeast capable of converting a variety of carbon sources into fatty acids. In addition, we present the draft genome sequence of this popular organism to provide a means for more in-depth studies of its fatty acid production potential.

  9. Draft genomic sequencing of six potential extraintestinal pathogenic Escherichia coli isolates from retail chicken meat.

    USDA-ARS?s Scientific Manuscript database

    Potential Extraintestinal pathogenic Escherichia coli isolates DP254, WH333, WH398, F356, FEX675 and FEX725 were isolated from retail chicken meat products. Here, we report the draft genome sequences for these six E. coli isolates, which are currently being used in food safety research....

  10. Draft Genome Sequence of Campylobacter jejuni 11168H

    PubMed Central

    Macdonald, Sarah E.; Gundogdu, Ozan; Dorrell, Nick; Wren, Brendan W.; Blake, Damer

    2017-01-01

    ABSTRACT Campylobacter jejuni is the most prevalent cause of food-borne gastroenteritis in the developed world. The reference and original sequenced strain C. jejuni NCTC11168 has low levels of motility compared to clinical isolates. Here, we describe the draft genome of the laboratory derived hypermotile variant named 11168H. PMID:28153902

  11. Permanent Draft Genome Sequence of Nocardia sp. BMG111209, an Actinobacterium Isolated from Nodules of Casuarina glauca

    DOE PAGES

    Ghodhbane-Gtari, Faten; Beauchemin, Nicholas; Gueddou, Abdellatif; ...

    2016-08-04

    Nocardiasp. strain BMG111209 is a non-Frankiaactinobacterium isolated from root nodules ofCasuarina glaucain Tunisia. Here, we report the 9.1-Mbp draft genome sequence ofNocardiasp. strain BMG111209 with a G + C content of 69.19% and 8,122 candidate protein-encoding genes.

  12. Draft Genome Sequence of Lactobacillus salivarius SGL 03, a Novel Potential Probiotic Strain.

    PubMed

    Federici, Federica; Manna, Laura; Rizzi, Eleonora; Galantini, Elena; Marini, Umberto

    2017-12-07

    In this work, we report the draft genome sequence of Lactobacillus salivarius SGL 03, a novel potential probiotic strain isolated from healthy infant stools. Antibiotic resistance analysis revealed the presence of a tetracycline resistance gene without elements potentially responsible for interspecific horizontal gene transfer. Copyright © 2017 Federici et al.

  13. Draft Genome Sequence of Lactobacillus johnsonii Strain 16, Isolated from Mice.

    PubMed

    Buhnik-Rosenblau, Keren; Danin-Poleg, Yael; Elgavish, Sharona; Kashi, Yechezkel

    2015-10-08

    Here, we report the genome sequence of Lactobacillus johnsonii, a member of the gut lactobacilli. This draft genome of L. johnsonii strain 16 isolated from C57BL/6J mice enables the identification of bacterial genes responsible for host-specific gut persistence. Copyright © 2015 Buhnik-Rosenblau et al.

  14. Assembly of the draft genome of buckwheat and its applications in identifying agronomically useful genes.

    PubMed

    Yasui, Yasuo; Hirakawa, Hideki; Ueno, Mariko; Matsui, Katsuhiro; Katsube-Tanaka, Tomoyuki; Yang, Soo Jung; Aii, Jotaro; Sato, Shingo; Mori, Masashi

    2016-06-01

    Buckwheat (Fagopyrum esculentum Moench; 2n = 2x = 16) is a nutritionally dense annual crop widely grown in temperate zones. To accelerate molecular breeding programmes of this important crop, we generated a draft assembly of the buckwheat genome using short reads obtained by next-generation sequencing (NGS), and constructed the Buckwheat Genome DataBase. After assembling short reads, we determined 387,594 scaffolds as the draft genome sequence (FES_r1.0). The total length of FES_r1.0 was 1,177,687,305 bp, and the N50 of the scaffolds was 25,109 bp. Gene prediction analysis revealed 286,768 coding sequences (CDSs; FES_r1.0_cds) including those related to transposable elements. The total length of FES_r1.0_cds was 212,917,911 bp, and the N50 was 1,101 bp. Of these, the functions of 35,816 CDSs excluding those for transposable elements were annotated by BLAST analysis. To demonstrate the utility of the database, we conducted several test analyses using BLAST and keyword searches. Furthermore, we used the draft genome as a reference sequence for NGS-based markers, and successfully identified novel candidate genes controlling heteromorphic self-incompatibility of buckwheat. The database and draft genome sequence provide a valuable resource that can be used in efforts to develop buckwheat cultivars with superior agronomic traits. © The Author 2016. Published by Oxford University Press on behalf of Kazusa DNA Research Institute.

  15. Draft genome sequence of two Shingopyxis sp. strains H107 and H115 isolated from a chloraminated drinking water distriburion system simulator

    EPA Pesticide Factsheets

    Draft genome sequence of two Shingopyxis sp. strains H107 and H115 isolated from a chloraminated drinking water distriburion system simulatorThis dataset is associated with the following publication:Gomez-Alvarez, V., S. Pfaller , and R. Revetta. Draft Genome of Two Sphingopyxis sp. Strains, Dominant Members of the Bacterial Community Associated with a Drinking Water Distribution System Simulator. Genome Announcements. American Society for Microbiology, Washington, DC, USA, 4(2): e00183-16, (2016).

  16. High quality draft genome sequence of Leucobacter chironomi strain MM2LBT (DSM 19883T) isolated from a Chironomus sp. egg mass

    DOE PAGES

    Laviad, Sivan; Lapidus, Alla; Copeland, Alex; ...

    2015-05-08

    Leucobacter chironomi strain MM2LBT (Halpern et al., Int J Syst Evol Microbiol 59:665-70 2009) is a Gram-positive, rod shaped, non-motile, aerobic, chemoorganotroph bacterium. L. chironomi belongs to the family Microbacteriaceae, a family within the class Actinobacteria. Strain MM2LBT was isolated from a chironomid (Diptera; Chironomidae) egg mass that was sampled from a waste stabilization pond in northern Israel. In a phylogenetic tree based on 16S rRNA gene sequences, strain MM2LBT formed a distinct branch within the radiation encompassing the genus Leucobacter. Here we describe the features of this organism, together with the complete genome sequence and annotation. We find thatmore » the DNA GC content is 69.90%. The chromosome length is 2,964,712 bp. It encodes 2,690 proteins and 61 RNA genes. L. chironomi genome is part of the Genomic Encyclopedia of Type Strains, Phase I: the one thousand microbial genomes (KMG) project.« less

  17. Draft Genome Sequence of a Novel Chitinophaga sp. Strain, MD30, Isolated from a Biofilm in an Air Conditioner Condensate Pipe

    PubMed Central

    Darris, Maxwell

    2017-01-01

    ABSTRACT Most of the 24 known Chitinophaga species were originally isolated from soils. We report the draft genome sequence of a putatively novel Chitinophaga sp. from a biofilm in an air conditioner condensate pipe. The genome comprises 7,661,303 bp in one scaffold, 5,694 predicted protein-coding sequences, and a G+C content of 47.6%. PMID:29051259

  18. Draft Genome Sequence of Pseudomonas putida CA-3, a Bacterium Capable of Styrene Degradation and Medium-Chain-Length Polyhydroxyalkanoate Synthesis.

    PubMed

    Almeida, Eduardo L; Margassery, Lekha M; O'Leary, Niall; Dobson, Alan D W

    2018-01-25

    Pseudomonas putida strain CA-3 is an industrial bioreactor isolate capable of synthesizing biodegradable polyhydroxyalkanoate polymers via the metabolism of styrene and other unrelated carbon sources. The pathways involved are subject to regulation by global cellular processes. The draft genome sequence is 6,177,154 bp long and contains 5,608 predicted coding sequences. Copyright © 2018 Almeida et al.

  19. Draft Genome Sequence of d-Branched-Chain Amino Acid Producer Lactobacillus otakiensis JCM 15040T, Isolated from a Traditional Japanese Pickle

    PubMed Central

    Mori, Kazuki; Mutaguchi, Yuta; Tashiro, Kosuke; Fujino, Yasuhiro; Ohmori, Taketo; Kuhara, Satoru; Ohshima, Toshihisa

    2013-01-01

    Lactobacillus otakiensis strain JCM 15040T was isolated from an unsalted pickling solution used in the production of sunki, a traditional Japanese pickle. Here, we prepared a draft genome sequence for this strain consisting of 40 contigs containing a total of 2,347,132 bp, 2,310 predicted coding sequences, and a G+C content of 42.4%. PMID:23929467

  20. Draft Genome Sequence of Cellulolytic and Xylanolytic Cellulomonas sp. Strain B6 Isolated from Subtropical Forest Soil.

    PubMed

    Piccinni, Florencia; Murua, Yanina; Ghio, Silvina; Talia, Paola; Rivarola, Máximo; Campos, Eleonora

    2016-08-25

    Cellulomonas sp. strain B6 was isolated from a subtropical forest soil sample and presented (hemi)cellulose-degrading activity. We report here its draft genome sequence, with an estimated genome size of 4 Mb, a G+C content of 75.1%, and 3,443 predicted protein-coding sequences, 92 of which are glycosyl hydrolases involved in polysaccharide degradation. Copyright © 2016 Piccinni et al.

  1. Draft Genome Sequence of Pseudomonas sp. Strain Ep R1 Isolated from Echinacea purpurea Roots and Effective in the Growth Inhibition of Human Opportunistic Pathogens Belonging to the Burkholderia cepacia Complex.

    PubMed

    Maggini, Valentina; Presta, Luana; Miceli, Elisangela; Fondi, Marco; Bosi, Emanuele; Chiellini, Carolina; Fagorzi, Camilla; Bogani, Patrizia; Di Pilato, Vincenzo; Rossolini, Gian Maria; Mengoni, Alessio; Firenzuoli, Fabio; Perrin, Elena; Fani, Renato

    2017-05-18

    In this announcement, we detail the draft genome sequence of the Pseudomonas sp. strain Ep R1, isolated from the roots of the medicinal plant Echinacea purpurea The elucidation of this genome sequence may allow the identification of genes associated with the production of antimicrobial compounds. Copyright © 2017 Maggini et al.

  2. Draft Genome Sequence of Pseudomonas sp. Strain Ep R1 Isolated from Echinacea purpurea Roots and Effective in the Growth Inhibition of Human Opportunistic Pathogens Belonging to the Burkholderia cepacia Complex

    PubMed Central

    Maggini, Valentina; Presta, Luana; Miceli, Elisangela; Fondi, Marco; Bosi, Emanuele; Chiellini, Carolina; Fagorzi, Camilla; Bogani, Patrizia; Di Pilato, Vincenzo; Rossolini, Gian Maria; Mengoni, Alessio; Firenzuoli, Fabio; Perrin, Elena

    2017-01-01

    ABSTRACT In this announcement, we detail the draft genome sequence of the Pseudomonas sp. strain Ep R1, isolated from the roots of the medicinal plant Echinacea purpurea. The elucidation of this genome sequence may allow the identification of genes associated with the production of antimicrobial compounds. PMID:28522712

  3. Draft Genome Sequence of Paenibacillus sp. Strain DMB20, Isolated from Alang Ship-Breaking Yard, Which Harbors Genes for Xenobiotic Degradation.

    PubMed

    Shah, Binal; Jain, Kunal; Patel, Namrata; Pandit, Ramesh; Patel, Anand; Joshi, Chaitanya G; Madamwar, Datta

    2015-06-11

    Paenibacillus sp. strain DMB20, in cometabolism with other Proteobacteria and Firmicutes, exhibits azoreduction of textile dyes. Here, we report the draft genome sequence of this bacterium, consisting of 6,647,181 bp with 7,668 coding sequences (CDSs). The data presented highlight multiple sets of functional genes associated with xenobiotic compound degradation. Copyright © 2015 Shah et al.

  4. Draft Genome Sequence of Cryophilic Basidiomycetous Yeast Mrakia blollopis SK-4, Isolated from an Algal Mat of Naga-ike Lake in the Skarvsnes Ice-Free Area, East Antarctica.

    PubMed

    Tsuji, Masaharu; Kudoh, Sakae; Hoshino, Tamotsu

    2015-01-22

    Mrakia blollopis strain SK-4 was isolated from an algal mat of Naga-ike, a lake in Skarvsnes, East Antarctica. Here, we report the draft genome sequence of M. blollopis SK-4. This is the first report on the genome sequence of any cold-adapted fungal species. Copyright © 2015 Tsuji et al.

  5. Draft Genome Sequence of the Fish Pathogen Yersinia ruckeri Strain 37551, Serotype O1b, Isolated from Diseased, Vaccinated Atlantic Salmon (Salmo salar) in Chile

    PubMed Central

    Navas, Esteban; Bohle, Harry; Henríquez, Patricio; Grothusen, Horst; Bustamante, Fernando; Bustos, Patricio

    2014-01-01

    We sequenced the genome of a motile O1b Yersinia ruckeri field isolate from Chile, which is causing enteric redmouth disease (ERM) in vaccinated Atlantic salmon (Salmo salar). The draft genome has 3,775,486 bp, a G+C content of 47.1%, and is predicted to contain 3,406 coding sequences. PMID:25169862

  6. Genome Sequence of Novosphingobium lindaniclasticum LE124T, Isolated from a Hexachlorocyclohexane Dumpsite

    PubMed Central

    Saxena, Anjali; Nayyar, Namita; Sangwan, Naseer; Kumari, Rashmi; Khurana, J. P.

    2013-01-01

    Novosphingobium lindaniclasticum LE124T is a hexachlorocyclohexane (HCH)-degrading bacterium isolated from a high-dosage-point HCH dumpsite (450 mg HCH/g soil) located in Lucknow, India (27°00′N and 81°09′E). Here, we present the annotated draft genome sequence of strain LE124T, which has an estimated size of 4.86 Mb and is comprised of 4,566 coding sequences. PMID:24029761

  7. CoCoNUT: an efficient system for the comparison and analysis of genomes

    PubMed Central

    2008-01-01

    Background Comparative genomics is the analysis and comparison of genomes from different species. This area of research is driven by the large number of sequenced genomes and heavily relies on efficient algorithms and software to perform pairwise and multiple genome comparisons. Results Most of the software tools available are tailored for one specific task. In contrast, we have developed a novel system CoCoNUT (Computational Comparative geNomics Utility Toolkit) that allows solving several different tasks in a unified framework: (1) finding regions of high similarity among multiple genomic sequences and aligning them, (2) comparing two draft or multi-chromosomal genomes, (3) locating large segmental duplications in large genomic sequences, and (4) mapping cDNA/EST to genomic sequences. Conclusion CoCoNUT is competitive with other software tools w.r.t. the quality of the results. The use of state of the art algorithms and data structures allows CoCoNUT to solve comparative genomics tasks more efficiently than previous tools. With the improved user interface (including an interactive visualization component), CoCoNUT provides a unified, versatile, and easy-to-use software tool for large scale studies in comparative genomics. PMID:19014477

  8. Leukotriene signaling in the extinct human subspecies Homo denisovan and Homo neanderthalensis. Structural and functional comparison with Homo sapiens.

    PubMed

    Adel, Susan; Kakularam, Kumar Reddy; Horn, Thomas; Reddanna, Pallu; Kuhn, Hartmut; Heydeck, Dagmar

    2015-01-01

    Mammalian lipoxygenases (LOXs) have been implicated in cell differentiation and in the biosynthesis of pro- and anti-inflammatory lipid mediators. The initial draft sequence of the Homo neanderthalensis genome (coverage of 1.3-fold) suggested defective leukotriene signaling in this archaic human subspecies since expression of essential proteins appeared to be corrupted. Meanwhile high quality genomic sequence data became available for two extinct human subspecies (H. neanderthalensis, Homo denisovan) and completion of the human 1000 genome project provided a comprehensive database characterizing the genetic variability of the human genome. For this study we extracted the nucleotide sequences of selected eicosanoid relevant genes (ALOX5, ALOX15, ALOX12, ALOX15B, ALOX12B, ALOXE3, COX1, COX2, LTA4H, LTC4S, ALOX5AP, CYSLTR1, CYSLTR2, BLTR1, BLTR2) from the corresponding databases. Comparison of the deduced amino acid sequences in connection with site-directed mutagenesis studies and structural modeling suggested that the major enzymes and receptors of leukotriene signaling as well as the two cyclooxygenase isoforms were fully functional in these two extinct human subspecies. Copyright © 2014 Elsevier Inc. All rights reserved.

  9. Resources for Ensuring Quality School-to-Work Opportunities for Young Women. Draft.

    ERIC Educational Resources Information Center

    Wider Opportunities for Women, Inc., Washington, DC.

    This annotated bibliography lists 49 resources for ensuring high quality school-to-work opportunities for young women. These resources are grouped into 10 categories: print material for middle and high school girls; videos for middle and high school girls; administrator/school guides; curriculum guides/resources for teachers; resources for…

  10. The draft genome of whitefly Bemisia tabaci MEAM1, a global crop pest, provides novel insights into virus transmission, host adaptation, and insecticide resistance

    USDA-ARS?s Scientific Manuscript database

    Whiteflies are among the most important agricultural pests. They have a broad range of host plants and exceptional ability to transmit a large number of plant viruses, and can rapidly evolve insecticide resistance. Here we present a high-quality draft genome of the whitefly, Bemisia tabaci. Comparat...

  11. Draft genome sequence of Streptomyces sp. strain F1, a potential source for glycoside hydrolases isolated from Brazilian soil.

    PubMed

    Melo, Ricardo Rodrigues de; Persinoti, Gabriela Felix; Paixão, Douglas Antonio Alvaredo; Squina, Fábio Márcio; Ruller, Roberto; Sato, Helia Harumi

    Here, we show the draft genome sequence of Streptomyces sp. F1, a strain isolated from soil with great potential for secretion of hydrolytic enzymes used to deconstruct cellulosic biomass. The draft genome assembly of Streptomyces sp. strain F1 has 69 contigs with a total genome size of 8,142,296bp and G+C 72.65%. Preliminary genome analysis identified 175 proteins as Carbohydrate-Active Enzymes, being 85 glycoside hydrolases organized in 33 distinct families. This draft genome information provides new insights on the key genes encoding hydrolytic enzymes involved in biomass deconstruction employed by soil bacteria. Copyright © 2017 Sociedade Brasileira de Microbiologia. Published by Elsevier Editora Ltda. All rights reserved.

  12. Draft genome sequence of Anoxybacillus flavithermus KU2-6-11 isolated from hot-spring in Uzon caldera (Kamchatka, Russia).

    PubMed

    Rozanov, Aleksey S; Korzhuk, Anton V; Bryanskaya, Alla V; Peltek, Sergey E

    2018-02-01

    The Anoxybacillus flavithermus KU2-6-11 was isolated from sediments of a nameless hot spring. The hot spring is located in Uzon caldera (Kamchatka, Russia). The sequenced and annotated genome is 2,646,305 bp and encodes 2787genes. The draft genome sequence of the Anoxybacillus flavithermus KU2-6-11 has been deposited at DDBJ/EMBL/GenBank under the accession PEDM01000000 and the sequences could be found at the site https://www.ncbi.nlm.nih.gov/nuccore/PEDM01000000.

  13. FlyBase: genes and gene models

    PubMed Central

    Drysdale, Rachel A.; Crosby, Madeline A.

    2005-01-01

    FlyBase (http://flybase.org) is the primary repository of genetic and molecular data of the insect family Drosophilidae. For the most extensively studied species, Drosophila melanogaster, a wide range of data are presented in integrated formats. Data types include mutant phenotypes, molecular characterization of mutant alleles and aberrations, cytological maps, wild-type expression patterns, anatomical images, transgenic constructs and insertions, sequence-level gene models and molecular classification of gene product functions. There is a growing body of data for other Drosophila species; this is expected to increase dramatically over the next year, with the completion of draft-quality genomic sequences of an additional 11 Drosphila species. PMID:15608223

  14. Draft Genome Sequence of an Isolate of Colletotrichum fructicola, a Causal Agent of Mango Anthracnose.

    PubMed

    Li, Qili; Bu, Junyan; Yu, Zhihe; Tang, Lihua; Huang, Suiping; Guo, Tangxun; Mo, Jianyou; Hsiang, Tom

    2018-02-22

    Here, we present a draft genome sequence of isolate 15060 of Colletotrichum fructicola , a causal agent of mango anthracnose. The final assembly consists of 1,048 scaffolds totaling 56,493,063 bp (G+C content, 53.38%) and 15,180 predicted genes. Copyright © 2018 Li et al.

  15. Draft Genome Sequence of Limnobacter sp. Strain CACIAM 66H1, a Heterotrophic Bacterium Associated with Cyanobacteria

    PubMed Central

    da Silva, Fábio Daniel Florêncio; Lima, Alex Ranieri Jerônimo; Moraes, Pablo Henrique Gonçalves; Siqueira, Andrei Santos; Dall’Agnol, Leonardo Teixeira; Baraúna, Anna Rafaella Ferreira; Martins, Luisa Carício; Oliveira, Karol Guimarães; de Lima, Clayton Pereira Silva; Nunes, Márcio Roberto Teixeira; Vianez-Júnior, João Lídio Silva Gonçalves

    2016-01-01

    Ecological interactions between cyanobacteria and heterotrophic prokaryotes are poorly known. To improve the genomic studies of heterotrophic bacterium-cyanobacterium associations, the draft genome sequence (3.2 Mbp) of Limnobacter sp. strain CACIAM 66H1, found in a nonaxenic culture of Synechococcus sp. (cyanobacteria), is presented here. PMID:27198027

  16. Draft Genome Sequence of Aquitalea magnusonii Strain H3, a Plant Growth-Promoting Bacterium of Duckweed (Lemna minor)

    PubMed Central

    Ishizawa, Hidehiro; Kuroda, Masashi

    2017-01-01

    ABSTRACT Aquitalea magnusonii strain H3 is a promising plant growth-promoting bacterium for duckweed. Here, we report the draft genome sequence of strain H3 comprising 4,750,601 bp in 73 contigs. Several genes associated with plant root colonization were identified. PMID:28818906

  17. Draft Genome Sequence of Aquitalea magnusonii Strain H3, a Plant Growth-Promoting Bacterium of Duckweed (Lemna minor).

    PubMed

    Ishizawa, Hidehiro; Kuroda, Masashi; Ike, Michihiko

    2017-08-17

    Aquitalea magnusonii strain H3 is a promising plant growth-promoting bacterium for duckweed. Here, we report the draft genome sequence of strain H3 comprising 4,750,601 bp in 73 contigs. Several genes associated with plant root colonization were identified. Copyright © 2017 Ishizawa et al.

  18. Draft Genome Sequence of Leptospira interrogans Serovar Bataviae Strain LepIMR 22 Isolated from a Rodent in Johor, Malaysia.

    PubMed

    Amran, Fairuz; Mohd Khalid, Mohd Khairul Nizam; Mohamad, Saharuddin; Mat Ripen, Adiratna; Ahmad, Norazah; Goris, Marga G A; Muhammad, Ayu Haslin; Noor Halim, Nurul Atiqah

    2016-09-08

    Leptospira interrogans serovar Bataviae was recently identified as one of the persistent Leptospira serovars in Malaysia. Here, we report the draft genome sequence of the L. interrogans serovar Bataviae strain LepIMR 22 isolated from kidney of a rodent in Johor, Malaysia. Copyright © 2016 Amran et al.

  19. Draft genome sequence of the New Jersey aster yellows strain of ‘Candidatus Phytoplasma asteris’

    USDA-ARS?s Scientific Manuscript database

    The NJAY (New Jersey aster yellows) strain of ‘Candidatus Phytoplasma asteris’ is a significant plant pathogen responsible for causing severe lettuce yellows in the U.S. state of New Jersey. A draft genome sequence was prepared for this organism and used for genome- and gene-based comparative phylog...

  20. Draft genome sequence of Streptomyces sp. strain SS, which produces a series of uridyl peptide antibiotic sansanmycins.

    PubMed

    Wang, Lifei; Xie, Yunying; Li, Qinglian; He, Ning; Yao, Entai; Xu, Hongzhang; Yu, Ying; Chen, Ruxian; Hong, Bin

    2012-12-01

    Streptomyces sp. SS produces a series of uridyl peptide antibiotic sansanmycins. Here, we present a draft genome sequence of Streptomyces sp. SS containing the biosynthetic gene cluster for the antibiotics. The identification of the biosynthetic gene cluster of sansanmycins may provide further insight into biosynthetic mechanisms for uridyl peptide antibiotics.

  1. Draft Genome Sequences of Dickeya sp. Isolates B16 (NIB Z 2098) and S1 (NIB Z 2099) Causing Soft Rot of Phalaenopsis Orchids.

    PubMed

    Alič, Špela; Naglič, Tina; Llop, Pablo; Toplak, Nataša; Koren, Simon; Ravnikar, Maja; Dreo, Tanja

    2015-09-10

    The genus Dickeya contains bacteria causing soft rot of economically important crops and ornamental plants. Here, we report the draft genome sequences of two Dickeya sp. isolates from rotted leaves of Phalaenopsis orchids. Copyright © 2015 Alič et al.

  2. Draft Genome Sequence of Bacillus velezensis GF610, a Producer of Potent Anti-Listeria Agents

    PubMed Central

    Gerst, Michelle M.; Dudley, Edward G.; Xiaoli, Lingzi

    2017-01-01

    ABSTRACT Bacillus velezensis GF610 was isolated from soil in Illinois, USA, and found to produce amyloliquecidin GF610, a potent two-component antimicrobial peptide. We report here the GF610 strain draft genome sequence, which contains 4.29 Mb and an overall GC content of 45.91%. PMID:29025938

  3. Two Draft Genome Sequences of Chromobacterium violaceum Isolates from the Rio Negro.

    PubMed

    da Gama, Auricélia Matos; de Almeida, Luiz Gustavo; Yamane, Tetsuo; Spira, Beny

    2018-01-04

    The draft genome sequences of two Chromobacterium violaceum strains isolated from the Rio Negro are reported here. These bacteria carry most genetic systems associated with the production of bioactive compounds, but unlike other C. violaceum strains, they lack a dedicated operon for arsenic resistance. Copyright © 2018 da Gama et al.

  4. Draft Genome Sequence of Pedobacter sp. Strain Hv1, an Isolate from Medicinal Leech Mucosal Castings

    PubMed Central

    Ott, Brittany M.; Beka, Lidia; Graf, Joerg

    2015-01-01

    The Pedobacter sp. Hv1 strain was isolated from the medicinal leech, Hirudo verbana, mucosal castings. These mucosal sheds have been demonstrated to play a role in horizontal symbiont transmission. Here, we report the draft 4.9 Mbp genome sequence of Pedobacter sp. strain Hv1. PMID:26679583

  5. Draft Genome Sequences for Five Strains of Trabulsiella odontotermitis, Isolated from Heterotermes sp. Termite Gut

    PubMed Central

    Olvera-García, Myrna; Fontes-Perez, Héctor; Chávez-Martínez, America; Ruiz Barrera, Oscar; Rodríguez-Almeida, Felipe A.

    2015-01-01

    Trabulsiella odontotermitis represents a novel species in the genus Trabulsiella with no complete genome reported yet. Here, we describe the draft genome sequences of five isolates from termites present in the north of Mexico, which have an interesting pool of genes related to cellulose degradation with biotechnological application. PMID:26543120

  6. Draft genome sequence of Penicillium expansum (R19) that cause postharvest decay of apple fruit

    USDA-ARS?s Scientific Manuscript database

    Among the species that cause blue mold, isolates of P. expansum are the most prevalent and virulent species causing more than 50 percent of postharvest decay. We report the draft genome sequence of P. expansum (R19) in order to identify fungal virulence factors and to understand the mechanism of inf...

  7. Draft Genome Sequence of Komagataeibacter intermedius Strain AF2, a Producer of Cellulose, Isolated from Kombucha Tea.

    PubMed

    Dos Santos, Renato Augusto Corrêa; Berretta, Andresa Aparecida; Barud, Hernane da Silva; Ribeiro, Sidney José Lima; González-García, Laura Natalia; Zucchi, Tiago Domingues; Goldman, Gustavo H; Riaño-Pachón, Diego M

    2015-12-03

    Here, we present the draft genome sequence of Komagataeibacter intermedius strain AF2, which was isolated from Kombucha tea and is capable of producing cellulose, although at lower levels compared to another bacterium from the same environment, K. rhaeticus strain AF1. Copyright © 2015 dos Santos et al.

  8. Draft Genome Sequence of Sphingobacterium sp. CZ-UAM, Isolated from a Methanotrophic Consortium

    PubMed Central

    Steffani-Vallejo, José Luis; Zuñiga, Cristal; Cruz-Morales, Pablo; Lozano, Luis; Morales, Marcia; Licona-Cassani, Cuauhtemoc; Revah, Sergio

    2017-01-01

    ABSTRACT Sphingobacterium sp. CZ-UAM was isolated from a methanotrophic consortium in mineral medium using methane as the only carbon source. A draft genome of 5.84 Mb with a 40.77% G+C content is reported here. This genome sequence will allow the investigation of potential methanotrophy in this isolated strain. PMID:28818899

  9. Draft genome sequence of “Candidatus Liberibacter asiaticus” from Diaphorina citri in Guangdong, China

    USDA-ARS?s Scientific Manuscript database

    The draft genome sequence of “Candidatus Liberibacter asiaticus” strain YCPsy from an Asian citrus psyllid (Diaphorina citri) in Guangdong of China is reported. The YCPsy strain has a genome size of 1,233,647 bp, 36.5% G+C content, 1,171 open reading frames (ORFs), and 53 RNAs....

  10. Draft genome sequences of Escherichia coli O113:H21 strains recovered from a major produce-production region in California

    USDA-ARS?s Scientific Manuscript database

    Shiga toxin-producing Escherichia coli is a foodborne and waterborne pathogen and is responsible for outbreaks of human gastroenteritis. This report documents the draft genome sequences of seven O113:H21 strains recovered from livestock, wildlife, and soil samples collected in a major agricultural r...

  11. Draft Genome Sequence of the Butyric Acid Producer Clostridium tyrobutyricum Strain CIP I-776 (IFP923).

    PubMed

    Wasels, François; Clément, Benjamin; Lopes Ferreira, Nicolas

    2016-03-03

    Here, we report the draft genome sequence of Clostridium tyrobutyricum CIP I-776 (IFP923), an efficient producer of butyric acid. The genome consists of a single chromosome of 3.19 Mb and provides useful data concerning the metabolic capacities of the strain. Copyright © 2016 Wasels et al.

  12. Draft Genome Sequence of Lactobacillus pobuzihii E100301T

    PubMed Central

    Chiu, Chi-ming; Chang, Chi-huan; Pan, Shwu-fen; Wu, Hui-chung; Li, Shiao-wen; Chang, Chuan-hsiung; Lee, Yun-shien; Chiang, Chih-ming

    2013-01-01

    Lactobacillus pobuzihii E100301T is a novel Lactobacillus species previously isolated from pobuzihi (fermented cummingcordia) in Taiwan. Phylogenetically, this strain is closest to Lactobacillus acidipiscis, but its phenotypic characteristics can be clearly distinguished from those of L. acidipiscis. We present the draft genome sequence of strain L. pobuzihii E100301T. PMID:23661478

  13. Draft genome sequence of Pyrodictium occultum PL19 T, a marine hyperthermophilic species of Archaea that grows optimally at 105°C

    DOE PAGES

    Utturkar, Sagar M.; Huber, Harald; Leptihn, Sebastian; ...

    2016-02-25

    We report here the draft genome sequence of Pyrodictium occultum PL19 T, a marine hyperthermophilic archaeon. In addition, the genome provides insights into molecular and cellular adaptation mechanisms to life in extreme environments and the evolution of early organisms on Earth.

  14. Draft Genome Sequence of the Spore-Forming Probiotic Strain Bacillus coagulans Unique IS-2

    PubMed Central

    Upadrasta, Aditya; Pitta, Swetha

    2016-01-01

    Bacillus coagulans Unique IS-2 is a potential spore-forming probiotic that is commercially available on the market. The draft genome sequence presented here provides deep insight into the beneficial features of this strain for its safe use as a probiotic for various human and animal health applications. PMID:27103709

  15. Draft Genome Sequence of a Sphingomonas sp., an Endosymbiotic Bacterium Isolated from an Arctic Lichen Umbilicaria sp.

    PubMed Central

    Lee, Jungeun; Shin, Seung Chul; Kim, Su Jin; Kim, Bum-Keun; Hong, Soon Gyu; Kim, Eun Hye; Park, Hyun

    2012-01-01

    Sphingomonas sp. strain PAMC 26617 has been isolated from an Arctic lichen Umbilicaria sp. on the Svalbard Islands. Here we present the draft genome sequence of this strain, which represents a valuable resource for understanding the symbiotic mechanisms between endosymbiotic bacteria and lichens surviving in extreme environments. PMID:22582371

  16. Draft Genome Sequence of Lactobacillus farciminis NBRC 111452, Isolated from Kôso, a Japanese Sugar-Vegetable Fermented Beverage.

    PubMed

    Chiou, Tai-Ying; Oshima, Kenshiro; Suda, Wataru; Hattori, Masahira; Takahashi, Tomoya

    2016-01-14

    Here, we report the draft genome sequence of the Lactobacillus farciminis strain NBRC 111452, isolated from kôso, a Japanese sugar-vegetable fermented beverage. This genome information is of potential use in studies of Lactobacillus farciminis as a probiotic. Copyright © 2016 Chiou et al.

  17. Draft Genome Sequence of Amycolatopsis mediterranei DSM 40773, a Tangible Antibiotic Producer

    PubMed Central

    Mukherjee, Udita; Saxena, Anjali; Kumari, Rashmi; Singh, Priya

    2014-01-01

    Amycolatopsis mediterranei DSM 40773 has been of special interest as successors of this strain are in use for the commercial production of rifamycin B. Here we present the draft genome sequence (~10 Mb) of this strain, which contains 108 contigs, 9,198 genes, and has a G+C content of 71.3%. PMID:25081263

  18. Draft genome sequence of the D-Xylose-Fermenting yeast Spathaspora xylofermentans UFMG-HMD23.3

    USDA-ARS?s Scientific Manuscript database

    Here, we report the draft genome sequence of the yeast Spathaspora xylofermentans UFMG-HMD23.3 (CBMAI 1427=CBS 12681), a D-xylose fermenting yeast isolated from the Amazonian forest. The genome consists of 298 contigs, with a total size of 15.1 Mb, including the mitochondrial genome, and 5,948 predi...

  19. Draft Genome Sequence of a “Candidatus Liberibacter europaeus” Strain Assembled from Broom Psyllids (Arytainilla spartiophila) from New Zealand

    PubMed Central

    Thompson, Sarah M.; Kalamorz, Falk; David, Charles; Addison, Shea M.; Smith, Grant R.

    2018-01-01

    ABSTRACT Here, we report the draft genome sequence of “Candidatus Liberibacter europaeus” ASNZ1, assembled from broom psyllids (Arytainilla spartiophila) from New Zealand. The assembly comprises 15 contigs, with a total length of 1.33 Mb and a G+C content of 33.5%. PMID:29773636

  20. Draft Genome Sequence of Rhodotorula mucilaginosa, an Emergent Opportunistic Pathogen

    PubMed Central

    Deligios, Massimo; Fraumene, Cristina; Abbondio, Marcello; Mannazzu, Ilaria; Tanca, Alessandro; Addis, Maria Filippa

    2015-01-01

    Rhodotorula mucilaginosa, a yeast with valuable biotechnological features, has also been recorded as an emergent opportunistic pathogen that might cause disease in both immunocompetent and immunocompromised individuals. Here, we report the draft genome sequence of R. mucilaginosa strain C2.5t1, which was isolated from cacao seeds in Cameroon. PMID:25858834

  1. Draft genome sequences of four uropathogenic escherichia coli 04:H5 isolates (ATCC 700414,700415,700416 and 700417)

    USDA-ARS?s Scientific Manuscript database

    Uropathogenic Escherichia coli O4: H5 isolates ATCC 700414, 700415, 700416, and 700417 were recovered from women with first-time urinary tract infections. Here, we report the draft genome sequences for these four E. coli isolates, which are currently being used to validate food safety processing tec...

  2. Draft Genomic Sequencing of Six Potential Extraintestinal Pathogenic Escherichia coli Isolates from Retail Chicken Meat

    PubMed Central

    Xu, Aixia; Johnson, James R.; Sheen, Shiowshuh; Needleman, David S.

    2018-01-01

    ABSTRACT Potential extraintestinal pathogenic Escherichia coli strains DP254, WH333, WH398, F356, FEX675, and FEX725 were isolated from retail chicken meat products. Here, we report the draft genome sequences for these six E. coli isolates, which are currently being used in food safety research. PMID:29798928

  3. Draft Genome Sequences of Two Aspergillus fumigatus Strains, Isolated from the International Space Station.

    PubMed

    Singh, Nitin Kumar; Blachowicz, Adriana; Checinska, Aleksandra; Wang, Clay; Venkateswaran, Kasthuri

    2016-07-14

    Draft genome sequences of Aspergillus fumigatus strains (ISSFT-021 and IF1SW-F4), opportunistic pathogens isolated from the International Space Station (ISS), were assembled to facilitate investigations of the nature of the virulence characteristics of the ISS strains to other clinical strains isolated on Earth. Copyright © 2016 Singh et al.

  4. Draft Genome Sequence of Lactobacillus salivarius L28 Isolated from Ground Beef.

    PubMed

    Ayala, Diana I; Cook, Peter W; Campos, David L; Brashears, Mindy M; den Bakker, Henk; Nightingale, Kendra K

    2017-09-28

    In this report, we describe the draft genome sequence of a newly discovered probiotic strain, Lactobacillus salivarius L28. L. salivarius L28 demonstrates antagonistic effects against human foodborne pathogens, including Escherichia coli O157:H7, Salmonella spp., and Listeria monocytogenes , in coculture experiments and food matrices. Copyright © 2017 Ayala et al.

  5. Draft Genome Sequence of Lactobacillus salivarius L28 Isolated from Ground Beef

    PubMed Central

    Ayala, Diana I.; Cook, Peter W.; Campos, David L.; Brashears, Mindy M.; den Bakker, Henk

    2017-01-01

    ABSTRACT In this report, we describe the draft genome sequence of a newly discovered probiotic strain, Lactobacillus salivarius L28. L. salivarius L28 demonstrates antagonistic effects against human foodborne pathogens, including Escherichia coli O157:H7, Salmonella spp., and Listeria monocytogenes, in coculture experiments and food matrices. PMID:28963206

  6. Draft Genome Sequence of Chryseobacterium sp. JV274 Isolated from Maize Rhizosphere

    PubMed Central

    Vacheron, Jordan; Dubost, Audrey; Chapulliot, David; Prigent-Combaret, Claire

    2017-01-01

    ABSTRACT We report the draft genome sequence of Chryseobacterium sp. JV274. This strain was isolated from the rhizosphere of maize during a greenhouse experiment. JV274 harbors genes involved in flexirubin production (darA and darB genes), bacterial competition (type VI secretion system), and gliding (bacterial motility; type IX secretion system). PMID:28408666

  7. Draft Genome Sequence of Elizabethkingia anophelis Strain EM361-97 Isolated from the Blood of a Cancer Patient

    PubMed Central

    Lin, Jiun-Nong; Yang, Chih-Hui; Lai, Chung-Hsu; Huang, Yi-Han

    2016-01-01

    Elizabethkingia anophelis EM361-97 was isolated from the blood of a patient with nasopharyngeal carcinoma and lung cancer. We report the draft genome sequence of EM361-97, which contains a G+C content of 35.7% and 3,611 candidate protein-encoding genes. PMID:27789647

  8. A draft genome sequence of “Candidatus Liberibacter asiaticus” from California, USA

    USDA-ARS?s Scientific Manuscript database

    The draft genome sequence of “Candidatus Liberibacter asiaticus” strain HHCA, collected from a lemon tree in California, USA, is reported. The HHCA strain has a genome size of 1,118,244 bp, with G+C content of 36.6%. The HHCA genome encodes 1,191 predicted open reading frames and 51 RNA genes....

  9. Draft Genome Sequence of the Efficient Bioflocculant-Producing Bacterium Paenibacillus sp. Strain A9

    PubMed Central

    Liu, Jin-liang; Hu, Xiao-min

    2013-01-01

    Paenibacillus sp. strain A9 is an important bioflocculant-producing bacterium, isolated from a soil sample, and is pale pink-pigmented, aerobic, and Gram-positive. Here, we report the draft genome sequence and the initial findings from a preliminary analysis of strain A9, which is a novel species of Paenibacillus. PMID:23618713

  10. Genome analysis of Diploscapter coronatus: insights into molecular peculiarities of a nematode with parthenogenetic reproduction.

    PubMed

    Hiraki, Hideaki; Kagoshima, Hiroshi; Kraus, Christopher; Schiffer, Philipp H; Ueta, Yumiko; Kroiher, Michael; Schierenberg, Einhard; Kohara, Yuji

    2017-06-24

    Sexual reproduction involving the fusion of egg and sperm is prevailing among eukaryotes. In contrast, the nematode Diploscapter coronatus, a close relative of the model Caenorhabditis elegans, reproduces parthenogenetically. Neither males nor sperm have been observed and some steps of meiosis are apparently skipped in this species. To uncover the genomic changes associated with the evolution of parthenogenesis in this nematode, we carried out a genome analysis. We obtained a 170 Mbp draft genome in only 511 scaffolds with a N 50 length of 1 Mbp. Nearly 90% of these scaffolds constitute homologous pairs with a 5.7% heterozygosity on average and inversions and translocations, meaning that the 170 Mbp sequences correspond to the diploid genome. Fluorescent staining shows that the D. coronatus genome consists of two chromosomes (2n = 2). In our genome annotation, we found orthologs of 59% of the C. elegans genes. However, a number of genes were missing or very divergent. These include genes involved in sex determination (e.g. xol-1, tra-2) and meiosis (e.g. the kleisins rec-8 and coh-3/4) giving a possible explanation for the absence of males and the second meiotic division. The high degree of heterozygosity allowed us to analyze the expression level of individual alleles. Most of the homologous pairs show very similar expression levels but others exhibit a 2-5-fold difference. Our high-quality draft genome of D. coronatus reveals the peculiarities of the genome of parthenogenesis and provides some clues to the genetic basis for parthenogenetic reproduction. This draft genome should be the basis to elucidate fundamental questions related to parthenogenesis such as its origin and mechanisms through comparative analyses with other nematodes. Furthermore, being the closest outgroup to the genus Caenorhabditis, the draft genome will help to disclose many idiosyncrasies of the model C. elegans and its congeners in future studies.

  11. Metagenomic Insights into the Uncultured Diversity and Physiology of Microbes in Four Hypersaline Soda Lake Brines

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Vavourakis, Charlotte D.; Ghai, Rohit; Rodriguez-Valera, Francisco

    Soda lakes are salt lakes with a naturally alkaline pH due to evaporative concentration of sodium carbonates in the absence of major divalent cations. Hypersaline soda brines harbor microbial communities with a high species- and strain-level archaeal diversity and a large proportion of still uncultured poly-extremophiles compared to neutral brines of similar salinities. We present the first "metagenomic snapshots" of microbial communities thriving in the brines of four shallow soda lakes from the Kulunda Steppe (Altai, Russia) covering a salinity range from 170 to 400 g/L. Both amplicon sequencing of 16S rRNA fragments and direct metagenomic sequencing showed that themore » top-level taxa abundance was linked to the ambient salinity: Bacteroidetes, Alpha-, and Gamma-proteobacteria were dominant below a salinity of 250 g/L, Euryarchaeota at higher salinities. Within these taxa, amplicon sequences related to Halorubrum, Natrinema, Gracilimonas, purple non-sulfur bacteria (Rhizobiales, Rhodobacter, and Rhodobaca) and chemolithotrophic sulfur oxidizers (Thioalkalivibrio) were highly abundant. Twenty-four draft population genomes from novel members and ecotypes within the Nanohaloarchaea, Halobacteria, and Bacteroidetes were reconstructed to explore their metabolic features, environmental abundance and strategies for osmotic adaptation. The Halobacteria- and Bacteroidetes-related draft genomes belong to putative aerobic heterotrophs, likely with the capacity to ferment sugars in the absence of oxygen. Members from both taxonomic groups are likely involved in primary organic carbon degradation, since some of the reconstructed genomes encode the ability to hydrolyze recalcitrant substrates, such as cellulose and chitin. Putative sodium-pumping rhodopsins were found in both a Flavobacteriaceae- and a Chitinophagaceae-related draft genome. The predicted proteomes of both the latter and a Rhodothermace ae-related draft genome were indicative of a "salt-in" strategy of osmotic adaptation. The primary catabolic and respiratory pathways shared among all available reference genomes of Nanohaloarchaea and our novel genome reconstructions remain incomplete, but point to a primarily fermentative lifestyle. Encoded xenorhodopsins found in most drafts suggest that light plays an important role in the ecology of Nanohaloarchaea. Putative encoded halolysins and laccase-like oxidases might indicate the potential for extracellular degradation of proteins and peptides, and phenolic or aromatic compounds.« less

  12. Metagenomic Insights into the Uncultured Diversity and Physiology of Microbes in Four Hypersaline Soda Lake Brines

    PubMed Central

    Vavourakis, Charlotte D.; Ghai, Rohit; Rodriguez-Valera, Francisco; Sorokin, Dimitry Y.; Tringe, Susannah G.; Hugenholtz, Philip; Muyzer, Gerard

    2016-01-01

    Soda lakes are salt lakes with a naturally alkaline pH due to evaporative concentration of sodium carbonates in the absence of major divalent cations. Hypersaline soda brines harbor microbial communities with a high species- and strain-level archaeal diversity and a large proportion of still uncultured poly-extremophiles compared to neutral brines of similar salinities. We present the first “metagenomic snapshots” of microbial communities thriving in the brines of four shallow soda lakes from the Kulunda Steppe (Altai, Russia) covering a salinity range from 170 to 400 g/L. Both amplicon sequencing of 16S rRNA fragments and direct metagenomic sequencing showed that the top-level taxa abundance was linked to the ambient salinity: Bacteroidetes, Alpha-, and Gamma-proteobacteria were dominant below a salinity of 250 g/L, Euryarchaeota at higher salinities. Within these taxa, amplicon sequences related to Halorubrum, Natrinema, Gracilimonas, purple non-sulfur bacteria (Rhizobiales, Rhodobacter, and Rhodobaca) and chemolithotrophic sulfur oxidizers (Thioalkalivibrio) were highly abundant. Twenty-four draft population genomes from novel members and ecotypes within the Nanohaloarchaea, Halobacteria, and Bacteroidetes were reconstructed to explore their metabolic features, environmental abundance and strategies for osmotic adaptation. The Halobacteria- and Bacteroidetes-related draft genomes belong to putative aerobic heterotrophs, likely with the capacity to ferment sugars in the absence of oxygen. Members from both taxonomic groups are likely involved in primary organic carbon degradation, since some of the reconstructed genomes encode the ability to hydrolyze recalcitrant substrates, such as cellulose and chitin. Putative sodium-pumping rhodopsins were found in both a Flavobacteriaceae- and a Chitinophagaceae-related draft genome. The predicted proteomes of both the latter and a Rhodothermaceae-related draft genome were indicative of a “salt-in” strategy of osmotic adaptation. The primary catabolic and respiratory pathways shared among all available reference genomes of Nanohaloarchaea and our novel genome reconstructions remain incomplete, but point to a primarily fermentative lifestyle. Encoded xenorhodopsins found in most drafts suggest that light plays an important role in the ecology of Nanohaloarchaea. Putative encoded halolysins and laccase-like oxidases might indicate the potential for extracellular degradation of proteins and peptides, and phenolic or aromatic compounds. PMID:26941731

  13. Metagenomic Insights into the Uncultured Diversity and Physiology of Microbes in Four Hypersaline Soda Lake Brines

    DOE PAGES

    Vavourakis, Charlotte D.; Ghai, Rohit; Rodriguez-Valera, Francisco; ...

    2016-02-25

    Soda lakes are salt lakes with a naturally alkaline pH due to evaporative concentration of sodium carbonates in the absence of major divalent cations. Hypersaline soda brines harbor microbial communities with a high species- and strain-level archaeal diversity and a large proportion of still uncultured poly-extremophiles compared to neutral brines of similar salinities. We present the first "metagenomic snapshots" of microbial communities thriving in the brines of four shallow soda lakes from the Kulunda Steppe (Altai, Russia) covering a salinity range from 170 to 400 g/L. Both amplicon sequencing of 16S rRNA fragments and direct metagenomic sequencing showed that themore » top-level taxa abundance was linked to the ambient salinity: Bacteroidetes, Alpha-, and Gamma-proteobacteria were dominant below a salinity of 250 g/L, Euryarchaeota at higher salinities. Within these taxa, amplicon sequences related to Halorubrum, Natrinema, Gracilimonas, purple non-sulfur bacteria (Rhizobiales, Rhodobacter, and Rhodobaca) and chemolithotrophic sulfur oxidizers (Thioalkalivibrio) were highly abundant. Twenty-four draft population genomes from novel members and ecotypes within the Nanohaloarchaea, Halobacteria, and Bacteroidetes were reconstructed to explore their metabolic features, environmental abundance and strategies for osmotic adaptation. The Halobacteria- and Bacteroidetes-related draft genomes belong to putative aerobic heterotrophs, likely with the capacity to ferment sugars in the absence of oxygen. Members from both taxonomic groups are likely involved in primary organic carbon degradation, since some of the reconstructed genomes encode the ability to hydrolyze recalcitrant substrates, such as cellulose and chitin. Putative sodium-pumping rhodopsins were found in both a Flavobacteriaceae- and a Chitinophagaceae-related draft genome. The predicted proteomes of both the latter and a Rhodothermace ae-related draft genome were indicative of a "salt-in" strategy of osmotic adaptation. The primary catabolic and respiratory pathways shared among all available reference genomes of Nanohaloarchaea and our novel genome reconstructions remain incomplete, but point to a primarily fermentative lifestyle. Encoded xenorhodopsins found in most drafts suggest that light plays an important role in the ecology of Nanohaloarchaea. Putative encoded halolysins and laccase-like oxidases might indicate the potential for extracellular degradation of proteins and peptides, and phenolic or aromatic compounds.« less

  14. Draft Genome Sequence of a Novel Chitinophaga sp. Strain, MD30, Isolated from a Biofilm in an Air Conditioner Condensate Pipe.

    PubMed

    Wan, Xuehua; Darris, Maxwell; Hou, Shaobin; Donachie, Stuart P

    2017-10-19

    Most of the 24 known Chitinophaga species were originally isolated from soils. We report the draft genome sequence of a putatively novel Chitinophaga sp. from a biofilm in an air conditioner condensate pipe. The genome comprises 7,661,303 bp in one scaffold, 5,694 predicted protein-coding sequences, and a G+C content of 47.6%. Copyright © 2017 Wan et al.

  15. Draft Genome Sequence of Leuconostoc mesenteroides P45 Isolated from Pulque, a Traditional Mexican Alcoholic Fermented Beverage.

    PubMed

    Riveros-Mckay, Fernando; Campos, Itzia; Giles-Gómez, Martha; Bolívar, Francisco; Escalante, Adelfo

    2014-11-06

    Leuconostoc mesenteroides P45 was isolated from the traditional Mexican pulque beverage. We report its draft genome sequence, assembled in 6 contigs consisting of 1,874,188 bp and no plasmids. Genome annotation predicted a total of 1,800 genes, 1,687 coding sequences, 52 pseudogenes, 9 rRNAs, 51 tRNAs, 1 noncoding RNA, and 44 frameshifted genes. Copyright © 2014 Riveros-Mckay et al.

  16. Draft Genome Sequences of Two Salmonella enterica Serotype Infantis Strains Isolated from a Captive Western Lowland Gorilla (Gorilla gorilla gorilla) and a Cohabitant Black and White Tegu (Tupinambis merianae) in Brazil

    PubMed Central

    Paixão, Tatiane A.; Coura, Fernanda M.; Malta, Marcelo C. C.; Tinoco, Herlandes P.; Pessanha, Angela T.; Pereira, Felipe L.; Leal, Carlos A. G.; Heinemann, Marcos B.; Figueiredo, Henrique C. P.

    2016-01-01

    The draft genome sequences of two Salmonella enterica serotype Infantis isolates are reported here. One of the strains was isolated from a western lowland gorilla (Gorilla gorilla gorilla) with colitis. The second strain was isolated from a reptile that inhabited the same premises. Whole-genome sequencing demonstrated that these isolates were not clonal. PMID:26798099

  17. Draft Genome Sequence for ICMP 5702, the Type Strain of Pectobacterium carotovorum subsp. carotovorum That Causes Soft Rot Disease on Potato

    PubMed Central

    Lu, Ashley; Armstrong, Karen F.

    2015-01-01

    Pectobacterium species are economically important bacteria that cause soft rotting of potato tubers in the field and in storage. Here, we report the draft genome sequence of the type strain for P. carotovorum subsp. carotovorum, ICMP 5702 (ATCC 15713). The genome sequence of ICMP 5702 will provide an important reference for future phylogenomic and taxonomic studies of the phytopathogenic Enterobacteriaceae. PMID:26251498

  18. Draft Genome Sequence of Pseudomonas oceani DSM 100277T, a Deep-Sea Bacterium.

    PubMed

    García-Valdés, Elena; Gomila, Margarita; Mulet, Magdalena; Lalucat, Jorge

    2018-04-12

    Pseudomonas oceani DSM 100277 T was isolated from deep seawater in the Okinawa Trough at 1390 m. P. oceani belongs to the Pseudomonas pertucinogena group. Here, we report the draft genome sequence of P. oceani , which has an estimated size of 4.1 Mb and exhibits 3,790 coding sequences, with a G+C content of 59.94 mol%. Copyright © 2018 García-Valdés et al.

  19. Metagenome-Assembled Genome Sequences of Acetobacterium sp. Strain MES1 and Desulfovibrio sp. Strain MES5 from a Cathode-Associated Acetogenic Microbial Community.

    PubMed

    Ross, Daniel E; Marshall, Christopher W; May, Harold D; Norman, R Sean

    2017-09-07

    Draft genome sequences of Acetobacterium sp. strain MES1 and Desulfovibrio sp. strain MES5 were obtained from the metagenome of a cathode-associated community enriched within a microbial electrosynthesis system (MES). The draft genome sequences provide insight into the functional potential of these microorganisms within an MES and a foundation for future comparative analyses. Copyright © 2017 Ross et al.

  20. Draft Genome Sequence of a Hexachlorocyclohexane-Degrading Bacterium, Sphingobium baderi Strain LL03T

    PubMed Central

    Kaur, Jasvinder; Verma, Helianthous; Tripathi, Charu; Khurana, J. P.

    2013-01-01

    Sphingobium baderi strain LL03T was isolated from hexachlorocyclohexane (HCH)-contaminated soil from Spolana, Czech Republic. Strain LL03T is a mutant that is deficient in linB and linC (genes that encode hexachlorocyclohexane haloalkane dehalogenase and dehydrogenase, respectively). The draft genome sequence of LL03T (~4.85 Mb) consists of 92 contigs and 4,914 coding sequences, with a G+C content of 63.5%. PMID:24051322

  1. Draft Genome Sequence of the Fish Pathogen Yersinia ruckeri Strain 37551, Serotype O1b, Isolated from Diseased, Vaccinated Atlantic Salmon (Salmo salar) in Chile.

    PubMed

    Navas, Esteban; Bohle, Harry; Henríquez, Patricio; Grothusen, Horst; Bustamante, Fernando; Bustos, Patricio; Mancilla, Marcos

    2014-08-28

    We sequenced the genome of a motile O1b Yersinia ruckeri field isolate from Chile, which is causing enteric redmouth disease (ERM) in vaccinated Atlantic salmon (Salmo salar). The draft genome has 3,775,486 bp, a G+C content of 47.1%, and is predicted to contain 3,406 coding sequences. Copyright © 2014 Navas et al.

  2. Draft genome sequence for virulent and avirulent strains of Xanthomonas arboricola isolated from Prunus spp. in Spain.

    PubMed

    Garita-Cambronero, Jerson; Palacio-Bielsa, Ana; López, María M; Cubero, Jaime

    2016-01-01

    Xanthomonas arboricola is a species in genus Xanthomonas which is mainly comprised of plant pathogens. Among the members of this taxon, X. arboricola pv. pruni, the causal agent of bacterial spot disease of stone fruits and almond, is distributed worldwide although it is considered a quarantine pathogen in the European Union. Herein, we report the draft genome sequence, the classification, the annotation and the sequence analyses of a virulent strain, IVIA 2626.1, and an avirulent strain, CITA 44, of X. arboricola associated with Prunus spp. The draft genome sequence of IVIA 2626.1 consists of 5,027,671 bp, 4,720 protein coding genes and 50 RNA encoding genes. The draft genome sequence of strain CITA 44 consists of 4,760,482 bp, 4,250 protein coding genes and 56 RNA coding genes. Initial comparative analyses reveals differences in the presence of structural and regulatory components of the type IV pilus, the type III secretion system, the type III effectors as well as variations in the number of the type IV secretion systems. The genome sequence data for these strains will facilitate the development of molecular diagnostics protocols that differentiate virulent and avirulent strains. In addition, comparative genome analysis will provide insights into the plant-pathogen interaction during the bacterial spot disease process.

  3. Cluster: Drafting. Course: Architectural Drafting. Research Project.

    ERIC Educational Resources Information Center

    Sanford - Lee County Schools, NC.

    The sequence of 10 units is designed for use with an instructor in architectural drafting, and is also keyed to other texts. Each unit contains several task packages specifying prerequisites, rationale for learning, objectives, learning activities to be supervised by the instructor, and learning practice. The units cover: architectural lettering…

  4. Teaching Drafting 101: What Comes First?

    ERIC Educational Resources Information Center

    Carkhuff, Don

    2006-01-01

    Employers require pristine drawings that convey clarity and precision for the production of goods. Can a change in sequence of instruction be expeditious and help teachers better prepare their students for the workplace? Research suggests that combining traditional drafting and computer-aided drafting (CAD) instruction makes sense. It is analogous…

  5. Draft Assembly of Elite Inbred Line PH207 Provides Insights into Genomic and Transcriptome Diversity in Maize

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Hirsch, Candice N.; Hirsch, Cory D.; Brohammer, Alex B.

    Intense artificial selection over the last 100 years has produced elite maize (Zea mays) inbred lines that combine to produce high-yielding hybrids. To further our understanding of how genome and transcriptome variation contribute to the production of high-yielding hybrids, we generated a draft genome assembly of the inbred line PH207 to complement and compare with the existing B73 reference sequence. B73 is a founder of the Stiff Stalk germplasm pool, while PH207 is a founder of Iodent germplasm, both of which have contributed substantially to the production of temperate commercial maize and are combined to make heterotic hybrids. Comparison ofmore » these two assemblies revealed over 2500 genes present in only one of the two genotypes and 136 gene families that have undergone extensive expansion or contraction. Transcriptome profiling revealed extensive expression variation, with as many as 10,564 differentially expressed transcripts and 7128 transcripts expressed in only one of the two genotypes in a single tissue. Genotype-specific genes were more likely to have tissue/condition-specific expression and lower transcript abundance. The availability of a high-quality genome assembly for the elite maize inbred PH207 expands our knowledge of the breadth of natural genome and transcriptome variation in elite maize inbred lines across heterotic pools.« less

  6. Draft Assembly of Elite Inbred Line PH207 Provides Insights into Genomic and Transcriptome Diversity in Maize

    DOE PAGES

    Hirsch, Candice N.; Hirsch, Cory D.; Brohammer, Alex B.; ...

    2016-11-01

    Intense artificial selection over the last 100 years has produced elite maize (Zea mays) inbred lines that combine to produce high-yielding hybrids. To further our understanding of how genome and transcriptome variation contribute to the production of high-yielding hybrids, we generated a draft genome assembly of the inbred line PH207 to complement and compare with the existing B73 reference sequence. B73 is a founder of the Stiff Stalk germplasm pool, while PH207 is a founder of Iodent germplasm, both of which have contributed substantially to the production of temperate commercial maize and are combined to make heterotic hybrids. Comparison ofmore » these two assemblies revealed over 2500 genes present in only one of the two genotypes and 136 gene families that have undergone extensive expansion or contraction. Transcriptome profiling revealed extensive expression variation, with as many as 10,564 differentially expressed transcripts and 7128 transcripts expressed in only one of the two genotypes in a single tissue. Genotype-specific genes were more likely to have tissue/condition-specific expression and lower transcript abundance. The availability of a high-quality genome assembly for the elite maize inbred PH207 expands our knowledge of the breadth of natural genome and transcriptome variation in elite maize inbred lines across heterotic pools.« less

  7. Draft Assembly of Elite Inbred Line PH207 Provides Insights into Genomic and Transcriptome Diversity in Maize[OPEN

    PubMed Central

    Soifer, Ilya; Barad, Omer; Shem-Tov, Doron; Baruch, Kobi; Lu, Fei; Hernandez, Alvaro G.; Wright, Chris L.; Koehler, Klaus; Buell, C. Robin; de Leon, Natalia

    2016-01-01

    Intense artificial selection over the last 100 years has produced elite maize (Zea mays) inbred lines that combine to produce high-yielding hybrids. To further our understanding of how genome and transcriptome variation contribute to the production of high-yielding hybrids, we generated a draft genome assembly of the inbred line PH207 to complement and compare with the existing B73 reference sequence. B73 is a founder of the Stiff Stalk germplasm pool, while PH207 is a founder of Iodent germplasm, both of which have contributed substantially to the production of temperate commercial maize and are combined to make heterotic hybrids. Comparison of these two assemblies revealed over 2500 genes present in only one of the two genotypes and 136 gene families that have undergone extensive expansion or contraction. Transcriptome profiling revealed extensive expression variation, with as many as 10,564 differentially expressed transcripts and 7128 transcripts expressed in only one of the two genotypes in a single tissue. Genotype-specific genes were more likely to have tissue/condition-specific expression and lower transcript abundance. The availability of a high-quality genome assembly for the elite maize inbred PH207 expands our knowledge of the breadth of natural genome and transcriptome variation in elite maize inbred lines across heterotic pools. PMID:27803309

  8. Toward Universal Forward Genetics: Using a Draft Genome Sequence of the Nematode Oscheius tipulae To Identify Mutations Affecting Vulva Development

    PubMed Central

    Besnard, Fabrice; Koutsovoulos, Georgios; Dieudonné, Sana; Blaxter, Mark; Félix, Marie-Anne

    2017-01-01

    Mapping-by-sequencing has become a standard method to map and identify phenotype-causing mutations in model species. Here, we show that a fragmented draft assembly is sufficient to perform mapping-by-sequencing in nonmodel species. We generated a draft assembly and annotation of the genome of the free-living nematode Oscheius tipulae, a distant relative of the model Caenorhabditis elegans. We used this draft to identify the likely causative mutations at the O. tipulae cov-3 locus, which affect vulval development. The cov-3 locus encodes the O. tipulae ortholog of C. elegans mig-13, and we further show that Cel-mig-13 mutants also have an unsuspected vulval-development phenotype. In a virtuous circle, we were able to use the linkage information collected during mutant mapping to improve the genome assembly. These results showcase the promise of genome-enabled forward genetics in nonmodel species. PMID:28630114

  9. Toward Universal Forward Genetics: Using a Draft Genome Sequence of the Nematode Oscheius tipulae To Identify Mutations Affecting Vulva Development.

    PubMed

    Besnard, Fabrice; Koutsovoulos, Georgios; Dieudonné, Sana; Blaxter, Mark; Félix, Marie-Anne

    2017-08-01

    Mapping-by-sequencing has become a standard method to map and identify phenotype-causing mutations in model species. Here, we show that a fragmented draft assembly is sufficient to perform mapping-by-sequencing in nonmodel species. We generated a draft assembly and annotation of the genome of the free-living nematode Oscheius tipulae , a distant relative of the model Caenorhabditis elegans We used this draft to identify the likely causative mutations at the O. tipulae cov -3 locus, which affect vulval development. The cov-3 locus encodes the O. tipulae ortholog of C. elegans mig-13 , and we further show that Cel-mig-13 mutants also have an unsuspected vulval-development phenotype. In a virtuous circle, we were able to use the linkage information collected during mutant mapping to improve the genome assembly. These results showcase the promise of genome-enabled forward genetics in nonmodel species. Copyright © 2017 by the Genetics Society of America.

  10. A draft annotation and overview of the human genome

    PubMed Central

    Wright, Fred A; Lemon, William J; Zhao, Wei D; Sears, Russell; Zhuo, Degen; Wang, Jian-Ping; Yang, Hee-Yung; Baer, Troy; Stredney, Don; Spitzner, Joe; Stutz, Al; Krahe, Ralf; Yuan, Bo

    2001-01-01

    Background The recent draft assembly of the human genome provides a unified basis for describing genomic structure and function. The draft is sufficiently accurate to provide useful annotation, enabling direct observations of previously inferred biological phenomena. Results We report here a functionally annotated human gene index placed directly on the genome. The index is based on the integration of public transcript, protein, and mapping information, supplemented with computational prediction. We describe numerous global features of the genome and examine the relationship of various genetic maps with the assembly. In addition, initial sequence analysis reveals highly ordered chromosomal landscapes associated with paralogous gene clusters and distinct functional compartments. Finally, these annotation data were synthesized to produce observations of gene density and number that accord well with historical estimates. Such a global approach had previously been described only for chromosomes 21 and 22, which together account for 2.2% of the genome. Conclusions We estimate that the genome contains 65,000-75,000 transcriptional units, with exon sequences comprising 4%. The creation of a comprehensive gene index requires the synthesis of all available computational and experimental evidence. PMID:11516338

  11. Genome assembly and transcriptome resource for river buffalo, Bubalus bubalis (2n = 50).

    PubMed

    Williams, John L; Iamartino, Daniela; Pruitt, Kim D; Sonstegard, Tad; Smith, Timothy P L; Low, Wai Yee; Biagini, Tommaso; Bomba, Lorenzo; Capomaccio, Stefano; Castiglioni, Bianca; Coletta, Angelo; Corrado, Federica; Ferré, Fabrizio; Iannuzzi, Leopoldo; Lawley, Cynthia; Macciotta, Nicolò; McClure, Matthew; Mancini, Giordano; Matassino, Donato; Mazza, Raffaele; Milanesi, Marco; Moioli, Bianca; Morandi, Nicola; Ramunno, Luigi; Peretti, Vincenzo; Pilla, Fabio; Ramelli, Paola; Schroeder, Steven; Strozzi, Francesco; Thibaud-Nissen, Francoise; Zicarelli, Luigi; Ajmone-Marsan, Paolo; Valentini, Alessio; Chillemi, Giovanni; Zimin, Aleksey

    2017-10-01

    Water buffalo is a globally important species for agriculture and local economies. A de novo assembled, well-annotated reference sequence for the water buffalo is an important prerequisite for studying the biology of this species, and is necessary to manage genetic diversity and to use modern breeding and genomic selection techniques. However, no such genome assembly has been previously reported. There are 2 species of domestic water buffalo, the river (2 n = 50) and the swamp (2 n = 48) buffalo. Here we describe a draft quality reference sequence for the river buffalo created from Illumina GA and Roche 454 short read sequences using the MaSuRCA assembler. The assembled sequence is 2.83 Gb, consisting of 366 983 scaffolds with a scaffold N50 of 1.41 Mb and contig N50 of 21 398 bp. Annotation of the genome was supported by transcriptome data from 30 tissues and identified 21 711 predicted protein coding genes. Searches for complete mammalian BUSCO gene groups found 98.6% of curated single copy orthologs present among predicted genes, which suggests a high level of completeness of the genome. The annotated sequence is available from NCBI at accession GCA_000471725.1. © The Author 2017. Published by Oxford University Press.

  12. Novel proteases from the genome of the carnivorous plant Drosera capensis: structural prediction and comparative analysis

    PubMed Central

    Butts, Carter T.; Bierma, Jan C.; Martin, Rachel W.

    2016-01-01

    In his 1875 monograph on insectivorous plants, Darwin described the feeding reactions of Drosera flypaper traps and predicted that their secretions contained a “ferment” similar to mammalian pepsin, an aspartic protease. Here we report a high-quality draft genome sequence for the cape sundew, Drosera capensis, the first genome of a carnivorous plant from order Caryophyllales, which also includes the Venus flytrap (Dionaea) and the tropical pitcher plants (Nepenthes). This species was selected in part for its hardiness and ease of cultivation, making it an excellent model organism for further investigations of plant carnivory. Analysis of predicted protein sequences yields genes encoding proteases homologous to those found in other plants, some of which display sequence and structural features that suggest novel functionalities. Because the sequence similarity to proteins of known structure is in most cases too low for traditional homology modeling, 3D structures of representative proteases are predicted using comparative modeling with all-atom refinement. Although the overall folds and active residues for these proteins are conserved, we find structural and sequence differences consistent with a diversity of substrate recognition patterns. Finally, we predict differences in substrate specificities using in silico experiments, providing targets for structure/function studies of novel enzymes with biological and technological significance. PMID:27353064

  13. Draft Genome Sequence of Highly Virulent Race 4/Biovar 3 of Ralstonia solanacearum CaRs_Mep Causing Bacterial Wilt in Zingiberaceae Plants in India.

    PubMed

    Kumar, Aundy; Munjal, Vibhuti; Sheoran, Neelam; Prameela, Thekkan Puthiyaveedu; Suseelabhai, Rajamma; Aggarwal, Rashmi; Jain, Rakesh Kumar; Eapen, Santhosh J

    2017-01-05

    The genome of Ralstonia solanacearum CaRs_Mep, a race 4/biovar 3/phylotype I bacterium causing wilt in small cardamom and other Zingiberaceae plants, was sequenced. Analysis of the 5.7-Mb genome sequence will aid in better understanding of the genetic determinants of host range, host jump, survival, pathogenicity, and virulence of race 4 of R. solanacearum. Copyright © 2017 Kumar et al.

  14. The draft genome sequence of the ascomycete fungus Penicillium subrubescens reveals a highly enriched content of plant biomass related CAZymes compared to related fungi.

    PubMed

    Peng, Mao; Dilokpimol, Adiphol; Mäkelä, Miia R; Hildén, Kristiina; Bervoets, Sander; Riley, Robert; Grigoriev, Igor V; Hainaut, Matthieu; Henrissat, Bernard; de Vries, Ronald P; Granchi, Zoraide

    2017-03-20

    Here we report the genome sequence of the ascomycete saprobic fungus Penicillium subrubescens FBCC1632/CBS132785 isolated from a Jerusalem artichoke field in Finland. The 39.75Mb genome containing 14,188 gene models is highly similar for that reported for other Penicillium species, but contains a significantly higher number of putative carbohydrate active enzyme (CAZyme) encoding genes. Copyright © 2017 Elsevier B.V. All rights reserved.

  15. High-quality draft genome sequence of the Thermus amyloliquefaciens type strain YIM 77409 T with an incomplete denitrification pathway

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Zhou, En -Min; Murugapiran, Senthil K.; Mefferd, Chrisabelle C.

    Thermus amyloliquefaciens type strain YIM 77409 T is a thermophilic, Gram-negative, non-motile and rod-shaped bacterium isolated from Niujie Hot Spring in Eryuan County, Yunnan Province, southwest China. In the present study we describe the features of strain YIM 77409 T together with its genome sequence and annotation. The genome is 2,160,855 bp long and consists of 6 scaffolds with 67.4 % average GC content. A total of 2,313 genes were predicted, comprising 2,257 protein-coding and 56 RNA genes. The genome is predicted to encode a complete glycolysis, pentose phosphate pathway, and tricarboxylic acid cycle. Additionally, a large number of transportersmore » and enzymes for heterotrophy highlight the broad heterotrophic lifestyle of this organism. Furthermore, a denitrification gene cluster included genes predicted to encode enzymes for the sequential reduction of nitrate to nitrous oxide, consistent with the incomplete denitrification phenotype of this strain.« less

  16. High-quality draft genome sequence of the Thermus amyloliquefaciens type strain YIM 77409 T with an incomplete denitrification pathway

    DOE PAGES

    Zhou, En -Min; Murugapiran, Senthil K.; Mefferd, Chrisabelle C.; ...

    2016-02-27

    Thermus amyloliquefaciens type strain YIM 77409 T is a thermophilic, Gram-negative, non-motile and rod-shaped bacterium isolated from Niujie Hot Spring in Eryuan County, Yunnan Province, southwest China. In the present study we describe the features of strain YIM 77409 T together with its genome sequence and annotation. The genome is 2,160,855 bp long and consists of 6 scaffolds with 67.4 % average GC content. A total of 2,313 genes were predicted, comprising 2,257 protein-coding and 56 RNA genes. The genome is predicted to encode a complete glycolysis, pentose phosphate pathway, and tricarboxylic acid cycle. Additionally, a large number of transportersmore » and enzymes for heterotrophy highlight the broad heterotrophic lifestyle of this organism. Furthermore, a denitrification gene cluster included genes predicted to encode enzymes for the sequential reduction of nitrate to nitrous oxide, consistent with the incomplete denitrification phenotype of this strain.« less

  17. Draft Genome Sequence of Limnobacter sp. Strain CACIAM 66H1, a Heterotrophic Bacterium Associated with Cyanobacteria.

    PubMed

    da Silva, Fábio Daniel Florêncio; Lima, Alex Ranieri Jerônimo; Moraes, Pablo Henrique Gonçalves; Siqueira, Andrei Santos; Dall'Agnol, Leonardo Teixeira; Baraúna, Anna Rafaella Ferreira; Martins, Luisa Carício; Oliveira, Karol Guimarães; de Lima, Clayton Pereira Silva; Nunes, Márcio Roberto Teixeira; Vianez-Júnior, João Lídio Silva Gonçalves; Gonçalves, Evonnildo Costa

    2016-05-19

    Ecological interactions between cyanobacteria and heterotrophic prokaryotes are poorly known. To improve the genomic studies of heterotrophic bacterium-cyanobacterium associations, the draft genome sequence (3.2 Mbp) of Limnobacter sp. strain CACIAM 66H1, found in a nonaxenic culture of Synechococcus sp. (cyanobacteria), is presented here. Copyright © 2016 da Silva et al.

  18. Draft Genome Sequence of Acinetobacter calcoaceticus Strain P23, a Plant Growth-Promoting Bacterium of Duckweed

    PubMed Central

    Hosoyama, Akira; Yamazoe, Atsushi; Morikawa, Masaaki

    2015-01-01

    Acinetobacter calcoaceticus strain P23 is a plant growth-promoting bacterium, which was isolated from the surface of duckweed. We report here the draft genome sequence of strain P23. The genome data will serve as a valuable reference for understanding the molecular mechanism of plant growth promotion in aquatic plants. PMID:25720680

  19. Draft Genome Sequences of Pseudomonas fluorescens BS2 and Pusillimonas noertemannii BS8, Soil Bacteria That Cooperate To Degrade the Poly-γ-d-Glutamic Acid Anthrax Capsule.

    PubMed

    Stabler, Richard A; Negus, David; Pain, Arnab; Taylor, Peter W

    2013-01-01

    A mixed culture of Pseudomonas fluorescens BS2 and Pusillimonas noertemannii BS8 degraded poly-γ-d-glutamic acid; when the 2 strains were cultured separately, no hydrolytic activity was apparent. Here we report the draft genome sequences of both soil isolates.

  20. Draft Genome Sequence of Sphingopyxis sp. Strain MWB1, a Crude-Oil-Degrading Marine Bacterium

    PubMed Central

    Kim, Jonghyun; Kim, Soo Jung; Kim, Seon Hee; Kim, Seung Il; Moon, Yoon-Jung; Park, Sung-Joon

    2014-01-01

    Sphingopyxis sp. strain MWB1, which is capable of degrading crude oil, diesel, and kerosene, was isolated from crude oil–contaminated seashore in Tae-an, South Korea. Here, we report the draft genome sequence of this strain, which comprises 3,118,428 bp with a G+C content of 62.85 mol%. PMID:25477411

Top