mb draft genome: Topics by Science.gov

Sample records for mb draft genome

Draft genome resources for the phytopathogenic fungi Monilinia fructicola, M. fructigena, M. polystroma and M. laxa, the causal agents of brown rot.

PubMed

Rivera, Yazmin; Zeller, Kurt; Srivastava, Subodh K; Sutherland, Jeremy; Galvez, Marco E; Nakhla, Mark K; Poniatowska, Anna; Schnabel, Guido; Sundin, George W; Abad, Gloria

2018-05-03

Fungi in the genus Monilinia are known to cause devastating brown rot disease of stone and pome fruits. Here, we report the draft genome assemblies of four important phytopathogenic species: Monilinia fructicola, Monilinia fructigena, Monilinia polystroma, and Monilinia laxa. The draft genome assemblies were 39 Mb (M. fructigena), 42 Mb (M. laxa), 43 Mb (M. fructicola), and 45 Mb (M. polystroma) with as few as 550 contigs (M. laxa). These are the first draft genome resources publicly available for M. laxa, M. fructigena, and M. polystroma.
Draft genome sequences of Streptococcus bovis strains ATCC 33317 and JB1

USDA-ARS?s Scientific Manuscript database

We report the draft genome sequences of Streptococcus bovis type strain ATTC 33317 (CVM42251) isolated from cow dung and strain JB1 (CVM42252) isolated from a cow rumen in 1977. Strains were subjected to Next Generation sequencing and the genome sizes are approximately 2 MB and 2.2 MB, respectively....
Draft Genome Sequences of Two Kocuria Isolates, K. salsicia G1 and K. rhizophila G2, Isolated from a Slaughterhouse in Denmark

PubMed Central

Herschend, Jakob; Raghupathi, Prem K.; Røder, Henriette L.; Sørensen, Søren J.

2016-01-01

We report here the draft genome sequences of Kocuria salsicia G1 and Kocuria rhizophila G2, which were isolated from a meat chopper at a small slaughterhouse in Denmark. The two annotated genomes are 2.99 Mb and 2.88 Mb in size, respectively. PMID:27034479
De novo genome assembly of the red silk cotton tree (Bombax ceiba).

PubMed

Gao, Yong; Wang, Haibo; Liu, Chao; Chu, Honglong; Dai, Dongqin; Song, Shengnan; Yu, Long; Han, Lihong; Fu, Yi; Tian, Bin; Tang, Lizhou

2018-05-01

Bombax ceiba L. (the red silk cotton tree) is a large deciduous tree that is distributed in tropical and sub-tropical Asia as well as northern Australia. It has great economic and ecological importance, with several applications in industry and traditional medicine in many Asian countries. To facilitate further utilization of this plant resource, we present here the draft genome sequence for B. ceiba. We assembled a relatively intact genome of B. ceiba by using PacBio single-molecule sequencing and BioNano optical mapping technologies. The final draft genome is approximately 895 Mb long, with contig and scaffold N50 sizes of 1.0 Mb and 2.06 Mb, respectively. The high-quality draft genome assembly of B. ceiba will be a valuable resource enabling further genetic improvement and more effective use of this tree species.
Draft Genome Sequence of the 2-Chloro-4-Nitrophenol-Degrading Bacterium Arthrobacter sp. Strain SJCon

PubMed Central

Vikram, Surendra; Kumar, Shailesh; Vaidya, Bhumika; Pinnaka, Anil Kumar

2013-01-01

We report the 4.39-Mb draft genome sequence of the 2-chloro-4-nitrophenol-degrading bacterium Arthrobacter sp. strain SJCon, isolated from a pesticide-contaminated site. The draft genome sequence of strain SJCon will be helpful in studying the genetic pathways involved in the degradation of several aromatic compounds. PMID:23516196
Draft Genome Sequence of a Pseudomonas aeruginosa NA04 Bacterium Isolated from an Entomopathogenic Nematode.

PubMed

Salgado-Morales, Rosalba; Rivera-Gómez, Nancy; Lozano-Aguirre Beltrán, Luis Fernando; Hernández-Mendoza, Armando; Dantán-González, Edgar

2017-09-07

We report the draft genome sequence of Gram-negative bacterium Pseudomonas aeruginosa NA04, isolated from the entomopathogenic nematode Heterorhabditis indica MOR03. The draft genome consists of 54 contigs, a length of 6.37 Mb, and a G+C content 66.49%. Copyright © 2017 Salgado-Morales et al.
Venturia carpophila draft genome sequence

USDA-ARS?s Scientific Manuscript database

Venturia carpophila causes peach scab, a disease that renders peach fruit unmarketable. We report a high-quality draft genome sequence (36.9 Mb) of V. carpophila from an isolate collected from a peach tree in central Georgia in the United States. The genome sequence described will be a useful resour...
Draft Genome Sequence of a Rare Smut Relative, Tilletiaria anomala UBC 951

DOE PAGES

Toome, Merje; Kuo, Alan; Henrissat, Bernard; ...

2014-06-12

We present the draft genome sequence of the smut fungus Tilletiaria anomala UBC 951 (Basidiomycota, Ustilaginomycotina). The sequenced genome size is 18.7 Mb, consisting of 289 scaffolds and a total of 6,810 predicted genes. This is the first genome sequence published for a fungus in the order Georgefisheriales (Exobasidiomycetes).
Draft genome of the Peruvian scallop Argopecten purpuratus.

PubMed

Li, Chao; Liu, Xiao; Liu, Bo; Ma, Bin; Liu, Fengqiao; Liu, Guilong; Shi, Qiong; Wang, Chunde

2018-04-01

The Peruvian scallop, Argopecten purpuratus, is mainly cultured in southern Chile and Peru was introduced into China in the last century. Unlike other Argopecten scallops, the Peruvian scallop normally has a long life span of up to 7 to 10 years. Therefore, researchers have been using it to develop hybrid vigor. Here, we performed whole genome sequencing, assembly, and gene annotation of the Peruvian scallop, with an important aim to develop genomic resources for genetic breeding in scallops. A total of 463.19-Gb raw DNA reads were sequenced. A draft genome assembly of 724.78 Mb was generated (accounting for 81.87% of the estimated genome size of 885.29 Mb), with a contig N50 size of 80.11 kb and a scaffold N50 size of 1.02 Mb. Repeat sequences were calculated to reach 33.74% of the whole genome, and 26,256 protein-coding genes and 3,057 noncoding RNAs were predicted from the assembly. We generated a high-quality draft genome assembly of the Peruvian scallop, which will provide a solid resource for further genetic breeding and for the analysis of the evolutionary history of this economically important scallop.
Draft genome sequence of the D-Xylose-Fermenting yeast Spathaspora xylofermentans UFMG-HMD23.3

USDA-ARS?s Scientific Manuscript database

Here, we report the draft genome sequence of the yeast Spathaspora xylofermentans UFMG-HMD23.3 (CBMAI 1427=CBS 12681), a D-xylose fermenting yeast isolated from the Amazonian forest. The genome consists of 298 contigs, with a total size of 15.1 Mb, including the mitochondrial genome, and 5,948 predi...
Draft Genome Sequence of the Terrestrial Cyanobacterium Scytonema millei VB511283, Isolated from Eastern India

PubMed Central

Sen, Diya; Chandrababunaidu, Mathu Malar; Singh, Deeksha; Sanghi, Neha; Ghorai, Arpita; Mishra, Gyan Prakash; Madduluri, Madhavi

2015-01-01

We report here the draft genome sequence of Scytonema millei VB511283, a cyanobacterium isolated from biofilms on the exterior of stone monuments in Santiniketan, eastern India. The draft genome is 11,627,246 bp long (11.63 Mb), with 118 scaffolds. About 9,011 protein-coding genes, 117 tRNAs, and 12 rRNAs are predicted from this assembly. PMID:25744984
Draft Genome Sequence of Bacillus sp. GZT, a 2,4,6-Tribromophenol-Degrading Strain Isolated from the River Sludge of an Electronic Waste-Dismantling Region

PubMed Central

Liang, Zhishu; Li, Guiying; Das, Ranjit

2016-01-01

Here, we report the draft genome sequence of Bacillus sp. strain GZT, a 2,4,6-tribromophenol (TBP)-degrading bacterium previously isolated from an electronic waste-dismantling region. The draft genome sequence is 5.18 Mb and has a G+C content of 35.1%. This is the first genome report of a brominated flame retardant-degrading strain. PMID:27257197
Draft Genomes of Anopheles cracens and Anopheles maculatus: Comparison of Simian Malaria and Human Malaria Vectors in Peninsular Malaysia

PubMed Central

Chen, Junhui; Zhong, Zhen; Jian, Jianbo; Amir, Amirah; Cheong, Fei-Wen; Sum, Jia-Siang; Fong, Mun-Yik

2016-01-01

Anopheles cracens has been incriminated as the vector of human knowlesi malaria in peninsular Malaysia. Besides, it is a good laboratory vector of Plasmodium falciparum and P. vivax. The distribution of An. cracens overlaps with that of An. maculatus, the human malaria vector in peninsular Malaysia that seems to be refractory to P. knowlesi infection in natural settings. Whole genome sequencing was performed on An. cracens and An. maculatus collected here. The draft genome of An. cracens was 395 Mb in size whereas the size of An. maculatus draft genome was 499 Mb. Comparison with the published Malaysian An. maculatus genome suggested the An. maculatus specimen used in this study as a different geographical race. Comparative analyses highlighted the similarities and differences between An. cracens and An. maculatus, providing new insights into their biological behavior and characteristics. PMID:27347683
Draft Genome Sequence of Streptomyces specialis Type Strain GW41-1564 (DSM 41924).

PubMed

Loucif, Lotfi; Michelle, Caroline; Terras, Jérôme; Rolain, Jean-Marc; Raoult, Didier; Fournier, Pierre-Edouard

2017-03-30

Here, we report the draft genome sequence of Streptomyces specialis type strain GW41-1564, which was isolated from soil. This 5.87-Mb genome exhibits a high G+C content of 72.72% and contains 5,486 protein-coding genes. Copyright © 2017 Loucif et al.
High-Quality Draft Genome Sequence of Babesia divergens, the Etiological Agent of Cattle and Human Babesiosis

PubMed Central

Cuesta, Isabel; González, Luis M.; Estrada, Karel; Grande, Ricardo; Zaballos, Ángel; Lobo, Cheryl A.; Barrera, Jorge

2014-01-01

Babesia divergens causes significant morbidity and mortality in cattle and splenectomized or immunocompromised individuals. Here, we present a 10.7-Mb high-quality draft genome of this parasite close to chromosome resolution that will enable comparative genome analyses and synteny studies among related parasites. PMID:25395649
Draft genome sequence of Venturia carpophila, the causal agent of peach scab

USDA-ARS?s Scientific Manuscript database

Venturia carpophila causes peach scab, a disease that renders peach fruit unmarketable. We report a high-quality draft genome sequence (36.9 Mb) of V. carpophila from an isolate collected from a peach tree in central Georgia in the United States. The genome sequence described will be a useful resour...
Draft Genome Sequence of Saccharomyces cerevisiae Barra Grande (BG-1), a Brazilian Industrial Bioethanol-Producing Strain

PubMed Central

Coutouné, Natalia; Mulato, Aline Tieppo Nogueira

2017-01-01

ABSTRACT Here, we present the draft genome sequence of Saccharomyces cerevisiae BG-1, a Brazilian industrial strain widely used for bioethanol production from sugarcane. The 11.7-Mb genome sequence consists of 216 scaffolds and harbors 5,607 predicted protein-coding genes. PMID:28360170
Draft Genome Sequence of the Terrestrial Cyanobacterium Scytonema millei VB511283, Isolated from Eastern India.

PubMed

Sen, Diya; Chandrababunaidu, Mathu Malar; Singh, Deeksha; Sanghi, Neha; Ghorai, Arpita; Mishra, Gyan Prakash; Madduluri, Madhavi; Adhikary, Siba Prasad; Tripathy, Sucheta

2015-03-05

We report here the draft genome sequence of Scytonema millei VB511283, a cyanobacterium isolated from biofilms on the exterior of stone monuments in Santiniketan, eastern India. The draft genome is 11,627,246 bp long (11.63 Mb), with 118 scaffolds. About 9,011 protein-coding genes, 117 tRNAs, and 12 rRNAs are predicted from this assembly. Copyright © 2015 Sen et al.
Draft Genome Sequence of Sphingobium ummariense Strain RL-3, a Hexachlorocyclohexane-Degrading Bacterium

PubMed Central

Kohli, Puneet; Dua, Ankita; Sangwan, Naseer; Oldach, Phoebe; Khurana, J. P.

2013-01-01

Here, we report the draft genome sequence of the hexachlorocyclohexane (HCH)-degrading bacterium Sphingobium ummariense strain RL-3, which was isolated from the HCH dumpsite located in Lucknow, India (27°00′N and 81°09′E). The annotated draft genome sequence (4.75 Mb) of strain RL-3 consisted of 139 contigs, 4,645 coding sequences, and 65% G+C content. PMID:24233594
Draft Genome Sequence of Sphingobacterium sp. CZ-UAM, Isolated from a Methanotrophic Consortium

PubMed Central

Steffani-Vallejo, José Luis; Zuñiga, Cristal; Cruz-Morales, Pablo; Lozano, Luis; Morales, Marcia; Licona-Cassani, Cuauhtemoc; Revah, Sergio

2017-01-01

ABSTRACT Sphingobacterium sp. CZ-UAM was isolated from a methanotrophic consortium in mineral medium using methane as the only carbon source. A draft genome of 5.84 Mb with a 40.77% G+C content is reported here. This genome sequence will allow the investigation of potential methanotrophy in this isolated strain. PMID:28818899

Draft Genome Sequence of the Butyric Acid Producer Clostridium tyrobutyricum Strain CIP I-776 (IFP923).

PubMed

Wasels, François; Clément, Benjamin; Lopes Ferreira, Nicolas

2016-03-03

Here, we report the draft genome sequence of Clostridium tyrobutyricum CIP I-776 (IFP923), an efficient producer of butyric acid. The genome consists of a single chromosome of 3.19 Mb and provides useful data concerning the metabolic capacities of the strain. Copyright © 2016 Wasels et al.
Draft Genome Sequence of Marine Sponge Symbiont Pseudoalteromonas luteoviolacea IPB1, Isolated from Hilo, Hawaii

PubMed Central

Yakym, Christopher J.; Helmkampf, Martin; Hagiwara, Kehau; Ip, Courtney G.; Antonio, Brandi J.; Armstrong, Ellie; Ulloa, Wesley J.; Awaya, Jonathan D.

2016-01-01

We report here the 6.0-Mb draft genome assembly of Pseudoalteromonas luteoviolacea strain IPB1 that was isolated from the Hawaiian marine sponge Iotrochota protea. Genome mining complemented with bioassay studies will elucidate secondary metabolite biosynthetic pathways and will help explain the ecological interaction between host sponge and microorganism. PMID:27660784
An Annotated Draft Genome for Radix auricularia (Gastropoda, Mollusca)

PubMed Central

Feldmeyer, Barbara; Schmidt, Hanno; Greshake, Bastian; Tills, Oliver; Truebano, Manuela; Rundle, Simon D.; Paule, Juraj; Ebersberger, Ingo; Pfenninger, Markus

2017-01-01

Molluscs are the second most species-rich phylum in the animal kingdom, yet only 11 genomes of this group have been published so far. Here, we present the draft genome sequence of the pulmonate freshwater snail Radix auricularia. Six whole genome shotgun libraries with different layouts were sequenced. The resulting assembly comprises 4,823 scaffolds with a cumulative length of 910 Mb and an overall read coverage of 72×. The assembly contains 94.6% of a metazoan core gene collection, indicating an almost complete coverage of the coding fraction. The discrepancy of ∼690 Mb compared with the estimated genome size of R. auricularia (1.6 Gb) results from a high repeat content of 70% mainly comprising DNA transposons. The annotation of 17,338 protein coding genes was supported by the use of publicly available transcriptome data. This draft will serve as starting point for further genomic and population genetic research in this scientifically important phylum. PMID:28204581
Draft Genome Sequence of Sphingobacterium sp. CZ-UAM, Isolated from a Methanotrophic Consortium.

PubMed

Steffani-Vallejo, José Luis; Zuñiga, Cristal; Cruz-Morales, Pablo; Lozano, Luis; Morales, Marcia; Licona-Cassani, Cuauhtemoc; Revah, Sergio; Utrilla, José

2017-08-17

Sphingobacterium sp. CZ-UAM was isolated from a methanotrophic consortium in mineral medium using methane as the only carbon source. A draft genome of 5.84 Mb with a 40.77% G+C content is reported here. This genome sequence will allow the investigation of potential methanotrophy in this isolated strain. Copyright © 2017 Steffani-Vallejo et al.
Draft Genome Sequence of Escherichia coli Strain SN137, a Bacterium with Extracellular Proteolytic Activity on Immunoglobulins and Persistence in Human Tissue Blood

PubMed Central

Najera-Hernandez, Salustio; Sanchez-Alonso, Maria Patricia; Anastacio-Marcelino, Estela; Negrete-Abascal, Erasmo

2018-01-01

ABSTRACT The draft genome sequence of Escherichia coli strain SN137 is reported here. The genome comprises 172 contigs, corresponding to 4.9 Mb with 50% G+C content, and contains several genes related to pathogenicity that explain its survival in human hematic tissue. PMID:29348341
Draft Genome Sequence of Lactobacillus crispatus EM-LC1, an Isolate with Antimicrobial Activity Cultured from an Elderly Subject

PubMed Central

Power, Susan E.; Harris, Hugh M. B.; Bottacini, Francesca; Ross, R. Paul; O’Toole, Paul W.

2013-01-01

Here we report the 1.86-Mb draft genome sequence of Lactobacillus crispatus EM-LC1, a fecal isolate with antimicrobial activity. This genome sequence is expected to provide insights into the antimicrobial activity of L. crispatus and improve our knowledge of its potential probiotic traits. PMID:24356836
Draft genome sequence of Xylaria sp., the causal agent of taproot decline of soybean in the southern United States.

PubMed

Sharma, Sandeep; Zaccaron, Alex Z; Ridenour, John B; Allen, Tom W; Conner, Kassie; Doyle, Vinson P; Price, Trey; Sikora, Edward; Singh, Raghuwinder; Spurlock, Terry; Tomaso-Peterson, Maria; Wilkerson, Tessie; Bluhm, Burton H

2018-04-01

The draft genome of Xylaria sp. isolate MSU_SB201401, causal agent of taproot decline of soybean in the southern U.S., is presented here. The genome assembly was 56.7 Mb in size with an L50 of 246. A total of 10,880 putative protein-encoding genes were predicted, including 647 genes encoding carbohydrate-active enzymes and 1053 genes encoding secreted proteins. This is the first draft genome of a plant-pathogenic Xylaria sp. associated with soybean. The draft genome of Xylaria sp. isolate MSU_SB201401 will provide an important resource for future experiments to determine the molecular basis of pathogenesis.
Draft Genome Sequence of Cyanobacterium Hassallia byssoidea Strain VB512170, Isolated from Monuments in India

PubMed Central

Singh, Deeksha; Chandrababunaidu, Mathu Malar; Panda, Arijit; Sen, Diya; Bhattacharyya, Sourav

2015-01-01

The draft genome assembly of Hassallia byssoidea strain VB512170 with a genome size of ~13 Mb and 10,183 protein-coding genes in 62 scaffolds is reported here for the first time. This is a terrestrial hydrophobic cyanobacterium isolated from monuments in India. We report several copies of luciferase and antibiotic genes in this organism. PMID:25745001
Draft Genome Sequence of the d-Xylose-Fermenting Yeast Spathaspora arborariae UFMG-HM19.1AT

PubMed Central

Lobo, Francisco P.; Gonçalves, Davi L.; Alves, Sergio L.; Gerber, Alexandra L.; de Vasconcelos, Ana Tereza R.; Basso, Luiz C.; Franco, Glória R.; Soares, Marco A.; Cadete, Raquel M.; Rosa, Carlos A.

2014-01-01

The draft genome sequence of the yeast Spathaspora arborariae UFMG-HM19.1AT (CBS 11463 = NRRL Y-48658) is presented here. The sequenced genome size is 12.7 Mb, consisting of 41 scaffolds containing a total of 5,625 predicted open reading frames, including many genes encoding enzymes and transporters involved in d-xylose fermentation. PMID:24435867
Draft Genome Sequence of the Phytopathogenic Fungus Ganoderma boninense, the Causal Agent of Basal Stem Rot Disease on Oil Palm

PubMed Central

Tanjung, Zulfikar Achmad; Aditama, Redi; Buana, Rika Fithri Nurani; Pratomo, Antonius Dony Madu; Tryono, Reno; Liwang, Tony

2018-01-01

ABSTRACT Ganoderma boninense is the dominant fungal pathogen of basal stem rot (BSR) disease on Elaeis guineensis. We sequenced the nuclear genome of mycelia using both Illumina and Pacific Biosciences platforms for assembly of scaffolds. The draft genome comprised 79.24 Mb, 495 scaffolds, and 26,226 predicted coding sequences. PMID:29700132
Draft Genome Sequence of Escherichia coli Strain SN137, a Bacterium with Extracellular Proteolytic Activity on Immunoglobulins and Persistence in Human Tissue Blood.

PubMed

Najera-Hernandez, Salustio; Sanchez-Alonso, Maria Patricia; Anastacio-Marcelino, Estela; Negrete-Abascal, Erasmo; Vazquez-Cruz, Candelario

2018-01-18

The draft genome sequence of Escherichia coli strain SN137 is reported here. The genome comprises 172 contigs, corresponding to 4.9 Mb with 50% G+C content, and contains several genes related to pathogenicity that explain its survival in human hematic tissue. Copyright © 2018 Najera-Hernandez et al.
Draft Genome Sequence of Cellulolytic and Xylanolytic Paenibacillus sp. A59, Isolated from Decaying Forest Soil from Patagonia, Argentina

PubMed Central

Ghio, Silvina; Martinez Cáceres, Alfredo I.; Talia, Paola; Grasso, Daniel H.

2015-01-01

Paenibacillus sp. A59 was isolated from decaying forest soil in Argentina and characterized as a xylanolytic strain. We report the draft genome sequence of this isolate, with an estimated genome size of 7 Mb which harbor 6,424 coding sequences. Genes coding for hydrolytic enzymes involved in lignocellulose deconstruction were predicted. PMID:26494679
Draft Genome Sequence of Marine Sponge Symbiont Pseudoalteromonas luteoviolacea IPB1, Isolated from Hilo, Hawaii.

PubMed

Sakai-Kawada, Francis E; Yakym, Christopher J; Helmkampf, Martin; Hagiwara, Kehau; Ip, Courtney G; Antonio, Brandi J; Armstrong, Ellie; Ulloa, Wesley J; Awaya, Jonathan D

2016-09-22

We report here the 6.0-Mb draft genome assembly of Pseudoalteromonas luteoviolacea strain IPB1 that was isolated from the Hawaiian marine sponge Iotrochota protea Genome mining complemented with bioassay studies will elucidate secondary metabolite biosynthetic pathways and will help explain the ecological interaction between host sponge and microorganism. Copyright © 2016 Sakai-Kawada et al.
Draft genome sequence of Coniochaeta ligniaria NRRL 30616, a lignocellulolytic fungus for bioabatement of inhibitors in plant biomass hydrolysates

USDA-ARS?s Scientific Manuscript database

Here, we report the first draft genome sequence (42.38 Mb that contains 13,657 genes) of Coniochaeta ligniaria NRRL30616, an ascomycete with high biotechnological relevance in the bioenergy field given its high potential for bioabatement of toxic furanic compounds in plant biomass hydrolysates and i...
Draft Genome Sequence of Bacillus velezensis GF610, a Producer of Potent Anti-Listeria Agents

PubMed Central

Gerst, Michelle M.; Dudley, Edward G.; Xiaoli, Lingzi

2017-01-01

ABSTRACT Bacillus velezensis GF610 was isolated from soil in Illinois, USA, and found to produce amyloliquecidin GF610, a potent two-component antimicrobial peptide. We report here the GF610 strain draft genome sequence, which contains 4.29 Mb and an overall GC content of 45.91%. PMID:29025938
Draft Genome Sequence of Amycolatopsis mediterranei DSM 40773, a Tangible Antibiotic Producer

PubMed Central

Mukherjee, Udita; Saxena, Anjali; Kumari, Rashmi; Singh, Priya

2014-01-01

Amycolatopsis mediterranei DSM 40773 has been of special interest as successors of this strain are in use for the commercial production of rifamycin B. Here we present the draft genome sequence (~10 Mb) of this strain, which contains 108 contigs, 9,198 genes, and has a G+C content of 71.3%. PMID:25081263
Draft Genome Sequence of a “Candidatus Liberibacter europaeus” Strain Assembled from Broom Psyllids (Arytainilla spartiophila) from New Zealand

PubMed Central

Thompson, Sarah M.; Kalamorz, Falk; David, Charles; Addison, Shea M.; Smith, Grant R.

2018-01-01

ABSTRACT Here, we report the draft genome sequence of “Candidatus Liberibacter europaeus” ASNZ1, assembled from broom psyllids (Arytainilla spartiophila) from New Zealand. The assembly comprises 15 contigs, with a total length of 1.33 Mb and a G+C content of 33.5%. PMID:29773636
Draft genome sequences of the oomycete Pythium insidiosum strain CBS 573.85 from a horse with pythiosis and strain CR02 from the environment.

PubMed

Patumcharoenpol, Preecha; Rujirawat, Thidarat; Lohnoo, Tassanee; Yingyong, Wanta; Vanittanakom, Nongnuch; Kittichotirat, Weerayuth; Krajaejun, Theerapong

2018-02-01

Pythium insidiosum is an aquatic oomycete microorganism that causes the fatal infectious disease, pythiosis, in humans and animals. The organism has been successfully isolated from the environment worldwide. Diagnosis and treatment of pythiosis is difficult and challenging. Genome sequences of P. insidiosum , isolated from humans, are available and accessible in public databases. To further facilitate biology-, pathogenicity-, and evolution-related genomic and genetic studies of P. insidiosum , we report two additional draft genome sequences of the P. insidiosum strain CBS 573.85 (35.6 Mb in size; accession number, BCFO00000000.1) isolated from a horse with pythiosis, and strain CR02 (37.7 Mb in size; accession number, BCFR00000000.1) isolated from the environment.
Draft Genome Sequence of Cyanobacterium Hassallia byssoidea Strain VB512170, Isolated from Monuments in India.

PubMed

Singh, Deeksha; Chandrababunaidu, Mathu Malar; Panda, Arijit; Sen, Diya; Bhattacharyya, Sourav; Adhikary, Siba Prasad; Tripathy, Sucheta

2015-03-05

The draft genome assembly of Hassallia byssoidea strain VB512170 with a genome size of ~13 Mb and 10,183 protein-coding genes in 62 scaffolds is reported here for the first time. This is a terrestrial hydrophobic cyanobacterium isolated from monuments in India. We report several copies of luciferase and antibiotic genes in this organism. Copyright © 2015 Singh et al.
Draft Genome Sequence of the Phytopathogenic Fungus Ganoderma boninense, the Causal Agent of Basal Stem Rot Disease on Oil Palm.

PubMed

Utomo, Condro; Tanjung, Zulfikar Achmad; Aditama, Redi; Buana, Rika Fithri Nurani; Pratomo, Antonius Dony Madu; Tryono, Reno; Liwang, Tony

2018-04-26

Ganoderma boninense is the dominant fungal pathogen of basal stem rot (BSR) disease on Elaeis guineensis We sequenced the nuclear genome of mycelia using both Illumina and Pacific Biosciences platforms for assembly of scaffolds. The draft genome comprised 79.24 Mb, 495 scaffolds, and 26,226 predicted coding sequences. Copyright © 2018 Utomo et al.

Draft Genome Sequence of Deep-Sea Alteromonas sp. Strain V450 Isolated from the Marine Sponge Leiodermatium sp.

PubMed Central

Barrett, Nolan H.; McCarthy, Peter J.

2017-01-01

ABSTRACT The proteobacterium Alteromonas sp. strain V450 was isolated from the Atlantic deep-sea sponge Leiodermatium sp. Here, we report the draft genome sequence of this strain, with a genome size of approx. 4.39 Mb and a G+C content of 44.01%. The results will aid deep-sea microbial ecology, evolution, and sponge-microbe association studies. PMID:28153886
Draft Genome Sequence of Cellulolytic and Xylanolytic Cellulomonas sp. Strain B6 Isolated from Subtropical Forest Soil

PubMed Central

Piccinni, Florencia; Murua, Yanina; Ghio, Silvina; Talia, Paola; Rivarola, Máximo

2016-01-01

Cellulomonas sp. strain B6 was isolated from a subtropical forest soil sample and presented (hemi)cellulose-degrading activity. We report here its draft genome sequence, with an estimated genome size of 4 Mb, a G+C content of 75.1%, and 3,443 predicted protein-coding sequences, 92 of which are glycosyl hydrolases involved in polysaccharide degradation. PMID:27563050
Draft Genome Sequence of Cellulolytic and Xylanolytic Paenibacillus sp. A59, Isolated from Decaying Forest Soil from Patagonia, Argentina.

PubMed

Ghio, Silvina; Martinez Cáceres, Alfredo I; Talia, Paola; Grasso, Daniel H; Campos, Eleonora

2015-10-22

Paenibacillus sp. A59 was isolated from decaying forest soil in Argentina and characterized as a xylanolytic strain. We report the draft genome sequence of this isolate, with an estimated genome size of 7 Mb which harbor 6,424 coding sequences. Genes coding for hydrolytic enzymes involved in lignocellulose deconstruction were predicted. Copyright © 2015 Ghio et al.
Draft Genome Sequence of a Copper-Resistant Marine Bacterium, Pantoea agglomerans Strain LMAE-2, a Bacterial Strain with Potential Use in Bioremediation

PubMed Central

Corsini, Gino; Valdés, Natalia; Pradel, Paulina; Tello, Mario; Cottet, Luis; Karahanian, Eduardo; Castillo, Antonio

2016-01-01

Pantoea agglomerans LMAE-2 was isolated from seabed sediment moderately contaminated with Cu2+. Here, we report its draft genome sequence, which has a size of 4.98 Mb. The presence of cop genes related with copper homeostasis in its genome may explain the resistance and strengthen its potential for use as bioremediation agent. PMID:27313292
Draft Genome Sequence of Photorhabdus luminescens HIM3 Isolated from an Entomopathogenic Nematode in Agricultural Soils.

PubMed

Salgado-Morales, Rosalba; Rivera-Gómez, Nancy; Martínez-Ocampo, Fernando; Lozano-Aguirre Beltrán, Luis Fernando; Hernández-Mendoza, Armando; Dantán-González, Edgar

2017-08-31

In this work, we report the draft genome sequence of Photorhabdus luminescens strain HIM3, a symbiotic bacterium associated with the entomopathogenic nematode Heterorhabditis indica MOR03, isolated from soil sugarcane in Yautepec, Morelos, Mexico. These bacteria have a G+C content of 42.6% and genome size of 5.47 Mb. Copyright © 2017 Salgado-Morales et al.
Draft genome of the Northern snakehead, Channa argus.

PubMed

Xu, Jian; Bian, Chao; Chen, Kunci; Liu, Guiming; Jiang, Yanliang; Luo, Qing; You, Xinxin; Peng, Wenzhu; Li, Jia; Huang, Yu; Yi, Yunhai; Dong, Chuanju; Deng, Hua; Zhang, Songhao; Zhang, Hanyuan; Shi, Qiong; Xu, Peng

2017-04-01

The Northern snakehead (Channa argus), a member of the Channidae family of the Perciformes, is an economically important freshwater fish native to East Asia. In North America, it has become notorious as an intentionally released invasive species. Its ability to breathe air with gills and migrate short distances over land makes it a good model for bimodal breath research. Therefore, recent research has focused on the identification of relevant candidate genes. Here, we performed whole genome sequencing of C. argus to construct its draft genome, aiming to offer useful information for further functional studies and identification of target genes related to its unusual facultative air breathing. Findings: We assembled the C. argus genome with a total of 140.3 Gb of raw reads, which were sequenced using the Illumina HiSeq2000 platform. The final draft genome assembly was approximately 615.3 Mb, with a contig N50 of 81.4 kb and scaffold N50 of 4.5 Mb. The identified repeat sequences account for 18.9% of the whole genome. The 19 877 protein-coding genes were predicted from the genome assembly, with an average of 10.5 exons per gene. Conclusion: We generated a high-quality draft genome of C. argus, which will provide a valuable genetic resource for further biomedical investigations of this economically important teleost fish. © The Author 2017. Published by Oxford University Press.
Draft Genome Sequence of Salmonella enterica subsp. enterica Serovar Infantis Strain SPE101, Isolated from a Chronic Human Infection

PubMed Central

Iriarte, Andrés; Giner-Lamia, Joaquín; Betancor, Laura; Astocondor, Lizeth; Cestero, Juan J.; Ochoa, Theresa; García, Coralith; Puente, José L.; Chabalgoity, José A.

2017-01-01

ABSTRACT We report a 4.99-Mb draft genome sequence of Salmonella enterica subsp. enterica serovar Infantis strain SPE101, isolated from feces of a 5-month-old breast-fed female showing diarrhea associated with severe dehydration and malnutrition. The infection prolonged for 6 months despite antibiotic treatment. PMID:28729277
Draft Genome Sequence of the Obligately Alkaliphilic Sulfate-Reducing Bacterium Desulfonatronum thiodismutans Strain MLF1

PubMed Central

Trubitsyn, Denis; Geurink, Corey; Pikuta, Elena; Lefèvre, Christopher T.; McShan, W. Michael; Gillaspy, Allison F.

2014-01-01

Desulfonatronum thiodismutans strain MLF1, an alkaliphilic bacterium capable of sulfate reduction, was isolated from Mono Lake, California. Here we report the 3.92-Mb draft genome sequence comprising 34 contigs and some results of its automated annotation. These data will improve our knowledge of mechanisms by which bacteria withstand extreme environments. PMID:25081260
Draft genome sequence of Bacillus azotoformans MEV2011, a (Co-) denitrifying strain unable to grow with oxygen.

PubMed

Nielsen, Maja; Schreiber, Lars; Finster, Kai; Schramm, Andreas

2015-01-01

Bacillus azotoformans MEV2011, isolated from soil, is a microaerotolerant obligate denitrifier, which can also produce N2 by co-denitrification. Oxygen is consumed but not growth-supportive. The draft genome has a size of 4.7 Mb and contains key genes for both denitrification and dissimilatory nitrate reduction to ammonium.
Draft genome sequence of Bacillus azotoformans MEV2011, a (Co-) denitrifying strain unable to grow with oxygen.

PubMed

Nielsen, Maja; Schreiber, Lars; Finster, Kai; Schramm, Andreas

2014-01-01

Bacillus azotoformans MEV2011, isolated from soil, is a microaerotolerant obligate denitrifier, which can also produce N2 by co-denitrification. Oxygen is consumed but not growth-supportive. The draft genome has a size of 4.7 Mb and contains key genes for both denitrification and dissimilatory nitrate reduction to ammonium.
Draft genome sequence of Bacillus azotoformans MEV2011, a (Co-) denitrifying strain unable to grow with oxygen

PubMed Central

2014-01-01

Bacillus azotoformans MEV2011, isolated from soil, is a microaerotolerant obligate denitrifier, which can also produce N2 by co-denitrification. Oxygen is consumed but not growth-supportive. The draft genome has a size of 4.7 Mb and contains key genes for both denitrification and dissimilatory nitrate reduction to ammonium. PMID:25685261
Draft Genome Sequence of Sphingobium quisquiliarum Strain P25T, a Novel Hexachlorocyclohexane (HCH)-Degrading Bacterium Isolated from an HCH Dumpsite

PubMed Central

Kumar Singh, Amit; Sangwan, Naseer; Sharma, Anukriti; Gupta, Vipin; Khurana, J. P.

2013-01-01

Here, we report the draft genome sequence (4.2 Mb) of Sphingobium quisquiliarum strain P25T, a natural lin (genes involved in degradation of hexachlorocyclohexane [HCH] isomers) variant genotype, isolated from a heavily contaminated (450 mg HCH/g of soil) HCH dumpsite. PMID:24029763
Draft Genome Sequence of Pontibacter sp. nov. BAB1700, a Halotolerant, Industrially Important Bacterium

PubMed Central

Joshi, M. N.; Sharma, A. C.; Pandya, R. V.; Patel, R. P.; Saiyed, Z. M.; Saxena, A. K.

2012-01-01

Pontibacter sp. nov. BAB1700 is a halotolerant, Gram-negative, rod-shaped, pink-pigmented, menaquinone-7-producing bacterium isolated from sediments of a drilling well. The draft genome sequence of the strain, consisting of one chromosome of 4.5 Mb, revealed vital gene clusters involved in vitamin biosynthesis and resistance against various metals and antibiotics. PMID:23105068
Draft Genome Sequence of Deep-Sea Alteromonas sp. Strain V450 Isolated from the Marine Sponge Leiodermatium sp.

PubMed

Wang, Guojun; Barrett, Nolan H; McCarthy, Peter J

2017-02-02

The proteobacterium Alteromonas sp. strain V450 was isolated from the Atlantic deep-sea sponge Leiodermatium sp. Here, we report the draft genome sequence of this strain, with a genome size of approx. 4.39 Mb and a G+C content of 44.01%. The results will aid deep-sea microbial ecology, evolution, and sponge-microbe association studies. Copyright © 2017 Wang et al.
Draft Genome Sequence of Cellulolytic and Xylanolytic Cellulomonas sp. Strain B6 Isolated from Subtropical Forest Soil.

PubMed

Piccinni, Florencia; Murua, Yanina; Ghio, Silvina; Talia, Paola; Rivarola, Máximo; Campos, Eleonora

2016-08-25

Cellulomonas sp. strain B6 was isolated from a subtropical forest soil sample and presented (hemi)cellulose-degrading activity. We report here its draft genome sequence, with an estimated genome size of 4 Mb, a G+C content of 75.1%, and 3,443 predicted protein-coding sequences, 92 of which are glycosyl hydrolases involved in polysaccharide degradation. Copyright © 2016 Piccinni et al.
Draft Genome Sequence of a Copper-Resistant Marine Bacterium, Pantoea agglomerans Strain LMAE-2, a Bacterial Strain with Potential Use in Bioremediation.

PubMed

Corsini, Gino; Valdés, Natalia; Pradel, Paulina; Tello, Mario; Cottet, Luis; Muiño, Laura; Karahanian, Eduardo; Castillo, Antonio; Gonzalez, Alex R

2016-06-16

Pantoea agglomerans LMAE-2 was isolated from seabed sediment moderately contaminated with Cu(2+) Here, we report its draft genome sequence, which has a size of 4.98 Mb. The presence of cop genes related with copper homeostasis in its genome may explain the resistance and strengthen its potential for use as bioremediation agent. Copyright © 2016 Corsini et al.
Draft genome sequence of an aflatoxigenic Aspergillus species, A. bombycis

USDA-ARS?s Scientific Manuscript database

The genome of the A. bombycis Type strain was sequenced using a Personal Genome Machine, followed by annotation of its predicted genes. The genome size for A. bombycis was found to be approximately 37 Mb and contained 12,266 genes. This announcement introduces a sequenced genome for an aflatoxigenic...
Draft genome sequence of Trametes villosa (Sw.) Kreisel CCMB561, a tropical white-rot Basidiomycota from the semiarid region of Brazil.

PubMed

Ferreira, Dalila Souza Santos; Kato, Rodrigo Bentes; Miranda, Fábio Malcher; da Costa Pinheiro, Kenny; Fonseca, Paula Luize Camargos; Tomé, Luiz Marcelo Ribeiro; Vaz, Aline Bruna Martins; Badotti, Fernanda; Ramos, Rommel Thiago Jucá; Brenig, Bertram; Azevedo, Vasco Ariston de Carvalho; Benevides, Raquel Guimarães; Góes-Neto, Aristóteles

2018-06-01

Herein, we present the draft genome of Trametes villosa isolate CCMB561, a wood-decaying Basidiomycota commonly found in tropical semiarid climate. The genome assembly was 57.98 Mb in size with an L50 of 691. A total of 16,711 putative protein-encoding genes was predicted, including 590 genes coding for carbohydrate-active enzymes (CAZy), directly involved in the decomposition of lignocellulosic materials. This is the first genome of this species of high interest in bioenergy research. The draft genome of Trametes villosa isolate CCMB561 will provide an important resource for future investigations in biofuel production, bioremediation and other green technologies.
Draft Genome Sequence of Lutibaculum baratangense Strain AMV1T, Isolated from a Mud Volcano in Andamans, India.

PubMed

Singh, Aditya; Sreenivas, Ara; Sathyanarayana Reddy, Gundlapally; Pinnaka, Anil Kumar; Shivaji, Sisinthy

2014-07-24

The 4.3-Mb genome of Lutibaculum baratangense strain AMV1(T), isolated from a soil sample collected from a mud volcano in Andamans, India, is reported. The draft genome of strain Lutibaculum baratangense AMV1(T) consists of 4,300,776 bp with a G+C content of 66.93 mol% and 4,198 predicted coding regions, including 56 RNAs. Copyright © 2014 Singh et al.
Draft Genome Sequence of Paenibacillus polymyxa Strain Mc5Re-14, an Antagonistic Root Endophyte of Matricaria chamomilla

DOE PAGES

Köberl, Martina; White, Richard A.; Erschen, Sabine; ...

2015-08-06

Paenibacillus polymyxa strain Mc5Re-14 was isolated from the inner root tissue of Matricaria chamomilla (German chamomile). Mc5Re-14 revealed promising in vitro antagonistic activity against plant and opportunistic human pathogens. The 6.0-Mb draft genome reveals genes putatively involved in pathogen suppression and direct and indirect plant growth promotion.

Draft Genome Sequence of Streptomyces sp. Strain Wb2n-11, a Desert Isolate with Broad-Spectrum Antagonism against Soilborne Phytopathogens

DOE PAGES

Köberl, Martina; White, Richard A.; Erschen, Sabine; ...

2015-08-06

Streptomyces sp. strain Wb2n-11, isolated from native desert soil, exhibited broad-spectrum antagonism against plant pathogenic fungi, bacteria, and nematodes. The 8.2-Mb draft genome reveals genes putatively responsible for its promising biocontrol activity and genes which enable the soil bacterium to directly interact beneficially with plants.
Draft genome sequence of the fungus associated with oak-wilt mortality in South Korea, Raffaelea quercus-mongolicae KACC44405

Treesearch

Jongbum Jeon; Ki-Tae Kim; Hyeunjeong Song; Gir-Won Lee; Kyeongchae Cheong; Hyunbin Kim; Gobong Choi; Yong-Hwan Lee; Jane E. Stewart; Ned B. Klopfenstein; Mee-Sook Kim

2017-01-01

The fungus Raffaelea quercus-mongolicae is the causal agent of Korean oak wilt, a disease associated with mass mortality of oak trees (e.g., Quercus spp.). The fungus is vectored and dispersed by the ambrosia beetle, Platypus koryoensis. Here, we present the 27.0-Mb draft genome sequence of R. quercus-mongolicae strain KACC44405.
Draft Genome Sequence of Bacillus stratosphericus LAMA 585, Isolated from the Atlantic Deep Sea

PubMed Central

Cabral, Alencar; Andreote, Fernando Dini; Cavalett, Angélica; Pessatti, Marcos Luiz; Dini-Andreote, Francisco; da Silva, Marcus Adonai Castro

2013-01-01

Bacillus stratosphericus LAMA 585 was isolated from the Mid-Atlantic-Ridge seafloor (5,500-m depth). This bacterium presents the capacity for cellulase, xylanase, and lipase production when growing aerobically in marine-broth media. Genes involved in the tolerance of oligotrophic and extreme conditions and prospection of biotechnological products were annotated in the draft genome (3.7 Mb). PMID:23640380
Draft Genome Sequence of Caenibacillus caldisaponilyticus B157T, a Thermophilic and Phospholipase-Producing Bacterium Isolated from Acidulocompost

PubMed Central

Tsujimoto, Yoshiyuki; Saito, Ryo; Sahara, Takehiko; Kimura, Nobutada; Tsuruoka, Naoki; Shigeri, Yasushi

2017-01-01

ABSTRACT Caenibacillus caldisaponilyticus B157T (= NBRC 111400T = DSM 101100T), in the family Sporolactobacillaceae, was isolated from acidulocompost as a thermophilic and phospholipid-degrading bacterium. Here, we report the 3.36-Mb draft genome sequence, with a G+C content of 51.8%, to provide the genetic information coding for phospholipases. PMID:28360164
Draft genome sequence of a strictly anaerobic dichloromethane-degrading bacterium

DOE PAGES

Kleindienst, Sara; Higgins, Steven A.; Tsementzi, Despina; ...

2016-03-03

Here, an anaerobic, dichloromethane-degrading bacterium affiliated with novel Peptococcaceae was maintained in a microbial consortium. The organism originated from pristine freshwater sediment collected from Rio Mameyes in Luquillo, Puerto Rico, in October 2009 (latitude 18°21'43.9", longitude –65°46'8.4"). The draft genome sequence is 2.1 Mb and has a G+C content of 43.5%.
Draft Genome Sequence of the Obligately Alkaliphilic Sulfate-Reducing Bacterium Desulfonatronum thiodismutans Strain MLF1.

PubMed

Trubitsyn, Denis; Geurink, Corey; Pikuta, Elena; Lefèvre, Christopher T; McShan, W Michael; Gillaspy, Allison F; Bazylinski, Dennis A

2014-07-31

Desulfonatronum thiodismutans strain MLF1, an alkaliphilic bacterium capable of sulfate reduction, was isolated from Mono Lake, California. Here we report the 3.92-Mb draft genome sequence comprising 34 contigs and some results of its automated annotation. These data will improve our knowledge of mechanisms by which bacteria withstand extreme environments. Copyright © 2014 Trubitsyn et al.
Draft Genome Sequence of the Extremely Halophilic Bacterium Halomonas salina Strain CIFRI1, Isolated from the East Coast of India

PubMed Central

Das, Priyanka; Maharana, Jitendra; Paria, Prasenjit; Mandal, Shambhu Nath; Meena, Dharmendra Kumar; Sharma, Anil Prakash; Jayarajan, Rijith; Dixit, Vishal; Verma, Ankit; Vellarikkal, Shamsudheen Karuthedath; Scaria, Vinod; Sivasubbu, Sridhar; Rao, Atmakuri Ramakrishna; Mohapatra, Trilochan

2015-01-01

Halomonas salina strain CIFRI1 is an extremely salt-stress-tolerant bacterium isolated from the salt crystals of the east coast of India. Here we report the annotated 3.45-Mb draft genome sequence of strain CIFRI1 having 86 contigs with 3,139 protein coding loci, including 62 RNA genes. PMID:25573926
Draft Genome Sequence of Exiguobacterium sp. Strain BMC-KP, an Environmental Isolate from Bryn Mawr, Pennsylvania.

PubMed

Hyson, Peter; Shapiro, Joshua A; Wien, Michelle W

2015-10-08

Exiguobacterium sp. strain BMC-KP was isolated as part of a student environmental sampling project at Bryn Mawr College, PA. Sequencing of bacterial DNA assembled a 3.32-Mb draft genome. Analysis suggests the presence of genes for tolerance to cold and toxic metals, broad carbohydrate metabolism, and genes derived from phage. Copyright © 2015 Hyson et al.
Draft Genome Sequence of Sphingobium lactosutens Strain DS20T, Isolated from a Hexachlorocyclohexane Dumpsite

PubMed Central

Kumar, Roshan; Dwivedi, Vatsala; Negi, Vivek; Khurana, J. P.

2013-01-01

Sphingobium lactosutens DS20T has been isolated from the hexachlorocyclohexane (HCH) dumpsite in Lucknow, India, but does not degrade any of the HCH isomers. Here, we present the ~5.36-Mb draft genome sequence of strain DS20T, which consists of 110 contigs and 5,288 coding sequences, with a G+C content of 63.1%. PMID:24051323
Draft Genome Sequence of Grammothele lineata SDL-CO-2015-1, a Jute Endophyte with a Potential for Paclitaxel Biosynthesis

PubMed Central

Das, Avizit; Ahmed, Oly; Baten, A. K. M. Abdul; Bushra, Samira; Islam, M. Tariqul; Ferdous, Ahlan Sabah; Islam, Mohammad Riazul

2017-01-01

ABSTRACT Grammothele lineata strain SDL-CO-2015-1, a basidiomycete fungus, was identified as an endophyte from a jute species, Corchorus olitorius var. 2015, and found to produce paclitaxel, a diterpenic polyoxygenated pseudoalkaloid with antitumor activity. Here, we report the draft genome sequence (42.8 Mb with 9,395 genes) of this strain. PMID:28818909
Draft Genome Sequence of Salmonella enterica subsp. enterica Serovar Infantis Strain SPE101, Isolated from a Chronic Human Infection.

PubMed

Iriarte, Andrés; Giner-Lamia, Joaquín; Silva, Claudia; Betancor, Laura; Astocondor, Lizeth; Cestero, Juan J; Ochoa, Theresa; García, Coralith; Puente, José L; Chabalgoity, José A; García-Del Portillo, Francisco

2017-07-20

We report a 4.99-Mb draft genome sequence of Salmonella enterica subsp. enterica serovar Infantis strain SPE101, isolated from feces of a 5-month-old breast-fed female showing diarrhea associated with severe dehydration and malnutrition. The infection prolonged for 6 months despite antibiotic treatment. Copyright © 2017 Iriarte et al.
Draft Genome Sequence of Acinetobacter calcoaceticus Strain GK1, a Hydrocarbon-Degrading Plant Growth-Promoting Rhizospheric Bacterium.

PubMed

Gkorezis, Panagiotis; Bottos, Eric M; Van Hamme, Jonathan D; Franzetti, Andrea; Abbamondi, Gennaro Roberto; Balseiro-Romero, Maria; Weyens, Nele; Rineau, Francois; Vangronsveld, Jaco

2015-08-13

The 3.94-Mb draft genome of Acinetobacter calcoaceticus GK1, a hydrocarbonoclastic plant growth-promoting Gram-negative rhizospheric bacterium, is presented here. Isolated at the Ford Motor Company site in Genk, Belgium, from poplar trees planted on a diesel-contaminated plume, GK1 is useful for enhancing hydrocarbon phytoremediation. Copyright © 2015 Gkorezis et al.
Draft Genome of the Pearl Oyster Pinctada fucata: A Platform for Understanding Bivalve Biology

PubMed Central

Takeuchi, Takeshi; Kawashima, Takeshi; Koyanagi, Ryo; Gyoja, Fuki; Tanaka, Makiko; Ikuta, Tetsuro; Shoguchi, Eiichi; Fujiwara, Mayuki; Shinzato, Chuya; Hisata, Kanako; Fujie, Manabu; Usami, Takeshi; Nagai, Kiyohito; Maeyama, Kaoru; Okamoto, Kikuhiko; Aoki, Hideo; Ishikawa, Takashi; Masaoka, Tetsuji; Fujiwara, Atushi; Endo, Kazuyoshi; Endo, Hirotoshi; Nagasawa, Hiromichi; Kinoshita, Shigeharu; Asakawa, Shuichi; Watabe, Shugo; Satoh, Nori

2012-01-01

The study of the pearl oyster Pinctada fucata is key to increasing our understanding of the molecular mechanisms involved in pearl biosynthesis and biology of bivalve molluscs. We sequenced ∼1150-Mb genome at ∼40-fold coverage using the Roche 454 GS-FLX and Illumina GAIIx sequencers. The sequences were assembled into contigs with N50 = 1.6 kb (total contig assembly reached to 1024 Mb) and scaffolds with N50 = 14.5 kb. The pearl oyster genome is AT-rich, with a GC content of 34%. DNA transposons, retrotransposons, and tandem repeat elements occupied 0.4, 1.5, and 7.9% of the genome, respectively (a total of 9.8%). Version 1.0 of the P. fucata draft genome contains 23 257 complete gene models, 70% of which are supported by the corresponding expressed sequence tags. The genes include those reported to have an association with bio-mineralization. Genes encoding transcription factors and signal transduction molecules are present in numbers comparable with genomes of other metazoans. Genome-wide molecular phylogeny suggests that the lophotrochozoan represents a distinct clade from ecdysozoans. Our draft genome of the pearl oyster thus provides a platform for the identification of selection markers and genes for calcification, knowledge of which will be important in the pearl industry. PMID:22315334
Draft genome of the medaka fish: a comprehensive resource for medaka developmental genetics and vertebrate evolutionary biology.

PubMed

Takeda, Hiroyuki

2008-06-01

The medaka Oryzias latipes is a small egg-laying freshwater teleost, and has become an excellent model system for developmental genetics and evolutionary biology. The medaka genome is relatively small in size, approximately 800 Mb, and the genome sequencing project was recently completed by Japanese research groups, providing a high-quality draft genome sequence of the inbred Hd-rR strain of medaka. In this review, I present an overview of the medaka genome project including genome resources, followed by specific findings obtained with the medaka draft genome. In particular, I focus on the analysis that was done by taking advantage of the medaka system, such as the sex chromosome differentiation and the regional history of medaka species using single nucleotide polymorphisms as genomic markers.
Draft genome sequence of bitter gourd (Momordica charantia), a vegetable and medicinal plant in tropical and subtropical regions.

PubMed

Urasaki, Naoya; Takagi, Hiroki; Natsume, Satoshi; Uemura, Aiko; Taniai, Naoki; Miyagi, Norimichi; Fukushima, Mai; Suzuki, Shouta; Tarora, Kazuhiko; Tamaki, Moritoshi; Sakamoto, Moriaki; Terauchi, Ryohei; Matsumura, Hideo

2017-02-01

Bitter gourd (Momordica charantia) is an important vegetable and medicinal plant in tropical and subtropical regions globally. In this study, the draft genome sequence of a monoecious bitter gourd inbred line, OHB3-1, was analyzed. Through Illumina sequencing and de novo assembly, scaffolds of 285.5 Mb in length were generated, corresponding to ∼84% of the estimated genome size of bitter gourd (339 Mb). In this draft genome sequence, 45,859 protein-coding gene loci were identified, and transposable elements accounted for 15.3% of the whole genome. According to synteny mapping and phylogenetic analysis of conserved genes, bitter gourd was more related to watermelon (Citrullus lanatus) than to cucumber (Cucumis sativus) or melon (C. melo). Using RAD-seq analysis, 1507 marker loci were genotyped in an F2 progeny of two bitter gourd lines, resulting in an improved linkage map, comprising 11 linkage groups. By anchoring RAD tag markers, 255 scaffolds were assigned to the linkage map. Comparative analysis of genome sequences and predicted genes determined that putative trypsin-inhibitor and ribosome-inactivating genes were distinctive in the bitter gourd genome. These genes could characterize the bitter gourd as a medicinal plant. © The Author 2016. Published by Oxford University Press on behalf of Kazusa DNA Research Institute.
A high-coverage draft genome of the mycalesine butterfly Bicyclus anynana.

PubMed

Nowell, Reuben W; Elsworth, Ben; Oostra, Vicencio; Zwaan, Bas J; Wheat, Christopher W; Saastamoinen, Marjo; Saccheri, Ilik J; Van't Hof, Arjen E; Wasik, Bethany R; Connahs, Heidi; Aslam, Muhammad L; Kumar, Sujai; Challis, Richard J; Monteiro, Antónia; Brakefield, Paul M; Blaxter, Mark

2017-07-01

The mycalesine butterfly Bicyclus anynana, the "Squinting bush brown," is a model organism in the study of lepidopteran ecology, development, and evolution. Here, we present a draft genome sequence for B. anynana to serve as a genomics resource for current and future studies of this important model species. Seven libraries with insert sizes ranging from 350 bp to 20 kb were constructed using DNA from an inbred female and sequenced using both Illumina and PacBio technology; 128 Gb of raw Illumina data was filtered to 124 Gb and assembled to a final size of 475 Mb (∼×260 assembly coverage). Contigs were scaffolded using mate-pair, transcriptome, and PacBio data into 10 800 sequences with an N50 of 638 kb (longest scaffold 5 Mb). The genome is comprised of 26% repetitive elements and encodes a total of 22 642 predicted protein-coding genes. Recovery of a BUSCO set of core metazoan genes was almost complete (98%). Overall, these metrics compare well with other recently published lepidopteran genomes. We report a high-quality draft genome sequence for Bicyclus anynana. The genome assembly and annotated gene models are available at LepBase (http://ensembl.lepbase.org/index.html). © The Authors 2017. Published by Oxford University Press.
A high-coverage draft genome of the mycalesine butterfly Bicyclus anynana

PubMed Central

Elsworth, Ben; Oostra, Vicencio; Zwaan, Bas J.; Wheat, Christopher W.; Saastamoinen, Marjo; Saccheri, Ilik J.; van’t Hof, Arjen E.; Wasik, Bethany R.; Connahs, Heidi; Aslam, Muhammad L.; Kumar, Sujai; Challis, Richard J.; Monteiro, Antónia; Brakefield, Paul M.

2017-01-01

Abstract The mycalesine butterfly Bicyclus anynana, the “Squinting bush brown,” is a model organism in the study of lepidopteran ecology, development, and evolution. Here, we present a draft genome sequence for B. anynana to serve as a genomics resource for current and future studies of this important model species. Seven libraries with insert sizes ranging from 350 bp to 20 kb were constructed using DNA from an inbred female and sequenced using both Illumina and PacBio technology; 128 Gb of raw Illumina data was filtered to 124 Gb and assembled to a final size of 475 Mb (∼×260 assembly coverage). Contigs were scaffolded using mate-pair, transcriptome, and PacBio data into 10 800 sequences with an N50 of 638 kb (longest scaffold 5 Mb). The genome is comprised of 26% repetitive elements and encodes a total of 22 642 predicted protein-coding genes. Recovery of a BUSCO set of core metazoan genes was almost complete (98%). Overall, these metrics compare well with other recently published lepidopteran genomes. We report a high-quality draft genome sequence for Bicyclus anynana. The genome assembly and annotated gene models are available at LepBase (http://ensembl.lepbase.org/index.html). PMID:28486658
Draft Genome Sequence of Curtobacterium sp. Strain ER1/6, an Endophytic Strain Isolated from Citrus sinensis with Potential To Be Used as a Biocontrol Agent.

PubMed

Garrido, Leandro Maza; Alves, João Marcelo Pereira; Oliveira, Liliane Santana; Gruber, Arthur; Padilla, Gabriel; Araújo, Welington Luiz

2016-11-17

Herein, we report a draft genome sequence of the endophytic Curtobacterium sp. strain ER1/6, isolated from a surface-sterilized Citrus sinensis branch, and it presented the capability to control phytopathogens. Functional annotation of the ~3.4-Mb genome revealed 3,100 protein-coding genes, with many products related to known ecological and biotechnological aspects of this bacterium. Copyright © 2016 Garrido et al.
Improved High-Quality Draft Genome Sequence of the Eurypsychrophile Rhodotorula sp. JG1b, Isolated from Permafrost in the Hyperarid Upper-Elevation McMurdo Dry Valleys, Antarctica

DOE Office of Scientific and Technical Information (OSTI.GOV)

Goordial, Jacqueline; Raymond-Bouchard, Isabelle; Riley, Robert

Here, we report the draft genome sequence of Rhodotorula sp. strain JG1b, a yeast that was isolated from ice-cemented permafrost in the upper-elevation McMurdo Dry Valleys, Antarctica. The sequenced genome size is 19.39 Mb, consisting of 156 scaffolds and containing a total of 5,625 predicted genes. This is the first known cold-adapted Rhodotorula sp. sequenced to date.
Improved High-Quality Draft Genome Sequence of the Eurypsychrophile Rhodotorula sp. JG1b, Isolated from Permafrost in the Hyperarid Upper-Elevation McMurdo Dry Valleys, Antarctica

DOE PAGES

Goordial, Jacqueline; Raymond-Bouchard, Isabelle; Riley, Robert; ...

2016-03-17

Here, we report the draft genome sequence of Rhodotorula sp. strain JG1b, a yeast that was isolated from ice-cemented permafrost in the upper-elevation McMurdo Dry Valleys, Antarctica. The sequenced genome size is 19.39 Mb, consisting of 156 scaffolds and containing a total of 5,625 predicted genes. This is the first known cold-adapted Rhodotorula sp. sequenced to date.

Draft Genome Sequence of a Tetrabromobisphenol A–Degrading Strain, Ochrobactrum sp. T, Isolated from an Electronic Waste Recycling Site

PubMed Central

Liang, Zhishu; Li, Guiying; Zhang, Guoxia; Das, Ranjit

2016-01-01

Ochrobactrum sp. T was previously isolated from a sludge sample collected from an electronic waste recycling site and characterized as a unique tetrabromobisphenol A (TBBPA)–degrading bacterium. Here, the draft genome sequence (3.9 Mb) of Ochrobactrum sp. T is reported to provide insights into its diversity and its TBBPA biodegradation mechanism in polluted environments. PMID:27445374
Draft Genome Sequence of Pseudomonas oceani DSM 100277T, a Deep-Sea Bacterium

PubMed Central

2018-01-01

ABSTRACT Pseudomonas oceani DSM 100277T was isolated from deep seawater in the Okinawa Trough at 1390 m. P. oceani belongs to the Pseudomonas pertucinogena group. Here, we report the draft genome sequence of P. oceani, which has an estimated size of 4.1 Mb and exhibits 3,790 coding sequences, with a G+C content of 59.94 mol%. PMID:29650573
Draft Genome Sequence of Lactobacillus sp. Strain TCF032-E4, Isolated from Fermented Radish.

PubMed

Mao, Yuejian; Chen, Meng; Horvath, Philippe

2015-07-30

Here, we report the draft genome sequence of Lactobacillus sp. strain TCF032-E4 (= CCTCC AB2015090 = DSM 100358), isolated from a Chinese fermented radish. The total length of the 57 contigs is about 2.9 Mb, with a G+C content of 43.5 mol% and 2,797 predicted coding sequences (CDSs). Copyright © 2015 Mao et al.
Draft Genome Sequence of Grammothele lineata SDL-CO-2015-1, a Jute Endophyte with a Potential for Paclitaxel Biosynthesis.

PubMed

Das, Avizit; Ahmed, Oly; Baten, A K M Abdul; Bushra, Samira; Islam, M Tariqul; Ferdous, Ahlan Sabah; Islam, Mohammad Riazul; Khan, Haseena

2017-08-17

Grammothele lineata strain SDL-CO-2015-1, a basidiomycete fungus, was identified as an endophyte from a jute species, Corchorus olitorius var. 2015, and found to produce paclitaxel, a diterpenic polyoxygenated pseudoalkaloid with antitumor activity. Here, we report the draft genome sequence (42.8 Mb with 9,395 genes) of this strain. Copyright © 2017 Das et al.
A manually annotated Actinidia chinensis var. chinensis (kiwifruit) genome highlights the challenges associated with draft genomes and gene prediction in plants.

PubMed

Pilkington, Sarah M; Crowhurst, Ross; Hilario, Elena; Nardozza, Simona; Fraser, Lena; Peng, Yongyan; Gunaseelan, Kularajathevan; Simpson, Robert; Tahir, Jibran; Deroles, Simon C; Templeton, Kerry; Luo, Zhiwei; Davy, Marcus; Cheng, Canhong; McNeilage, Mark; Scaglione, Davide; Liu, Yifei; Zhang, Qiong; Datson, Paul; De Silva, Nihal; Gardiner, Susan E; Bassett, Heather; Chagné, David; McCallum, John; Dzierzon, Helge; Deng, Cecilia; Wang, Yen-Yi; Barron, Lorna; Manako, Kelvina; Bowen, Judith; Foster, Toshi M; Erridge, Zoe A; Tiffin, Heather; Waite, Chethi N; Davies, Kevin M; Grierson, Ella P; Laing, William A; Kirk, Rebecca; Chen, Xiuyin; Wood, Marion; Montefiori, Mirco; Brummell, David A; Schwinn, Kathy E; Catanach, Andrew; Fullerton, Christina; Li, Dawei; Meiyalaghan, Sathiyamoorthy; Nieuwenhuizen, Niels; Read, Nicola; Prakash, Roneel; Hunter, Don; Zhang, Huaibi; McKenzie, Marian; Knäbel, Mareike; Harris, Alastair; Allan, Andrew C; Gleave, Andrew; Chen, Angela; Janssen, Bart J; Plunkett, Blue; Ampomah-Dwamena, Charles; Voogd, Charlotte; Leif, Davin; Lafferty, Declan; Souleyre, Edwige J F; Varkonyi-Gasic, Erika; Gambi, Francesco; Hanley, Jenny; Yao, Jia-Long; Cheung, Joey; David, Karine M; Warren, Ben; Marsh, Ken; Snowden, Kimberley C; Lin-Wang, Kui; Brian, Lara; Martinez-Sanchez, Marcela; Wang, Mindy; Ileperuma, Nadeesha; Macnee, Nikolai; Campin, Robert; McAtee, Peter; Drummond, Revel S M; Espley, Richard V; Ireland, Hilary S; Wu, Rongmei; Atkinson, Ross G; Karunairetnam, Sakuntala; Bulley, Sean; Chunkath, Shayhan; Hanley, Zac; Storey, Roy; Thrimawithana, Amali H; Thomson, Susan; David, Charles; Testolin, Raffaele; Huang, Hongwen; Hellens, Roger P; Schaffer, Robert J

2018-04-16

Most published genome sequences are drafts, and most are dominated by computational gene prediction. Draft genomes typically incorporate considerable sequence data that are not assigned to chromosomes, and predicted genes without quality confidence measures. The current Actinidia chinensis (kiwifruit) 'Hongyang' draft genome has 164 Mb of sequences unassigned to pseudo-chromosomes, and omissions have been identified in the gene models. A second genome of an A. chinensis (genotype Red5) was fully sequenced. This new sequence resulted in a 554.0 Mb assembly with all but 6 Mb assigned to pseudo-chromosomes. Pseudo-chromosomal comparisons showed a considerable number of translocation events have occurred following a whole genome duplication (WGD) event some consistent with centromeric Robertsonian-like translocations. RNA sequencing data from 12 tissues and ab initio analysis informed a genome-wide manual annotation, using the WebApollo tool. In total, 33,044 gene loci represented by 33,123 isoforms were identified, named and tagged for quality of evidential support. Of these 3114 (9.4%) were identical to a protein within 'Hongyang' The Kiwifruit Information Resource (KIR v2). Some proportion of the differences will be varietal polymorphisms. However, as most computationally predicted Red5 models required manual re-annotation this proportion is expected to be small. The quality of the new gene models was tested by fully sequencing 550 cloned 'Hort16A' cDNAs and comparing with the predicted protein models for Red5 and both the original 'Hongyang' assembly and the revised annotation from KIR v2. Only 48.9% and 63.5% of the cDNAs had a match with 90% identity or better to the original and revised 'Hongyang' annotation, respectively, compared with 90.9% to the Red5 models. Our study highlights the need to take a cautious approach to draft genomes and computationally predicted genes. Our use of the manual annotation tool WebApollo facilitated manual checking and correction of gene models enabling improvement of computational prediction. This utility was especially relevant for certain types of gene families such as the EXPANSIN like genes. Finally, this high quality gene set will supply the kiwifruit and general plant community with a new tool for genomics and other comparative analysis.
Draft genome sequences of two closely related aflatoxigenic Aspergillus species obtained from the Ivory Coast

USDA-ARS?s Scientific Manuscript database

The genomes of the A. ochraceoroseus and A. rambellii type strains were sequenced using a personal genome machine, followed by annotation of their genes. The genome size for A. ochraceoroseus was found to be approximately 23 Mb and contained 7,837 genes, while the A. rambellii genome was found to be...
Draft genome analysis provides insights into the fiber yield, crude protein biosynthesis, and vegetative growth of domesticated ramie (Boehmeria nivea L. Gaud).

PubMed

Liu, Chan; Zeng, Liangbin; Zhu, Siyuan; Wu, Lingqing; Wang, Yanzhou; Tang, Shouwei; Wang, Hongwu; Zheng, Xia; Zhao, Jian; Chen, Xiaorong; Dai, Qiuzhong; Liu, Touming

2017-11-15

Plentiful bast fiber, a high crude protein content, and vigorous vegetative growth make ramie a popular fiber and forage crop. Here, we report the draft genome of ramie, along with a genomic comparison and evolutionary analysis. The draft genome contained a sequence of approximately 335.6 Mb with 42,463 predicted genes. A high-density genetic map with 4,338 single nucleotide polymorphisms (SNPs) was developed and used to anchor the genome sequence, thus, creating an integrated genetic and physical map containing a 58.2-Mb genome sequence and 4,304 molecular markers. A genomic comparison identified 1,075 unique gene families in ramie, containing 4,082 genes. Among these unique genes, five were cellulose synthase genes that were specifically expressed in stem bark, and 3 encoded a WAT1-related protein, suggesting that they are probably related to high bast fiber yield. An evolutionary analysis detected 106 positively selected genes, 22 of which were related to nitrogen metabolism, indicating that they are probably responsible for the crude protein content and vegetative growth of domesticated varieties. This study is the first to characterize the genome and develop a high-density genetic map of ramie and provides a basis for the genetic and molecular study of this crop. © The Author 2017. Published by Oxford University Press on behalf of Kazusa DNA Research Institute.
Draft Genome Sequence of Arthrobacter sp. Strain SPG23, a Hydrocarbon-Degrading and Plant Growth-Promoting Soil Bacterium.

PubMed

Gkorezis, Panagiotis; Bottos, Eric M; Van Hamme, Jonathan D; Thijs, Sofie; Rineau, Francois; Franzetti, Andrea; Balseiro-Romero, Maria; Weyens, Nele; Vangronsveld, Jaco

2015-12-23

We report here the 4.7-Mb draft genome of Arthrobacter sp. SPG23, a hydrocarbonoclastic Gram-positive bacterium belonging to the Actinobacteria, isolated from diesel-contaminated soil at the Ford Motor Company site in Genk, Belgium. Strain SPG23 is a potent plant growth promoter useful for diesel fuel remediation applications based on plant-bacterium associations. Copyright © 2015 Gkorezis et al.
Draft Genome Sequence of Bacillus licheniformis Strain GB2, a Hydrocarbon-Degrading and Plant Growth-Promoting Soil Bacterium.

PubMed

Gkorezis, Panagiotis; Van Hamme, Jonathan; Bottos, Eric; Thijs, Sofie; Balseiro-Romero, Maria; Monterroso, Carmela; Kidd, Petra Suzan; Rineau, Francois; Weyens, Nele; Sillen, Wouter; Vangronsveld, Jaco

2016-06-23

We report the 4.39 Mb draft genome of Bacillus licheniformis GB2, a hydrocarbonoclastic Gram-positive bacterium of the family Bacillaceae, isolated from diesel-contaminated soil at the Ford Motor Company site in Genk, Belgium. Strain GB2 is an effective plant-growth promoter useful for diesel fuel remediation applications based on plant-bacterium associations. Copyright © 2016 Gkorezis et al.
Draft Genome Sequence of a "Candidatus Brocadia" Bacterium Enriched from Activated Sludge Collected in a Tropical Climate.

PubMed

Liu, Xianghui; Arumugam, Krithika; Natarajan, Gayathri; Seviour, Thomas W; Drautz-Moses, Daniela I; Wuertz, Stefan; Law, Yingyu; Williams, Rohan B H

2018-05-10

Here, we present the draft genome sequence of an anaerobic ammonium-oxidizing bacterium (AnAOB), " Candidatus Brocadia," which was enriched in an anammox reactor. A 3.2-Mb genome sequence comprising 168 contigs was assembled, in which 2,765 protein-coding genes, 47 tRNAs, and one each of 5S, 16S, and 23S rRNAs were annotated. No evidence for the presence of a nitric oxide-forming nitrite reductase was found. Copyright © 2018 Liu et al.
Draft Genome Sequence of Pseudomonas pachastrellae Strain CCUG 46540T, a Deep-Sea Bacterium.

PubMed

Gomila, Margarita; Mulet, Magdalena; Lalucat, Jorge; García-Valdés, Elena

2017-04-06

Pseudomonas pachastrellae strain CCUG 46540 T (KMM 330 T ) was isolated from a deep-sea sponge specimen collected in the Philippine Sea at a depth of 750 m. The draft genome has an estimated size of 4.0 Mb, exhibits a G+C content of 61.2 mol%, and is predicted to encode 3,592 proteins, including pathways for the degradation of aromatic compounds. Copyright © 2017 Gomila et al.
Draft Genome Sequence of Pseudomonas oceani DSM 100277T, a Deep-Sea Bacterium.

PubMed

García-Valdés, Elena; Gomila, Margarita; Mulet, Magdalena; Lalucat, Jorge

2018-04-12

Pseudomonas oceani DSM 100277 T was isolated from deep seawater in the Okinawa Trough at 1390 m. P. oceani belongs to the Pseudomonas pertucinogena group. Here, we report the draft genome sequence of P. oceani , which has an estimated size of 4.1 Mb and exhibits 3,790 coding sequences, with a G+C content of 59.94 mol%. Copyright © 2018 García-Valdés et al.
Draft Genome Sequence of Pseudomonas pachastrellae Strain CCUG 46540T, a Deep-Sea Bacterium

PubMed Central

2017-01-01

ABSTRACT Pseudomonas pachastrellae strain CCUG 46540T (KMM 330T) was isolated from a deep-sea sponge specimen collected in the Philippine Sea at a depth of 750 m. The draft genome has an estimated size of 4.0 Mb, exhibits a G+C content of 61.2 mol%, and is predicted to encode 3,592 proteins, including pathways for the degradation of aromatic compounds. PMID:28385850
Draft Genome Sequence of Pantoea ananatis GB1, a Plant-Growth-Promoting Hydrocarbonoclastic Root Endophyte, Isolated at a Diesel Fuel Phytoremediation Site Planted with Populus.

PubMed

Gkorezis, Panagiotis; Van Hamme, Jonathan D; Bottos, Eric M; Thijs, Sofie; Balseiro-Romero, Maria; Monterroso, Carmela; Kidd, Petra Suzan; Rineau, Francois; Weyens, Nele; Vangronsveld, Jaco

2016-02-25

We report the 4.76-Mb draft genome of Pantoea ananatis GB1, a Gram-negative bacterium of the family Enterobacteriaceae, isolated from the roots of poplars planted for phytoremediation of a diesel-contaminated plume at the Ford Motor Company site in Genk, Belgium. Strain GB1 promotes plant growth in various hosts and metabolizes hydrocarbons. Copyright © 2016 Gkorezis et al.
Draft Genome Sequence of a Hexachlorocyclohexane-Degrading Bacterium, Sphingobium baderi Strain LL03T

PubMed Central

Kaur, Jasvinder; Verma, Helianthous; Tripathi, Charu; Khurana, J. P.

2013-01-01

Sphingobium baderi strain LL03T was isolated from hexachlorocyclohexane (HCH)-contaminated soil from Spolana, Czech Republic. Strain LL03T is a mutant that is deficient in linB and linC (genes that encode hexachlorocyclohexane haloalkane dehalogenase and dehydrogenase, respectively). The draft genome sequence of LL03T (~4.85 Mb) consists of 92 contigs and 4,914 coding sequences, with a G+C content of 63.5%. PMID:24051322
Draft Genome Sequence of Janthinobacterium sp. Ant5-2-1, Isolated from Proglacial Lake Podprudnoye in the Schirmacher Oasis of East Antarctica.

PubMed

Koo, Hyunmin; Strope, Bailey M; Kim, Eddy H; Shabani, Adel M; Kumar, Ranjit; Crowley, Michael R; Andersen, Dale T; Bej, Asim K

2016-01-21

Janthinobacterium sp. Ant5-2-1, isolated from the Schirmacher Oasis of East Antarctica, produces a purple-violet pigment, manifests diverse energy metabolism abilities, and tolerates cold, ultraviolet radiation, and other environmental stressors. We report here the 6.19-Mb draft genome of strain Ant5-2-1, which will help understand its survival mechanisms in extreme Antarctic ecosystems. Copyright © 2016 Koo et al.
Draft Genome Sequence of Hymenobacter sp. Strain IS2118, Isolated from a Freshwater Lake in Schirmacher Oasis, Antarctica, Reveals Diverse Genes for Adaptation to Cold Ecosystems

PubMed Central

Ptacek, Travis; Crowley, Michael; Swain, Ashit K.; Osborne, John D.; Bej, Asim K.; Andersen, Dale T.

2014-01-01

Hymenobacter sp. IS2118, isolated from a freshwater lake in Schirmacher Oasis, Antarctica, produces extracellular polymeric substance (EPS) and manifests tolerance to cold, UV radiation (UVR), and oxidative stress. We report the 5.26-Mb draft genome of strain IS2118, which will help us to understand its adaptation and survival mechanisms in Antarctic extreme ecosystems. PMID:25103756
High-quality genome of the peach scab pathogen, Venturia carpophila

USDA-ARS?s Scientific Manuscript database

Venturia carpophila causes peach scab, a disease that renders peach (Prunus persica) fruit unmarketable. We report a high-quality draft genome (36.9 Mb) of V. carpophila from an isolate collected from a peach tree in central Georgia. The genome was sequenced by MiSeq using an Illumina paired-end lib...
Genome Sequence of a Chromium-Reducing Strain, Bacillus cereus S612

DOE PAGES

Wang, Dongping; Boukhalfa, Hakim; Ware, Doug S.; ...

2015-12-10

We report here the genome sequence of an effective chromium-reducing bacterium,Bacillus cereusstrain S612. We found that the size of the draft genome sequence is approximately 5.4 Mb, with a G+C content of 35%, and it is predicted to contain 5,450 protein-coding genes.
DOE Office of Scientific and Technical Information (OSTI.GOV)

Chauhan, Archana; Layton, Alice; Williams, Daniel W

Pseudomonas fluorescens strain HK44 (DSM 6700) is a genetically engineered lux-based bioluminescent bioreporter. Here we report the draft genome sequence of strain HK44. Annotation of {approx}6.1 Mb sequence indicates that 30% of the traits are unique and distributed over 5 genomic islands, a prophage and two plasmids.

Draft genome of the protandrous Chinese black porgy, Acanthopagrus schlegelii.

PubMed

Zhang, Zhiyong; Zhang, Kai; Chen, Shuyin; Zhang, Zhiwei; Zhang, Jinyong; You, Xinxin; Bian, Chao; Xu, Jin; Jia, Chaofeng; Qiang, Jun; Zhu, Fei; Li, Hongxia; Liu, Hailin; Shen, Dehua; Ren, Zhonghong; Chen, Jieming; Li, Jia; Gao, Tianheng; Gu, Ruobo; Xu, Junmin; Shi, Qiong; Xu, Pao

2018-04-01

As one of the most popular and valuable commercial marine fishes in China and East Asian countries, the Chinese black porgy (Acanthopagrus schlegelii), also known as the blackhead seabream, has some attractive characteristics such as fast growth rate, good meat quality, resistance to diseases, and excellent adaptability to various environments. Furthermore, the black porgy is a good model for investigating sex changes in fish due to its protandrous hermaphroditism. Here, we obtained a high-quality genome assembly of this interesting teleost species and performed a genomic survey on potential genes associated with the sex-change phenomenon. We generated 175.4 gigabases (Gb) of clean sequence reads using a whole-genome shotgun sequencing strategy. The final genome assembly is approximately 688.1 megabases (Mb), accounting for 93% of the estimated genome size (739.6 Mb). The achieved scaffold N50 is 7.6 Mb, reaching a relatively high level among sequenced fish species. We identified 19 465 protein-coding genes, which had an average transcript length of 17.3 kb. By performing a comparative genomic analysis, we found 3 types of genes potentially associated with sex change, which are useful for studying the genetic basis of the protandrous hermaphroditism. We provide a draft genome assembly of the Chinese black porgy and discuss the potential genetic mechanisms of sex change. These data are also an important resource for studying the biology and for facilitating breeding of this economically important fish.
Draft Genome sequence of Frankia sp. strains CN3 , an atypical, non-infective (Nod-) ineffective (Fix-) isolate from Coriaria nepalensis

DOE Office of Scientific and Technical Information (OSTI.GOV)

Ghodhbane-Gtari, Faten; Beauchemin, Nicholas; Bruce, David

2013-01-01

We report here the genome sequence of Frankia sp. strain CN3, which was isolated from Coriaria nepalensis. This genome sequence is the first from the fourth lineage of Frankia, that are unable to re-infect actinorhizal plants. At 10 Mb, it represents the largest Frankia genome sequenced to date.
Genome Sequence of Enterohemorrhagic Escherichia coli NCCP15658

PubMed Central

Song, Ju Yeon; Yoo, Ran Hee; Jang, Song Yee; Seong, Won-Keun; Kim, Seon-Young; Jeong, Haeyoung; Kang, Sung Gyun; Kim, Byung Kwon; Kwon, Soon-Kyeong; Lee, Choong Hoon; Yu, Dong Su; Park, Mi-Sun

2012-01-01

Enterohemorrhagic Escherichia coli causes severe food-borne disease in the guts of humans and animals. Here, we report the high-quality draft genome sequence of E. coli NCCP15658 isolated from a patient in the Republic of Korea. Its genome size was determined to be 5.46 Mb, and its genomic features, including genes encoding virulence factors, were analyzed. PMID:22740673
Draft Genome Sequence of Hymenobacter sp. Strain IS2118, Isolated from a Freshwater Lake in Schirmacher Oasis, Antarctica, Reveals Diverse Genes for Adaptation to Cold Ecosystems.

PubMed

Koo, Hyunmin; Ptacek, Travis; Crowley, Michael; Swain, Ashit K; Osborne, John D; Bej, Asim K; Andersen, Dale T

2014-08-07

Hymenobacter sp. IS2118, isolated from a freshwater lake in Schirmacher Oasis, Antarctica, produces extracellular polymeric substance (EPS) and manifests tolerance to cold, UV radiation (UVR), and oxidative stress. We report the 5.26-Mb draft genome of strain IS2118, which will help us to understand its adaptation and survival mechanisms in Antarctic extreme ecosystems. Copyright © 2014 Koo et al.
Draft Genome Sequence of a Multidrug- and Colistin-Resistant mcr-1-Producing Escherichia coli Isolate from a Swine Farm in Mexico

PubMed Central

Garza-Ramos, Ulises; Tamayo-Legorreta, Elsa; Arellano-Quintanilla, Doris María; Rodriguez-Medina, Nadia; Silva-Sanchez, Jesús; Catalan-Najera, Juan; Rocha-Martínez, Marisol Karina; Bravo-Díaz, María Asunción

2018-01-01

ABSTRACT A colistin-resistant mcr-1-carrying Escherichia coli strain, RC2-007, was isolated from a swine farm in Mexico. This extraintestinal and uropathogenic strain of E. coli belongs to serotype O89:H9 and sequence type 744. Assembly and annotation resulted in a 4.9-Mb draft genome that revealed the presence of plasmid-mediated mcr-1-ISApI1 genes as part of a prophage. PMID:29519827
Draft Genome Sequence of Acinetobacter oleivorans PF1, a Diesel-Degrading and Plant-Growth-Promoting Endophytic Strain Isolated from Poplar Trees Growing on a Diesel-Contaminated Plume.

PubMed

Gkorezis, Panagiotis; Rineau, Francois; Van Hamme, Jonathan; Franzetti, Andrea; Daghio, Matteo; Thijs, Sofie; Weyens, Nele; Vangronsveld, Jaco

2015-02-05

We report the 3.7-Mb draft genome of Acinetobacter oleivorans strain PF1, a hydrocarbonoclastic Gram-negative bacterium in the class Gammaproteobacteria, isolated from poplar trees growing on a diesel-contaminated plume at the Ford Motor Company site in Genk, Belgium. Strain PF1 is a potent plant-growth promoter, useful for diesel fuel phytoremediation applications. Copyright © 2015 Gkorezis et al.
Genome Sequence of Actinobacillus seminis Strain ATCC 15768, a Reference Strain of Ovine Pathogens That Causes Infections in Reproductive Organs

PubMed Central

Negrete-Abascal, Erasmo; Montes-Garcia, Fernando; Vaca-Pacheco, Sergio; Leyto-Gil, Abraham M.; Fragoso-Garcia, Edgar; Carvente-Garcia, Roberto; Perez-Agueros, Sandra; Castelan-Sanchez, Hugo G.; Garcia-Molina, Alejandra; Villamar, Tomas E.; Sánchez-Alonso, Patricia

2018-01-01

ABSTRACT The draft genome sequence of Actinobacillus seminis strain ATCC 15768 is reported here. The genome comprises 22 contigs corresponding to 2.36 Mb with 40.7% G+C content and contains several genes related to virulence, including a putative RTX protein. PMID:29326222
Deciphering the Genome Sequences of the Hydrophobic Cyanobacterium Scytonema tolypothrichoides VB-61278

PubMed Central

Das, Abhishek; Panda, Arijit; Singh, Deeksha; Chandrababunaidu, Mathu Malar; Mishra, Gyan Prakash; Bhan, Sushma

2015-01-01

Scytonema tolypothrichoides VB-61278, a terrestrial cyanobacterium, can be exploited to produce commercially important products. Here, we report for the first time a 10-Mb draft genome assembly of S. tolypothrichoides VB-61278, with 214 scaffolds and 7,148 putative protein-coding genes. PMID:25838486
The genome of black raspberry (Rubus occidentalis)

USDA-ARS?s Scientific Manuscript database

Black raspberry (Rubus occidentalis) is an important specialty fruit crop in the U.S. Pacific Northwest that can hybridize with the globally commercialized red raspberry (R. idaeus). Here we report a 243 Mb draft genome of black raspberry that will serve as a useful reference for the Rosaceae and Ru...
IMA Genome-F 3: Draft genomes of Amanita jacksonii, Ceratocystis albifundus, Fusarium circinatum, Huntiella omanensis, Leptographium procerum, Rutstroemia sydowiana, and Sclerotinia echinophila.

PubMed

van der Nest, Magriet A; Beirn, Lisa A; Crouch, Jo Anne; Demers, Jill E; de Beer, Z Wilhelm; De Vos, Lieschen; Gordon, Thomas R; Moncalvo, Jean-Marc; Naidoo, Kershney; Sanchez-Ramirez, Santiago; Roodt, Danielle; Santana, Quentin C; Slinski, Stephanie L; Stata, Matt; Taerum, Stephen J; Wilken, P Markus; Wilson, Andrea M; Wingfield, Michael J; Wingfield, Brenda D

2014-12-01

The genomes of fungi provide an important resource to resolve issues pertaining to their taxonomy, biology, and evolution. The genomes of Amanita jacksonii, Ceratocystis albifundus, a Fusarium circinatum variant, Huntiella omanensis, Leptographium procerum, Sclerotinia echinophila, and Rutstroemia sydowiana are presented in this genome announcement. These seven genomes are from a number of fungal pathogens and economically important species. The genome sizes range from 27 Mb in the case of Ceratocystis albifundus to 51.9 Mb for Rutstroemia sydowiana. The latter also encodes for a predicted 17 350 genes, more than double that of Ceratocystis albifundus. These genomes will add to the growing body of knowledge of these fungi and provide a value resource to researchers studying these fungi.
Elucidating the triplicated ancestral genome structure of radish based on chromosome-level comparison with the Brassica genomes.

PubMed

Jeong, Young-Min; Kim, Namshin; Ahn, Byung Ohg; Oh, Mijin; Chung, Won-Hyong; Chung, Hee; Jeong, Seongmun; Lim, Ki-Byung; Hwang, Yoon-Jung; Kim, Goon-Bo; Baek, Seunghoon; Choi, Sang-Bong; Hyung, Dae-Jin; Lee, Seung-Won; Sohn, Seong-Han; Kwon, Soo-Jin; Jin, Mina; Seol, Young-Joo; Chae, Won Byoung; Choi, Keun Jin; Park, Beom-Seok; Yu, Hee-Ju; Mun, Jeong-Hwan

2016-07-01

This study presents a chromosome-scale draft genome sequence of radish that is assembled into nine chromosomal pseudomolecules. A comprehensive comparative genome analysis with the Brassica genomes provides genomic evidences on the evolution of the mesohexaploid radish genome. Radish (Raphanus sativus L.) is an agronomically important root vegetable crop and its origin and phylogenetic position in the tribe Brassiceae is controversial. Here we present a comprehensive analysis of the radish genome based on the chromosome sequences of R. sativus cv. WK10039. The radish genome was sequenced and assembled into 426.2 Mb spanning >98 % of the gene space, of which 344.0 Mb were integrated into nine chromosome pseudomolecules. Approximately 36 % of the genome was repetitive sequences and 46,514 protein-coding genes were predicted and annotated. Comparative mapping of the tPCK-like ancestral genome revealed that the radish genome has intermediate characteristics between the Brassica A/C and B genomes in the triplicated segments, suggesting an internal origin from the genus Brassica. The evolutionary characteristics shared between radish and other Brassica species provided genomic evidences that the current form of nine chromosomes in radish was rearranged from the chromosomes of hexaploid progenitor. Overall, this study provides a chromosome-scale draft genome sequence of radish as well as novel insight into evolution of the mesohexaploid genomes in the tribe Brassiceae.
Draft genome sequence of Sugiyamaella xylanicola UFMG-CM-Y1884T, a xylan-degrading yeast species isolated from rotting wood samples in Brazil.

PubMed

Batista, Thiago M; Moreira, Rennan G; Hilário, Heron O; Morais, Camila G; Franco, Glória R; Rosa, Luiz H; Rosa, Carlos A

2017-03-01

We present the draft genome sequence of the type strain of the yeast Sugiyamaella xylanicola UFMG-CM-Y1884 T (= UFMG-CA-32.1 T = CBS 12683 T ), a xylan-degrading species capable of fermenting d-xylose to ethanol. The assembled genome has a size of ~ 13.7 Mb and a GC content of 33.8% and contains 5971 protein-coding genes. We identified 15 genes with significant similarity to the d-xylose reductase gene from several other fungal species. The draft genome assembled from whole-genome shotgun sequencing of the yeast Sugiyamaella xylanicola UFMG-CM-Y1884 T (= UFMG-CA-32.1 T = CBS 12683 T ) has been deposited at DDBJ/ENA/GenBank under the accession number MQSX00000000 under version MQSX01000000.
The draft genome sequence of cork oak

PubMed Central

Ramos, António Marcos; Usié, Ana; Barbosa, Pedro; Barros, Pedro M.; Capote, Tiago; Chaves, Inês; Simões, Fernanda; Abreu, Isabl; Carrasquinho, Isabel; Faro, Carlos; Guimarães, Joana B.; Mendonça, Diogo; Nóbrega, Filomena; Rodrigues, Leandra; Saibo, Nelson J. M.; Varela, Maria Carolina; Egas, Conceição; Matos, José; Miguel, Célia M.; Oliveira, M. Margarida; Ricardo, Cândido P.; Gonçalves, Sónia

2018-01-01

Cork oak (Quercus suber) is native to southwest Europe and northwest Africa where it plays a crucial environmental and economical role. To tackle the cork oak production and industrial challenges, advanced research is imperative but dependent on the availability of a sequenced genome. To address this, we produced the first draft version of the cork oak genome. We followed a de novo assembly strategy based on high-throughput sequence data, which generated a draft genome comprising 23,347 scaffolds and 953.3 Mb in size. A total of 79,752 genes and 83,814 transcripts were predicted, including 33,658 high-confidence genes. An InterPro signature assignment was detected for 69,218 transcripts, which represented 82.6% of the total. Validation studies demonstrated the genome assembly and annotation completeness and highlighted the usefulness of the draft genome for read mapping of high-throughput sequence data generated using different protocols. All data generated is available through the public databases where it was deposited, being therefore ready to use by the academic and industry communities working on cork oak and/or related species. PMID:29786699
The draft genome sequence of cork oak.

PubMed

Ramos, António Marcos; Usié, Ana; Barbosa, Pedro; Barros, Pedro M; Capote, Tiago; Chaves, Inês; Simões, Fernanda; Abreu, Isabl; Carrasquinho, Isabel; Faro, Carlos; Guimarães, Joana B; Mendonça, Diogo; Nóbrega, Filomena; Rodrigues, Leandra; Saibo, Nelson J M; Varela, Maria Carolina; Egas, Conceição; Matos, José; Miguel, Célia M; Oliveira, M Margarida; Ricardo, Cândido P; Gonçalves, Sónia

2018-05-22

Cork oak (Quercus suber) is native to southwest Europe and northwest Africa where it plays a crucial environmental and economical role. To tackle the cork oak production and industrial challenges, advanced research is imperative but dependent on the availability of a sequenced genome. To address this, we produced the first draft version of the cork oak genome. We followed a de novo assembly strategy based on high-throughput sequence data, which generated a draft genome comprising 23,347 scaffolds and 953.3 Mb in size. A total of 79,752 genes and 83,814 transcripts were predicted, including 33,658 high-confidence genes. An InterPro signature assignment was detected for 69,218 transcripts, which represented 82.6% of the total. Validation studies demonstrated the genome assembly and annotation completeness and highlighted the usefulness of the draft genome for read mapping of high-throughput sequence data generated using different protocols. All data generated is available through the public databases where it was deposited, being therefore ready to use by the academic and industry communities working on cork oak and/or related species.
The Genome Sequence of Avibacterium paragallinarum Strain CL Has a Large Repertoire of Insertion Sequence Elements.

PubMed

Horta-Valerdi, Guillermo; Sanchez-Alonso, Maria Patricia; Perez-Marquez, Victor M; Negrete-Abascal, Erasmo; Vaca-Pacheco, Sergio; Hernandez-Gonzalez, Ismael; Gomez-Lunar, Zulema; Olmedo-Álvarez, Gabriela; Vázquez-Cruz, Candelario

2017-04-13

The draft genome sequence of Avibacterium paragallinarum strain CL serovar C is reported here. The genome comprises 154 contigs corresponding to 2.4 Mb with 41% G+C content and many insertion sequence (IS) elements, a characteristic not previously reported in A. paragallinarum . Copyright © 2017 Horta-Valerdi et al.
Genome Sequence of Actinobacillus seminis Strain ATCC 15768, a Reference Strain of Ovine Pathogens That Causes Infections in Reproductive Organs.

PubMed

Negrete-Abascal, Erasmo; Montes-Garcia, Fernando; Vaca-Pacheco, Sergio; Leyto-Gil, Abraham M; Fragoso-Garcia, Edgar; Carvente-Garcia, Roberto; Perez-Agueros, Sandra; Castelan-Sanchez, Hugo G; Garcia-Molina, Alejandra; Villamar, Tomas E; Sánchez-Alonso, Patricia; Vazquez-Cruz, Candelario

2018-01-11

The draft genome sequence of Actinobacillus seminis strain ATCC 15768 is reported here. The genome comprises 22 contigs corresponding to 2.36 Mb with 40.7% G+C content and contains several genes related to virulence, including a putative RTX protein. Copyright © 2018 Negrete-Abascal et al.
Draft genome of neurotropic nematode parasite Angiostrongylus cantonensis, causative agent of human eosinophilic meningitis.

PubMed

Yong, Hoi-Sen; Eamsobhana, Praphathip; Lim, Phaik-Eem; Razali, Rozaimi; Aziz, Farhanah Abdul; Rosli, Nurul Shielawati Mohamed; Poole-Johnson, Johan; Anwar, Arif

2015-08-01

Angiostrongylus cantonensis is a bursate nematode parasite that causes eosinophilic meningitis (or meningoencephalitis) in humans in many parts of the world. The genomic data from A. cantonensis will form a useful resource for comparative genomic and chemogenomic studies to aid the development of diagnostics and therapeutics. We have sequenced, assembled and annotated the genome of A. cantonensis. The genome size is estimated to be ∼260 Mb, with 17,280 genomic scaffolds, 91X coverage, 81.45% for complete and 93.95% for partial score based on CEGMA analysis of genome completeness. The number of predicted genes of ≥300 bp was 17,482. A total of 7737 predicted protein-coding genes of ≥50 amino acids were identified in the assembled genome. Among the proteins of known function, kinases are the most abundant followed by transferases. The draft genome contains 34 excretory-secretory proteins (ES), a minimum of 44 Nematode Astacin (NAS) metalloproteases, 12 Homeobox (HOX) genes, and 30 neurotransmitters. The assembled genome size (260 Mb) is larger than those of Pristionchus pacificus, Caenorhabditis elegans, Necator americanus, Caenorhabditis briggsae, Trichinella spiralis, Brugia malayi and Loa loa, but smaller than Haemonchus contortus and Ascaris suum. The repeat content (25%) is similar to H. contortus. The GC content (41.17%) is lower compared to P. pacificus (42.7%) and H. contortus (43.1%) but higher compared to C. briggsae (37.69%), A. suum (37.9%) and N. americanus (40.2%) while the scaffold N50 is 42,191. This draft genome will facilitate the understanding of many unresolved issues on the parasite and the disorder it causes. Copyright © 2015 Elsevier B.V. All rights reserved.
Deciphering the Genome Sequences of the Hydrophobic Cyanobacterium Scytonema tolypothrichoides VB-61278.

PubMed

Das, Abhishek; Panda, Arijit; Singh, Deeksha; Chandrababunaidu, Mathu Malar; Mishra, Gyan Prakash; Bhan, Sushma; Adhikary, Siba Prasad; Tripathy, Sucheta

2015-04-02

Scytonema tolypothrichoides VB-61278, a terrestrial cyanobacterium, can be exploited to produce commercially important products. Here, we report for the first time a 10-Mb draft genome assembly of S. tolypothrichoides VB-61278, with 214 scaffolds and 7,148 putative protein-coding genes. Copyright © 2015 Das et al.
Draft genome sequence of chickpea (Cicer arietinum) provides a resource for trait improvement

USDA-ARS?s Scientific Manuscript database

Chickpea (Cicer arietinum) is the world’s second most important grain legume crop, accounting for a significant proportion of human dietary protein and playing a critical role in food security in developing countries. We report the sequence of the ~738 Mb kabuli (CDC Frontier) chickpea genome, which...
Draft Sequences of the Radish (Raphanus sativus L.) Genome

PubMed Central

Kitashiba, Hiroyasu; Li, Feng; Hirakawa, Hideki; Kawanabe, Takahiro; Zou, Zhongwei; Hasegawa, Yoichi; Tonosaki, Kaoru; Shirasawa, Sachiko; Fukushima, Aki; Yokoi, Shuji; Takahata, Yoshihito; Kakizaki, Tomohiro; Ishida, Masahiko; Okamoto, Shunsuke; Sakamoto, Koji; Shirasawa, Kenta; Tabata, Satoshi; Nishio, Takeshi

2014-01-01

Radish (Raphanus sativus L., n = 9) is one of the major vegetables in Asia. Since the genomes of Brassica and related species including radish underwent genome rearrangement, it is quite difficult to perform functional analysis based on the reported genomic sequence of Brassica rapa. Therefore, we performed genome sequencing of radish. Short reads of genomic sequences of 191.1 Gb were obtained by next-generation sequencing (NGS) for a radish inbred line, and 76,592 scaffolds of ≥300 bp were constructed along with the bacterial artificial chromosome-end sequences. Finally, the whole draft genomic sequence of 402 Mb spanning 75.9% of the estimated genomic size and containing 61,572 predicted genes was obtained. Subsequently, 221 single nucleotide polymorphism markers and 768 PCR-RFLP markers were used together with the 746 markers produced in our previous study for the construction of a linkage map. The map was combined further with another radish linkage map constructed mainly with expressed sequence tag-simple sequence repeat markers into a high-density integrated map of 1,166 cM with 2,553 DNA markers. A total of 1,345 scaffolds were assigned to the linkage map, spanning 116.0 Mb. Bulked PCR products amplified by 2,880 primer pairs were sequenced by NGS, and SNPs in eight inbred lines were identified. PMID:24848699

Draft genome of the lined seahorse, Hippocampus erectus.

PubMed

Lin, Qiang; Qiu, Ying; Gu, Ruobo; Xu, Meng; Li, Jia; Bian, Chao; Zhang, Huixian; Qin, Geng; Zhang, Yanhong; Luo, Wei; Chen, Jieming; You, Xinxin; Fan, Mingjun; Sun, Min; Xu, Pao; Venkatesh, Byrappa; Xu, Junming; Fu, Hongtuo; Shi, Qiong

2017-06-01

The lined seahorse, Hippocampus erectus , is an Atlantic species and mainly inhabits shallow sea beds or coral reefs. It has become very popular in China for its wide use in traditional Chinese medicine. In order to improve the aquaculture yield of this valuable fish species, we are trying to develop genomic resources for assistant selection in genetic breeding. Here, we provide whole genome sequencing, assembly, and gene annotation of the lined seahorse, which can enrich genome resource and further application for its molecular breeding. A total of 174.6 Gb (Gigabase) raw DNA sequences were generated by the Illumina Hiseq2500 platform. The final assembly of the lined seahorse genome is around 458 Mb, representing 94% of the estimated genome size (489 Mb by k-mer analysis). The contig N50 and scaffold N50 reached 14.57 kb and 1.97 Mb, respectively. Quality of the assembled genome was assessed by BUSCO with prediction of 85% of the known vertebrate genes and evaluated using the de novo assembled RNA-seq transcripts to prove a high mapping ratio (more than 99% transcripts could be mapped to the assembly). Using homology-based, de novo and transcriptome-based prediction methods, we predicted 20 788 protein-coding genes in the generated assembly, which is less than our previously reported gene number (23 458) of the tiger tail seahorse ( H. comes ). We report a draft genome of the lined seahorse. These generated genomic data are going to enrich genome resource of this economically important fish, and also provide insights into the genetic mechanisms of its iconic morphology and male pregnancy behavior. © The Authors 2017. Published by Oxford University Press.
Draft genome of the lined seahorse, Hippocampus erectus

PubMed Central

Lin, Qiang; Qiu, Ying; Gu, Ruobo; Xu, Meng; Li, Jia; Bian, Chao; Zhang, Huixian; Qin, Geng; Zhang, Yanhong; Luo, Wei; Chen, Jieming; You, Xinxin; Fan, Mingjun; Sun, Min; Xu, Pao; Venkatesh, Byrappa

2017-01-01

Abstract Background: The lined seahorse, Hippocampus erectus, is an Atlantic species and mainly inhabits shallow sea beds or coral reefs. It has become very popular in China for its wide use in traditional Chinese medicine. In order to improve the aquaculture yield of this valuable fish species, we are trying to develop genomic resources for assistant selection in genetic breeding. Here, we provide whole genome sequencing, assembly, and gene annotation of the lined seahorse, which can enrich genome resource and further application for its molecular breeding. Findings: A total of 174.6 Gb (Gigabase) raw DNA sequences were generated by the Illumina Hiseq2500 platform. The final assembly of the lined seahorse genome is around 458 Mb, representing 94% of the estimated genome size (489 Mb by k-mer analysis). The contig N50 and scaffold N50 reached 14.57 kb and 1.97 Mb, respectively. Quality of the assembled genome was assessed by BUSCO with prediction of 85% of the known vertebrate genes and evaluated using the de novo assembled RNA-seq transcripts to prove a high mapping ratio (more than 99% transcripts could be mapped to the assembly). Using homology-based, de novo and transcriptome-based prediction methods, we predicted 20 788 protein-coding genes in the generated assembly, which is less than our previously reported gene number (23 458) of the tiger tail seahorse (H. comes). Conclusion: We report a draft genome of the lined seahorse. These generated genomic data are going to enrich genome resource of this economically important fish, and also provide insights into the genetic mechanisms of its iconic morphology and male pregnancy behavior. PMID:28444302
Draft Genome Sequences of Lactobacillus equicursoris CIP 110162T and Lactobacillus sp. Strain CRBIP 24.137, Isolated from Thoroughbred Racehorse Feces and Human Urine, Respectively.

PubMed

Cousin, Sylvie; Loux, Valentin; Ma, Laurence; Creno, Sophie; Clermont, Dominique; Bizet, Chantal; Bouchier, Christiane

2013-08-22

We report the draft genome sequences of strain Lactobacillus equicursoris CIP 110162(T), isolated from racehorse breed feces, and Lactobacillus sp. strain CRBIP 24.137, isolated from human urine; the two strains are closely related. The total lengths of the 116 and 62 scaffolds are about 2.157 and 2.358 Mb, with G+C contents of 46 and 45% and 2,279 and 2,342 coding sequences (CDSs), respectively.
Draft Genome Sequence of a Multidrug- and Colistin-Resistant mcr-1-Producing Escherichia coli Isolate from a Swine Farm in Mexico.

PubMed

Garza-Ramos, Ulises; Tamayo-Legorreta, Elsa; Arellano-Quintanilla, Doris María; Rodriguez-Medina, Nadia; Silva-Sanchez, Jesús; Catalan-Najera, Juan; Rocha-Martínez, Marisol Karina; Bravo-Díaz, María Asunción; Alpuche-Aranda, Celia

2018-03-08

A colistin-resistant mcr-1 -carrying Escherichia coli strain, RC2-007, was isolated from a swine farm in Mexico. This extraintestinal and uropathogenic strain of E. coli belongs to serotype O89:H9 and sequence type 744. Assembly and annotation resulted in a 4.9-Mb draft genome that revealed the presence of plasmid-mediated mcr-1 -IS ApI1 genes as part of a prophage. Copyright © 2018 Garza-Ramos et al.
A reference genome of the European beech (Fagus sylvatica L.).

PubMed

Mishra, Bagdevi; Gupta, Deepak K; Pfenninger, Markus; Hickler, Thomas; Langer, Ewald; Nam, Bora; Paule, Juraj; Sharma, Rahul; Ulaszewski, Bartosz; Warmbier, Joanna; Burczyk, Jaroslaw; Thines, Marco

2018-06-01

The European beech is arguably the most important climax broad-leaved tree species in Central Europe, widely planted for its valuable wood. Here, we report the 542 Mb draft genome sequence of an up to 300-year-old individual (Bhaga) from an undisturbed stand in the Kellerwald-Edersee National Park in central Germany. Using a hybrid assembly approach, Illumina reads with short- and long-insert libraries, coupled with long Pacific Biosciences reads, we obtained an assembled genome size of 542 Mb, in line with flow cytometric genome size estimation. The largest scaffold was of 1.15 Mb, the N50 length was 145 kb, and the L50 count was 983. The assembly contained 0.12% of Ns. A Benchmarking with Universal Single-Copy Orthologs (BUSCO) analysis retrieved 94% complete BUSCO genes, well in the range of other high-quality draft genomes of trees. A total of 62,012 protein-coding genes were predicted, assisted by transcriptome sequencing. In addition, we are reporting an efficient method for extracting high-molecular-weight DNA from dormant buds, by which contamination by environmental bacteria and fungi was kept at a minimum. The assembled genome will be a valuable resource and reference for future population genomics studies on the evolution and past climate change adaptation of beech and will be helpful for identifying genes, e.g., involved in drought tolerance, in order to select and breed individuals to adapt forestry to climate change in Europe. A continuously updated genome browser and download page can be accessed from beechgenome.net, which will include future genome versions of the reference individual Bhaga, as new sequencing approaches develop.
Whole-Genome Sequences of Cronobacter sakazakii Isolates Obtained from Foods of Plant Origin and Dried-Food Manufacturing Environments.

PubMed

Jang, Hyein; Addy, Nicole; Ewing, Laura; Jean-Gilles Beaubrun, Junia; Lee, YouYoung; Woo, JungHa; Negrete, Flavia; Finkelstein, Samantha; Tall, Ben D; Lehner, Angelika; Eshwar, Athmanya; Gopinath, Gopal R

2018-04-12

Here, we present draft genome sequences of 29 Cronobacter sakazakii isolates obtained from foods of plant origin and dried-food manufacturing facilities. Assemblies and annotations resulted in genome sizes ranging from 4.3 to 4.5 Mb and 3,977 to 4,256 gene-coding sequences with G+C contents of ∼57.0%.
Genome Sequence of a Heterotrophic Nitrifier and Aerobic Denitrifier, Paracoccus denitrificans Strain ISTOD1, Isolated from Wastewater

PubMed Central

Medhi, Kristina; Mishra, Arti

2018-01-01

ABSTRACT We report here the draft genome sequence of Paracoccus denitrificans strain ISTOD1 of 4.9 Mb, isolated from wastewater. It has been identified as a heterotrophic nitrifying and aerobic denitrifying bacterium. Genomic analysis revealed genes related to nitrogen and phosphorus removal, showing that the strain holds potential for bioremediation and biorefinery uses. PMID:29650568
The Draft Genome Sequence of a Novel High-Efficient Butanol-Producing Bacterium Clostridium Diolis Strain WST.

PubMed

Chen, Chaoyang; Sun, Chongran; Wu, Yi-Rui

2018-03-21

A wild-type solventogenic strain Clostridium diolis WST, isolated from mangrove sediments, was characterized to produce high amount of butanol and acetone with negligible level of ethanol and acids from glucose via a unique acetone-butanol (AB) fermentation pathway. Through the genomic sequencing, the assembled draft genome of strain WST is calculated to be 5.85 Mb with a GC content of 29.69% and contains 5263 genes that contribute to the annotation of 5049 protein-coding sequences. Within these annotated genes, the butanol dehydrogenase gene (bdh) was determined to be in a higher amount from strain WST compared to other Clostridial strains, which is positively related to its high-efficient production of butanol. Therefore, we present a draft genome sequence analysis of strain WST in this article that should facilitate to further understand the solventogenic mechanism of this special microorganism.
Genome Sequence of Pseudomonas sp. Strain S9, an Extracellular Arylsulfatase-Producing Bacterium Isolated from Mangrove Soil ▿

PubMed Central

Long, Mengxian; Ruan, Lingwei; Yu, Ziniu; Xu, Xun

2011-01-01

Pseudomonas sp. strain S9 was originally isolated from mangrove soil in Xiamen, China. It is an aerobic bacterium which shows extracellular arylsulfatase activity. Here, we describe the 4.8-Mb draft genome sequence of Pseudomonas sp. S9, which exhibits novel cysteine-type sulfatases. PMID:21622746
High genome heterozygosity and endemic genetic recombination in the wheat stripe rust fungus

USDA-ARS?s Scientific Manuscript database

Stripe rust, caused by Puccinia striiformis f. sp. tritici (Pst), is one of the most destructive diseases of wheat. Here we report a 110-Mb draft sequence of Pst isolate CY32, obtained using a ‘fosmid-to-fosmid’ strategy, to better understand its race evolution and pathogenesis. The Pst genome is hi...
Genome Sequence of the Thermotolerant Yeast Kluyveromyces marxianus var. marxianus KCTC 17555

PubMed Central

Jeong, Haeyoung; Lee, Dae-Hee; Kim, Sun Hong; Kim, Hyun-Jin; Lee, Kyusang; Song, Ju Yeon; Kim, Byung Kwon; Sung, Bong Hyun; Sohn, Jung Hoon; Koo, Hyun Min

2012-01-01

Kluyveromyces marxianus is a thermotolerant yeast that has been explored for potential use in biotechnological applications, such as production of biofuels, single-cell proteins, enzymes, and other heterologous proteins. Here, we present the high-quality draft of the 10.9-Mb genome of K. marxianus var. marxianus KCTC 17555 (= CBS 6556 = ATCC 26548). PMID:23193140
Draft Genome Sequence of Nafulsella turpanensis ZLM-10T, a Novel Member of the Family Flammeovirgaceae

PubMed Central

Zhang, Lei; Si, Meiru; Zhu, Lingfang; Li, Changfu; Wei, Yahong

2014-01-01

Nafulsella turpanensis ZLM-10T is a slightly halophilic, Gram-negative, rod-shaped, gliding, pale-pink-pigmented bacterium in the family Flammeovirgaceae, and it shows resistance to gentamicin, kanamycin, neomycin, and streptomycin. Here, we report the genome sequence of N. turpanensis strain ZLM-10T, which has a 4.8-Mb genome and a G+C content of 45.67%. PMID:24699960
Whole-Genome Sequences of Cronobacter sakazakii Isolates Obtained from Foods of Plant Origin and Dried-Food Manufacturing Environments

PubMed Central

Addy, Nicole; Ewing, Laura; Jean-Gilles Beaubrun, Junia; Lee, YouYoung; Woo, JungHa; Negrete, Flavia; Finkelstein, Samantha; Tall, Ben D.; Lehner, Angelika; Eshwar, Athmanya; Gopinath, Gopal R.

2018-01-01

ABSTRACT Here, we present draft genome sequences of 29 Cronobacter sakazakii isolates obtained from foods of plant origin and dried-food manufacturing facilities. Assemblies and annotations resulted in genome sizes ranging from 4.3 to 4.5 Mb and 3,977 to 4,256 gene-coding sequences with G+C contents of ∼57.0%. PMID:29650569
Genome Sequence of a Heterotrophic Nitrifier and Aerobic Denitrifier, Paracoccus denitrificans Strain ISTOD1, Isolated from Wastewater.

PubMed

Medhi, Kristina; Mishra, Arti; Thakur, Indu Shekhar

2018-04-12

We report here the draft genome sequence of Paracoccus denitrificans strain ISTOD1 of 4.9 Mb, isolated from wastewater. It has been identified as a heterotrophic nitrifying and aerobic denitrifying bacterium. Genomic analysis revealed genes related to nitrogen and phosphorus removal, showing that the strain holds potential for bioremediation and biorefinery uses. Copyright © 2018 Medhi et al.
Draft Genome Sequence of Nafulsella turpanensis ZLM-10T, a Novel Member of the Family Flammeovirgaceae.

PubMed

Zhang, Lei; Si, Meiru; Zhu, Lingfang; Li, Changfu; Wei, Yahong; Shen, Xihui

2014-04-03

Nafulsella turpanensis ZLM-10(T) is a slightly halophilic, Gram-negative, rod-shaped, gliding, pale-pink-pigmented bacterium in the family Flammeovirgaceae, and it shows resistance to gentamicin, kanamycin, neomycin, and streptomycin. Here, we report the genome sequence of N. turpanensis strain ZLM-10(T), which has a 4.8-Mb genome and a G+C content of 45.67%.
A post-assembly genome-improvement toolkit (PAGIT) to obtain annotated genomes from contigs.

PubMed

Swain, Martin T; Tsai, Isheng J; Assefa, Samual A; Newbold, Chris; Berriman, Matthew; Otto, Thomas D

2012-06-07

Genome projects now produce draft assemblies within weeks owing to advanced high-throughput sequencing technologies. For milestone projects such as Escherichia coli or Homo sapiens, teams of scientists were employed to manually curate and finish these genomes to a high standard. Nowadays, this is not feasible for most projects, and the quality of genomes is generally of a much lower standard. This protocol describes software (PAGIT) that is used to improve the quality of draft genomes. It offers flexible functionality to close gaps in scaffolds, correct base errors in the consensus sequence and exploit reference genomes (if available) in order to improve scaffolding and generating annotations. The protocol is most accessible for bacterial and small eukaryotic genomes (up to 300 Mb), such as pathogenic bacteria, malaria and parasitic worms. Applying PAGIT to an E. coli assembly takes ∼24 h: it doubles the average contig size and annotates over 4,300 gene models.
De novo Assembly of a 40 Mb Eukaryotic Genome from Short Sequence Reads: Sordaria macrospora, a Model Organism for Fungal Morphogenesis

PubMed Central

Nowrousian, Minou; Stajich, Jason E.; Chu, Meiling; Engh, Ines; Espagne, Eric; Halliday, Karen; Kamerewerd, Jens; Kempken, Frank; Knab, Birgit; Kuo, Hsiao-Che; Osiewacz, Heinz D.; Pöggeler, Stefanie; Read, Nick D.; Seiler, Stephan; Smith, Kristina M.; Zickler, Denise; Kück, Ulrich; Freitag, Michael

2010-01-01

Filamentous fungi are of great importance in ecology, agriculture, medicine, and biotechnology. Thus, it is not surprising that genomes for more than 100 filamentous fungi have been sequenced, most of them by Sanger sequencing. While next-generation sequencing techniques have revolutionized genome resequencing, e.g. for strain comparisons, genetic mapping, or transcriptome and ChIP analyses, de novo assembly of eukaryotic genomes still presents significant hurdles, because of their large size and stretches of repetitive sequences. Filamentous fungi contain few repetitive regions in their 30–90 Mb genomes and thus are suitable candidates to test de novo genome assembly from short sequence reads. Here, we present a high-quality draft sequence of the Sordaria macrospora genome that was obtained by a combination of Illumina/Solexa and Roche/454 sequencing. Paired-end Solexa sequencing of genomic DNA to 85-fold coverage and an additional 10-fold coverage by single-end 454 sequencing resulted in ∼4 Gb of DNA sequence. Reads were assembled to a 40 Mb draft version (N50 of 117 kb) with the Velvet assembler. Comparative analysis with Neurospora genomes increased the N50 to 498 kb. The S. macrospora genome contains even fewer repeat regions than its closest sequenced relative, Neurospora crassa. Comparison with genomes of other fungi showed that S. macrospora, a model organism for morphogenesis and meiosis, harbors duplications of several genes involved in self/nonself-recognition. Furthermore, S. macrospora contains more polyketide biosynthesis genes than N. crassa. Phylogenetic analyses suggest that some of these genes may have been acquired by horizontal gene transfer from a distantly related ascomycete group. Our study shows that, for typical filamentous fungi, de novo assembly of genomes from short sequence reads alone is feasible, that a mixture of Solexa and 454 sequencing substantially improves the assembly, and that the resulting data can be used for comparative studies to address basic questions of fungal biology. PMID:20386741
De novo assembly of a 40 Mb eukaryotic genome from short sequence reads: Sordaria macrospora, a model organism for fungal morphogenesis.

PubMed

Nowrousian, Minou; Stajich, Jason E; Chu, Meiling; Engh, Ines; Espagne, Eric; Halliday, Karen; Kamerewerd, Jens; Kempken, Frank; Knab, Birgit; Kuo, Hsiao-Che; Osiewacz, Heinz D; Pöggeler, Stefanie; Read, Nick D; Seiler, Stephan; Smith, Kristina M; Zickler, Denise; Kück, Ulrich; Freitag, Michael

2010-04-08

Filamentous fungi are of great importance in ecology, agriculture, medicine, and biotechnology. Thus, it is not surprising that genomes for more than 100 filamentous fungi have been sequenced, most of them by Sanger sequencing. While next-generation sequencing techniques have revolutionized genome resequencing, e.g. for strain comparisons, genetic mapping, or transcriptome and ChIP analyses, de novo assembly of eukaryotic genomes still presents significant hurdles, because of their large size and stretches of repetitive sequences. Filamentous fungi contain few repetitive regions in their 30-90 Mb genomes and thus are suitable candidates to test de novo genome assembly from short sequence reads. Here, we present a high-quality draft sequence of the Sordaria macrospora genome that was obtained by a combination of Illumina/Solexa and Roche/454 sequencing. Paired-end Solexa sequencing of genomic DNA to 85-fold coverage and an additional 10-fold coverage by single-end 454 sequencing resulted in approximately 4 Gb of DNA sequence. Reads were assembled to a 40 Mb draft version (N50 of 117 kb) with the Velvet assembler. Comparative analysis with Neurospora genomes increased the N50 to 498 kb. The S. macrospora genome contains even fewer repeat regions than its closest sequenced relative, Neurospora crassa. Comparison with genomes of other fungi showed that S. macrospora, a model organism for morphogenesis and meiosis, harbors duplications of several genes involved in self/nonself-recognition. Furthermore, S. macrospora contains more polyketide biosynthesis genes than N. crassa. Phylogenetic analyses suggest that some of these genes may have been acquired by horizontal gene transfer from a distantly related ascomycete group. Our study shows that, for typical filamentous fungi, de novo assembly of genomes from short sequence reads alone is feasible, that a mixture of Solexa and 454 sequencing substantially improves the assembly, and that the resulting data can be used for comparative studies to address basic questions of fungal biology.
Draft Genome Sequence of Pseudomonas sp. Strain LFM046, a Producer of Medium-Chain-Length Polyhydroxyalkanoate

PubMed Central

Cardinali-Rezende, Juliana; Alexandrino, Paulo Moises Raduan; Nahat, Rafael Augusto Theodoro Pereira de Souza; Sant’Ana, Débora Parrine Vieira; Silva, Luiziana Ferreira; Gomez, José Gregório Cabrera

2015-01-01

Pseudomonas sp. LFM046 is a medium-chain-length polyhydroxyalkanoate (PHAMCL) producer capable of using various carbon sources (carbohydrates, organic acids, and vegetable oils) and was first isolated from sugarcane cultivation soil in Brazil. The genome sequence was found to be 5.97 Mb long with a G+C content of 66%. PMID:26294616
Draft Genome Sequence of Two Marine Plantactinospora spp. from the Gulf of California.

PubMed

Contreras-Castro, Luis; Maldonado, Luis A; Quintana, Erika T; Raggi, Luciana; Sánchez-Flores, Alejandro

2018-05-24

Plantactinospora sp. strains BB1 and BC1 were isolated in 2009 from sediment samples of the Gulf of California from among almost 300 actinobacteria. Genome mining of their ∼8.5-Mb sequences showed the bioprospecting potential of these rare actinomycetes, providing an insight to their ecological and biotechnological importance. Copyright © 2018 Contreras-Castro et al.

Draft Genome Sequence of Bacillus amyloliquefaciens EBL11, a New Strain of Plant Growth-Promoting Bacterium Isolated from Rice Rhizosphere

PubMed Central

Wang, Yinghuan; Greenfield, Paul; Jin, Decai

2014-01-01

Bacillus amyloliquefaciens strain EBL11 is a bacterium that can promote plant growth by inhibiting the growth of fungi on plant surfaces and providing nutrients as a nonchemical biofertilizer. The estimated genome of this strain is 4.05 Mb in size and harbors 3,683 coding genes (CDSs). PMID:25059875
Genome Sequence of the Moderately Acidophilic Sulfate-Reducing Firmicute Desulfosporosinus acididurans (Strain M1T)

PubMed Central

Petzsch, Patrick; Poehlein, Anja; Johnson, D. Barrie; Daniel, Rolf; Schlömann, Michael

2015-01-01

Microbial dissimilatory sulfate reduction is commonplace in many anaerobic environments, though few acidophilic bacteria are known to mediate this process. We report the 4.64-Mb draft genome of the type strain of the moderate acidophile Desulfosporosinus acididurans, which was isolated from acidic sediment in a river draining the Soufrière volcano, Montserrat. PMID:26251501
Genome Sequence of Herbaspirillum sp. Strain GW103, a Plant Growth-Promoting Bacterium

PubMed Central

Lee, Gun Woong; Lee, Kui-Jae

2012-01-01

Herbaspirillum sp. strain GW103 was isolated from rhizosphere soil of the reed Phragmites australis on reclaimed land. Here we report the 5.05-Mb draft genome sequence of the strain, providing bioinformation about the agronomic benefits of this strain, such as multiple traits relevant to plant root colonization and plant growth promotion. PMID:22815460
Mining of the Uncharacterized Cytochrome P450 Genes Involved in Alkaloid Biosynthesis in California Poppy Using a Draft Genome Sequence

PubMed Central

Hori, Kentaro; Yamada, Yasuyuki; Purwanto, Ratmoyo; Minakuchi, Yohei; Toyoda, Atsushi; Hirakawa, Hideki

2018-01-01

Abstract Land plants produce specialized low molecular weight metabolites to adapt to various environmental stressors, such as UV radiation, pathogen infection, wounding and animal feeding damage. Due to the large variety of stresses, plants produce various chemicals, particularly plant species-specific alkaloids, through specialized biosynthetic pathways. In this study, using a draft genome sequence and querying known biosynthetic cytochrome P450 (P450) enzyme-encoding genes, we characterized the P450 genes involved in benzylisoquinoline alkaloid (BIA) biosynthesis in California poppy (Eschscholzia californica), as P450s are key enzymes involved in the diversification of specialized metabolism. Our in silico studies showed that all identified enzyme-encoding genes involved in BIA biosynthesis were found in the draft genome sequence of approximately 489 Mb, which covered approximately 97% of the whole genome (502 Mb). Further analyses showed that some P450 families involved in BIA biosynthesis, i.e. the CYP80, CYP82 and CYP719 families, were more enriched in the genome of E. californica than in the genome of Arabidopsis thaliana, a plant that does not produce BIAs. CYP82 family genes were highly abundant, so we measured the expression of CYP82 genes with respect to alkaloid accumulation in different plant tissues and two cell lines whose BIA production differs to estimate the functions of the genes. Further characterization revealed two highly homologous P450s (CYP82P2 and CYP82P3) that exhibited 10-hydroxylase activities with different substrate specificities. Here, we discuss the evolution of the P450 genes and the potential for further genome mining of the genes encoding the enzymes involved in BIA biosynthesis. PMID:29301019
Genome Sequence of the Necrotrophic Plant Pathogen Alternaria brassicicola Abra43

PubMed Central

Belmas, Elodie; Briand, Martial; Kwasiborski, Anthony; Colou, Justine; N’Guyen, Guillaume; Iacomi, Béatrice; Grappin, Philippe; Campion, Claire; Simoneau, Philippe; Barret, Matthieu

2018-01-01

ABSTRACT Alternaria brassicicola causes dark spot (or black spot) disease, which is one of the most common and destructive fungal diseases of Brassicaceae spp. worldwide. Here, we report the draft genome sequence of strain Abra43. The assembly comprises 29 scaffolds, with an N50 value of 2.1 Mb. The assembled genome was 31,036,461 bp in length, with a G+C content of 50.85%. PMID:29439047
Genome Sequence of an Endophytic Fungus, Fusarium solani JS-169, Which Has Antifungal Activity.

PubMed

Kim, Jung A; Jeon, Jongbum; Park, Sook-Young; Kim, Ki-Tae; Choi, Gobong; Lee, Hyun-Jung; Kim, Yangsun; Yang, Hee-Sun; Yeo, Joo-Hong; Lee, Yong-Hwan; Kim, Soonok

2017-10-19

An endophytic fungus, Fusarium solani strain JS-169, isolated from a mulberry twig, showed considerable antifungal activity. Here, we report the draft genome sequence of this strain. The assembly comprises 17 scaffolds, with an N 50 value of 4.93 Mb. The assembled genome was 45,813,297 bp in length, with a G+C content of 49.91%. Copyright © 2017 Kim et al.
DOE Office of Scientific and Technical Information (OSTI.GOV)

Köberl, Martina; White, Richard A.; Erschen, Sabine

Streptomyces sp. strain Wb2n-11, isolated from native desert soil, exhibited broad-spectrum antagonism against plant pathogenic fungi, bacteria and nematodes. The 8.2 Mb draft genome reveals genes putatively responsible for its promising biocontrol activity and genes which enable the soil bacterium to directly interact beneficially with plants.
Draft sequencing and analysis of the genome of pufferfish Takifugu flavidus.

PubMed

Gao, Yang; Gao, Qiang; Zhang, Huan; Wang, Lingling; Zhang, Fuchong; Yang, Chuanyan; Song, Linsheng

2014-12-01

The pufferfish Takifugu flavidus is an important economic species due to its outstanding flavour and high market value. It has been regarded as an excellent model of genetic study for decades as well. In the present study, three mate-pair libraries of T. flavidus genome were sequenced by the SOLiD 4 next-generation sequencing platform, and the draft genome was constructed with the short reads using an assisted assembly strategy. The draft consists of 50,947 scaffolds with an N50 value of 305.7 kb, and the average GC content was 45.2%. The combined length of repetitive sequences was 26.5 Mb, which accounted for 6.87% of the genome, indicating that the compactness of T. flavidus genome was approximative with that of T. rubripes genome. A total of 1,253 non-coding RNA genes and 30,285 protein-encoding genes were assigned to the genome. There were 132,775 and 394 presumptive genes playing roles in the colour pattern variation, the relatively slow growth and the lipid metabolism, respectively. Among them, genes involved in the microtubule-dependent transport system, angiogenesis, decapentaplegic pathway and lipid mobilization were significantly expanded in the T. flavidus genome. This draft genome provides a valuable resource for understanding and improving both fundamental and applied research with pufferfish in the future. © The Author 2014. Published by Oxford University Press on behalf of Kazusa DNA Research Institute.
Draft genome sequence of Halorubrum tropicale strain V5, a novel halophilic archaeon isolated from the solar salterns of Cabo Rojo, Puerto Rico.

PubMed

Sánchez-Nieves, Rubén; Facciotti, Marc T; Saavedra-Collado, Sofía; Dávila-Santiago, Lizbeth; Rodríguez-Carrero, Roy; Montalvo-Rodríguez, Rafael

2016-03-01

The genus Halorubrum is a member of the family Halobacteriaceae which currently has the highest number of described species (31) of all the haloarchaea. Here we report the draft genome sequence of strain V5, a new species within this genus that was isolated from the solar salterns of Cabo Rojo, Puerto Rico. Assembly was performed and rendered the genome into 17 contigs (N50 = 515,834 bp), the largest of which contains 1,031,026 bp. The genome consists of 3.57 MB in length with G + C content of 67.6%. In general, the genome includes 4 rRNAs, 52 tRNAs, and 3246 protein-coding sequences. The NCBI accession number for this genome is LIST00000000 and the strain deposit number is CECT9000.
Genome Sequence of Salt-Tolerant Bacillus safensis Strain VK, Isolated from Saline Desert Area of Gujarat, India.

PubMed

Kothari, V V; Kothari, R K; Kothari, C R; Bhatt, V D; Nathani, N M; Koringa, P G; Joshi, C G; Vyas, B R M

2013-09-05

Bacillus safensis strain VK was isolated from the rhizosphere of a cumin plant growing in the saline desert of Radhanpar, Gujarat, India. Here, we provide the 3.68-Mb draft genome sequence of B. safensis VK, which might provide information about the salt tolerance and genes encoding enzymes for the strain's plant growth-promoting potential.
Genome Sequence of Streptomyces wadayamensis Strain A23, an Endophytic Actinobacterium from Citrus reticulata

PubMed Central

Tormet Gonzalez, Gabriela D.; Samborsky, Markyian; Marcon, Joelma; Araujo, Welington L.; de Azevedo, João Lucio

2014-01-01

The actinobacterium Streptomyces wadayamensis A23 is an endophyte of Citrus reticulata that produces the antimycin and mannopeptimycin antibiotics, among others. The strain has the capability to inhibit Xylella fastidiosa growth. The draft genome of S. wadayamensis A23 has ~7.0 Mb and 6,006 protein-coding sequences, with a 73.5% G+C content. PMID:24994795
Draft Genome Sequence of the Marine Bacterium Pseudomonas aestusnigri VGXO14T.

PubMed

Gomila, Margarita; Mulet, Magdalena; Lalucat, Jorge; García-Valdés, Elena

2017-08-10

The type strain of Pseudomonas aestusnigri (VGXO14), isolated from a crude oil-polluted marine sand sample, is a member of the P. pertucinogena phylogenetic group. Here, we report the genome sequence (3.83 Mb) of P. aestusnigri to gain insights into the biology and taxonomy of marine Pseudomonas spp. adapted to polluted marine habitats. Copyright © 2017 Gomila et al.
Draft Genome Sequence of the Marine Bacterium Pseudomonas aestusnigri VGXO14T

PubMed Central

2017-01-01

ABSTRACT The type strain of Pseudomonas aestusnigri (VGXO14), isolated from a crude oil-polluted marine sand sample, is a member of the P. pertucinogena phylogenetic group. Here, we report the genome sequence (3.83 Mb) of P. aestusnigri to gain insights into the biology and taxonomy of marine Pseudomonas spp. adapted to polluted marine habitats. PMID:28798177
Genome Sequence of the Moderately Acidophilic Sulfate-Reducing Firmicute Desulfosporosinus acididurans (Strain M1T).

PubMed

Petzsch, Patrick; Poehlein, Anja; Johnson, D Barrie; Daniel, Rolf; Schlömann, Michael; Mühling, Martin

2015-08-06

Microbial dissimilatory sulfate reduction is commonplace in many anaerobic environments, though few acidophilic bacteria are known to mediate this process. We report the 4.64-Mb draft genome of the type strain of the moderate acidophile Desulfosporosinus acididurans, which was isolated from acidic sediment in a river draining the Soufrière volcano, Montserrat. Copyright © 2015 Petzsch et al.
A High Quality Draft Consensus Sequence of the Genome of a Heterozygous Grapevine Variety

PubMed Central

Cartwright, Dustin A.; Cestaro, Alessandro; Pruss, Dmitry; Pindo, Massimo; FitzGerald, Lisa M.; Vezzulli, Silvia; Reid, Julia; Malacarne, Giulia; Iliev, Diana; Coppola, Giuseppina; Wardell, Bryan; Micheletti, Diego; Macalma, Teresita; Facci, Marco; Mitchell, Jeff T.; Perazzolli, Michele; Eldredge, Glenn; Gatto, Pamela; Oyzerski, Rozan; Moretto, Marco; Gutin, Natalia; Stefanini, Marco; Chen, Yang; Segala, Cinzia; Davenport, Christine; Demattè, Lorenzo; Mraz, Amy; Battilana, Juri; Stormo, Keith; Costa, Fabrizio; Tao, Quanzhou; Si-Ammour, Azeddine; Harkins, Tim; Lackey, Angie; Perbost, Clotilde; Taillon, Bruce; Stella, Alessandra; Solovyev, Victor; Fawcett, Jeffrey A.; Sterck, Lieven; Vandepoele, Klaas; Grando, Stella M.; Toppo, Stefano; Moser, Claudio; Lanchbury, Jerry; Bogden, Robert; Skolnick, Mark; Sgaramella, Vittorio; Bhatnagar, Satish K.; Fontana, Paolo; Gutin, Alexander; Van de Peer, Yves; Salamini, Francesco; Viola, Roberto

2007-01-01

Background Worldwide, grapes and their derived products have a large market. The cultivated grape species Vitis vinifera has potential to become a model for fruit trees genetics. Like many plant species, it is highly heterozygous, which is an additional challenge to modern whole genome shotgun sequencing. In this paper a high quality draft genome sequence of a cultivated clone of V. vinifera Pinot Noir is presented. Principal Findings We estimate the genome size of V. vinifera to be 504.6 Mb. Genomic sequences corresponding to 477.1 Mb were assembled in 2,093 metacontigs and 435.1 Mb were anchored to the 19 linkage groups (LGs). The number of predicted genes is 29,585, of which 96.1% were assigned to LGs. This assembly of the grape genome provides candidate genes implicated in traits relevant to grapevine cultivation, such as those influencing wine quality, via secondary metabolites, and those connected with the extreme susceptibility of grape to pathogens. Single nucleotide polymorphism (SNP) distribution was consistent with a diffuse haplotype structure across the genome. Of around 2,000,000 SNPs, 1,751,176 were mapped to chromosomes and one or more of them were identified in 86.7% of anchored genes. The relative age of grape duplicated genes was estimated and this made possible to reveal a relatively recent Vitis-specific large scale duplication event concerning at least 10 chromosomes (duplication not reported before). Conclusions Sanger shotgun sequencing and highly efficient sequencing by synthesis (SBS), together with dedicated assembly programs, resolved a complex heterozygous genome. A consensus sequence of the genome and a set of mapped marker loci were generated. Homologous chromosomes of Pinot Noir differ by 11.2% of their DNA (hemizygous DNA plus chromosomal gaps). SNP markers are offered as a tool with the potential of introducing a new era in the molecular breeding of grape. PMID:18094749
Draft genome of the most devastating insect pest of coffee worldwide: the coffee berry borer, Hypothenemus hampei

DOE Office of Scientific and Technical Information (OSTI.GOV)

Vega, Fernando E.; Brown, Stuart M.; Chen, Hao

The coffee berry borer, Hypothenemus hampei, is the most economically important insect pest of coffee worldwide. We present an analysis of the draft genome of the coffee berry borer, the third genome for a Coleopteran species. The genome size is ca. 163 Mb with 19,222 predicted protein-coding genes. Analysis was focused on genes involved in primary digestion as well as gene families involved in detoxification of plant defense molecules and insecticides, such as carboxylesterases, cytochrome P450, gluthathione S-transferases, ATP-binding cassette transporters, and a gene that confers resistance to the insecticide dieldrin. A broad range of enzymes capable of degrading complexmore » polysaccharides were identified. We also evaluated the pathogen defense system and found homologs to antimicrobial genes reported in the Drosophila genome. Ten cases of horizontal gene transfer were identified with evidence for expression, integration into the H. hampei genome, and phylogenetic evidence that the sequences are more closely related to bacterial rather than eukaryotic genes. We find the draft genome analysis broadly expands our knowledge on the biology of a devastating tropical insect pest and suggests new pest management strategies.« less
Draft genome of the most devastating insect pest of coffee worldwide: the coffee berry borer, Hypothenemus hampei

PubMed Central

Vega, Fernando E.; Brown, Stuart M.; Chen, Hao; Shen, Eric; Nair, Mridul B.; Ceja-Navarro, Javier A.; Brodie, Eoin L.; Infante, Francisco; Dowd, Patrick F.; Pain, Arnab

2015-01-01

The coffee berry borer, Hypothenemus hampei, is the most economically important insect pest of coffee worldwide. We present an analysis of the draft genome of the coffee berry borer, the third genome for a Coleopteran species. The genome size is ca. 163 Mb with 19,222 predicted protein-coding genes. Analysis was focused on genes involved in primary digestion as well as gene families involved in detoxification of plant defense molecules and insecticides, such as carboxylesterases, cytochrome P450, gluthathione S-transferases, ATP-binding cassette transporters, and a gene that confers resistance to the insecticide dieldrin. A broad range of enzymes capable of degrading complex polysaccharides were identified. We also evaluated the pathogen defense system and found homologs to antimicrobial genes reported in the Drosophila genome. Ten cases of horizontal gene transfer were identified with evidence for expression, integration into the H. hampei genome, and phylogenetic evidence that the sequences are more closely related to bacterial rather than eukaryotic genes. The draft genome analysis broadly expands our knowledge on the biology of a devastating tropical insect pest and suggests new pest management strategies. PMID:26228545
Draft genome of the most devastating insect pest of coffee worldwide: the coffee berry borer, Hypothenemus hampei

DOE PAGES

Vega, Fernando E.; Brown, Stuart M.; Chen, Hao; ...

2015-07-31

The coffee berry borer, Hypothenemus hampei, is the most economically important insect pest of coffee worldwide. We present an analysis of the draft genome of the coffee berry borer, the third genome for a Coleopteran species. The genome size is ca. 163 Mb with 19,222 predicted protein-coding genes. Analysis was focused on genes involved in primary digestion as well as gene families involved in detoxification of plant defense molecules and insecticides, such as carboxylesterases, cytochrome P450, gluthathione S-transferases, ATP-binding cassette transporters, and a gene that confers resistance to the insecticide dieldrin. A broad range of enzymes capable of degrading complexmore » polysaccharides were identified. We also evaluated the pathogen defense system and found homologs to antimicrobial genes reported in the Drosophila genome. Ten cases of horizontal gene transfer were identified with evidence for expression, integration into the H. hampei genome, and phylogenetic evidence that the sequences are more closely related to bacterial rather than eukaryotic genes. We find the draft genome analysis broadly expands our knowledge on the biology of a devastating tropical insect pest and suggests new pest management strategies.« less
Draft genome sequence and genetic transformation of the oleaginous alga Nannochloropis gaditana

PubMed Central

Radakovits, Randor; Jinkerson, Robert E.; Fuerstenberg, Susan I.; Tae, Hongseok; Settlage, Robert E.; Boore, Jeffrey L.; Posewitz, Matthew C.

2012-01-01

The potential use of algae in biofuels applications is receiving significant attention. However, none of the current algal model species are competitive production strains. Here we present a draft genome sequence and a genetic transformation method for the marine microalga Nannochloropsis gaditana CCMP526. We show that N. gaditana has highly favourable lipid yields, and is a promising production organism. The genome assembly includes nuclear (~29 Mb) and organellar genomes, and contains 9,052 gene models. We define the genes required for glycerolipid biogenesis and detail the differential regulation of genes during nitrogen-limited lipid biosynthesis. Phylogenomic analysis identifies genetic attributes of this organism, including unique stramenopile photosynthesis genes and gene expansions that may explain the distinguishing photoautotrophic phenotypes observed. The availability of a genome sequence and transformation methods will facilitate investigations into N. gaditana lipid biosynthesis and permit genetic engineering strategies to further improve this naturally productive alga. PMID:22353717
Draft genome sequence and genetic transformation of the oleaginous alga Nannochloropis gaditana.

PubMed

Radakovits, Randor; Jinkerson, Robert E; Fuerstenberg, Susan I; Tae, Hongseok; Settlage, Robert E; Boore, Jeffrey L; Posewitz, Matthew C

2012-02-21

The potential use of algae in biofuels applications is receiving significant attention. However, none of the current algal model species are competitive production strains. Here we present a draft genome sequence and a genetic transformation method for the marine microalga Nannochloropsis gaditana CCMP526. We show that N. gaditana has highly favourable lipid yields, and is a promising production organism. The genome assembly includes nuclear (~29 Mb) and organellar genomes, and contains 9,052 gene models. We define the genes required for glycerolipid biogenesis and detail the differential regulation of genes during nitrogen-limited lipid biosynthesis. Phylogenomic analysis identifies genetic attributes of this organism, including unique stramenopile photosynthesis genes and gene expansions that may explain the distinguishing photoautotrophic phenotypes observed. The availability of a genome sequence and transformation methods will facilitate investigations into N. gaditana lipid biosynthesis and permit genetic engineering strategies to further improve this naturally productive alga.

Remarkably Divergent Regions Punctuate the Genome Assembly of the Caenorhabditis elegans Hawaiian Strain CB4856

PubMed Central

Thompson, Owen A.; Snoek, L. Basten; Nijveen, Harm; Sterken, Mark G.; Volkers, Rita J. M.; Brenchley, Rachel; van’t Hof, Arjen; Bevers, Roel P. J.; Cossins, Andrew R.; Yanai, Itai; Hajnal, Alex; Schmid, Tobias; Perkins, Jaryn D.; Spencer, David; Kruglyak, Leonid; Andersen, Erik C.; Moerman, Donald G.; Hillier, LaDeana W.; Kammenga, Jan E.; Waterston, Robert H.

2015-01-01

The Hawaiian strain (CB4856) of Caenorhabditis elegans is one of the most divergent from the canonical laboratory strain N2 and has been widely used in developmental, population, and evolutionary studies. To enhance the utility of the strain, we have generated a draft sequence of the CB4856 genome, exploiting a variety of resources and strategies. When compared against the N2 reference, the CB4856 genome has 327,050 single nucleotide variants (SNVs) and 79,529 insertion–deletion events that result in a total of 3.3 Mb of N2 sequence missing from CB4856 and 1.4 Mb of sequence present in CB4856 but not present in N2. As previously reported, the density of SNVs varies along the chromosomes, with the arms of chromosomes showing greater average variation than the centers. In addition, we find 61 regions totaling 2.8 Mb, distributed across all six chromosomes, which have a greatly elevated SNV density, ranging from 2 to 16% SNVs. A survey of other wild isolates show that the two alternative haplotypes for each region are widely distributed, suggesting they have been maintained by balancing selection over long evolutionary times. These divergent regions contain an abundance of genes from large rapidly evolving families encoding F-box, MATH, BATH, seven-transmembrane G-coupled receptors, and nuclear hormone receptors, suggesting that they provide selective advantages in natural environments. The draft sequence makes available a comprehensive catalog of sequence differences between the CB4856 and N2 strains that will facilitate the molecular dissection of their phenotypic differences. Our work also emphasizes the importance of going beyond simple alignment of reads to a reference genome when assessing differences between genomes. PMID:25995208
Draft Genome Sequence of Zobellia sp. Strain OII3, Isolated from the Coastal Zone of the Baltic Sea.

PubMed

Harms, Henrik; Poehlein, Anja; Thürmer, Andrea; König, Gabriele M; Schäberle, Till F

2017-09-07

Zobellia sp. strain OII3 was isolated from a marine environmental sample due to its heterotrophic lifestyle, i.e., using Escherichia coli cells as prey. It shows strong agar-lytic activity. The genome was assembled into 41 contigs with a total size of 5.4 Mb, revealing the genetic basis for natural product biosynthesis. Copyright © 2017 Harms et al.
Draft Genome Sequence of Pseudomonas sp. Strain LFM046, a Producer of Medium-Chain-Length Polyhydroxyalkanoate.

PubMed

Cardinali-Rezende, Juliana; Alexandrino, Paulo Moises Raduan; Nahat, Rafael Augusto Theodoro Pereira de Souza; Sant'Ana, Débora Parrine Vieira; Silva, Luiziana Ferreira; Gomez, José Gregório Cabrera; Taciro, Marilda Keico

2015-08-20

Pseudomonas sp. LFM046 is a medium-chain-length polyhydroxyalkanoate (PHAMCL) producer capable of using various carbon sources (carbohydrates, organic acids, and vegetable oils) and was first isolated from sugarcane cultivation soil in Brazil. The genome sequence was found to be 5.97 Mb long with a G+C content of 66%. Copyright © 2015 Cardinali-Rezende et al.
Whole genome de novo sequencing and genome annotation of the world popular cultivated edible mushroom, Lentinula edodes.

PubMed

Shim, Donghwan; Park, Sin-Gi; Kim, Kangmin; Bae, Wonsil; Lee, Gir Won; Ha, Byeong-Suk; Ro, Hyeon-Su; Kim, Myungkil; Ryoo, Rhim; Rhee, Sung-Keun; Nou, Ill-Sup; Koo, Chang-Duck; Hong, Chang Pyo; Ryu, Hojin

2016-04-10

Lentinula edodes, the popular shiitake mushroom, is one of the most important cultivated edible mushrooms. It is used as a food and for medicinal purposes. Here, we present the 46.1 Mb draft genome of L. edodes, comprising 13,028 predicted gene models. The genome assembly consists of 31 scaffolds. Gene annotation provides key information about various signaling pathways and secondary metabolites. This genomic information should help establish the molecular genetic markers for MAS/MAB and increase our understanding of the genome structure and function. Copyright © 2016 Elsevier B.V. All rights reserved.
A draft genome assembly of the army worm, Spodoptera frugiperda.

PubMed

Kakumani, Pavan Kumar; Malhotra, Pawan; Mukherjee, Sunil K; Bhatnagar, Raj K

2014-08-01

Spodoptera is an agriculturally important pest insect and studies in understanding its biology have been limited by the unavailability of its genome. In the present study, the genomic DNA was sequenced and assembled into 37,243 scaffolds of size, 358 Mb with N50 of 53.7 kb. Based on degree of identity, we could anchor 305 Mb of the genome onto all the 28 chromosomes of Bombyx mori. Repeat elements were identified, which accounts for 20.28% of the total genome. Further, we predicted 11,595 genes, with an average intron length of 726 bp. The genes were annotated and domain analysis revealed that Sf genes share a significant homology and expression pattern with B. mori, despite differences in KOG gene categories and representation of certain protein families. The present study on Sf genome would help in the characterization of cellular pathways to understand its biology and comparative evolutionary studies among lepidopteran family members to help annotate their genomes. Copyright © 2014 Elsevier Inc. All rights reserved.
The draft genome sequence of the ascomycete fungus Penicillium subrubescens reveals a highly enriched content of plant biomass related CAZymes compared to related fungi.

PubMed

Peng, Mao; Dilokpimol, Adiphol; Mäkelä, Miia R; Hildén, Kristiina; Bervoets, Sander; Riley, Robert; Grigoriev, Igor V; Hainaut, Matthieu; Henrissat, Bernard; de Vries, Ronald P; Granchi, Zoraide

2017-03-20

Here we report the genome sequence of the ascomycete saprobic fungus Penicillium subrubescens FBCC1632/CBS132785 isolated from a Jerusalem artichoke field in Finland. The 39.75Mb genome containing 14,188 gene models is highly similar for that reported for other Penicillium species, but contains a significantly higher number of putative carbohydrate active enzyme (CAZyme) encoding genes. Copyright © 2017 Elsevier B.V. All rights reserved.
Draft Genome Sequence of Highly Virulent Race 4/Biovar 3 of Ralstonia solanacearum CaRs_Mep Causing Bacterial Wilt in Zingiberaceae Plants in India.

PubMed

Kumar, Aundy; Munjal, Vibhuti; Sheoran, Neelam; Prameela, Thekkan Puthiyaveedu; Suseelabhai, Rajamma; Aggarwal, Rashmi; Jain, Rakesh Kumar; Eapen, Santhosh J

2017-01-05

The genome of Ralstonia solanacearum CaRs_Mep, a race 4/biovar 3/phylotype I bacterium causing wilt in small cardamom and other Zingiberaceae plants, was sequenced. Analysis of the 5.7-Mb genome sequence will aid in better understanding of the genetic determinants of host range, host jump, survival, pathogenicity, and virulence of race 4 of R. solanacearum. Copyright © 2017 Kumar et al.
Genome sequence of Bradyrhizobium sp. LMTR 3, a diazotrophic symbiont of Lima bean (Phaseolus lunatus).

PubMed

Ormeño-Orrillo, Ernesto; Rey, Luis; Durán, David; Canchaya, Carlos A; Zúñiga-Dávila, Doris; Imperial, Juan; Martínez-Romero, Esperanza; Ruiz-Argüeso, Tomás

2017-09-01

Bradyrhizobium sp. LMTR 3 is a representative strain of one of the geno(species) of diazotrophic symbionts associated with Lima bean ( Phaseolus lunatus ) in Peru. Its 7.83 Mb genome was sequenced using the Illumina technology and found to encode a complete set of genes required for nodulation and nitrogen fixation, and additional genes putatively involved in root colonization. Its draft genome sequence and annotation have been deposited at GenBank under the accession number MAXC00000000.
Whole-genome scan identifies quantitative trait loci for chronic pastern dermatitis in German draft horses.

PubMed

Mittmann, E Henrike; Mömke, Stefanie; Distl, Ottmar

2010-02-01

Chronic pastern dermatitis (CPD), also known as chronic progressive lymphedema (CPL), is a skin disease that affects draft horses. This disease causes painful lower-leg swelling, nodule formation, and skin ulceration, interfering with movement. The aim of this whole-genome scan was to identify quantitative trait loci (QTL) for CPD in German draft horses. We recorded clinical data for CPD in 917 German draft horses and collected blood samples from these horses. Of these 917 horses, 31 paternal half-sib families comprising 378 horses from the breeds Rhenish German, Schleswig, Saxon-Thuringian, and South German were chosen for genotyping. Each half-sib family was constituted by only one draft horse breed. Genotyping was done for 318 polymorphic microsatellites evenly distributed on all equine autosomes and the X chromosome with a mean distance of 7.5 Mb. An across-breed multipoint linkage analysis revealed chromosome-wide significant QTL on horse chromosomes (ECA) 1, 9, 16, and 17. Analyses by breed confirmed the QTL on ECA1 in South German and the QTL on ECA9, 16, and 17 in Saxon-Thuringian draft horses. For the Rhenish German and Schleswig draft horses, additional QTL on ECA4 and 10 and for the South German draft horses an additional QTL on ECA7 were found. This is the first whole-genome scan for CPD in draft horses and it is an important step toward the identification of candidate genes.
Draft genome sequence of chickpea (Cicer arietinum) provides a resource for trait improvement.

PubMed

Varshney, Rajeev K; Song, Chi; Saxena, Rachit K; Azam, Sarwar; Yu, Sheng; Sharpe, Andrew G; Cannon, Steven; Baek, Jongmin; Rosen, Benjamin D; Tar'an, Bunyamin; Millan, Teresa; Zhang, Xudong; Ramsay, Larissa D; Iwata, Aiko; Wang, Ying; Nelson, William; Farmer, Andrew D; Gaur, Pooran M; Soderlund, Carol; Penmetsa, R Varma; Xu, Chunyan; Bharti, Arvind K; He, Weiming; Winter, Peter; Zhao, Shancen; Hane, James K; Carrasquilla-Garcia, Noelia; Condie, Janet A; Upadhyaya, Hari D; Luo, Ming-Cheng; Thudi, Mahendar; Gowda, C L L; Singh, Narendra P; Lichtenzveig, Judith; Gali, Krishna K; Rubio, Josefa; Nadarajan, N; Dolezel, Jaroslav; Bansal, Kailash C; Xu, Xun; Edwards, David; Zhang, Gengyun; Kahl, Guenter; Gil, Juan; Singh, Karam B; Datta, Swapan K; Jackson, Scott A; Wang, Jun; Cook, Douglas R

2013-03-01

Chickpea (Cicer arietinum) is the second most widely grown legume crop after soybean, accounting for a substantial proportion of human dietary nitrogen intake and playing a crucial role in food security in developing countries. We report the ∼738-Mb draft whole genome shotgun sequence of CDC Frontier, a kabuli chickpea variety, which contains an estimated 28,269 genes. Resequencing and analysis of 90 cultivated and wild genotypes from ten countries identifies targets of both breeding-associated genetic sweeps and breeding-associated balancing selection. Candidate genes for disease resistance and agronomic traits are highlighted, including traits that distinguish the two main market classes of cultivated chickpea--desi and kabuli. These data comprise a resource for chickpea improvement through molecular breeding and provide insights into both genome diversity and domestication.
Draft genome of Haloarcula rubripromontorii strain SL3, a novel halophilic archaeon isolated from the solar salterns of Cabo Rojo, Puerto Rico.

PubMed

Sánchez-Nieves, Rubén; Facciotti, Marc; Saavedra-Collado, Sofía; Dávila-Santiago, Lizbeth; Rodríguez-Carrero, Roy; Montalvo-Rodríguez, Rafael

2016-03-01

The genus Haloarcula belongs to the family Halobacteriaceae which currently has 10 valid species. Here we report the draft genome sequence of strain SL3, a new species within this genus, isolated from the Solar Salterns of Cabo Rojo, Puerto Rico. Genome assembly performed using NGEN Assembler resulted in 18 contigs (N50 = 601,911 bp), the largest of which contains 1,023,775 bp. The genome consists of 3.97 MB and has a GC content of 61.97%. Like all species of Haloarcula, the genome encodes heterogeneous copies of the small subunit ribosomal RNA. In addition, the genome includes 6 rRNAs, 48 tRNAs, and 3797 protein coding sequences. Several carbohydrate-active enzymes genes were found, as well as enzymes involved in the dihydroxyacetone processing pathway which are not found in other Haloarcula species. The NCBI accession number for this genome is LIUF00000000 and the strain deposit number is CECT9001.
The Draft Genome Sequence of Actinokineospora bangkokensis 44EHWT Reveals the Biosynthetic Pathway of the Antifungal Thailandin Compounds with Unusual Butylmalonyl-CoA Extender Units.

PubMed

Greule, Anja; Intra, Bungonsiri; Flemming, Stephan; Rommel, Marcel G E; Panbangred, Watanalai; Bechthold, Andreas

2016-11-23

We report the draft genome sequence of Actinokineospora bangkokensis 44EHW T , the producer of the antifungal polyene compounds, thailandins A and B. The sequence contains 7.45 Mb, 74.1% GC content and 35 putative gene clusters for the biosynthesis of secondary metabolites. There are three gene clusters encoding large polyketide synthases of type I. Annotation of the ORF functions and targeted gene disruption enabled us to identify the cluster for thailandin biosynthesis. We propose a plausible biosynthetic pathway for thailandin, where the unusual butylmalonyl-CoA extender unit is incorporated and results in an untypical side chain.
Draft Genome Sequence of Bacillus urumqiensis BZ-SZ-XJ18T, a Moderately Haloalkaliphilic Bacterium Isolated from a Saline-Alkaline Lake.

PubMed

Liao, Ziya; Ren, Chao; Guo, Xiaomeng; Yan, Yanchun; Li, Jun; Zhao, Baisuo

2018-05-31

The moderately haloalkaliphilic bacterium Bacillus urumqiensis BZ-SZ-XJ18 T was isolated from a saline-alkaline lake located in the Xinjiang Uyghur Autonomous Region of China. Optimum growth occurred at the total Na + concentration of 1.08 M, with a broad optimum pH of 8.5 to 9.5. The draft genome consists of approximately 3.28 Mb and contains 3,228 predicted genes. A number of genes associated with adaptation strategies for osmotic balance and alkaline pH homeostasis were identified, providing pertinent insight into specific adaptations to the double-extreme environment. Copyright © 2018 Liao et al.
Whole genome sequencing of Chinese clearhead icefish, Protosalanx hyalocranius.

PubMed

Liu, Kai; Xu, Dongpo; Li, Jia; Bian, Chao; Duan, Jinrong; Zhou, Yanfeng; Zhang, Minying; You, Xinxin; You, Yang; Chen, Jieming; Yu, Hui; Xu, Gangchun; Fang, Di-An; Qiang, Jun; Jiang, Shulun; He, Jie; Xu, Junmin; Shi, Qiong; Zhang, Zhiyong; Xu, Pao

2017-04-01

Chinese clearhead icefish, Protosalanx hyalocranius , is a representative icefish species with economic importance and special appearance. Due to its great economic value in China, the fish was introduced into Lake Dianchi and several other lakes from the Lake Taihu half a century ago. Similar to the Sinocyclocheilus cavefish, the clearhead icefish has certain cavefish-like traits, such as transparent body and nearly scaleless skin. Here, we provide the whole genome sequence of this surface-dwelling fish and generated a draft genome assembly, aiming at exploring molecular mechanisms for the biological interests. A total of 252.1 Gb of raw reads were sequenced. Subsequently, a novel draft genome assembly was generated, with the scaffold N50 reaching 1.163 Mb. The genome completeness was estimated to be 98.39 % by using the CEGMA evaluation. Finally, we annotated 19 884 protein-coding genes and observed that repeat sequences account for 24.43 % of the genome assembly. We report the first draft genome of the Chinese clearhead icefish. The genome assembly will provide a solid foundation for further molecular breeding and germplasm resource protection in Chinese clearhead icefish, as well as other icefishes. It is also a valuable genetic resource for revealing the molecular mechanisms for the cavefish-like characters. © The Authors 2017. Published by Oxford University Press.
Draft genome sequence of Cicer reticulatum L., the wild progenitor of chickpea provides a resource for agronomic trait improvement.

PubMed

Gupta, Sonal; Nawaz, Kashif; Parween, Sabiha; Roy, Riti; Sahu, Kamlesh; Kumar Pole, Anil; Khandal, Hitaishi; Srivastava, Rishi; Kumar Parida, Swarup; Chattopadhyay, Debasis

2017-02-01

Cicer reticulatum L. is the wild progenitor of the fourth most important legume crop chickpea (C. arietinum L.). We assembled short-read sequences into 416 Mb draft genome of C. reticulatum and anchored 78% (327 Mb) of this assembly to eight linkage groups. Genome annotation predicted 25,680 protein-coding genes covering more than 90% of predicted gene space. The genome assembly shared a substantial synteny and conservation of gene orders with the genome of the model legume Medicago truncatula. Resistance gene homologs of wild and domesticated chickpeas showed high sequence homology and conserved synteny. Comparison of gene sequences and nucleotide diversity using 66 wild and domesticated chickpea accessions suggested that the desi type chickpea was genetically closer to the wild species than the kabuli type. Comparative analyses predicted gene flow between the wild and the cultivated species during domestication. Molecular diversity and population genetic structure determination using 15,096 genome-wide single nucleotide polymorphisms revealed an admixed domestication pattern among cultivated (desi and kabuli) and wild chickpea accessions belonging to three population groups reflecting significant influence of parentage or geographical origin for their cultivar-specific population classification. The assembly and the polymorphic sequence resources presented here would facilitate the study of chickpea domestication and targeted use of wild Cicer germplasms for agronomic trait improvement in chickpea. © The Author 2016. Published by Oxford University Press on behalf of Kazusa DNA Research Institute.
Genome-wide sequencing of longan (Dimocarpus longan Lour.) provides insights into molecular basis of its polyphenol-rich characteristics

PubMed Central

Lin, Yuling; Min, Jiumeng; Lai, Ruilian; Wu, Zhangyan; Chen, Yukun; Yu, Lili; Cheng, Chunzhen; Jin, Yuanchun; Tian, Qilin; Liu, Qingfeng; Liu, Weihua; Zhang, Chengguang; Lin, Lixia; Hu, Yan; Zhang, Dongmin; Thu, Minkyaw; Zhang, Zihao; Liu, Shengcai; Zhong, Chunshui; Fang, Xiaodong; Wang, Jian; Yang, Huanming

2017-01-01

Abstract Longan (Dimocarpus longan Lour.), an important subtropical fruit in the family Sapindaceae, is grown in more than 10 countries. Longan is an edible drupe fruit and a source of traditional medicine with polyphenol-rich traits. Tree size, alternate bearing, and witches' broom disease still pose serious problems. To gain insights into the genomic basis of longan traits, a draft genome sequence was assembled. The draft genome (about 471.88 Mb) of a Chinese longan cultivar, “Honghezi,” was estimated to contain 31 007 genes and 261.88 Mb of repetitive sequences. No recent whole-genome-wide duplication event was detected in the genome. Whole-genome resequencing and analysis of 13 cultivated D. longan accessions revealed the extent of genetic diversity. Comparative transcriptome studies combined with genome-wide analysis revealed polyphenol-rich and pathogen resistance characteristics. Genes involved in secondary metabolism, especially those from significantly expanded (DHS, SDH, F3΄H, ANR, and UFGT) and contracted (PAL, CHS, and F3΄5΄H) gene families with tissue-specific expression, may be important contributors to the high accumulation levels of polyphenolic compounds observed in longan fruit. The high number of genes encoding nucleotide-binding site leucine-rich repeat (NBS-LRR) and leucine-rich repeat receptor-like kinase proteins, as well as the recent expansion and contraction of the NBS-LRR family, suggested a genomic basis for resistance to insects, fungus, and bacteria in this fruit tree. These data provide insights into the evolution and diversity of the longan genome. The comparative genomic and transcriptome analyses provided information about longan-specific traits, particularly genes involved in its polyphenol-rich and pathogen resistance characteristics. PMID:28368449
Draft genome sequence of marine-derived Streptomyces sp. TP-A0598, a producer of anti-MRSA antibiotic lydicamycins.

PubMed

Komaki, Hisayuki; Ichikawa, Natsuko; Hosoyama, Akira; Fujita, Nobuyuki; Igarashi, Yasuhiro

2015-01-01

Streptomyces sp. TP-A0598, isolated from seawater, produces lydicamycin, structurally unique type I polyketide bearing two nitrogen-containing five-membered rings, and four congeners TPU-0037-A, -B, -C, and -D. We herein report the 8 Mb draft genome sequence of this strain, together with classification and features of the organism and generation, annotation and analysis of the genome sequence. The genome encodes 7,240 putative ORFs, of which 4,450 ORFs were assigned with COG categories. Also, 66 tRNA genes and one rRNA operon were identified. The genome contains eight gene clusters involved in the production of polyketides and nonribosomal peptides. Among them, a PKS/NRPS gene cluster was assigned to be responsible for lydicamycin biosynthesis and a plausible biosynthetic pathway was proposed on the basis of gene function prediction. This genome sequence data will facilitate to probe the potential of secondary metabolism in marine-derived Streptomyces.
The draft genome of the pest tephritid fruit fly Bactrocera tryoni: resources for the genomic analysis of hybridising species.

PubMed

Gilchrist, Anthony Stuart; Shearman, Deborah C A; Frommer, Marianne; Raphael, Kathryn A; Deshpande, Nandan P; Wilkins, Marc R; Sherwin, William B; Sved, John A

2014-12-20

The tephritid fruit flies include a number of economically important pests of horticulture, with a large accumulated body of research on their biology and control. Amongst the Tephritidae, the genus Bactrocera, containing over 400 species, presents various species groups of potential utility for genetic studies of speciation, behaviour or pest control. In Australia, there exists a triad of closely-related, sympatric Bactrocera species which do not mate in the wild but which, despite distinct morphologies and behaviours, can be force-mated in the laboratory to produce fertile hybrid offspring. To exploit the opportunities offered by genomics, such as the efficient identification of genetic loci central to pest behaviour and to the earliest stages of speciation, investigators require genomic resources for future investigations. We produced a draft de novo genome assembly of Australia's major tephritid pest species, Bactrocera tryoni. The male genome (650-700 Mbp) includes approximately 150 Mb of interspersed repetitive DNA sequences and 60 Mb of satellite DNA. Assessment using conserved core eukaryotic sequences indicated 98% completeness. Over 16,000 MAKER-derived gene models showed a large degree of overlap with other Dipteran reference genomes. The sequence of the ribosomal RNA transcribed unit was also determined. Unscaffolded assemblies of B. neohumeralis and B. jarvisi were then produced; comparison with B. tryoni showed that the species are more closely related than any Drosophila species pair. The similarity of the genomes was exploited to identify 4924 potentially diagnostic indels between the species, all of which occur in non-coding regions. This first draft B. tryoni genome resembles other dipteran genomes in terms of size and putative coding sequences. For all three species included in this study, we have identified a comprehensive set of non-redundant repetitive sequences, including the ribosomal RNA unit, and have quantified the major satellite DNA families. These genetic resources will facilitate the further investigations of genetic mechanisms responsible for the behavioural and morphological differences between these three species and other tephritids. We have also shown how whole genome sequence data can be used to generate simple diagnostic tests between very closely-related species where only one of the species is scaffolded.
Polymorphic SSR markers for Plasmopara obducens (Peronosporaceae), the newly emergent downy mildew pathogen of Impatiens (Balsaminaceae)

USDA-ARS?s Scientific Manuscript database

Premise of the study: Microsatellite markers were developed for Plasmopara obducens, the causal agent of the newly emergent downy mildew disease of Impatiens walleriana. Methods and Results: A 151.2 Mb draft genome assembly was generated from P. obducens using Illumina technology and mined to identi...
Transcriptome analysis of root response to citrus blight based on the newly assembled Swingle citrumelo draft genome.

PubMed

Zhang, Yunzeng; Barthe, Gary; Grosser, Jude W; Wang, Nian

2016-07-08

Citrus blight is a citrus tree overall decline disease and causes serious losses in the citrus industry worldwide. Although it was described more than one hundred years ago, its causal agent remains unknown and its pathophysiology is not well determined, which hampers our understanding of the disease and design of suitable disease management. In this study, we sequenced and assembled the draft genome for Swingle citrumelo, one important citrus rootstock. The draft genome is approximately 280 Mb, which covers 74 % of the estimated Swingle citrumelo genome and the average coverage is around 15X. The draft genome of Swingle citrumelo enabled us to conduct transcriptome analysis of roots of blight and healthy Swingle citrumelo using RNA-seq. The RNA-seq was reliable as evidenced by the high consistence of RNA-seq analysis and quantitative reverse transcription PCR results (R(2) = 0.966). Comparison of the gene expression profiles between blight and healthy root samples revealed the molecular mechanism underneath the characteristic blight phenotypes including decline, starch accumulation, and drought stress. The JA and ET biosynthesis and signaling pathways showed decreased transcript abundance, whereas SA-mediated defense-related genes showed increased transcript abundance in blight trees, suggesting unclassified biotrophic pathogen was involved in this disease. Overall, the Swingle citrumelo draft genome generated in this study will advance our understanding of plant biology and contribute to the citrus breeding. Transcriptome analysis of blight and healthy trees deepened our understanding of the pathophysiology of citrus blight.

Draft genome sequence of pigeonpea (Cajanus cajan), an orphan legume crop of resource-poor farmers.

PubMed

Varshney, Rajeev K; Chen, Wenbin; Li, Yupeng; Bharti, Arvind K; Saxena, Rachit K; Schlueter, Jessica A; Donoghue, Mark T A; Azam, Sarwar; Fan, Guangyi; Whaley, Adam M; Farmer, Andrew D; Sheridan, Jaime; Iwata, Aiko; Tuteja, Reetu; Penmetsa, R Varma; Wu, Wei; Upadhyaya, Hari D; Yang, Shiaw-Pyng; Shah, Trushar; Saxena, K B; Michael, Todd; McCombie, W Richard; Yang, Bicheng; Zhang, Gengyun; Yang, Huanming; Wang, Jun; Spillane, Charles; Cook, Douglas R; May, Gregory D; Xu, Xun; Jackson, Scott A

2011-11-06

Pigeonpea is an important legume food crop grown primarily by smallholder farmers in many semi-arid tropical regions of the world. We used the Illumina next-generation sequencing platform to generate 237.2 Gb of sequence, which along with Sanger-based bacterial artificial chromosome end sequences and a genetic map, we assembled into scaffolds representing 72.7% (605.78 Mb) of the 833.07 Mb pigeonpea genome. Genome analysis predicted 48,680 genes for pigeonpea and also showed the potential role that certain gene families, for example, drought tolerance-related genes, have played throughout the domestication of pigeonpea and the evolution of its ancestors. Although we found a few segmental duplication events, we did not observe the recent genome-wide duplication events observed in soybean. This reference genome sequence will facilitate the identification of the genetic basis of agronomically important traits, and accelerate the development of improved pigeonpea varieties that could improve food security in many developing countries.
Draft genome sequence of a thermostable, alkaliphilic α-amylase and protease producing Bacillus amyloliquefaciens strain KCP2.

PubMed

Prajapati, Vimalkumar S; Ray, Sanket; Narayan, Jitendra; Joshi, Chaitanya C; Patel, Kamlesh C; Trivedi, Ujjval B; Patel, R M

2017-12-01

Bacillus amyloliquefaciens strain KCP2 was isolated from municipal food waste samples collected in Vallabh Vidyanagar, Gujarat, India. Strain KCP2 is noteworthy due to its ability to produce a thermostable, alkaliphilic α-amylase and a protease. These enzymes have importance in several industrial processes including bread making, brewing, starch processing, pharmacy, and textile industries. Whole genome sequencing of strain KCP2 showed that the estimated genome size was 3.9 Mb, the G + C content was 46%, and it coded for 4113 genes.
Genome Sequence of Novosphingobium lindaniclasticum LE124T, Isolated from a Hexachlorocyclohexane Dumpsite

PubMed Central

Saxena, Anjali; Nayyar, Namita; Sangwan, Naseer; Kumari, Rashmi; Khurana, J. P.

2013-01-01

Novosphingobium lindaniclasticum LE124T is a hexachlorocyclohexane (HCH)-degrading bacterium isolated from a high-dosage-point HCH dumpsite (450 mg HCH/g soil) located in Lucknow, India (27°00′N and 81°09′E). Here, we present the annotated draft genome sequence of strain LE124T, which has an estimated size of 4.86 Mb and is comprised of 4,566 coding sequences. PMID:24029761
Genome Sequence of the Freshwater Yangtze Finless Porpoise.

PubMed

Yuan, Yuan; Zhang, Peijun; Wang, Kun; Liu, Mingzhong; Li, Jing; Zheng, Jingsong; Wang, Ding; Xu, Wenjie; Lin, Mingli; Dong, Lijun; Zhu, Chenglong; Qiu, Qiang; Li, Songhai

2018-04-16

The Yangtze finless porpoise ( Neophocaena asiaeorientalis ssp. asiaeorientalis ) is a subspecies of the narrow-ridged finless porpoise ( N. asiaeorientalis ). In total, 714.28 gigabases (Gb) of raw reads were generated by whole-genome sequencing of the Yangtze finless porpoise, using an Illumina HiSeq 2000 platform. After filtering the low-quality and duplicated reads, we assembled a draft genome of 2.22 Gb, with contig N50 and scaffold N50 values of 46.69 kilobases (kb) and 1.71 megabases (Mb), respectively. We identified 887.63 Mb of repetitive sequences and predicted 18,479 protein-coding genes in the assembled genome. The phylogenetic tree showed a relationship between the Yangtze finless porpoise and the Yangtze River dolphin, which diverged approximately 20.84 million years ago. In comparisons with the genomes of 10 other mammals, we detected 44 species-specific gene families, 164 expanded gene families, and 313 positively selected genes in the Yangtze finless porpoise genome. The assembled genome sequence and underlying sequence data are available at the National Center for Biotechnology Information under BioProject accession number PRJNA433603.
Genome Sequence of the Freshwater Yangtze Finless Porpoise

PubMed Central

Yuan, Yuan; Zhang, Peijun; Wang, Kun; Liu, Mingzhong; Li, Jing; Zheng, Jinsong; Wang, Ding; Xu, Wenjie; Lin, Mingli; Dong, Lijun; Zhu, Chenglong; Qiu, Qiang

2018-01-01

The Yangtze finless porpoise (Neophocaena asiaeorientalis ssp. asiaeorientalis) is a subspecies of the narrow-ridged finless porpoise (N. asiaeorientalis). In total, 714.28 gigabases (Gb) of raw reads were generated by whole-genome sequencing of the Yangtze finless porpoise, using an Illumina HiSeq 2000 platform. After filtering the low-quality and duplicated reads, we assembled a draft genome of 2.22 Gb, with contig N50 and scaffold N50 values of 46.69 kilobases (kb) and 1.71 megabases (Mb), respectively. We identified 887.63 Mb of repetitive sequences and predicted 18,479 protein-coding genes in the assembled genome. The phylogenetic tree showed a relationship between the Yangtze finless porpoise and the Yangtze River dolphin, which diverged approximately 20.84 million years ago. In comparisons with the genomes of 10 other mammals, we detected 44 species-specific gene families, 164 expanded gene families, and 313 positively selected genes in the Yangtze finless porpoise genome. The assembled genome sequence and underlying sequence data are available at the National Center for Biotechnology Information under BioProject accession number PRJNA433603. PMID:29659530
The genome and developmental transcriptome of the strongylid nematode Haemonchus contortus

PubMed Central

2013-01-01

Background The barber's pole worm, Haemonchus contortus, is one of the most economically important parasites of small ruminants worldwide. Although this parasite can be controlled using anthelmintic drugs, resistance against most drugs in common use has become a widespread problem. We provide a draft of the genome and the transcriptomes of all key developmental stages of H. contortus to support biological and biotechnological research areas of this and related parasites. Results The draft genome of H. contortus is 320 Mb in size and encodes 23,610 protein-coding genes. On a fundamental level, we elucidate transcriptional alterations taking place throughout the life cycle, characterize the parasite's gene silencing machinery, and explore molecules involved in development, reproduction, host-parasite interactions, immunity, and disease. The secretome of H. contortus is particularly rich in peptidases linked to blood-feeding activity and interactions with host tissues, and a diverse array of molecules is involved in complex immune responses. On an applied level, we predict drug targets and identify vaccine molecules. Conclusions The draft genome and developmental transcriptome of H. contortus provide a major resource to the scientific community for a wide range of genomic, genetic, proteomic, metabolomic, evolutionary, biological, ecological, and epidemiological investigations, and a solid foundation for biotechnological outcomes, including new anthelmintics, vaccines and diagnostic tests. This first draft genome of any strongylid nematode paves the way for a rapid acceleration in our understanding of a wide range of socioeconomically important parasites of one of the largest nematode orders. PMID:23985341
Draft Genome Sequence of a Cellulase-Producing Psychrotrophic Paenibacillus Strain, IHB B 3415, Isolated from the Cold Environment of the Western Himalayas, India.

PubMed

Dhar, Hena; Swarnkar, Mohit Kumar; Gulati, Arvind; Singh, Anil Kumar; Kasana, Ramesh Chand

2015-02-19

Paenibacillus sp. strain IHB B 3415 is a cellulase-producing psychrotrophic bacterium isolated from a soil sample from the cold deserts of Himachal Pradesh, India. Here, we report an 8.44-Mb assembly of its genome sequence with a G+C content of 50.77%. The data presented here will provide insights into the mechanisms of cellulose degradation at low temperature. Copyright © 2015 Dhar et al.
Genetic blueprint of the zoonotic pathogen Toxocara canis

PubMed Central

Zhu, Xing-Quan; Korhonen, Pasi K.; Cai, Huimin; Young, Neil D.; Nejsum, Peter; von Samson-Himmelstjerna, Georg; Boag, Peter R.; Tan, Patrick; Li, Qiye; Min, Jiumeng; Yang, Yulan; Wang, Xiuhua; Fang, Xiaodong; Hall, Ross S.; Hofmann, Andreas; Sternberg, Paul W.; Jex, Aaron R.; Gasser, Robin B.

2015-01-01

Toxocara canis is a zoonotic parasite of major socioeconomic importance worldwide. In humans, this nematode causes disease (toxocariasis) mainly in the under-privileged communities in developed and developing countries. Although relatively well studied from clinical and epidemiological perspectives, to date, there has been no global investigation of the molecular biology of this parasite. Here we use next-generation sequencing to produce a draft genome and transcriptome of T. canis to support future biological and biotechnological investigations. This genome is 317 Mb in size, has a repeat content of 13.5% and encodes at least 18,596 protein-coding genes. We study transcription in a larval, as well as adult female and male stages, characterize the parasite’s gene-silencing machinery, explore molecules involved in development or host–parasite interactions and predict intervention targets. The draft genome of T. canis should provide a useful resource for future molecular studies of this and other, related parasites. PMID:25649139
Draft genome sequence of Paraburkholderia tropica Ppe8 strain, a sugarcane endophytic diazotrophic bacterium.

PubMed

Silva, Paula Renata Alves da; Simões-Araújo, Jean Luiz; Vidal, Márcia Soares; Cruz, Leonardo Magalhães; Souza, Emanuel Maltempi de; Baldani, José Ivo

Paraburkholderia tropica (syn Burkholderia tropica) are nitrogen-fixing bacteria commonly found in sugarcane. The Paraburkholderia tropica strain Ppe8 is part of the sugarcane inoculant consortium that has a beneficial effect on yield. Here, we report a draft genome sequence of this strain elucidating the mechanisms involved in its interaction mainly with Poaceae. A genome size of approximately 8.75Mb containing 7844 protein coding genes distributed in 526 subsystems was de novo assembled with ABySS and annotated by RAST. Genes related to the nitrogen fixation process, the secretion systems (I, II, III, IV, and VI), and related to a variety of metabolic traits, such as metabolism of carbohydrates, amino acids, vitamins, and proteins, were detected, suggesting a broad metabolic capacity and possible adaptation to plant association. Copyright © 2017 Sociedade Brasileira de Microbiologia. Published by Elsevier Editora Ltda. All rights reserved.
A high-density genetic map reveals variation in recombination rate across the genome of Daphnia magna.

PubMed

Dukić, Marinela; Berner, Daniel; Roesti, Marius; Haag, Christoph R; Ebert, Dieter

2016-10-13

Recombination rate is an essential parameter for many genetic analyses. Recombination rates are highly variable across species, populations, individuals and different genomic regions. Due to the profound influence that recombination can have on intraspecific diversity and interspecific divergence, characterization of recombination rate variation emerges as a key resource for population genomic studies and emphasises the importance of high-density genetic maps as tools for studying genome biology. Here we present such a high-density genetic map for Daphnia magna, and analyse patterns of recombination rate across the genome. A F2 intercross panel was genotyped by Restriction-site Associated DNA sequencing to construct the third-generation linkage map of D. magna. The resulting high-density map included 4037 markers covering 813 scaffolds and contigs that sum up to 77 % of the currently available genome draft sequence (v2.4) and 55 % of the estimated genome size (238 Mb). Total genetic length of the map presented here is 1614.5 cM and the genome-wide recombination rate is estimated to 6.78 cM/Mb. Merging genetic and physical information we consistently found that recombination rate estimates are high towards the peripheral parts of the chromosomes, while chromosome centres, harbouring centromeres in D. magna, show very low recombination rate estimates. Due to its high-density, the third-generation linkage map for D. magna can be coupled with the draft genome assembly, providing an essential tool for genome investigation in this model organism. Thus, our linkage map can be used for the on-going improvements of the genome assembly, but more importantly, it has enabled us to characterize variation in recombination rate across the genome of D. magna for the first time. These new insights can provide a valuable assistance in future studies of the genome evolution, mapping of quantitative traits and population genetic studies.
The pomegranate (Punica granatum L.) genome provides insights into fruit quality and ovule developmental biology.

PubMed

Yuan, Zhaohe; Fang, Yanming; Zhang, Taikui; Fei, Zhangjun; Han, Fengming; Liu, Cuiyu; Liu, Min; Xiao, Wei; Zhang, Wenjing; Wu, Shan; Zhang, Mengwei; Ju, Youhui; Xu, Huili; Dai, He; Liu, Yujun; Chen, Yanhui; Wang, Lili; Zhou, Jianqing; Guan, Dian; Yan, Ming; Xia, Yanhua; Huang, Xianbin; Liu, Dongyuan; Wei, Hongmin; Zheng, Hongkun

2017-12-22

Pomegranate (Punica granatum L.) has an ancient cultivation history and has become an emerging profitable fruit crop due to its attractive features such as the bright red appearance and the high abundance of medicinally valuable ellagitannin-based compounds in its peel and aril. However, the limited genomic resources have restricted further elucidation of genetics and evolution of these interesting traits. Here, we report a 274-Mb high-quality draft pomegranate genome sequence, which covers approximately 81.5% of the estimated 336-Mb genome, consists of 2177 scaffolds with an N50 size of 1.7 Mb and contains 30 903 genes. Phylogenomic analysis supported that pomegranate belongs to the Lythraceae family rather than the monogeneric Punicaceae family, and comparative analyses showed that pomegranate and Eucalyptus grandis share the paleotetraploidy event. Integrated genomic and transcriptomic analyses provided insights into the molecular mechanisms underlying the biosynthesis of ellagitannin-based compounds, the colour formation in both peels and arils during pomegranate fruit development, and the unique ovule development processes that are characteristic of pomegranate. This genome sequence provides an important resource to expand our understanding of some unique biological processes and to facilitate both comparative biology studies and crop breeding. © 2017 The Authors. Plant Biotechnology Journal published by Society for Experimental Biology and The Association of Applied Biologists and John Wiley & Sons Ltd.
Lineage-Specific Biology Revealed by a Finished Genome Assembly of the Mouse

PubMed Central

Hillier, LaDeana W.; Zody, Michael C.; Goldstein, Steve; She, Xinwe; Bult, Carol J.; Agarwala, Richa; Cherry, Joshua L.; DiCuccio, Michael; Hlavina, Wratko; Kapustin, Yuri; Meric, Peter; Maglott, Donna; Birtle, Zoë; Marques, Ana C.; Graves, Tina; Zhou, Shiguo; Teague, Brian; Potamousis, Konstantinos; Churas, Christopher; Place, Michael; Herschleb, Jill; Runnheim, Ron; Forrest, Daniel; Amos-Landgraf, James; Schwartz, David C.; Cheng, Ze; Lindblad-Toh, Kerstin; Eichler, Evan E.; Ponting, Chris P.

2009-01-01

The mouse (Mus musculus) is the premier animal model for understanding human disease and development. Here we show that a comprehensive understanding of mouse biology is only possible with the availability of a finished, high-quality genome assembly. The finished clone-based assembly of the mouse strain C57BL/6J reported here has over 175,000 fewer gaps and over 139 Mb more of novel sequence, compared with the earlier MGSCv3 draft genome assembly. In a comprehensive analysis of this revised genome sequence, we are now able to define 20,210 protein-coding genes, over a thousand more than predicted in the human genome (19,042 genes). In addition, we identified 439 long, non–protein-coding RNAs with evidence for transcribed orthologs in human. We analyzed the complex and repetitive landscape of 267 Mb of sequence that was missing or misassembled in the previously published assembly, and we provide insights into the reasons for its resistance to sequencing and assembly by whole-genome shotgun approaches. Duplicated regions within newly assembled sequence tend to be of more recent ancestry than duplicates in the published draft, correcting our initial understanding of recent evolution on the mouse lineage. These duplicates appear to be largely composed of sequence regions containing transposable elements and duplicated protein-coding genes; of these, some may be fixed in the mouse population, but at least 40% of segmentally duplicated sequences are copy number variable even among laboratory mouse strains. Mouse lineage-specific regions contain 3,767 genes drawn mainly from rapidly-changing gene families associated with reproductive functions. The finished mouse genome assembly, therefore, greatly improves our understanding of rodent-specific biology and allows the delineation of ancestral biological functions that are shared with human from derived functions that are not. PMID:19468303
Draft genome sequence of the silver pomfret fish, Pampus argenteus.

PubMed

AlMomin, Sabah; Kumar, Vinod; Al-Amad, Sami; Al-Hussaini, Mohsen; Dashti, Talal; Al-Enezi, Khaznah; Akbar, Abrar

2016-01-01

Silver pomfret, Pampus argenteus, is a fish species from coastal waters. Despite its high commercial value, this edible fish has not been sequenced. Hence, its genetic and genomic studies have been limited. We report the first draft genome sequence of the silver pomfret obtained using a Next Generation Sequencing (NGS) technology. We assembled 38.7 Gb of nucleotides into scaffolds of 350 Mb with N50 of about 1.5 kb, using high quality paired end reads. These scaffolds represent 63.7% of the estimated silver pomfret genome length. The newly sequenced and assembled genome has 11.06% repetitive DNA regions, and this percentage is comparable to that of the tilapia genome. The genome analysis predicted 16 322 genes. About 91% of these genes showed homology with known proteins. Many gene clusters were annotated to protein and fatty-acid metabolism pathways that may be important in the context of the meat texture and immune system developmental processes. The reference genome can pave the way for the identification of many other genomic features that could improve breeding and population-management strategies, and it can also help characterize the genetic diversity of P. argenteus.
Rapid construction of genome map for large yellow croaker (Larimichthys crocea) by the whole-genome mapping in BioNano Genomics Irys system.

PubMed

Xiao, Shijun; Li, Jiongtang; Ma, Fengshou; Fang, Lujing; Xu, Shuangbin; Chen, Wei; Wang, Zhi Yong

2015-09-03

Large yellow croaker (Larimichthys crocea) is an important commercial fish in China and East-Asia. The annual product of the species from the aqua-farming industry is about 90 thousand tons. In spite of its economic importance, genetic studies of economic traits and genomic selections of the species are hindered by the lack of genomic resources. Specifically, a whole-genome physical map of large yellow croaker is still missing. The traditional BAC-based fingerprint method is extremely time- and labour-consuming. Here we report the first genome map construction using the high-throughput whole-genome mapping technique by nanochannel arrays in BioNano Genomics Irys system. For an optimal marker density of ~10 per 100 kb, the nicking endonuclease Nt.BspQ1 was chosen for the genome map generation. 645,305 DNA molecules with a total length of ~112 Gb were labelled and detected, covering more than 160X of the large yellow croaker genome. Employing IrysView package and signature patterns in raw DNA molecules, a whole-genome map of large yellow croaker was assembled into 686 maps with a total length of 727 Mb, which was consistent with the estimated genome size. The N50 length of the whole-genome map, including 126 maps, was up to 1.7 Mb. The excellent hybrid alignment with large yellow croaker draft genome validated the consensus genome map assembly and highlighted a promising application of whole-genome mapping on draft genome sequence super-scaffolding. The genome map data of large yellow croaker are accessible on lycgenomics.jmu.edu.cn/pm. Using the state-of-the-art whole-genome mapping technique in Irys system, the first whole-genome map for large yellow croaker has been constructed and thus highly facilitates the ongoing genomic and evolutionary studies for the species. To our knowledge, this is the first public report on genome map construction by the whole-genome mapping for aquatic-organisms. Our study demonstrates a promising application of the whole-genome mapping on genome maps construction for other non-model organisms in a fast and reliable manner.
Draft genome sequence of Micrococcus luteus strain O'Kane implicates metabolic versatility and the potential to degrade polyhydroxybutyrates.

PubMed

Hanafy, Radwa A; Couger, M B; Baker, Kristina; Murphy, Chelsea; O'Kane, Shannon D; Budd, Connie; French, Donald P; Hoff, Wouter D; Youssef, Noha

2016-09-01

Micrococcus luteus is a predominant member of skin microbiome. We here report on the genomic analysis of Micrococcus luteus strain O'Kane that was isolated from an elevator. The partial genome assembly of Micrococcus luteus strain O'Kane is 2.5 Mb with 2256 protein-coding genes and 62 RNA genes. Genomic analysis revealed metabolic versatility with genes involved in the metabolism and transport of glucose, galactose, fructose, mannose, alanine, aspartate, asparagine, glutamate, glutamine, glycine, serine, cysteine, methionine, arginine, proline, histidine, phenylalanine, and fatty acids. Genomic comparison to other M. luteus representatives identified the potential to degrade polyhydroxybutyrates, as well as several antibiotic resistance genes absent from other genomes.
Genome sequencing of adzuki bean (Vigna angularis) provides insight into high starch and low fat accumulation and domestication.

PubMed

Yang, Kai; Tian, Zhixi; Chen, Chunhai; Luo, Longhai; Zhao, Bo; Wang, Zhuo; Yu, Lili; Li, Yisong; Sun, Yudong; Li, Weiyu; Chen, Yan; Li, Yongqiang; Zhang, Yueyang; Ai, Danjiao; Zhao, Jinyang; Shang, Cheng; Ma, Yong; Wu, Bin; Wang, Mingli; Gao, Li; Sun, Dongjing; Zhang, Peng; Guo, Fangfang; Wang, Weiwei; Li, Yuan; Wang, Jinlong; Varshney, Rajeev K; Wang, Jun; Ling, Hong-Qing; Wan, Ping

2015-10-27

Adzuki bean (Vigna angularis), an important legume crop, is grown in more than 30 countries of the world. The seed of adzuki bean, as an important source of starch, digestible protein, mineral elements, and vitamins, is widely used foods for at least a billion people. Here, we generated a high-quality draft genome sequence of adzuki bean by whole-genome shotgun sequencing. The assembled contig sequences reached to 450 Mb (83% of the genome) with an N50 of 38 kb, and the total scaffold sequences were 466.7 Mb with an N50 of 1.29 Mb. Of them, 372.9 Mb of scaffold sequences were assigned to the 11 chromosomes of adzuki bean by using a single nucleotide polymorphism genetic map. A total of 34,183 protein-coding genes were predicted. Functional analysis revealed that significant differences in starch and fat content between adzuki bean and soybean were likely due to transcriptional abundance, rather than copy number variations, of the genes related to starch and oil synthesis. We detected strong selection signals in domestication by the population analysis of 50 accessions including 11 wild, 11 semiwild, 17 landraces, and 11 improved varieties. In addition, the semiwild accessions were illuminated to have a closer relationship to the cultigen accessions than the wild type, suggesting that the semiwild adzuki bean might be a preliminary landrace and play some roles in the adzuki bean domestication. The genome sequence of adzuki bean will facilitate the identification of agronomically important genes and accelerate the improvement of adzuki bean.
Genome sequencing of adzuki bean (Vigna angularis) provides insight into high starch and low fat accumulation and domestication

PubMed Central

Yang, Kai; Tian, Zhixi; Chen, Chunhai; Luo, Longhai; Zhao, Bo; Wang, Zhuo; Yu, Lili; Li, Yisong; Sun, Yudong; Li, Weiyu; Chen, Yan; Li, Yongqiang; Zhang, Yueyang; Ai, Danjiao; Zhao, Jinyang; Shang, Cheng; Ma, Yong; Wu, Bin; Wang, Mingli; Gao, Li; Sun, Dongjing; Zhang, Peng; Guo, Fangfang; Wang, Weiwei; Li, Yuan; Wang, Jinlong; Varshney, Rajeev K.; Wang, Jun; Ling, Hong-Qing; Wan, Ping

2015-01-01

Adzuki bean (Vigna angularis), an important legume crop, is grown in more than 30 countries of the world. The seed of adzuki bean, as an important source of starch, digestible protein, mineral elements, and vitamins, is widely used foods for at least a billion people. Here, we generated a high-quality draft genome sequence of adzuki bean by whole-genome shotgun sequencing. The assembled contig sequences reached to 450 Mb (83% of the genome) with an N50 of 38 kb, and the total scaffold sequences were 466.7 Mb with an N50 of 1.29 Mb. Of them, 372.9 Mb of scaffold sequences were assigned to the 11 chromosomes of adzuki bean by using a single nucleotide polymorphism genetic map. A total of 34,183 protein-coding genes were predicted. Functional analysis revealed that significant differences in starch and fat content between adzuki bean and soybean were likely due to transcriptional abundance, rather than copy number variations, of the genes related to starch and oil synthesis. We detected strong selection signals in domestication by the population analysis of 50 accessions including 11 wild, 11 semiwild, 17 landraces, and 11 improved varieties. In addition, the semiwild accessions were illuminated to have a closer relationship to the cultigen accessions than the wild type, suggesting that the semiwild adzuki bean might be a preliminary landrace and play some roles in the adzuki bean domestication. The genome sequence of adzuki bean will facilitate the identification of agronomically important genes and accelerate the improvement of adzuki bean. PMID:26460024
High quality draft genome sequences of Pseudomonas fulva DSM 17717 T, Pseudomonas parafulva DSM 17004 T and Pseudomonas cremoricolorata DSM 17059 T type strains

DOE PAGES

Peña, Arantxa; Busquets, Antonio; Gomila, Margarita; ...

2016-09-01

Pseudomonas has the highest number of species out of any genus of Gram-negative bacteria and is phylogenetically divided into several groups. The Pseudomonas putida phylogenetic branch includes at least 13 species of environmental and industrial interest, plant-associated bacteria, insect pathogens, and even some members that have been found in clinical specimens. In the context of the Genomic Encyclopedia of Bacteria and Archaea project, we present the permanent, high-quality draft genomes of the type strains of 3 taxonomically and ecologically closely related species in the Pseudomonas putida phylogenetic branch: Pseudomonas fulva DSM 17717 T, Pseudomonas parafulva DSM 17004 T and Pseudomonasmore » cremoricolorata DSM 17059T. All three genomes are comparable in size (4.6-4.9Mb), with 4,119-4,459 protein-coding genes. Average nucleotide identity based on BLAST comparisons and digital genome-to-genome distance calculations are in good agreement with experimental DNA-DNA hybridization results. The genome sequences presented here will be very helpful in elucidating the taxonomy, phylogeny and evolution of the Pseudomonas putida species complex.« less
High quality draft genome sequences of Pseudomonas fulva DSM 17717 T, Pseudomonas parafulva DSM 17004 T and Pseudomonas cremoricolorata DSM 17059 T type strains

DOE Office of Scientific and Technical Information (OSTI.GOV)

Peña, Arantxa; Busquets, Antonio; Gomila, Margarita

Pseudomonas has the highest number of species out of any genus of Gram-negative bacteria and is phylogenetically divided into several groups. The Pseudomonas putida phylogenetic branch includes at least 13 species of environmental and industrial interest, plant-associated bacteria, insect pathogens, and even some members that have been found in clinical specimens. In the context of the Genomic Encyclopedia of Bacteria and Archaea project, we present the permanent, high-quality draft genomes of the type strains of 3 taxonomically and ecologically closely related species in the Pseudomonas putida phylogenetic branch: Pseudomonas fulva DSM 17717 T, Pseudomonas parafulva DSM 17004 T and Pseudomonasmore » cremoricolorata DSM 17059T. All three genomes are comparable in size (4.6-4.9Mb), with 4,119-4,459 protein-coding genes. Average nucleotide identity based on BLAST comparisons and digital genome-to-genome distance calculations are in good agreement with experimental DNA-DNA hybridization results. The genome sequences presented here will be very helpful in elucidating the taxonomy, phylogeny and evolution of the Pseudomonas putida species complex.« less
The draft genome of blunt snout bream (Megalobrama amblycephala) reveals the development of intermuscular bone and adaptation to herbivorous diet

PubMed Central

Liu, Han; Chen, Chunhai; Gao, Zexia; Min, Jiumeng; Gu, Yongming; Jian, Jianbo; Jiang, Xiewu; Cai, Huimin; Ebersberger, Ingo; Xu, Meng; Zhang, Xinhui; Chen, Jianwei; Luo, Wei; Chen, Boxiang; Chen, Junhui; Liu, Hong; Li, Jiang; Lai, Ruifang; Bai, Mingzhou; Wei, Jin; Yi, Shaokui; Wang, Huanling; Cao, Xiaojuan; Zhou, Xiaoyun; Zhao, Yuhua; Wei, Kaijian; Yang, Ruibin; Liu, Bingnan; Zhao, Shancen; Fang, Xiaodong

2017-01-01

Abstract The blunt snout bream Megalobrama amblycephala is the economically most important cyprinid fish species. As an herbivore, it can be grown by eco-friendly and resource-conserving aquaculture. However, the large number of intermuscular bones in the trunk musculature is adverse to fish meat processing and consumption. As a first towards optimizing this aquatic livestock, we present a 1.116-Gb draft genome of M. amblycephala, with 779.54 Mb anchored on 24 linkage groups. Integrating spatiotemporal transcriptome analyses, we show that intermuscular bone is formed in the more basal teleosts by intramembranous ossification and may be involved in muscle contractibility and coordinating cellular events. Comparative analysis revealed that olfactory receptor genes, especially of the beta type, underwent an extensive expansion in herbivorous cyprinids, whereas the gene for the umami receptor T1R1 was specifically lost in M. amblycephala. The composition of gut microflora, which contributes to the herbivorous adaptation of M. amblycephala, was found to be similar to that of other herbivores. As a valuable resource for the improvement of M. amblycephala livestock, the draft genome sequence offers new insights into the development of intermuscular bone and herbivorous adaptation. PMID:28535200

Draft Genome Sequence of Methanoculleus sediminis S3FaT, a Hydrogenotrophic Methanogen Isolated from a Submarine Mud Volcano in Taiwan.

PubMed

Chen, Sheng-Chung; Chen, Mei-Fei; Weng, Chieh-Yin; Lai, Mei-Chin; Wu, Sue-Yao

2016-04-21

Here, we announce the genome sequence of ITALIC! Methanoculleus sediminisS3Fa(T)(DSM 29354(T)), a strict anaerobic methanoarchaeon, which was isolated from sediments near the submarine mud volcano MV4 located offshore in southwestern Taiwan. The 2.49-Mb genome consists of 2,459 predicted genes, 3 rRNAs, 48 tRNAs, and 1 ncRNA. The sequence of this novel strain may provide more information for species delineation and the roles that this strain plays in the unique marine mud volcano habitat. Copyright © 2016 Chen et al.
A High-Resolution SNP Array-Based Linkage Map Anchors a New Domestic Cat Draft Genome Assembly and Provides Detailed Patterns of Recombination.

PubMed

Li, Gang; Hillier, LaDeana W; Grahn, Robert A; Zimin, Aleksey V; David, Victor A; Menotti-Raymond, Marilyn; Middleton, Rondo; Hannah, Steven; Hendrickson, Sher; Makunin, Alex; O'Brien, Stephen J; Minx, Pat; Wilson, Richard K; Lyons, Leslie A; Warren, Wesley C; Murphy, William J

2016-06-01

High-resolution genetic and physical maps are invaluable tools for building accurate genome assemblies, and interpreting results of genome-wide association studies (GWAS). Previous genetic and physical maps anchored good quality draft assemblies of the domestic cat genome, enabling the discovery of numerous genes underlying hereditary disease and phenotypes of interest to the biomedical science and breeding communities. However, these maps lacked sufficient marker density to order thousands of shorter scaffolds in earlier assemblies, which instead relied heavily on comparative mapping with related species. A high-resolution map would aid in validating and ordering chromosome scaffolds from existing and new genome assemblies. Here, we describe a high-resolution genetic linkage map of the domestic cat genome based on genotyping 453 domestic cats from several multi-generational pedigrees on the Illumina 63K SNP array. The final maps include 58,055 SNP markers placed relative to 6637 markers with unique positions, distributed across all autosomes and the X chromosome. Our final sex-averaged maps span a total autosomal length of 4464 cM, the longest described linkage map for any mammal, confirming length estimates from a previous microsatellite-based map. The linkage map was used to order and orient the scaffolds from a substantially more contiguous domestic cat genome assembly (Felis catus v8.0), which incorporated ∼20 × coverage of Illumina fragment reads. The new genome assembly shows substantial improvements in contiguity, with a nearly fourfold increase in N50 scaffold size to 18 Mb. We use this map to report probable structural errors in previous maps and assemblies, and to describe features of the recombination landscape, including a massive (∼50 Mb) recombination desert (of virtually zero recombination) on the X chromosome that parallels a similar desert on the porcine X chromosome in both size and physical location. Copyright © 2016 Li et al.
Host-Associated Genomic Features of the Novel Uncultured Intracellular Pathogen Ca. Ichthyocystis Revealed by Direct Sequencing of Epitheliocysts

PubMed Central

Qi, Weihong; Vaughan, Lloyd; Katharios, Pantelis; Schlapbach, Ralph; Seth-Smith, Helena M.B.

2016-01-01

Advances in single-cell and mini-metagenome sequencing have enabled important investigations into uncultured bacteria. In this study, we applied the mini-metagenome sequencing method to assemble genome drafts of the uncultured causative agents of epitheliocystis, an emerging infectious disease in the Mediterranean aquaculture species gilthead seabream. We sequenced multiple cyst samples and constructed 11 genome drafts from a novel beta-proteobacterial lineage, Candidatus Ichthyocystis. The draft genomes demonstrate features typical of pathogenic bacteria with an obligate intracellular lifestyle: a reduced genome of up to 2.6 Mb, reduced G + C content, and reduced metabolic capacity. Reconstruction of metabolic pathways reveals that Ca. Ichthyocystis genomes lack all amino acid synthesis pathways, compelling them to scavenge from the fish host. All genomes encode type II, III, and IV secretion systems, a large repertoire of predicted effectors, and a type IV pilus. These are all considered to be virulence factors, required for adherence, invasion, and host manipulation. However, no evidence of lipopolysaccharide synthesis could be found. Beyond the core functions shared within the genus, alignments showed distinction into different species, characterized by alternative large gene families. These comprise up to a third of each genome, appear to have arisen through duplication and diversification, encode many effector proteins, and are seemingly critical for virulence. Thus, Ca. Ichthyocystis represents a novel obligatory intracellular pathogenic beta-proteobacterial lineage. The methods used: mini-metagenome analysis and manual annotation, have generated important insights into the lifestyle and evolution of the novel, uncultured pathogens, elucidating many putative virulence factors including an unprecedented array of novel gene families. PMID:27190004
Single-molecule sequencing and optical mapping yields an improved genome of woodland strawberry (Fragaria vesca) with chromosome-scale contiguity.

PubMed

Edger, Patrick P; VanBuren, Robert; Colle, Marivi; Poorten, Thomas J; Wai, Ching Man; Niederhuth, Chad E; Alger, Elizabeth I; Ou, Shujun; Acharya, Charlotte B; Wang, Jie; Callow, Pete; McKain, Michael R; Shi, Jinghua; Collier, Chad; Xiong, Zhiyong; Mower, Jeffrey P; Slovin, Janet P; Hytönen, Timo; Jiang, Ning; Childs, Kevin L; Knapp, Steven J

2018-02-01

Although draft genomes are available for most agronomically important plant species, the majority are incomplete, highly fragmented, and often riddled with assembly and scaffolding errors. These assembly issues hinder advances in tool development for functional genomics and systems biology. Here we utilized a robust, cost-effective approach to produce high-quality reference genomes. We report a near-complete genome of diploid woodland strawberry (Fragaria vesca) using single-molecule real-time sequencing from Pacific Biosciences (PacBio). This assembly has a contig N50 length of ∼7.9 million base pairs (Mb), representing a ∼300-fold improvement of the previous version. The vast majority (>99.8%) of the assembly was anchored to 7 pseudomolecules using 2 sets of optical maps from Bionano Genomics. We obtained ∼24.96 Mb of sequence not present in the previous version of the F. vesca genome and produced an improved annotation that includes 1496 new genes. Comparative syntenic analyses uncovered numerous, large-scale scaffolding errors present in each chromosome in the previously published version of the F. vesca genome. Our results highlight the need to improve existing short-read based reference genomes. Furthermore, we demonstrate how genome quality impacts commonly used analyses for addressing both fundamental and applied biological questions. © The Authors 2017. Published by Oxford University Press.
Use of a draft genome of coffee (Coffea arabica) to identify SNPs associated with caffeine content.

PubMed

Tran, Hue T M; Ramaraj, Thiruvarangan; Furtado, Agnelo; Lee, Leonard Slade; Henry, Robert J

2018-03-07

Arabica coffee (Coffea arabica) has a small gene pool limiting genetic improvement. Selection for caffeine content within this gene pool would be assisted by identification of the genes controlling this important trait. Sequencing of DNA bulks from 18 genotypes with extreme high- or low-caffeine content from a population of 232 genotypes was used to identify linked polymorphisms. To obtain a reference genome, a whole genome assembly of arabica coffee (variety K7) was achieved by sequencing using short read (Illumina) and long-read (PacBio) technology. Assembly was performed using a range of assembly tools resulting in 76 409 scaffolds with a scaffold N50 of 54 544 bp and a total scaffold length of 1448 Mb. Validation of the genome assembly using different tools showed high completeness of the genome. More than 99% of transcriptome sequences mapped to the C. arabica draft genome, and 89% of BUSCOs were present. The assembled genome annotated using AUGUSTUS yielded 99 829 gene models. Using the draft arabica genome as reference in mapping and variant calling allowed the detection of 1444 nonsynonymous single nucleotide polymorphisms (SNPs) associated with caffeine content. Based on Kyoto Encyclopaedia of Genes and Genomes pathway-based analysis, 65 caffeine-associated SNPs were discovered, among which 11 SNPs were associated with genes encoding enzymes involved in the conversion of substrates, which participate in the caffeine biosynthesis pathways. This analysis demonstrated the complex genetic control of this key trait in coffee. © 2018 The Authors. Plant Biotechnology Journal published by Society for Experimental Biology and The Association of Applied Biologists and John Wiley & Sons Ltd.
Whole-genome sequencing of Aspergillus tubingensis G131 and overview of its secondary metabolism potential.

PubMed

Choque, Elodie; Klopp, Christophe; Valiere, Sophie; Raynal, José; Mathieu, Florence

2018-03-15

Black Aspergilli represent one of the most important fungal resources of primary and secondary metabolites for biotechnological industry. Having several black Aspergilli sequenced genomes should allow targeting the production of certain metabolites with bioactive properties. In this study, we report the draft genome of a black Aspergilli, A. tubingensis G131, isolated from a French Mediterranean vineyard. This 35 Mb genome includes 10,994 predicted genes. A genomic-based discovery identifies 80 secondary metabolites biosynthetic gene clusters. Genomic sequences of these clusters were blasted on 3 chosen black Aspergilli genomes: A. tubingensis CBS 134.48, A. niger CBS 513.88 and A. kawachii IFO 4308. This comparison highlights different levels of clusters conservation between the four strains. It also allows identifying seven unique clusters in A. tubingensis G131. Moreover, the putative secondary metabolites clusters for asperazine and naphtho-gamma-pyrones production were proposed based on this genomic analysis. Key biosynthetic genes required for the production of 2 mycotoxins, ochratoxin A and fumonisin, are absent from this draft genome. Even if intergenic sequences of these mycotoxins biosynthetic pathways are present, this could not lead to the production of those mycotoxins by A. tubingensis G131. Functional and bioinformatics analyses of A. tubingensis G131 genome highlight its potential for metabolites production in particular for TAN-1612, asperazine and naphtho-gamma-pyrones presenting antioxidant, anticancer or antibiotic properties.
An SNP resource for rice genetics and breeding based on subspecies indica and japonica genome alignments.

PubMed

Feltus, F Alex; Wan, Jun; Schulze, Stefan R; Estill, James C; Jiang, Ning; Paterson, Andrew H

2004-09-01

Dense coverage of the rice genome with polymorphic DNA markers is an invaluable tool for DNA marker-assisted breeding, positional cloning, and a wide range of evolutionary studies. We have aligned drafts of two rice subspecies, indica and japonica, and analyzed levels and patterns of genetic diversity. After filtering multiple copy and low quality sequence, 408,898 candidate DNA polymorphisms (SNPs/INDELs) were discerned between the two subspecies. These filters have the consequence that our data set includes only a subset of the available SNPs (in particular excluding large numbers of SNPs that may occur between repetitive DNA alleles) but increase the likelihood that this subset is useful: Direct sequencing suggests that 79.8% +/- 7.5% of the in silico SNPs are real. The SNP sample in our database is not randomly distributed across the genome. In fact, 566 rice genomic regions had unusually high (328 contigs/48.6 Mb/13.6% of genome) or low (237 contigs/64.7 Mb/18.1% of genome) polymorphism rates. Many SNP-poor regions were substantially longer than most SNP-rich regions, covering up to 4 Mb, and possibly reflecting introgression between the respective gene pools that may have occurred hundreds of years ago. Although 46.2% +/- 8.3% of the SNPs differentiate other pairs of japonica and indica genotypes, SNP rates in rice were not predictive of evolutionary rates for corresponding genes in another grass species, sorghum. The data set is freely available at http://www.plantgenome.uga.edu/snp.
An SNP Resource for Rice Genetics and Breeding Based on Subspecies Indica and Japonica Genome Alignments

PubMed Central

Feltus, F. Alex; Wan, Jun; Schulze, Stefan R.; Estill, James C.; Jiang, Ning; Paterson, Andrew H.

2004-01-01

Dense coverage of the rice genome with polymorphic DNA markers is an invaluable tool for DNA marker-assisted breeding, positional cloning, and a wide range of evolutionary studies. We have aligned drafts of two rice subspecies, indica and japonica, and analyzed levels and patterns of genetic diversity. After filtering multiple copy and low quality sequence, 408,898 candidate DNA polymorphisms (SNPs/INDELs) were discerned between the two subspecies. These filters have the consequence that our data set includes only a subset of the available SNPs (in particular excluding large numbers of SNPs that may occur between repetitive DNA alleles) but increase the likelihood that this subset is useful: Direct sequencing suggests that 79.8% ± 7.5% of the in silico SNPs are real. The SNP sample in our database is not randomly distributed across the genome. In fact, 566 rice genomic regions had unusually high (328 contigs/48.6 Mb/13.6% of genome) or low (237 contigs/64.7 Mb/18.1% of genome) polymorphism rates. Many SNP-poor regions were substantially longer than most SNP-rich regions, covering up to 4 Mb, and possibly reflecting introgression between the respective gene pools that may have occurred hundreds of years ago. Although 46.2% ± 8.3% of the SNPs differentiate other pairs of japonica and indica genotypes, SNP rates in rice were not predictive of evolutionary rates for corresponding genes in another grass species, sorghum. The data set is freely available at http://www.plantgenome.uga.edu/snp. PMID:15342564
Draft genome assembly of the Bengalese finch, Lonchura striata domestica, a model for motor skill variability and learning

PubMed Central

Mets, David G; Brainard, Michael S

2018-01-01

Abstract Background Vocal learning in songbirds has emerged as a powerful model for sensorimotor learning. Neurobehavioral studies of Bengalese finch (Lonchura striata domestica) song, naturally more variable and plastic than songs of other finch species, have demonstrated the importance of behavioral variability for initial learning, maintenance, and plasticity of vocalizations. However, the molecular and genetic underpinnings of this variability and the learning it supports are poorly understood. Findings To establish a platform for the molecular analysis of behavioral variability and plasticity, we generated an initial draft assembly of the Bengalese finch genome from a single male animal to 151× coverage and an N50 of 3.0 MB. Furthermore, we developed an initial set of gene models using RNA-seq data from 8 samples that comprise liver, muscle, cerebellum, brainstem/midbrain, and forebrain tissue from juvenile and adult Bengalese finches of both sexes. Conclusions We provide a draft Bengalese finch genome and gene annotation to facilitate the study of the molecular-genetic influences on behavioral variability and the process of vocal learning. These data will directly support many avenues for the identification of genes involved in learning, including differential expression analysis, comparative genomic analysis (through comparison to existing avian genome assemblies), and derivation of genetic maps for linkage analysis. Bengalese finch gene models and sequences will be essential for subsequent manipulation (molecular or genetic) of genes and gene products, enabling novel mechanistic investigations into the role of variability in learned behavior. PMID:29618046
Draft genome assembly of the Bengalese finch, Lonchura striata domestica, a model for motor skill variability and learning.

PubMed

Colquitt, Bradley M; Mets, David G; Brainard, Michael S

2018-03-01

Vocal learning in songbirds has emerged as a powerful model for sensorimotor learning. Neurobehavioral studies of Bengalese finch (Lonchura striata domestica) song, naturally more variable and plastic than songs of other finch species, have demonstrated the importance of behavioral variability for initial learning, maintenance, and plasticity of vocalizations. However, the molecular and genetic underpinnings of this variability and the learning it supports are poorly understood. To establish a platform for the molecular analysis of behavioral variability and plasticity, we generated an initial draft assembly of the Bengalese finch genome from a single male animal to 151× coverage and an N50 of 3.0 MB. Furthermore, we developed an initial set of gene models using RNA-seq data from 8 samples that comprise liver, muscle, cerebellum, brainstem/midbrain, and forebrain tissue from juvenile and adult Bengalese finches of both sexes. We provide a draft Bengalese finch genome and gene annotation to facilitate the study of the molecular-genetic influences on behavioral variability and the process of vocal learning. These data will directly support many avenues for the identification of genes involved in learning, including differential expression analysis, comparative genomic analysis (through comparison to existing avian genome assemblies), and derivation of genetic maps for linkage analysis. Bengalese finch gene models and sequences will be essential for subsequent manipulation (molecular or genetic) of genes and gene products, enabling novel mechanistic investigations into the role of variability in learned behavior.
Draft Genome Sequence of Eggplant (Solanum melongena L.): the Representative Solanum Species Indigenous to the Old World

PubMed Central

Hirakawa, Hideki; Shirasawa, Kenta; Miyatake, Koji; Nunome, Tsukasa; Negoro, Satomi; Ohyama, Akio; Yamaguchi, Hirotaka; Sato, Shusei; Isobe, Sachiko; Tabata, Satoshi; Fukuoka, Hiroyuki

2014-01-01

Unlike other important Solanaceae crops such as tomato, potato, chili pepper, and tobacco, all of which originated in South America and are cultivated worldwide, eggplant (Solanum melongena L.) is indigenous to the Old World and in this respect it is phylogenetically unique. To broaden our knowledge of the genomic nature of solanaceous plants further, we dissected the eggplant genome and built a draft genome dataset with 33,873 scaffolds termed SME_r2.5.1 that covers 833.1 Mb, ca. 74% of the eggplant genome. Approximately 90% of the gene space was estimated to be covered by SME_r2.5.1 and 85,446 genes were predicted in the genome. Clustering analysis of the predicted genes of eggplant along with the genes of three other solanaceous plants as well as Arabidopsis thaliana revealed that, of the 35,000 clusters generated, 4,018 were exclusively composed of eggplant genes that would perhaps confer eggplant-specific traits. Between eggplant and tomato, 16,573 pairs of genes were deduced to be orthologous, and 9,489 eggplant scaffolds could be mapped onto the tomato genome. Furthermore, 56 conserved synteny blocks were identified between the two species. The detailed comparative analysis of the eggplant and tomato genomes will facilitate our understanding of the genomic architecture of solanaceous plants, which will contribute to cultivation and further utilization of these crops. PMID:25233906
Draft genome and reference transcriptomic resources for the urticating pine defoliator Thaumetopoea pityocampa (Lepidoptera: Notodontidae).

PubMed

Gschloessl, B; Dorkeld, F; Berges, H; Beydon, G; Bouchez, O; Branco, M; Bretaudeau, A; Burban, C; Dubois, E; Gauthier, P; Lhuillier, E; Nichols, J; Nidelet, S; Rocha, S; Sauné, L; Streiff, R; Gautier, M; Kerdelhué, C

2018-05-01

The pine processionary moth Thaumetopoea pityocampa (Lepidoptera: Notodontidae) is the main pine defoliator in the Mediterranean region. Its urticating larvae cause severe human and animal health concerns in the invaded areas. This species shows a high phenotypic variability for various traits, such as phenology, fecundity and tolerance to extreme temperatures. This study presents the construction and analysis of extensive genomic and transcriptomic resources, which are an obligate prerequisite to understand their underlying genetic architecture. Using a well-studied population from Portugal with peculiar phenological characteristics, the karyotype was first determined and a first draft genome of 537 Mb total length was assembled into 68,292 scaffolds (N50 = 164 kb). From this genome assembly, 29,415 coding genes were predicted. To circumvent some limitations for fine-scale physical mapping of genomic regions of interest, a 3X coverage BAC library was also developed. In particular, 11 BACs from this library were individually sequenced to assess the assembly quality. Additionally, de novo transcriptomic resources were generated from various developmental stages sequenced with HiSeq and MiSeq Illumina technologies. The reads were de novo assembled into 62,376 and 63,175 transcripts, respectively. Then, a robust subset of the genome-predicted coding genes, the de novo transcriptome assemblies and previously published 454/Sanger data were clustered to obtain a high-quality and comprehensive reference transcriptome consisting of 29,701 bona fide unigenes. These sequences covered 99% of the cegma and 88% of the busco highly conserved eukaryotic genes and 84% of the busco arthropod gene set. Moreover, 90% of these transcripts could be localized on the draft genome. The described information is available via a genome annotation portal (http://bipaa.genouest.org/sp/thaumetopoea_pityocampa/). © 2018 John Wiley & Sons Ltd.
Draft genome sequence of non-shiga toxin-producing Escherichia coli O157 NCCP15738.

PubMed

Kwon, Taesoo; Kim, Jung-Beom; Bak, Young-Seok; Yu, Young-Bin; Kwon, Ki Sung; Kim, Won; Cho, Seung-Hak

2016-01-01

The non-shiga toxin-producing Escherichia coli (non-STEC) O157 is a pathogenic strain that cause diarrhea but does not cause hemolytic-uremic syndrome, or hemorrhagic colitis. Here, we present the 5-Mb draft genome sequence of non-STEC O157 NCCP15738, which was isolated from the feces of a Korean patient with diarrhea, and describe its features and the structural basis for its genome evolution. A total of 565-Mbp paired-end reads were generated using the Illumina-HiSeq 2000 platform. The reads were assembled into 135 scaffolds throughout the de novo assembly. The assembled genome size of NCCP15738 was 5,005,278 bp with an N50 value of 142,450 bp and 50.65 % G+C content. Using Rapid Annotation using Subsystem Technology analysis, we predicted 4780 ORFs and 31 RNA genes. The evolutionary tree was inferred from multiple sequence alignment of 45 E. coli species. The most closely related neighbor of NCCP15738 indicated by whole-genome phylogeny was E. coli UMNK88, but that indicated by multilocus sequence analysis was E. coli DH1(ME8569). A comparison between the NCCP15738 genome and those of reference strains, E. coli K-12 substr. MG1655 and EHEC O157:H7 EDL933 by bioinformatics analyses revealed unique genes in NCCP15738 associated with lysis protein S, two-component signal transduction system, conjugation, the flagellum, nucleotide-binding proteins, and metal-ion binding proteins. Notably, NCCP15738 has a dual flagella system like that in Vibrio parahaemolyticus, Aeromonas spp., and Rhodospirillum centenum. The draft genome sequence and the results of bioinformatics analysis of NCCP15738 provide the basis for understanding the genomic evolution of this strain.
Genome and transcriptome of the porcine whipworm Trichuris suis

PubMed Central

Jex, Aaron R.; Nejsum, Peter; Schwarz, Erich M.; Hu, Li; Young, Neil D.; Hall, Ross S.; Korhonen, Pasi K.; Liao, Shengguang; Thamsborg, Stig; Xia, Jinquan; Xu, Pengwei; Wang, Shaowei; Scheerlinck, Jean-Pierre Y.; Hofmann, Andreas; Sternberg, Paul W.; Wang, Jun; Gasser, Robin B.

2014-01-01

Trichuris (whipworm) infects 1 billion people worldwide, and causes a disease (trichuriasis) that results in major socioeconomic losses in both humans and pigs. Trichuriasis relates to an inflammation of the large intestine manifested in bloody diarrhoea, and chronic disease can cause malnourishment and stunting in children. Paradoxically, Trichuris of pigs has shown substantial promise as a treatment for human autoimmune disorders, including inflammatory bowel disease (IBD) and multiple sclerosis (MS). Here, we report ~80 megabase (Mb) draft assemblies of the genomes of adult male and female T. suis, and explore stage-, sex- and tissue-specific transcription of messenger and small non-coding RNAs. PMID:24929829
Genome and transcriptome of the porcine whipworm Trichuris suis.

PubMed

Jex, Aaron R; Nejsum, Peter; Schwarz, Erich M; Hu, Li; Young, Neil D; Hall, Ross S; Korhonen, Pasi K; Liao, Shengguang; Thamsborg, Stig; Xia, Jinquan; Xu, Pengwei; Wang, Shaowei; Scheerlinck, Jean-Pierre Y; Hofmann, Andreas; Sternberg, Paul W; Wang, Jun; Gasser, Robin B

2014-07-01

Trichuris (whipworm) infects 1 billion people worldwide and causes a disease (trichuriasis) that results in major socioeconomic losses in both humans and pigs. Trichuriasis relates to an inflammation of the large intestine manifested in bloody diarrhea, and chronic disease can cause malnourishment and stunting in children. Paradoxically, Trichuris of pigs has shown substantial promise as a treatment for human autoimmune disorders, including inflammatory bowel disease (IBD) and multiple sclerosis. Here we report whole-genome sequencing at ∼140-fold coverage of adult male and female T. suis and ∼80-Mb draft assemblies. We explore stage-, sex- and tissue-specific transcription of mRNAs and small noncoding RNAs.
Towards Positional Isolation of Three Quantitative Trait Loci Conferring Resistance to Powdery Mildew in Two Spanish Barley Landraces

PubMed Central

Silvar, Cristina; Perovic, Dragan; Nussbaumer, Thomas; Spannagl, Manuel; Usadel, Björn; Casas, Ana; Igartua, Ernesto; Ordon, Frank

2013-01-01

Three quantitative trait loci (QTL) conferring broad spectrum resistance to powdery mildew, caused by the fungus Blumeria graminis f. sp. hordei, were previously identified on chromosomes 7HS, 7HL and 6HL in the Spanish barley landrace-derived lines SBCC097 and SBCC145. In the present work, a genome-wide putative linear gene index of barley (Genome Zipper) and the first draft of the physical, genetic and functional sequence of the barley genome were used to go one step further in the shortening and explicit demarcation on the barley genome of these regions conferring resistance to powdery mildew as well as in the identification of candidate genes. First, a comparative analysis of the target regions to the barley Genome Zippers of chromosomes 7H and 6H allowed the development of 25 new gene-based molecular markers, which slightly better delimit the QTL intervals. These new markers provided the framework for anchoring of genetic and physical maps, figuring out the outline of the barley genome at the target regions in SBCC097 and SBCC145. The outermost flanking markers of QTLs on 7HS, 7HL and 6HL defined a physical area of 4 Mb, 3.7 Mb and 3.2 Mb, respectively. In total, 21, 10 and 16 genes on 7HS, 7HL and 6HL, respectively, could be interpreted as potential candidates to explain the resistance to powdery mildew, as they encode proteins of related functions with respect to the known pathogen defense-related processes. The majority of these were annotated as belonging to the NBS-LRR class or protein kinase family. PMID:23826271
Lessons learned from the initial sequencing of the pig genome: comparative analysis of an 8 Mb region of pig chromosome 17

PubMed Central

Hart, Elizabeth A; Caccamo, Mario; Harrow, Jennifer L; Humphray, Sean J; Gilbert, James GR; Trevanion, Steve; Hubbard, Tim; Rogers, Jane; Rothschild, Max F

2007-01-01

Background We describe here the sequencing, annotation and comparative analysis of an 8 Mb region of pig chromosome 17, which provides a useful test region to assess coverage and quality for the pig genome sequencing project. We report our findings comparing the annotation of draft sequence assembled at different depths of coverage. Results Within this region we annotated 71 loci, of which 53 are orthologous to human known coding genes. When compared to the syntenic regions in human (20q13.13-q13.33) and mouse (chromosome 2, 167.5 Mb-178.3 Mb), this region was found to be highly conserved with respect to gene order. The most notable difference between the three species is the presence of a large expansion of zinc finger coding genes and pseudogenes on mouse chromosome 2 between Edn3 and Phactr3 that is absent from pig and human. All of our annotation has been made publicly available in the Vertebrate Genome Annotation browser, VEGA. We assessed the impact of coverage on sequence assembly across this region and found, as expected, that increased sequence depth resulted in fewer, longer contigs. One-third of our annotated loci could not be fully re-aligned back to the low coverage version of the sequence, principally because the transcripts are fragmented over several contigs. Conclusion We have demonstrated the considerable advantages of sequencing at increased read depths and discuss the implications that lower coverage sequence may have on subsequent comparative and functional studies, particularly those involving complex loci such as GNAS. PMID:17705864
Sequencing and comparative analyses of the genomes of zoysiagrasses

PubMed Central

Tanaka, Hidenori; Hirakawa, Hideki; Kosugi, Shunichi; Nakayama, Shinobu; Ono, Akiko; Watanabe, Akiko; Hashiguchi, Masatsugu; Gondo, Takahiro; Ishigaki, Genki; Muguerza, Melody; Shimizu, Katsuya; Sawamura, Noriko; Inoue, Takayasu; Shigeki, Yuichi; Ohno, Naoki; Tabata, Satoshi; Akashi, Ryo; Sato, Shusei

2016-01-01

Zoysia is a warm-season turfgrass, which comprises 11 allotetraploid species (2n = 4x = 40), each possessing different morphological and physiological traits. To characterize the genetic systems of Zoysia plants and to analyse their structural and functional differences in individual species and accessions, we sequenced the genomes of Zoysia species using HiSeq and MiSeq platforms. As a reference sequence of Zoysia species, we generated a high-quality draft sequence of the genome of Z. japonica accession ‘Nagirizaki’ (334 Mb) in which 59,271 protein-coding genes were predicted. In parallel, draft genome sequences of Z. matrella ‘Wakaba’ and Z. pacifica ‘Zanpa’ were also generated for comparative analyses. To investigate the genetic diversity among the Zoysia species, genome sequence reads of three additional accessions, Z. japonica ‘Kyoto’, Z. japonica ‘Miyagi’ and Z. matrella ‘Chiba Fair Green’, were accumulated, and aligned against the reference genome of ‘Nagirizaki’ along with those from ‘Wakaba’ and ‘Zanpa’. As a result, we detected 7,424,163 single-nucleotide polymorphisms and 852,488 short indels among these species. The information obtained in this study will be valuable for basic studies on zoysiagrass evolution and genetics as well as for the breeding of zoysiagrasses, and is made available in the ‘Zoysia Genome Database’ at http://zoysia.kazusa.or.jp. PMID:26975196
Sequencing and comparative analyses of the genomes of zoysiagrasses.

PubMed

Tanaka, Hidenori; Hirakawa, Hideki; Kosugi, Shunichi; Nakayama, Shinobu; Ono, Akiko; Watanabe, Akiko; Hashiguchi, Masatsugu; Gondo, Takahiro; Ishigaki, Genki; Muguerza, Melody; Shimizu, Katsuya; Sawamura, Noriko; Inoue, Takayasu; Shigeki, Yuichi; Ohno, Naoki; Tabata, Satoshi; Akashi, Ryo; Sato, Shusei

2016-04-01

Zoysiais a warm-season turfgrass, which comprises 11 allotetraploid species (2n= 4x= 40), each possessing different morphological and physiological traits. To characterize the genetic systems of Zoysia plants and to analyse their structural and functional differences in individual species and accessions, we sequenced the genomes of Zoysia species using HiSeq and MiSeq platforms. As a reference sequence of Zoysia species, we generated a high-quality draft sequence of the genome of Z. japonica accession 'Nagirizaki' (334 Mb) in which 59,271 protein-coding genes were predicted. In parallel, draft genome sequences of Z. matrella 'Wakaba' and Z. pacifica 'Zanpa' were also generated for comparative analyses. To investigate the genetic diversity among the Zoysia species, genome sequence reads of three additional accessions, Z. japonica'Kyoto', Z. japonica'Miyagi' and Z. matrella'Chiba Fair Green', were accumulated, and aligned against the reference genome of 'Nagirizaki' along with those from 'Wakaba' and 'Zanpa'. As a result, we detected 7,424,163 single-nucleotide polymorphisms and 852,488 short indels among these species. The information obtained in this study will be valuable for basic studies on zoysiagrass evolution and genetics as well as for the breeding of zoysiagrasses, and is made available in the 'Zoysia Genome Database' at http://zoysia.kazusa.or.jp. © The Author 2016. Published by Oxford University Press on behalf of Kazusa DNA Research Institute.
Draft whole genome sequence of groundnut stem rot fungus Athelia rolfsii revealing genetic architect of its pathogenicity and virulence.

PubMed

Iquebal, M A; Tomar, Rukam S; Parakhia, M V; Singla, Deepak; Jaiswal, Sarika; Rathod, V M; Padhiyar, S M; Kumar, Neeraj; Rai, Anil; Kumar, Dinesh

2017-07-13

Groundnut (Arachis hypogaea L.) is an important oil seed crop having major biotic constraint in production due to stem rot disease caused by fungus, Athelia rolfsii causing 25-80% loss in productivity. As chemical and biological combating strategies of this fungus are not very effective, thus genome sequencing can reveal virulence and pathogenicity related genes for better understanding of the host-parasite interaction. We report draft assembly of Athelia rolfsii genome of ~73 Mb having 8919 contigs. Annotation analysis revealed 16830 genes which are involved in fungicide resistance, virulence and pathogenicity along with putative effector and lethal genes. Secretome analysis revealed CAZY genes representing 1085 enzymatic genes, glycoside hydrolases, carbohydrate esterases, carbohydrate-binding modules, auxillary activities, glycosyl transferases and polysaccharide lyases. Repeat analysis revealed 11171 SSRs, LTR, GYPSY and COPIA elements. Comparative analysis with other existing ascomycotina genome predicted conserved domain family of WD40, CYP450, Pkinase and ABC transporter revealing insight of evolution of pathogenicity and virulence. This study would help in understanding pathogenicity and virulence at molecular level and development of new combating strategies. Such approach is imperative in endeavour of genome based solution in stem rot disease management leading to better productivity of groundnut crop in tropical region of world.

A first genetic map of date palm (Phoenix dactylifera) reveals long-range genome structure conservation in the palms.

PubMed

Mathew, Lisa S; Spannagl, Manuel; Al-Malki, Ameena; George, Binu; Torres, Maria F; Al-Dous, Eman K; Al-Azwani, Eman K; Hussein, Emad; Mathew, Sweety; Mayer, Klaus F X; Mohamoud, Yasmin Ali; Suhre, Karsten; Malek, Joel A

2014-04-15

The date palm is one of the oldest cultivated fruit trees. It is critical in many ways to cultures in arid lands by providing highly nutritious fruit while surviving extreme heat and environmental conditions. Despite its importance from antiquity, few genetic resources are available for improving the productivity and development of the dioecious date palm. To date there has been no genetic map and no sex chromosome has been identified. Here we present the first genetic map for date palm and identify the putative date palm sex chromosome. We placed ~4000 markers on the map using nearly 1200 framework markers spanning a total of 1293 cM. We have integrated the genetic map, derived from the Khalas cultivar, with the draft genome and placed up to 19% of the draft genome sequence scaffolds onto linkage groups for the first time. This analysis revealed approximately ~1.9 cM/Mb on the map. Comparison of the date palm linkage groups revealed significant long-range synteny to oil palm. Analysis of the date palm sex-determination region suggests it is telomeric on linkage group 12 and recombination is not suppressed in the full chromosome. Based on a modified genotyping-by-sequencing approach we have overcome challenges due to lack of genetic resources and provide the first genetic map for date palm. Combined with the recent draft genome sequence of the same cultivar, this resource offers a critical new tool for date palm biotechnology, palm comparative genomics and a better understanding of sex chromosome development in the palms.
Draft genome of the Antarctic dragonfish, Parachaenichthys charcoti.

PubMed

Ahn, Do-Hwan; Shin, Seung Chul; Kim, Bo-Mi; Kang, Seunghyun; Kim, Jin-Hyoung; Ahn, Inhye; Park, Joonho; Park, Hyun

2017-08-01

The Antarctic bathydraconid dragonfish, Parachaenichthys charcoti, is an Antarctic notothenioid teleost endemic to the Southern Ocean. The Southern Ocean has cooled to -1.8ºC over the past 30 million years, and the seawater had retained this cold temperature and isolated oceanic environment because of the Antarctic Circumpolar Current. Notothenioids dominate Antarctic fish, making up 90% of the biomass, and all notothenioids have undergone molecular and ecological diversification to survive in this cold environment. Therefore, they are considered an attractive Antarctic fish model for evolutionary and ancestral genomic studies. Bathydraconidae is a speciose family of the Notothenioidei, the dominant taxonomic component of Antarctic teleosts. To understand the process of evolution of Antarctic fish, we select a typical Antarctic bathydraconid dragonfish, P. charcoti. Here, we have sequenced, de novo assembled, and annotated a comprehensive genome from P. charcoti. The draft genome of P. charcoti is 709 Mb in size. The N50 contig length is 6145 bp, and its N50 scaffold length 178 362 kb. The genome of P. charcoti is predicted to contain 32 712 genes, 18 455 of which have been assigned preliminary functions. A total of 8951 orthologous groups common to 7 species of fish were identified, while 333 genes were identified in P. charcoti only; 2519 orthologous groups were also identified in both P. charcoti and N. coriiceps, another Antarctic fish. Four gene ontology terms were statistically overrepresented among the 333 genes unique to P. charcoti, according to gene ontology enrichment analysis. The draft P. charcoti genome will broaden our understanding of the evolution of Antarctic fish in their extreme environment. It will provide a basis for further investigating the unusual characteristics of Antarctic fishes. © The Author 2017. Published by Oxford University Press.
Comparative genomics of maize ear rot pathogens reveals expansion of carbohydrate-active enzymes and secondary metabolism backbone genes in Stenocarpella maydis.

PubMed

Zaccaron, Alex Z; Woloshuk, Charles P; Bluhm, Burton H

2017-11-01

Stenocarpella maydis is a plant pathogenic fungus that causes Diplodia ear rot, one of the most destructive diseases of maize. To date, little information is available regarding the molecular basis of pathogenesis in this organism, in part due to limited genomic resources. In this study, a 54.8 Mb draft genome assembly of S. maydis was obtained with Illumina and PacBio sequencing technologies, and analyzed. Comparative genomic analyses with the predominant maize ear rot pathogens Aspergillus flavus, Fusarium verticillioides, and Fusarium graminearum revealed an expanded set of carbohydrate-active enzymes for cellulose and hemicellulose degradation in S. maydis. Analyses of predicted genes involved in starch degradation revealed six putative α-amylases, four extracellular and two intracellular, and two putative γ-amylases, one of which appears to have been acquired from bacteria via horizontal transfer. Additionally, 87 backbone genes involved in secondary metabolism were identified, which represents one of the largest known assemblages among Pezizomycotina species. Numerous secondary metabolite gene clusters were identified, including two clusters likely involved in the biosynthesis of diplodiatoxin and chaetoglobosins. The draft genome of S. maydis presented here will serve as a useful resource for molecular genetics, functional genomics, and analyses of population diversity in this organism. Copyright © 2017 British Mycological Society. Published by Elsevier Ltd. All rights reserved.
Draft genome sequence of ramie, Boehmeria nivea (L.) Gaudich.

PubMed

Luan, Ming-Bao; Jian, Jian-Bo; Chen, Ping; Chen, Jun-Hui; Chen, Jian-Hua; Gao, Qiang; Gao, Gang; Zhou, Ju-Hong; Chen, Kun-Mei; Guang, Xuan-Min; Chen, Ji-Kang; Zhang, Qian-Qian; Wang, Xiao-Fei; Fang, Long; Sun, Zhi-Min; Bai, Ming-Zhou; Fang, Xiao-Dong; Zhao, Shan-Cen; Xiong, He-Ping; Yu, Chun-Ming; Zhu, Ai-Guo

2018-05-01

Ramie, Boehmeria nivea (L.) Gaudich, family Urticaceae, is a plant native to eastern Asia, and one of the world's oldest fibre crops. It is also used as animal feed and for the phytoremediation of heavy metal-contaminated farmlands. Thus, the genome sequence of ramie was determined to explore the molecular basis of its fibre quality, protein content and phytoremediation. For further understanding ramie genome, different paired-end and mate-pair libraries were combined to generate 134.31 Gb of raw DNA sequences using the Illumina whole-genome shotgun sequencing approach. The highly heterozygous B. nivea genome was assembled using the Platanus Genome Assembler, which is an effective tool for the assembly of highly heterozygous genome sequences. The final length of the draft genome of this species was approximately 341.9 Mb (contig N50 = 22.62 kb, scaffold N50 = 1,126.36 kb). Based on ramie genome annotations, 30,237 protein-coding genes were predicted, and the repetitive element content was 46.3%. The completeness of the final assembly was evaluated by benchmarking universal single-copy orthologous genes (BUSCO); 90.5% of the 1,440 expected embryophytic genes were identified as complete, and 4.9% were identified as fragmented. Phylogenetic analysis based on single-copy gene families and one-to-one orthologous genes placed ramie with mulberry and cannabis, within the clade of urticalean rosids. Genome information of ramie will be a valuable resource for the conservation of endangered Boehmeria species and for future studies on the biogeography and characteristic evolution of members of Urticaceae. © 2018 John Wiley & Sons Ltd.
Florida-specific NTCIP management information base (MIB) for closed-circuit television (CCTV) camera : final draft.

DOT National Transportation Integrated Search

2009-01-01

Description: This following MIB has been developed for use by FDOT. This : proposed Florida-Specific NTCIP Management Information Base (MIB) For : Closed-Circuit Television (CCTV) Camera MIB is based on the following : documentations: : NTCIP 120...
A first genetic map of date palm (Phoenix dactylifera) reveals long-range genome structure conservation in the palms

PubMed Central

2014-01-01

Background The date palm is one of the oldest cultivated fruit trees. It is critical in many ways to cultures in arid lands by providing highly nutritious fruit while surviving extreme heat and environmental conditions. Despite its importance from antiquity, few genetic resources are available for improving the productivity and development of the dioecious date palm. To date there has been no genetic map and no sex chromosome has been identified. Results Here we present the first genetic map for date palm and identify the putative date palm sex chromosome. We placed ~4000 markers on the map using nearly 1200 framework markers spanning a total of 1293 cM. We have integrated the genetic map, derived from the Khalas cultivar, with the draft genome and placed up to 19% of the draft genome sequence scaffolds onto linkage groups for the first time. This analysis revealed approximately ~1.9 cM/Mb on the map. Comparison of the date palm linkage groups revealed significant long-range synteny to oil palm. Analysis of the date palm sex-determination region suggests it is telomeric on linkage group 12 and recombination is not suppressed in the full chromosome. Conclusions Based on a modified gentoyping-by-sequencing approach we have overcome challenges due to lack of genetic resources and provide the first genetic map for date palm. Combined with the recent draft genome sequence of the same cultivar, this resource offers a critical new tool for date palm biotechnology, palm comparative genomics and a better understanding of sex chromosome development in the palms. PMID:24735434
The draft genome of Ruellia speciosa (Beautiful Wild Petunia: Acanthaceae).

PubMed

Zhuang, Yongbin; Tripp, Erin A

2017-04-01

The genus Ruellia (Wild Petunias; Acanthaceae) is characterized by an enormous diversity of floral shapes and colours manifested among closely related species. Using Illumina platform, we reconstructed the draft genome of Ruellia speciosa, with a scaffold size of 1,021 Mb (or ∼1.02 Gb) and an N50 size of 17,908 bp, spanning ∼93% of the estimated genome (∼1.1 Gb). The draft assembly predicted 40,124 gene models and phylogenetic analyses of four key enzymes involved in anthocyanin colour production [flavanone 3-hydroxylase (F3H), flavonoid 3'-hydroxylase (F3'H), flavonoid 3',5'-hydroxylase (F3'5'H), and dihydroflavonol 4-reductase (DFR)] found that most angiosperms here sampled harboured at least one copy of F3H, F3'H, and DFR. In contrast, fewer than one-half (but including R. speciosa) harboured a copy of F3'5'H, supporting observations that blue flowers and/or fruits, which this enzyme is required for, are less common among flowering plants. Ka/Ks analyses of duplicated copies of F3'H and DFR in R. speciosa suggested purifying selection in the former but detected evidence of positive selection in the latter. The genome sequence and annotation of R. speciosa represents only one of only four families sequenced in the large and important Asterid clade of flowering plants and, as such, will facilitate extensive future research on this diverse group, particularly with respect to floral evolution. © The Author 2017. Published by Oxford University Press on behalf of Kazusa DNA Research Institute.
The draft genome of Ruellia speciosa (Beautiful Wild Petunia: Acanthaceae)

PubMed Central

Zhuang, Yongbin

2017-01-01

Abstract The genus Ruellia (Wild Petunias; Acanthaceae) is characterized by an enormous diversity of floral shapes and colours manifested among closely related species. Using Illumina platform, we reconstructed the draft genome of Ruellia speciosa, with a scaffold size of 1,021 Mb (or ∼1.02 Gb) and an N50 size of 17,908 bp, spanning ∼93% of the estimated genome (∼1.1 Gb). The draft assembly predicted 40,124 gene models and phylogenetic analyses of four key enzymes involved in anthocyanin colour production [flavanone 3-hydroxylase (F3H), flavonoid 3′-hydroxylase (F3′H), flavonoid 3′,5′-hydroxylase (F3′5′H), and dihydroflavonol 4-reductase (DFR)] found that most angiosperms here sampled harboured at least one copy of F3H, F3′H, and DFR. In contrast, fewer than one-half (but including R. speciosa) harboured a copy of F3′5′H, supporting observations that blue flowers and/or fruits, which this enzyme is required for, are less common among flowering plants. Ka/Ks analyses of duplicated copies of F3′H and DFR in R. speciosa suggested purifying selection in the former but detected evidence of positive selection in the latter. The genome sequence and annotation of R. speciosa represents only one of only four families sequenced in the large and important Asterid clade of flowering plants and, as such, will facilitate extensive future research on this diverse group, particularly with respect to floral evolution. PMID:28431014
Genome sequence of the Japanese oak silk moth, Antheraea yamamai: the first draft genome in the family Saturniidae

PubMed Central

Kim, Seong-Ryul; Kwak, Woori; Kim, Hyaekang; Kim, Kee-Young; Kim, Su-Bae; Choi, Kwang-Ho; Kim, Seong-Wan; Hwang, Jae-Sam; Kim, Minjee; Kim, Iksoo; Goo, Tae-Won

2018-01-01

Abstract Background Antheraea yamamai, also known as the Japanese oak silk moth, is a wild species of silk moth. Silk produced by A. yamamai, referred to as tensan silk, shows different characteristics such as thickness, compressive elasticity, and chemical resistance compared with common silk produced from the domesticated silkworm, Bombyx mori. Its unique characteristics have led to its use in many research fields including biotechnology and medical science, and the scientific as well as economic importance of the wild silk moth continues to gradually increase. However, no genomic information for the wild silk moth, including A. yamamai, is currently available. Findings In order to construct the A. yamamai genome, a total of 147G base pairs using Illumina and Pacbio sequencing platforms were generated, providing 210-fold coverage based on the 700-Mb estimated genome size of A. yamamai. The assembled genome of A. yamamai was 656 Mb (>2 kb) with 3675 scaffolds, and the N50 length of assembly was 739 Kb with a 34.07% GC ratio. Identified repeat elements covered 37.33% of the total genome, and the completeness of the constructed genome assembly was estimated to be 96.7% by Benchmarking Universal Single-Copy Orthologs v2 analysis. A total of 15 481 genes were identified using Evidence Modeler based on the gene prediction results obtained from 3 different methods (ab initio, RNA-seq-based, known-gene-based) and manual curation. Conclusions Here we present the genome sequence of A. yamamai, the first genome sequence of the wild silk moth. These results provide valuable genomic information, which will help enrich our understanding of the molecular mechanisms relating to not only specific phenotypes such as wild silk itself but also the genomic evolution of Saturniidae. PMID:29186418
De novo genome assembly of the soil-borne fungus and tomato pathogen Pyrenochaeta lycopersici

PubMed Central

2014-01-01

Background Pyrenochaeta lycopersici is a soil-dwelling ascomycete pathogen that causes corky root rot disease in tomato (Solanum lycopersicum) and other Solanaceous crops, reducing fruit yields by up to 75%. Fungal pathogens that infect roots receive less attention than those infecting the aerial parts of crops despite their significant impact on plant growth and fruit production. Results We assembled a 54.9Mb P. lycopersici draft genome sequence based on Illumina short reads, and annotated approximately 17,000 genes. The P. lycopersici genome is closely related to hemibiotrophs and necrotrophs, in agreement with the phenotypic characteristics of the fungus and its lifestyle. Several gene families related to host–pathogen interactions are strongly represented, including those responsible for nutrient absorption, the detoxification of fungicides and plant cell wall degradation, the latter confirming that much of the genome is devoted to the pathogenic activity of the fungus. We did not find a MAT gene, which is consistent with the classification of P. lycopersici as an imperfect fungus, but we observed a significant expansion of the gene families associated with heterokaryon incompatibility (HI). Conclusions The P. lycopersici draft genome sequence provided insight into the molecular and genetic basis of the fungal lifestyle, characterizing previously unknown pathogenic behaviors and defining strategies that allow this asexual fungus to increase genetic diversity and to acquire new pathogenic traits. PMID:24767544
Strategies for optimizing BioNano and Dovetail explored through a second reference quality assembly for the legume model, Medicago truncatula.

PubMed

Moll, Karen M; Zhou, Peng; Ramaraj, Thiruvarangan; Fajardo, Diego; Devitt, Nicholas P; Sadowsky, Michael J; Stupar, Robert M; Tiffin, Peter; Miller, Jason R; Young, Nevin D; Silverstein, Kevin A T; Mudge, Joann

2017-08-04

Third generation sequencing technologies, with sequencing reads in the tens- of kilo-bases, facilitate genome assembly by spanning ambiguous regions and improving continuity. This has been critical for plant genomes, which are difficult to assemble due to high repeat content, gene family expansions, segmental and tandem duplications, and polyploidy. Recently, high-throughput mapping and scaffolding strategies have further improved continuity. Together, these long-range technologies enable quality draft assemblies of complex genomes in a cost-effective and timely manner. Here, we present high quality genome assemblies of the model legume plant, Medicago truncatula (R108) using PacBio, Dovetail Chicago (hereafter, Dovetail) and BioNano technologies. To test these technologies for plant genome assembly, we generated five assemblies using all possible combinations and ordering of these three technologies in the R108 assembly. While the BioNano and Dovetail joins overlapped, they also showed complementary gains in continuity and join numbers. Both technologies spanned repetitive regions that PacBio alone was unable to bridge. Combining technologies, particularly Dovetail followed by BioNano, resulted in notable improvements compared to Dovetail or BioNano alone. A combination of PacBio, Dovetail, and BioNano was used to generate a high quality draft assembly of R108, a M. truncatula accession widely used in studies of functional genomics. As a test for the usefulness of the resulting genome sequence, the new R108 assembly was used to pinpoint breakpoints and characterize flanking sequence of a previously identified translocation between chromosomes 4 and 8, identifying more than 22.7 Mb of novel sequence not present in the earlier A17 reference assembly. Adding Dovetail followed by BioNano data yielded complementary improvements in continuity over the original PacBio assembly. This strategy proved efficient and cost-effective for developing a quality draft assembly compared to traditional reference assemblies.
The draft genome of the carcinogenic human liver fluke Clonorchis sinensis

PubMed Central

2011-01-01

Background Clonorchis sinensis is a carcinogenic human liver fluke that is widespread in Asian countries. Increasing infection rates of this neglected tropical disease are leading to negative economic and public health consequences in affected regions. Experimental and epidemiological studies have shown a strong association between the incidence of cholangiocarcinoma and the infection rate of C. sinensis. To aid research into this organism, we have sequenced its genome. Results We combined de novo sequencing with computational techniques to provide new information about the biology of this liver fluke. The assembled genome has a total size of 516 Mb with a scaffold N50 length of 42 kb. Approximately 16,000 reliable protein-coding gene models were predicted. Genes for the complete pathways for glycolysis, the Krebs cycle and fatty acid metabolism were found, but key genes involved in fatty acid biosynthesis are missing from the genome, reflecting the parasitic lifestyle of a liver fluke that receives lipids from the bile of its host. We also identified pathogenic molecules that may contribute to liver fluke-induced hepatobiliary diseases. Large proteins such as multifunctional secreted proteases and tegumental proteins were identified as potential targets for the development of drugs and vaccines. Conclusions This study provides valuable genomic information about the human liver fluke C. sinensis and adds to our knowledge on the biology of the parasite. The draft genome will serve as a platform to develop new strategies for parasite control. PMID:22023798
The cacao Criollo genome v2.0: an improved version of the genome for genetic and functional genomic studies.

PubMed

Argout, X; Martin, G; Droc, G; Fouet, O; Labadie, K; Rivals, E; Aury, J M; Lanaud, C

2017-09-15

Theobroma cacao L., native to the Amazonian basin of South America, is an economically important fruit tree crop for tropical countries as a source of chocolate. The first draft genome of the species, from a Criollo cultivar, was published in 2011. Although a useful resource, some improvements are possible, including identifying misassemblies, reducing the number of scaffolds and gaps, and anchoring un-anchored sequences to the 10 chromosomes. We used a NGS-based approach to significantly improve the assembly of the Belizian Criollo B97-61/B2 genome. We combined four Illumina large insert size mate paired libraries with 52x of Pacific Biosciences long reads to correct misassembled regions and reduced the number of scaffolds. We then used genotyping by sequencing (GBS) methods to increase the proportion of the assembly anchored to chromosomes. The scaffold number decreased from 4,792 in assembly V1 to 554 in V2 while the scaffold N50 size has increased from 0.47 Mb in V1 to 6.5 Mb in V2. A total of 96.7% of the assembly was anchored to the 10 chromosomes compared to 66.8% in the previous version. Unknown sites (Ns) were reduced from 10.8% to 5.7%. In addition, we updated the functional annotations and performed a new RefSeq structural annotation based on RNAseq evidence. Theobroma cacao Criollo genome version 2 will be a valuable resource for the investigation of complex traits at the genomic level and for future comparative genomics and genetics studies in cacao tree. New functional tools and annotations are available on the Cocoa Genome Hub ( http://cocoa-genome-hub.southgreen.fr ).
The Draft Assembly of the Radically Organized Stylonychia lemnae Macronuclear Genome

PubMed Central

Aeschlimann, Samuel H.; Jönsson, Franziska; Postberg, Jan; Stover, Nicholas A.; Petera, Robert L.; Lipps, Hans-Joachim; Nowacki, Mariusz; Swart, Estienne C.

2014-01-01

Stylonychia lemnae is a classical model single-celled eukaryote, and a quintessential ciliate typified by dimorphic nuclei: A small, germline micronucleus and a massive, vegetative macronucleus. The genome within Stylonychia’s macronucleus has a very unusual architecture, comprised variably and highly amplified “nanochromosomes,” each usually encoding a single gene with a minimal amount of surrounding noncoding DNA. As only a tiny fraction of the Stylonychia genes has been sequenced, and to promote research using this organism, we sequenced its macronuclear genome. We report the analysis of the 50.2-Mb draft S. lemnae macronuclear genome assembly, containing in excess of 16,000 complete nanochromosomes, assembled as less than 20,000 contigs. We found considerable conservation of fundamental genomic properties between S. lemnae and its close relative, Oxytricha trifallax, including nanochromosomal gene synteny, alternative fragmentation, and copy number. Protein domain searches in Stylonychia revealed two new telomere-binding protein homologs and the presence of linker histones. Among the diverse histone variants of S. lemnae and O. trifallax, we found divergent, coexpressed variants corresponding to four of the five core nucleosomal proteins (H1.2, H2A.6, H2B.4, and H3.7) suggesting that these ciliates may possess specialized nucleosomes involved in genome processing during nuclear differentiation. The assembly of the S. lemnae macronuclear genome demonstrates that largely complete, well-assembled highly fragmented genomes of similar size and complexity may be produced from one library and lane of Illumina HiSeq 2000 shotgun sequencing. The provision of the S. lemnae macronuclear genome sets the stage for future detailed experimental studies of chromatin-mediated, RNA-guided developmental genome rearrangements. PMID:24951568
SNP Identification from RNA Sequencing and Linkage Map Construction of Rubber Tree for Anchoring the Draft Genome

PubMed Central

Shearman, Jeremy R.; Sangsrakru, Duangjai; Jomchai, Nukoon; Ruang-areerate, Panthita; Sonthirod, Chutima; Naktang, Chaiwat; Theerawattanasuk, Kanikar; Tragoonrung, Somvong; Tangphatsornruang, Sithichoke

2015-01-01

Hevea brasiliensis, or rubber tree, is an important crop species that accounts for the majority of natural latex production. The rubber tree nuclear genome consists of 18 chromosomes and is roughly 2.15 Gb. The current rubber tree reference genome assembly consists of 1,150,326 scaffolds ranging from 200 to 531,465 bp and totalling 1.1 Gb. Only 143 scaffolds, totalling 7.6 Mb, have been placed into linkage groups. We have performed RNA-seq on 6 varieties of rubber tree to identify SNPs and InDels and used this information to perform target sequence enrichment and high throughput sequencing to genotype a set of SNPs in 149 rubber tree offspring from a cross between RRIM 600 and RRII 105 rubber tree varieties. We used this information to generate a linkage map allowing for the anchoring of 24,424 contigs from 3,009 scaffolds, totalling 115 Mb or 10.4% of the published sequence, into 18 linkage groups. Each linkage group contains between 319 and 1367 SNPs, or 60 to 194 non-redundant marker positions, and ranges from 156 to 336 cM in length. This linkage map includes 20,143 of the 69,300 predicted genes from rubber tree and will be useful for mapping studies and improving the reference genome assembly. PMID:25831195
SNP identification from RNA sequencing and linkage map construction of rubber tree for anchoring the draft genome.

PubMed

Shearman, Jeremy R; Sangsrakru, Duangjai; Jomchai, Nukoon; Ruang-Areerate, Panthita; Sonthirod, Chutima; Naktang, Chaiwat; Theerawattanasuk, Kanikar; Tragoonrung, Somvong; Tangphatsornruang, Sithichoke

2015-01-01

Hevea brasiliensis, or rubber tree, is an important crop species that accounts for the majority of natural latex production. The rubber tree nuclear genome consists of 18 chromosomes and is roughly 2.15 Gb. The current rubber tree reference genome assembly consists of 1,150,326 scaffolds ranging from 200 to 531,465 bp and totalling 1.1 Gb. Only 143 scaffolds, totalling 7.6 Mb, have been placed into linkage groups. We have performed RNA-seq on 6 varieties of rubber tree to identify SNPs and InDels and used this information to perform target sequence enrichment and high throughput sequencing to genotype a set of SNPs in 149 rubber tree offspring from a cross between RRIM 600 and RRII 105 rubber tree varieties. We used this information to generate a linkage map allowing for the anchoring of 24,424 contigs from 3,009 scaffolds, totalling 115 Mb or 10.4% of the published sequence, into 18 linkage groups. Each linkage group contains between 319 and 1367 SNPs, or 60 to 194 non-redundant marker positions, and ranges from 156 to 336 cM in length. This linkage map includes 20,143 of the 69,300 predicted genes from rubber tree and will be useful for mapping studies and improving the reference genome assembly.
The monarch butterfly genome yields insights into long-distance migration

PubMed Central

Zhan, Shuai; Merlin, Christine; Boore, Jeffrey L.; Reppert, Steven M.

2011-01-01

SUMMARY We present the draft 273 Mb genome of the migratory monarch butterfly (Danaus plexippus) and a set of 16, 866 protein-coding genes. Orthology properties suggest that the Lepidoptera are the fastest evolving insect order yet examined. Compared to the silkmoth Bombyx mori, the monarch genome shares prominent similarity in orthology content, microsynteny, and protein family sizes. The monarch genome reveals: a vertebrate-like opsin whose existence in insects is widespread; a full repertoire of molecular components for the monarch circadian clockwork; all members of the juvenile hormone biosynthetic pathway whose regulation shows unexpected sexual dimorphism; additional molecular signatures of oriented flight behavior; microRNAs that are differentially expressed between summer and migratory butterflies; monarch-specific expansions of chemoreceptors potentially important for long-distance migration; and a variant of the sodium/potassium pump that underlies a valuable chemical defense mechanism. The monarch genome enhances our ability to better understand the genetic and molecular basis of long-distance migration. PMID:22118469
The genome of woodland strawberry (Fragaria vesca)

PubMed Central

Shulaev, Vladimir; Sargent, Daniel J; Crowhurst, Ross N; Mockler, Todd C; Folkerts, Otto; Delcher, Arthur L; Jaiswal, Pankaj; Mockaitis, Keithanne; Liston, Aaron; Mane, Shrinivasrao P; Burns, Paul; Davis, Thomas M; Slovin, Janet P; Bassil, Nahla; Hellens, Roger P; Evans, Clive; Harkins, Tim; Kodira, Chinnappa; Desany, Brian; Crasta, Oswald R; Jensen, Roderick V; Allan, Andrew C; Michael, Todd P; Setubal, Joao Carlos; Celton, Jean-Marc; Rees, D Jasper G; Williams, Kelly P; Holt, Sarah H; Ruiz Rojas, Juan Jairo; Chatterjee, Mithu; Liu, Bo; Silva, Herman; Meisel, Lee; Adato, Avital; Filichkin, Sergei A; Troggio, Michela; Viola, Roberto; Ashman, Tia-Lynn; Wang, Hao; Dharmawardhana, Palitha; Elser, Justin; Raja, Rajani; Priest, Henry D; Bryant, Douglas W; Fox, Samuel E; Givan, Scott A; Wilhelm, Larry J; Naithani, Sushma; Christoffels, Alan; Salama, David Y; Carter, Jade; Girona, Elena Lopez; Zdepski, Anna; Wang, Wenqin; Kerstetter, Randall A; Schwab, Wilfried; Korban, Schuyler S; Davik, Jahn; Monfort, Amparo; Denoyes-Rothan, Beatrice; Arus, Pere; Mittler, Ron; Flinn, Barry; Aharoni, Asaph; Bennetzen, Jeffrey L; Salzberg, Steven L; Dickerman, Allan W; Velasco, Riccardo; Borodovsky, Mark; Veilleux, Richard E; Folta, Kevin M

2012-01-01

The woodland strawberry, Fragaria vesca (2n = 2x = 14), is a versatile experimental plant system. This diminutive herbaceous perennial has a small genome (240 Mb), is amenable to genetic transformation and shares substantial sequence identity with the cultivated strawberry (Fragaria × ananassa) and other economically important rosaceous plants. Here we report the draft F. vesca genome, which was sequenced to ×39 coverage using second-generation technology, assembled de novo and then anchored to the genetic linkage map into seven pseudochromosomes. This diploid strawberry sequence lacks the large genome duplications seen in other rosids. Gene prediction modeling identified 34,809 genes, with most being supported by transcriptome mapping. Genes critical to valuable horticultural traits including flavor, nutritional value and flowering time were identified. Macrosyntenic relationships between Fragaria and Prunus predict a hypothetical ancestral Rosaceae genome that had nine chromosomes. New phylogenetic analysis of 154 protein-coding genes suggests that assignment of Populus to Malvidae, rather than Fabidae, is warranted. PMID:21186353
A comprehensive draft genome sequence for lupin (Lupinus angustifolius), an emerging health food: insights into plant-microbe interactions and legume evolution.

PubMed

Hane, James K; Ming, Yao; Kamphuis, Lars G; Nelson, Matthew N; Garg, Gagan; Atkins, Craig A; Bayer, Philipp E; Bravo, Armando; Bringans, Scott; Cannon, Steven; Edwards, David; Foley, Rhonda; Gao, Ling-Ling; Harrison, Maria J; Huang, Wei; Hurgobin, Bhavna; Li, Sean; Liu, Cheng-Wu; McGrath, Annette; Morahan, Grant; Murray, Jeremy; Weller, James; Jian, Jianbo; Singh, Karam B

2017-03-01

Lupins are important grain legume crops that form a critical part of sustainable farming systems, reducing fertilizer use and providing disease breaks. It has a basal phylogenetic position relative to other crop and model legumes and a high speciation rate. Narrow-leafed lupin (NLL; Lupinus angustifolius L.) is gaining popularity as a health food, which is high in protein and dietary fibre but low in starch and gluten-free. We report the draft genome assembly (609 Mb) of NLL cultivar Tanjil, which has captured >98% of the gene content, sequences of additional lines and a dense genetic map. Lupins are unique among legumes and differ from most other land plants in that they do not form mycorrhizal associations. Remarkably, we find that NLL has lost all mycorrhiza-specific genes, but has retained genes commonly required for mycorrhization and nodulation. In addition, the genome also provided candidate genes for key disease resistance and domestication traits. We also find evidence of a whole-genome triplication at around 25 million years ago in the genistoid lineage leading to Lupinus. Our results will support detailed studies of legume evolution and accelerate lupin breeding programmes. © 2016 The Authors. Plant Biotechnology Journal published by Society for Experimental Biology and The Association of Applied Biologists and John Wiley & Sons Ltd.
Complete genome sequence of the phenanthrene-degrading soil bacterium Delftia acidovorans Cs1-4

DOE Office of Scientific and Technical Information (OSTI.GOV)

Shetty, Ameesha R.; de Gannes, Vidya; Obi, Chioma C.

Polycyclic aromatic hydrocarbons (PAH) are ubiquitous environmental pollutants and microbial biodegradation is an important means of remediation of PAH-contaminated soil. Delftia acidovorans Cs1-4 (formerly Delftia sp. Cs1-4) was isolated by using phenanthrene as the sole carbon source from PAH contaminated soil in Wisconsin. Its full genome sequence was determined to gain insights into a mechanisms underlying biodegradation of PAH. Three genomic libraries were constructed and sequenced: an Illumina GAii shotgun library (916,416,493 reads), a 454 Titanium standard library (770,171 reads) and one paired-end 454 library (average insert size of 8 kb, 508,092 reads). The initial assembly contained 40 contigs inmore » two scaffolds. The 454 Titanium standard data and the 454 paired end data were assembled together and the consensus sequences were computationally shredded into 2 kb overlapping shreds. Illumina sequencing data was assembled, and the consensus sequence was computationally shredded into 1.5 kb overlapping shreds. Gaps between contigs were closed by editing in Consed, by PCR and by Bubble PCR primer walks. A total of 182 additional reactions were needed to close gaps and to raise the quality of the finished sequence. The final assembly is based on 253.3 Mb of 454 draft data (averaging 38.4 X coverage) and 590.2 Mb of Illumina draft data (averaging 89.4 X coverage). The genome of strain Cs1-4 consists of a single circular chromosome of 6,685,842 bp (66.7 %G+C) containing 6,028 predicted genes; 5,931 of these genes were protein-encoding and 4,425 gene products were assigned to a putative function. Genes encoding phenanthrene degradation were localized to a 232 kb genomic island (termed the phn island), which contained near its 3’ end a bacteriophage P4-like integrase, an enzyme often associated with chromosomal integration of mobile genetic elements. Other biodegradation pathways reconstructed from the genome sequence included: benzoate (by the acetyl-CoA pathway), styrene, nicotinic acid (by the maleamate pathway) and the pesticides Dicamba and Fenitrothion. Lastly, determination of the complete genome sequence of D. acidovorans Cs1-4 has provided new insights the microbial mechanisms of PAH biodegradation that may shape the process in the environment.« less

Complete genome sequence of the phenanthrene-degrading soil bacterium Delftia acidovorans Cs1-4

DOE PAGES

Shetty, Ameesha R.; de Gannes, Vidya; Obi, Chioma C.; ...

2015-08-15

Polycyclic aromatic hydrocarbons (PAH) are ubiquitous environmental pollutants and microbial biodegradation is an important means of remediation of PAH-contaminated soil. Delftia acidovorans Cs1-4 (formerly Delftia sp. Cs1-4) was isolated by using phenanthrene as the sole carbon source from PAH contaminated soil in Wisconsin. Its full genome sequence was determined to gain insights into a mechanisms underlying biodegradation of PAH. Three genomic libraries were constructed and sequenced: an Illumina GAii shotgun library (916,416,493 reads), a 454 Titanium standard library (770,171 reads) and one paired-end 454 library (average insert size of 8 kb, 508,092 reads). The initial assembly contained 40 contigs inmore » two scaffolds. The 454 Titanium standard data and the 454 paired end data were assembled together and the consensus sequences were computationally shredded into 2 kb overlapping shreds. Illumina sequencing data was assembled, and the consensus sequence was computationally shredded into 1.5 kb overlapping shreds. Gaps between contigs were closed by editing in Consed, by PCR and by Bubble PCR primer walks. A total of 182 additional reactions were needed to close gaps and to raise the quality of the finished sequence. The final assembly is based on 253.3 Mb of 454 draft data (averaging 38.4 X coverage) and 590.2 Mb of Illumina draft data (averaging 89.4 X coverage). The genome of strain Cs1-4 consists of a single circular chromosome of 6,685,842 bp (66.7 %G+C) containing 6,028 predicted genes; 5,931 of these genes were protein-encoding and 4,425 gene products were assigned to a putative function. Genes encoding phenanthrene degradation were localized to a 232 kb genomic island (termed the phn island), which contained near its 3’ end a bacteriophage P4-like integrase, an enzyme often associated with chromosomal integration of mobile genetic elements. Other biodegradation pathways reconstructed from the genome sequence included: benzoate (by the acetyl-CoA pathway), styrene, nicotinic acid (by the maleamate pathway) and the pesticides Dicamba and Fenitrothion. Lastly, determination of the complete genome sequence of D. acidovorans Cs1-4 has provided new insights the microbial mechanisms of PAH biodegradation that may shape the process in the environment.« less
Draft Genome of the Scarab Beetle Oryctes borbonicus on La Réunion Island

PubMed Central

Meyer, Jan M.; Markov, Gabriel V.; Baskaran, Praveen; Herrmann, Matthias; Sommer, Ralf J.; Rödelsperger, Christian

2016-01-01

Beetles represent the largest insect order and they display extreme morphological, ecological and behavioral diversity, which makes them ideal models for evolutionary studies. Here, we present the draft genome of the scarab beetle Oryctes borbonicus, which has a more basal phylogenetic position than the two previously sequenced pest species Tribolium castaneum and Dendroctonus ponderosae providing the potential for sequence polarization. Oryctes borbonicus is endemic to La Réunion, an island located in the Indian Ocean, and is the host of the nematode Pristionchus pacificus, a well-established model organism for integrative evolutionary biology. At 518 Mb, the O. borbonicus genome is substantially larger and encodes more genes than T. castaneum and D. ponderosae. We found that only 25% of the predicted genes of O. borbonicus are conserved as single copy genes across the nine investigated insect genomes, suggesting substantial gene turnover within insects. Even within beetles, up to 21% of genes are restricted to only one species, whereas most other genes have undergone lineage-specific duplications and losses. We illustrate lineage-specific duplications using detailed phylogenetic analysis of two gene families. This study serves as a reference point for insect/coleopteran genomics, although its original motivation was to find evidence for potential horizontal gene transfer (HGT) between O. borbonicus and P. pacificus. The latter was previously shown to be the recipient of multiple horizontally transferred genes including some genes from insect donors. However, our study failed to provide any clear evidence for additional HGTs between the two species. PMID:27289092
Draft genome sequence and transcriptional analysis of Rosellinia necatrix infected with a virulent mycovirus.

PubMed

Shimizu, Takeo; Kanematsu, Satoko; Yaegashi, Hajime

2018-04-24

Understanding the molecular mechanisms of pathogenesis is useful in developing effective control methods for fungal diseases. The white root rot fungus Rosellinia necatrix is a soil-borne pathogen that causes serious economic losses in various crops, including fruit trees, worldwide. Here, using next-generation sequencing techniques, we first produced a 44-Mb draft genome sequence of R. necatrix strain W97, an isolate from Japan, in which 12,444 protein-coding genes were predicted. To survey differentially expressed genes (DEGs) associated with the pathogenesis of the fungus, the hypovirulent W97 strain infected with Rosellinia necatrix megabirnavirus 1 (RnMBV1) was used for a comprehensive transcriptome analysis. In total, 545 and 615 genes are up- and down-regulated, respectively, in R. necatrix infected with RnMBV1. Gene ontology and Kyoto Encyclopedia of Genes and Genomes pathway analyses of the DEGs suggested that primary and secondary metabolism would be greatly disturbed in R. necatrix infected with RnMBV1. The genes encoding transcriptional regulators, plant cell wall-degrading enzymes, and toxin production, such as cytochalasin E, were also found in the DEGs. The genetic resources provided in this study will accelerate the discovery of genes associated with pathogenesis and other biological characteristics of R. necatrix, thus contributing to disease control.
Genome sequencing of herb Tulsi (Ocimum tenuiflorum) unravels key genes behind its strong medicinal properties.

PubMed

Upadhyay, Atul K; Chacko, Anita R; Gandhimathi, A; Ghosh, Pritha; Harini, K; Joseph, Agnel P; Joshi, Adwait G; Karpe, Snehal D; Kaushik, Swati; Kuravadi, Nagesh; Lingu, Chandana S; Mahita, J; Malarini, Ramya; Malhotra, Sony; Malini, Manoharan; Mathew, Oommen K; Mutt, Eshita; Naika, Mahantesha; Nitish, Sathyanarayanan; Pasha, Shaik Naseer; Raghavender, Upadhyayula S; Rajamani, Anantharamanan; Shilpa, S; Shingate, Prashant N; Singh, Heikham Russiachand; Sukhwal, Anshul; Sunitha, Margaret S; Sumathi, Manojkumar; Ramaswamy, S; Gowda, Malali; Sowdhamini, Ramanathan

2015-08-28

Krishna Tulsi, a member of Lamiaceae family, is a herb well known for its spiritual, religious and medicinal importance in India. The common name of this plant is 'Tulsi' (or 'Tulasi' or 'Thulasi') and is considered sacred by Hindus. We present the draft genome of Ocimum tenuiflurum L (subtype Krishna Tulsi) in this report. The paired-end and mate-pair sequence libraries were generated for the whole genome sequenced with the Illumina Hiseq 1000, resulting in an assembled genome of 374 Mb, with a genome coverage of 61 % (612 Mb estimated genome size). We have also studied transcriptomes (RNA-Seq) of two subtypes of O. tenuiflorum, Krishna and Rama Tulsi and report the relative expression of genes in both the varieties. The pathways leading to the production of medicinally-important specialized metabolites have been studied in detail, in relation to similar pathways in Arabidopsis thaliana and other plants. Expression levels of anthocyanin biosynthesis-related genes in leaf samples of Krishna Tulsi were observed to be relatively high, explaining the purple colouration of Krishna Tulsi leaves. The expression of six important genes identified from genome data were validated by performing q-RT-PCR in different tissues of five different species, which shows the high extent of urosolic acid-producing genes in young leaves of the Rama subtype. In addition, the presence of eugenol and ursolic acid, implied as potential drugs in the cure of many diseases including cancer was confirmed using mass spectrometry. The availability of the whole genome of O.tenuiflorum and our sequence analysis suggests that small amino acid changes at the functional sites of genes involved in metabolite synthesis pathways confer special medicinal properties to this herb.
Palaeosymbiosis Revealed by Genomic Fossils of Wolbachia in a Strongyloidean Nematode

PubMed Central

Koutsovoulos, Georgios; Makepeace, Benjamin; Tanya, Vincent N.; Blaxter, Mark

2014-01-01

Wolbachia are common endosymbionts of terrestrial arthropods, and are also found in nematodes: the animal-parasitic filaria, and the plant-parasite Radopholus similis. Lateral transfer of Wolbachia DNA to the host genome is common. We generated a draft genome sequence for the strongyloidean nematode parasite Dictyocaulus viviparus, the cattle lungworm. In the assembly, we identified nearly 1 Mb of sequence with similarity to Wolbachia. The fragments were unlikely to derive from a live Wolbachia infection: most were short, and the genes were disabled through inactivating mutations. Many fragments were co-assembled with definitively nematode-derived sequence. We found limited evidence of expression of the Wolbachia-derived genes. The D. viviparus Wolbachia genes were most similar to filarial strains and strains from the host-promiscuous clade F. We conclude that D. viviparus was infected by Wolbachia in the past, and that clade F-like symbionts may have been the source of filarial Wolbachia infections. PMID:24901418
Dictyocaulus viviparus genome, variome and transcriptome elucidate lungworm biology and support future intervention

PubMed Central

McNulty, Samantha N.; Strübe, Christina; Rosa, Bruce A.; Martin, John C.; Tyagi, Rahul; Choi, Young-Jun; Wang, Qi; Hallsworth Pepin, Kymberlie; Zhang, Xu; Ozersky, Philip; Wilson, Richard K.; Sternberg, Paul W.; Gasser, Robin B.; Mitreva, Makedonka

2016-01-01

The bovine lungworm, Dictyocaulus viviparus (order Strongylida), is an important parasite of livestock that causes substantial economic and production losses worldwide. Here we report the draft genome, variome, and developmental transcriptome of D. viviparus. The genome (161 Mb) is smaller than those of related bursate nematodes and encodes fewer proteins (14,171 total). In the first genome-wide assessment of genomic variation in any parasitic nematode, we found a high degree of sequence variability in proteins predicted to be involved host-parasite interactions. Next, we used extensive RNA sequence data to track gene transcription across the life cycle of D. viviparus, and identified genes that might be important in nematode development and parasitism. Finally, we predicted genes that could be vital in host-parasite interactions, genes that could serve as drug targets, and putative RNAi effectors with a view to developing functional genomic tools. This extensive, well-curated dataset should provide a basis for developing new anthelmintics, vaccines, and improved diagnostic tests and serve as a platform for future investigations of drug resistance and epidemiology of the bovine lungworm and related nematodes. PMID:26856411
Radiation hybrid maps of the D-genome of Aegilops tauschii and their application in sequence assembly of large and complex plant genomes.

PubMed

Kumar, Ajay; Seetan, Raed; Mergoum, Mohamed; Tiwari, Vijay K; Iqbal, Muhammad J; Wang, Yi; Al-Azzam, Omar; Šimková, Hana; Luo, Ming-Cheng; Dvorak, Jan; Gu, Yong Q; Denton, Anne; Kilian, Andrzej; Lazo, Gerard R; Kianian, Shahryar F

2015-10-16

The large and complex genome of bread wheat (Triticum aestivum L., ~17 Gb) requires high resolution genome maps with saturated marker scaffolds to anchor and orient BAC contigs/ sequence scaffolds for whole genome assembly. Radiation hybrid (RH) mapping has proven to be an excellent tool for the development of such maps for it offers much higher and more uniform marker resolution across the length of the chromosome compared to genetic mapping and does not require marker polymorphism per se, as it is based on presence (retention) vs. absence (deletion) marker assay. In this study, a 178 line RH panel was genotyped with SSRs and DArT markers to develop the first high resolution RH maps of the entire D-genome of Ae. tauschii accession AL8/78. To confirm map order accuracy, the AL8/78-RH maps were compared with:1) a DArT consensus genetic map constructed using more than 100 bi-parental populations, 2) a RH map of the D-genome of reference hexaploid wheat 'Chinese Spring', and 3) two SNP-based genetic maps, one with anchored D-genome BAC contigs and another with anchored D-genome sequence scaffolds. Using marker sequences, the RH maps were also anchored with a BAC contig based physical map and draft sequence of the D-genome of Ae. tauschii. A total of 609 markers were mapped to 503 unique positions on the seven D-genome chromosomes, with a total map length of 14,706.7 cR. The average distance between any two marker loci was 29.2 cR which corresponds to 2.1 cM or 9.8 Mb. The average mapping resolution across the D-genome was estimated to be 0.34 Mb (Mb/cR) or 0.07 cM (cM/cR). The RH maps showed almost perfect agreement with several published maps with regard to chromosome assignments of markers. The mean rank correlations between the position of markers on AL8/78 maps and the four published maps, ranged from 0.75 to 0.92, suggesting a good agreement in marker order. With 609 mapped markers, a total of 2481 deletions for the whole D-genome were detected with an average deletion size of 42.0 Mb. A total of 520 markers were anchored to 216 Ae. tauschii sequence scaffolds, 116 of which were not anchored earlier to the D-genome. This study reports the development of first high resolution RH maps for the D-genome of Ae. tauschii accession AL8/78, which were then used for the anchoring of unassigned sequence scaffolds. This study demonstrates how RH mapping, which offered high and uniform resolution across the length of the chromosome, can facilitate the complete sequence assembly of the large and complex plant genomes.
A draft fur seal genome provides insights into factors affecting SNP validation and how to mitigate them.

PubMed

Humble, E; Martinez-Barrio, A; Forcada, J; Trathan, P N; Thorne, M A S; Hoffmann, M; Wolf, J B W; Hoffman, J I

2016-07-01

Custom genotyping arrays provide a flexible and accurate means of genotyping single nucleotide polymorphisms (SNPs) in a large number of individuals of essentially any organism. However, validation rates, defined as the proportion of putative SNPs that are verified to be polymorphic in a population, are often very low. A number of potential causes of assay failure have been identified, but none have been explored systematically. In particular, as SNPs are often developed from transcriptomes, parameters relating to the genomic context are rarely taken into account. Here, we assembled a draft Antarctic fur seal (Arctocephalus gazella) genome (assembly size: 2.41 Gb; scaffold/contig N50 : 3.1 Mb/27.5 kb). We then used this resource to map the probe sequences of 144 putative SNPs genotyped in 480 individuals. The number of probe-to-genome mappings and alignment length together explained almost a third of the variation in validation success, indicating that sequence uniqueness and proximity to intron-exon boundaries play an important role. The same pattern was found after mapping the probe sequences to the Walrus and Weddell seal genomes, suggesting that the genomes of species divergent by as much as 23 million years can hold information relevant to SNP validation outcomes. Additionally, reanalysis of genotyping data from seven previous studies found the same two variables to be significantly associated with SNP validation success across a variety of taxa. Finally, our study reveals considerable scope for validation rates to be improved, either by simply filtering for SNPs whose flanking sequences align uniquely and completely to a reference genome, or through predictive modelling. © 2015 John Wiley & Sons Ltd.
Comparisons with Caenorhabditis (approximately 100 Mb) and Drosophila (approximately 175 Mb) using flow cytometry show genome size in Arabidopsis to be approximately 157 Mb and thus approximately 25% larger than the Arabidopsis genome initiative estimate of approximately 125 Mb.

PubMed

Bennett, Michael D; Leitch, Ilia J; Price, H James; Johnston, J Spencer

2003-04-01

Recent genome sequencing papers have given genome sizes of 180 Mb for Drosophila melanogaster Iso-1 and 125 Mb for Arabidopsis thaliana Columbia. The former agrees with early cytochemical estimates, but numerous cytometric estimates of around 170 Mb imply that a genome size of 125 Mb for arabidopsis is an underestimate. In this study, nuclei of species pairs were compared directly using flow cytometry. Co-run Columbia and Iso-1 female gave a 2C peak for arabidopsis only approx. 15 % below that for drosophila, and 16C endopolyploid Columbia nuclei had approx. 15 % more DNA than 2C chicken nuclei (with >2280 Mb). Caenorhabditis elegans Bristol N2 (genome size approx. 100 Mb) co-run with Columbia or Iso-1 gave a 2C peak for drosophila approx. 75 % above that for 2C C. elegans, and a 2C peak for arabidopsis approx. 57 % above that for C. elegans. This confirms that 1C in drosophila is approx. 175 Mb and, combined with other evidence, leads us to conclude that the genome size of arabidopsis is not approx. 125 Mb, but probably approx. 157 Mb. It is likely that the discrepancy represents extra repeated sequences in unsequenced gaps in heterochromatic regions. Complete sequencing of the arabidopsis genome until no gaps remain at telomeres, nucleolar organizing regions or centromeres is still needed to provide the first precise angiosperm C-value as a benchmark calibration standard for plant genomes, and to ensure that no genes have been missed in arabidopsis, especially in centromeric regions, which are clearly larger than once imagined.
Improvement of the banana "Musa acuminata" reference sequence using NGS data and semi-automated bioinformatics methods.

PubMed

Martin, Guillaume; Baurens, Franc-Christophe; Droc, Gaëtan; Rouard, Mathieu; Cenci, Alberto; Kilian, Andrzej; Hastie, Alex; Doležel, Jaroslav; Aury, Jean-Marc; Alberti, Adriana; Carreel, Françoise; D'Hont, Angélique

2016-03-16

Recent advances in genomics indicate functional significance of a majority of genome sequences and their long range interactions. As a detailed examination of genome organization and function requires very high quality genome sequence, the objective of this study was to improve reference genome assembly of banana (Musa acuminata). We have developed a modular bioinformatics pipeline to improve genome sequence assemblies, which can handle various types of data. The pipeline comprises several semi-automated tools. However, unlike classical automated tools that are based on global parameters, the semi-automated tools proposed an expert mode for a user who can decide on suggested improvements through local compromises. The pipeline was used to improve the draft genome sequence of Musa acuminata. Genotyping by sequencing (GBS) of a segregating population and paired-end sequencing were used to detect and correct scaffold misassemblies. Long insert size paired-end reads identified scaffold junctions and fusions missed by automated assembly methods. GBS markers were used to anchor scaffolds to pseudo-molecules with a new bioinformatics approach that avoids the tedious step of marker ordering during genetic map construction. Furthermore, a genome map was constructed and used to assemble scaffolds into super scaffolds. Finally, a consensus gene annotation was projected on the new assembly from two pre-existing annotations. This approach reduced the total Musa scaffold number from 7513 to 1532 (i.e. by 80%), with an N50 that increased from 1.3 Mb (65 scaffolds) to 3.0 Mb (26 scaffolds). 89.5% of the assembly was anchored to the 11 Musa chromosomes compared to the previous 70%. Unknown sites (N) were reduced from 17.3 to 10.0%. The release of the Musa acuminata reference genome version 2 provides a platform for detailed analysis of banana genome variation, function and evolution. Bioinformatics tools developed in this work can be used to improve genome sequence assemblies in other species.
Genome sequences and comparative genomics of two Lactobacillus ruminis strains from the bovine and human intestinal tracts

PubMed Central

2011-01-01

Background The genus Lactobacillus is characterized by an extraordinary degree of phenotypic and genotypic diversity, which recent genomic analyses have further highlighted. However, the choice of species for sequencing has been non-random and unequal in distribution, with only a single representative genome from the L. salivarius clade available to date. Furthermore, there is no data to facilitate a functional genomic analysis of motility in the lactobacilli, a trait that is restricted to the L. salivarius clade. Results The 2.06 Mb genome of the bovine isolate Lactobacillus ruminis ATCC 27782 comprises a single circular chromosome, and has a G+C content of 44.4%. In silico analysis identified 1901 coding sequences, including genes for a pediocin-like bacteriocin, a single large exopolysaccharide-related cluster, two sortase enzymes, two CRISPR loci and numerous IS elements and pseudogenes. A cluster of genes related to a putative pilin was identified, and shown to be transcribed in vitro. A high quality draft assembly of the genome of a second L. ruminis strain, ATCC 25644 isolated from humans, suggested a slightly larger genome of 2.138 Mb, that exhibited a high degree of synteny with the ATCC 27782 genome. In contrast, comparative analysis of L. ruminis and L. salivarius identified a lack of long-range synteny between these closely related species. Comparison of the L. salivarius clade core proteins with those of nine other Lactobacillus species distributed across 4 major phylogenetic groups identified the set of shared proteins, and proteins unique to each group. Conclusions The genome of L. ruminis provides a comparative tool for directing functional analyses of other members of the L. salivarius clade, and it increases understanding of the divergence of this distinct Lactobacillus lineage from other commensal lactobacilli. The genome sequence provides a definitive resource to facilitate investigation of the genetics, biochemistry and host interactions of these motile intestinal lactobacilli. PMID:21995554
Identification and Characterization of Microsatellite Markers Derived from the Whole Genome Analysis of Taenia solium.

PubMed

Pajuelo, Mónica J; Eguiluz, María; Dahlstrom, Eric; Requena, David; Guzmán, Frank; Ramirez, Manuel; Sheen, Patricia; Frace, Michael; Sammons, Scott; Cama, Vitaliano; Anzick, Sarah; Bruno, Dan; Mahanty, Siddhartha; Wilkins, Patricia; Nash, Theodore; Gonzalez, Armando; García, Héctor H; Gilman, Robert H; Porcella, Steve; Zimic, Mirko

2015-12-01

Infections with Taenia solium are the most common cause of adult acquired seizures worldwide, and are the leading cause of epilepsy in developing countries. A better understanding of the genetic diversity of T. solium will improve parasite diagnostics and transmission pathways in endemic areas thereby facilitating the design of future control measures and interventions. Microsatellite markers are useful genome features, which enable strain typing and identification in complex pathogen genomes. Here we describe microsatellite identification and characterization in T. solium, providing information that will assist in global efforts to control this important pathogen. For genome sequencing, T. solium cysts and proglottids were collected from Huancayo and Puno in Peru, respectively. Using next generation sequencing (NGS) and de novo assembly, we assembled two draft genomes and one hybrid genome. Microsatellite sequences were identified and 36 of them were selected for further analysis. Twenty T. solium isolates were collected from Tumbes in the northern region, and twenty from Puno in the southern region of Peru. The size-polymorphism of the selected microsatellites was determined with multi-capillary electrophoresis. We analyzed the association between microsatellite polymorphism and the geographic origin of the samples. The predicted size of the hybrid (proglottid genome combined with cyst genome) T. solium genome was 111 MB with a GC content of 42.54%. A total of 7,979 contigs (>1,000 nt) were obtained. We identified 9,129 microsatellites in the Puno-proglottid genome and 9,936 in the Huancayo-cyst genome, with 5 or more repeats, ranging from mono- to hexa-nucleotide. Seven microsatellites were polymorphic and 29 were monomorphic within the analyzed isolates. T. solium tapeworms were classified into two genetic groups that correlated with the North/South geographic origin of the parasites. The availability of draft genomes for T. solium represents a significant step towards the understanding the biology of the parasite. We report here a set of T. solium polymorphic microsatellite markers that appear promising for genetic epidemiology studies.
A Multi-Platform Draft de novo Genome Assembly and Comparative Analysis for the Scarlet Macaw (Ara macao)

PubMed Central

Seabury, Christopher M.; Dowd, Scot E.; Seabury, Paul M.; Raudsepp, Terje; Brightsmith, Donald J.; Liboriussen, Poul; Halley, Yvette; Fisher, Colleen A.; Owens, Elaine; Viswanathan, Ganesh; Tizard, Ian R.

2013-01-01

Data deposition to NCBI Genomes This Whole Genome Shotgun project has been deposited at DDBJ/EMBL/GenBank under the accession AMXX00000000 (SMACv1.0, unscaffolded genome assembly). The version described in this paper is the first version (AMXX01000000). The scaffolded assembly (SMACv1.1) has been deposited at DDBJ/EMBL/GenBank under the accession AOUJ00000000, and is also the first version (AOUJ01000000). Strong biological interest in traits such as the acquisition and utilization of speech, cognitive abilities, and longevity catalyzed the utilization of two next-generation sequencing platforms to provide the first-draft de novo genome assembly for the large, new world parrot Ara macao (Scarlet Macaw). Despite the challenges associated with genome assembly for an outbred avian species, including 951,507 high-quality putative single nucleotide polymorphisms, the final genome assembly (>1.035 Gb) includes more than 997 Mb of unambiguous sequence data (excluding N’s). Cytogenetic analyses including ZooFISH revealed complex rearrangements associated with two scarlet macaw macrochromosomes (AMA6, AMA7), which supports the hypothesis that translocations, fusions, and intragenomic rearrangements are key factors associated with karyotype evolution among parrots. In silico annotation of the scarlet macaw genome provided robust evidence for 14,405 nuclear gene annotation models, their predicted transcripts and proteins, and a complete mitochondrial genome. Comparative analyses involving the scarlet macaw, chicken, and zebra finch genomes revealed high levels of nucleotide-based conservation as well as evidence for overall genome stability among the three highly divergent species. Application of a new whole-genome analysis of divergence involving all three species yielded prioritized candidate genes and noncoding regions for parrot traits of interest (i.e., speech, intelligence, longevity) which were independently supported by the results of previous human GWAS studies. We also observed evidence for genes and noncoding loci that displayed extreme conservation across the three avian lineages, thereby reflecting their likely biological and developmental importance among birds. PMID:23667475
Phase V of Early Restoration | NOAA Gulf Spill Restoration

Science.gov Websites

Phase V Early Restoration Plan and Environmental Assessment. The project will acquire land along Florida million. Phase V Early Restoration Plan and Environmental Assessment (pdf, 10 MB) Draft Phase V Early Restoration Plan and Environmental Assessment (Executive Summary) (2 MB) Phase V Fact Sheet (pdf, 2 MB) Gulf
Genome and Proteome Analysis of Rhodococcus erythropolis MI2: Elucidation of the 4,4´-Dithiodibutyric Acid Catabolism

PubMed Central

Khairy, Heba; Meinert, Christina; Wübbeler, Jan Hendrik; Poehlein, Anja; Daniel, Rolf; Voigt, Birgit; Riedel, Katharina; Steinbüchel, Alexander

2016-01-01

Rhodococcus erythropolis MI2 has the extraordinary ability to utilize the xenobiotic 4,4´-dithiodibutyric acid (DTDB). Cleavage of DTDB by the disulfide-reductase Nox, which is the only verified enzyme involved in DTDB-degradation, raised 4-mercaptobutyric acid (4MB). 4MB could act as building block of a novel polythioester with unknown properties. To completely unravel the catabolism of DTDB, the genome of R. erythropolis MI2 was sequenced, and subsequently the proteome was analyzed. The draft genome sequence consists of approximately 7.2 Mbp with an overall G+C content of 62.25% and 6,859 predicted protein-encoding genes. The genome of strain MI2 is composed of three replicons: one chromosome and two megaplasmids with sizes of 6.45, 0.4 and 0.35 Mbp, respectively. When cells of strain MI2 were cultivated with DTDB as sole carbon source and compared to cells grown with succinate, several interesting proteins with significantly higher expression levels were identified using 2D-PAGE and MALDI-TOF mass spectrometry. A putative luciferase-like monooxygenase-class F420-dependent oxidoreductase (RERY_05640), which is encoded by one of the 126 monooxygenase-encoding genes of the MI2-genome, showed a 3-fold increased expression level. This monooxygenase could oxidize the intermediate 4MB into 4-oxo-4-sulfanylbutyric acid. Next, a desulfurization step, which forms succinic acid and volatile hydrogen sulfide, is proposed. One gene coding for a putative desulfhydrase (RERY_06500) was identified in the genome of strain MI2. However, the gene product was not recognized in the proteome analyses. But, a significant expression level with a ratio of up to 7.3 was determined for a putative sulfide:quinone oxidoreductase (RERY_02710), which could also be involved in the abstraction of the sulfur group. As response to the toxicity of the intermediates, several stress response proteins were strongly expressed, including a superoxide dismutase (RERY_05600) and an osmotically induced protein (RERY_02670). Accordingly, novel insights in the catabolic pathway of DTDB were gained. PMID:27977722
Genome and Proteome Analysis of Rhodococcus erythropolis MI2: Elucidation of the 4,4´-Dithiodibutyric Acid Catabolism.

PubMed

Khairy, Heba; Meinert, Christina; Wübbeler, Jan Hendrik; Poehlein, Anja; Daniel, Rolf; Voigt, Birgit; Riedel, Katharina; Steinbüchel, Alexander

2016-01-01

Rhodococcus erythropolis MI2 has the extraordinary ability to utilize the xenobiotic 4,4´-dithiodibutyric acid (DTDB). Cleavage of DTDB by the disulfide-reductase Nox, which is the only verified enzyme involved in DTDB-degradation, raised 4-mercaptobutyric acid (4MB). 4MB could act as building block of a novel polythioester with unknown properties. To completely unravel the catabolism of DTDB, the genome of R. erythropolis MI2 was sequenced, and subsequently the proteome was analyzed. The draft genome sequence consists of approximately 7.2 Mbp with an overall G+C content of 62.25% and 6,859 predicted protein-encoding genes. The genome of strain MI2 is composed of three replicons: one chromosome and two megaplasmids with sizes of 6.45, 0.4 and 0.35 Mbp, respectively. When cells of strain MI2 were cultivated with DTDB as sole carbon source and compared to cells grown with succinate, several interesting proteins with significantly higher expression levels were identified using 2D-PAGE and MALDI-TOF mass spectrometry. A putative luciferase-like monooxygenase-class F420-dependent oxidoreductase (RERY_05640), which is encoded by one of the 126 monooxygenase-encoding genes of the MI2-genome, showed a 3-fold increased expression level. This monooxygenase could oxidize the intermediate 4MB into 4-oxo-4-sulfanylbutyric acid. Next, a desulfurization step, which forms succinic acid and volatile hydrogen sulfide, is proposed. One gene coding for a putative desulfhydrase (RERY_06500) was identified in the genome of strain MI2. However, the gene product was not recognized in the proteome analyses. But, a significant expression level with a ratio of up to 7.3 was determined for a putative sulfide:quinone oxidoreductase (RERY_02710), which could also be involved in the abstraction of the sulfur group. As response to the toxicity of the intermediates, several stress response proteins were strongly expressed, including a superoxide dismutase (RERY_05600) and an osmotically induced protein (RERY_02670). Accordingly, novel insights in the catabolic pathway of DTDB were gained.
Annotated Draft Genome Assemblies for the Northern Bobwhite (Colinus virginianus) and the Scaled Quail (Callipepla squamata) Reveal Disparate Estimates of Modern Genome Diversity and Historic Effective Population Size.

PubMed

Oldeschulte, David L; Halley, Yvette A; Wilson, Miranda L; Bhattarai, Eric K; Brashear, Wesley; Hill, Joshua; Metz, Richard P; Johnson, Charles D; Rollins, Dale; Peterson, Markus J; Bickhart, Derek M; Decker, Jared E; Sewell, John F; Seabury, Christopher M

2017-09-07

Northern bobwhite ( Colinus virginianus ; hereafter bobwhite) and scaled quail ( Callipepla squamata ) populations have suffered precipitous declines across most of their US ranges. Illumina-based first- (v1.0) and second- (v2.0) generation draft genome assemblies for the scaled quail and the bobwhite produced N50 scaffold sizes of 1.035 and 2.042 Mb, thereby producing a 45-fold improvement in contiguity over the existing bobwhite assembly, and ≥90% of the assembled genomes were captured within 1313 and 8990 scaffolds, respectively. The scaled quail assembly (v1.0 = 1.045 Gb) was ∼20% smaller than the bobwhite (v2.0 = 1.254 Gb), which was supported by kmer-based estimates of genome size. Nevertheless, estimates of GC content (41.72%; 42.66%), genome-wide repetitive content (10.40%; 10.43%), and MAKER-predicted protein coding genes (17,131; 17,165) were similar for the scaled quail (v1.0) and bobwhite (v2.0) assemblies, respectively. BUSCO analyses utilizing 3023 single-copy orthologs revealed a high level of assembly completeness for the scaled quail (v1.0; 84.8%) and the bobwhite (v2.0; 82.5%), as verified by comparison with well-established avian genomes. We also detected 273 putative segmental duplications in the scaled quail genome (v1.0), and 711 in the bobwhite genome (v2.0), including some that were shared among both species. Autosomal variant prediction revealed ∼2.48 and 4.17 heterozygous variants per kilobase within the scaled quail (v1.0) and bobwhite (v2.0) genomes, respectively, and estimates of historic effective population size were uniformly higher for the bobwhite across all time points in a coalescent model. However, large-scale declines were predicted for both species beginning ∼15-20 KYA. Copyright © 2017 Oldeschulte et al.
Exceptionally high levels of recombination across the honey bee genome.

PubMed

Beye, Martin; Gattermeier, Irene; Hasselmann, Martin; Gempe, Tanja; Schioett, Morten; Baines, John F; Schlipalius, David; Mougel, Florence; Emore, Christine; Rueppell, Olav; Sirviö, Anu; Guzmán-Novoa, Ernesto; Hunt, Greg; Solignac, Michel; Page, Robert E

2006-11-01

The first draft of the honey bee genome sequence and improved genetic maps are utilized to analyze a genome displaying 10 times higher levels of recombination (19 cM/Mb) than previously analyzed genomes of higher eukaryotes. The exceptionally high recombination rate is distributed genome-wide, but varies by two orders of magnitude. Analysis of chromosome, sequence, and gene parameters with respect to recombination showed that local recombination rate is associated with distance to the telomere, GC content, and the number of simple repeats as described for low-recombining genomes. Recombination rate does not decrease with chromosome size. On average 5.7 recombination events per chromosome pair per meiosis are found in the honey bee genome. This contrasts with a wide range of taxa that have a uniform recombination frequency of about 1.6 per chromosome pair. The excess of recombination activity does not support a mechanistic role of recombination in stabilizing pairs of homologous chromosome during chromosome pairing. Recombination rate is associated with gene size, suggesting that introns are larger in regions of low recombination and may improve the efficacy of selection in these regions. Very few transposons and no retrotransposons are present in the high-recombining genome. We propose evolutionary explanations for the exceptionally high genome-wide recombination rate.
Whole genome sequence analysis of Geitlerinema sp. FC II unveils competitive edge of the strain in marine cultivation system for biofuel production.

PubMed

Batchu, Navish Kumar; Khater, Shradha; Patil, Sonal; Nagle, Vinod; Das, Gautam; Bhadra, Bhaskar; Sapre, Ajit; Dasgupta, Santanu

2018-03-05

A filamentous cyanobacteria, Geitlerinema sp. FC II, was isolated from marine algae culture pond at Reliance Industries Limited (RIL), India. The 6.7 Mb draft genome of FC II encodes for 6697 protein coding genes. Analysis of the whole genome sequence revealed presence of nif gene cluster, supporting its capability to fix atmospheric nitrogen. FC II genome contains two variants of sulfide:quinone oxidoreductases (SQR), which is a crucial elector donor in cyanobacterial metabolic processes. FC II is characterized by the presence of multiple CRISPR- Cas (Clustered Regularly Interspaced Short Palindrome Repeats - CRISPR associated proteins) clusters, multiple variants of genes encoding photosystem reaction centres, biosynthetic gene clusters of alkane, polyketides and non-ribosomal peptides. Presence of these pathways will help FC II in gaining an ecological advantage over other strains for biomass production in large scale cultivation system. Hence, FC II may be used for production of biofuel and other industrially important metabolites. Copyright © 2018 Elsevier Inc. All rights reserved.
The draft genome and transcriptome of Cannabis sativa

PubMed Central

2011-01-01

Background Cannabis sativa has been cultivated throughout human history as a source of fiber, oil and food, and for its medicinal and intoxicating properties. Selective breeding has produced cannabis plants for specific uses, including high-potency marijuana strains and hemp cultivars for fiber and seed production. The molecular biology underlying cannabinoid biosynthesis and other traits of interest is largely unexplored. Results We sequenced genomic DNA and RNA from the marijuana strain Purple Kush using shortread approaches. We report a draft haploid genome sequence of 534 Mb and a transcriptome of 30,000 genes. Comparison of the transcriptome of Purple Kush with that of the hemp cultivar 'Finola' revealed that many genes encoding proteins involved in cannabinoid and precursor pathways are more highly expressed in Purple Kush than in 'Finola'. The exclusive occurrence of Δ9-tetrahydrocannabinolic acid synthase in the Purple Kush transcriptome, and its replacement by cannabidiolic acid synthase in 'Finola', may explain why the psychoactive cannabinoid Δ9-tetrahydrocannabinol (THC) is produced in marijuana but not in hemp. Resequencing the hemp cultivars 'Finola' and 'USO-31' showed little difference in gene copy numbers of cannabinoid pathway enzymes. However, single nucleotide variant analysis uncovered a relatively high level of variation among four cannabis types, and supported a separation of marijuana and hemp. Conclusions The availability of the Cannabis sativa genome enables the study of a multifunctional plant that occupies a unique role in human culture. Its availability will aid the development of therapeutic marijuana strains with tailored cannabinoid profiles and provide a basis for the breeding of hemp with improved agronomic characteristics. PMID:22014239

The draft genome and transcriptome of Cannabis sativa.

PubMed

van Bakel, Harm; Stout, Jake M; Cote, Atina G; Tallon, Carling M; Sharpe, Andrew G; Hughes, Timothy R; Page, Jonathan E

2011-10-20

Cannabis sativa has been cultivated throughout human history as a source of fiber, oil and food, and for its medicinal and intoxicating properties. Selective breeding has produced cannabis plants for specific uses, including high-potency marijuana strains and hemp cultivars for fiber and seed production. The molecular biology underlying cannabinoid biosynthesis and other traits of interest is largely unexplored. We sequenced genomic DNA and RNA from the marijuana strain Purple Kush using shortread approaches. We report a draft haploid genome sequence of 534 Mb and a transcriptome of 30,000 genes. Comparison of the transcriptome of Purple Kush with that of the hemp cultivar 'Finola' revealed that many genes encoding proteins involved in cannabinoid and precursor pathways are more highly expressed in Purple Kush than in 'Finola'. The exclusive occurrence of Δ9-tetrahydrocannabinolic acid synthase in the Purple Kush transcriptome, and its replacement by cannabidiolic acid synthase in 'Finola', may explain why the psychoactive cannabinoid Δ9-tetrahydrocannabinol (THC) is produced in marijuana but not in hemp. Resequencing the hemp cultivars 'Finola' and 'USO-31' showed little difference in gene copy numbers of cannabinoid pathway enzymes. However, single nucleotide variant analysis uncovered a relatively high level of variation among four cannabis types, and supported a separation of marijuana and hemp. The availability of the Cannabis sativa genome enables the study of a multifunctional plant that occupies a unique role in human culture. Its availability will aid the development of therapeutic marijuana strains with tailored cannabinoid profiles and provide a basis for the breeding of hemp with improved agronomic characteristics.
Draft Genome Sequence of Caloramator australicus Strain RC3T, a Thermoanaerobe from the Great Artesian Basin of Australia ▿

PubMed Central

Ogg, Christopher D.; Patel, Bharat K. C.

2011-01-01

Caloramator australicus strain RC3T (JCM 15081T = KCTC 5601T) is the type strain of a newly identified thermophilic species, which was isolated from red microbial mats that thrive at 66°C in the runoff channel of a Great Artesian Basin bore (New Lorne bore, registered number 17263) in outback Queensland, Australia. The ability of the C. australicus strain to use metals as terminal electron acceptors has led to concerns that it could colonize and enhance corrosion of the metal casing of Great Artesian Basin bore well pipes and that this could subsequently lead to bore failure and loss of water availability for the community which is so reliant on it. The genome of the C. australicus strain has been sequenced, and annotation of the ∼2.65-Mb sequence indicates that the attributes are consistent with physiological and phenotypic traits. PMID:21421756
Genome sequence of foxtail millet (Setaria italica) provides insights into grass evolution and biofuel potential.

PubMed

Zhang, Gengyun; Liu, Xin; Quan, Zhiwu; Cheng, Shifeng; Xu, Xun; Pan, Shengkai; Xie, Min; Zeng, Peng; Yue, Zhen; Wang, Wenliang; Tao, Ye; Bian, Chao; Han, Changlei; Xia, Qiuju; Peng, Xiaohua; Cao, Rui; Yang, Xinhua; Zhan, Dongliang; Hu, Jingchu; Zhang, Yinxin; Li, Henan; Li, Hua; Li, Ning; Wang, Junyi; Wang, Chanchan; Wang, Renyi; Guo, Tao; Cai, Yanjie; Liu, Chengzhang; Xiang, Haitao; Shi, Qiuxiang; Huang, Ping; Chen, Qingchun; Li, Yingrui; Wang, Jun; Zhao, Zhihai; Wang, Jian

2012-05-13

Foxtail millet (Setaria italica), a member of the Poaceae grass family, is an important food and fodder crop in arid regions and has potential for use as a C(4) biofuel. It is a model system for other biofuel grasses, including switchgrass and pearl millet. We produced a draft genome (∼423 Mb) anchored onto nine chromosomes and annotated 38,801 genes. Key chromosome reshuffling events were detected through collinearity identification between foxtail millet, rice and sorghum including two reshuffling events fusing rice chromosomes 7 and 9, 3 and 10 to foxtail millet chromosomes 2 and 9, respectively, that occurred after the divergence of foxtail millet and rice, and a single reshuffling event fusing rice chromosome 5 and 12 to foxtail millet chromosome 3 that occurred after the divergence of millet and sorghum. Rearrangements in the C(4) photosynthesis pathway were also identified.
SCRaMbLE generates designed combinatorial stochastic diversity in synthetic chromosomes

PubMed Central

Shen, Yue; Stracquadanio, Giovanni; Wang, Yun; Yang, Kun; Mitchell, Leslie A.; Xue, Yaxin; Cai, Yizhi; Chen, Tai; Dymond, Jessica S.; Kang, Kang; Gong, Jianhui; Zeng, Xiaofan; Zhang, Yongfen; Li, Yingrui; Feng, Qiang; Xu, Xun; Wang, Jun; Wang, Jian; Yang, Huanming; Boeke, Jef D.; Bader, Joel S.

2016-01-01

Synthetic chromosome rearrangement and modification by loxP-mediated evolution (SCRaMbLE) generates combinatorial genomic diversity through rearrangements at designed recombinase sites. We applied SCRaMbLE to yeast synthetic chromosome arm synIXR (43 recombinase sites) and then used a computational pipeline to infer or unscramble the sequence of recombinations that created the observed genomes. Deep sequencing of 64 synIXR SCRaMbLE strains revealed 156 deletions, 89 inversions, 94 duplications, and 55 additional complex rearrangements; several duplications are consistent with a double rolling circle mechanism. Every SCRaMbLE strain was unique, validating the capability of SCRaMbLE to explore a diverse space of genomes. Rearrangements occurred exclusively at designed loxPsym sites, with no significant evidence for ectopic rearrangements or mutations involving synthetic regions, the 99% nonsynthetic nuclear genome, or the mitochondrial genome. Deletion frequencies identified genes required for viability or fast growth. Replacement of 3′ UTR by non-UTR sequence had surprisingly little effect on fitness. SCRaMbLE generates genome diversity in designated regions, reveals fitness constraints, and should scale to simultaneous evolution of multiple synthetic chromosomes. PMID:26566658
Sequencing, de novo assembling, and annotating the genome of the endangered Chinese crocodile lizard Shinisaurus crocodilurus.

PubMed

Gao, Jian; Li, Qiye; Wang, Zongji; Zhou, Yang; Martelli, Paolo; Li, Fang; Xiong, Zijun; Wang, Jian; Yang, Huanming; Zhang, Guojie

2017-07-01

The Chinese crocodile lizard, Shinisaurus crocodilurus, is the only living representative of the monotypic family Shinisauridae under the order Squamata. It is an obligate semi-aquatic, viviparous, diurnal species restricted to specific portions of mountainous locations in southwestern China and northeastern Vietnam. However, in the past several decades, this species has undergone a rapid decrease in population size due to illegal poaching and habitat disruption, making this unique reptile species endangered and listed in the Convention on International Trade in Endangered Species of Wild Fauna and Flora Appendix II since 1990. A proposal to uplist it to Appendix I was passed at the Convention on International Trade in Endangered Species of Wild Fauna and Flora Seventeenth meeting of the Conference of the Parties in 2016. To promote the conservation of this species, we sequenced the genome of a male Chinese crocodile lizard using a whole-genome shotgun strategy on the Illumina HiSeq 2000 platform. In total, we generated ∼291 Gb of raw sequencing data (×149 depth) from 13 libraries with insert sizes ranging from 250 bp to 40 kb. After filtering for polymerase chain reaction-duplicated and low-quality reads, ∼137 Gb of clean data (×70 depth) were obtained for genome assembly. We yielded a draft genome assembly with a total length of 2.24 Gb and an N50 scaffold size of 1.47 Mb. The assembled genome was predicted to contain 20 150 protein-coding genes and up to 1114 Mb (49.6%) of repetitive elements. The genomic resource of the Chinese crocodile lizard will contribute to deciphering the biology of this organism and provides an essential tool for conservation efforts. It also provides a valuable resource for future study of squamate evolution. © The Authors 2017. Published by Oxford University Press.
Genome assembly and transcriptome resource for river buffalo, Bubalus bubalis (2n = 50)

PubMed Central

Iamartino, Daniela; Pruitt, Kim D; Sonstegard, Tad; Smith, Timothy P L; Low, Wai Yee; Biagini, Tommaso; Bomba, Lorenzo; Capomaccio, Stefano; Castiglioni, Bianca; Coletta, Angelo; Corrado, Federica; Ferré, Fabrizio; Iannuzzi, Leopoldo; Lawley, Cynthia; Macciotta, Nicolò; McClure, Matthew; Mancini, Giordano; Matassino, Donato; Mazza, Raffaele; Milanesi, Marco; Moioli, Bianca; Morandi, Nicola; Ramunno, Luigi; Peretti, Vincenzo; Pilla, Fabio; Ramelli, Paola; Schroeder, Steven; Strozzi, Francesco; Thibaud-Nissen, Francoise; Zicarelli, Luigi; Ajmone-Marsan, Paolo; Valentini, Alessio; Chillemi, Giovanni; Zimin, Aleksey

2017-01-01

Abstract Water buffalo is a globally important species for agriculture and local economies. A de novo assembled, well-annotated reference sequence for the water buffalo is an important prerequisite for studying the biology of this species, and is necessary to manage genetic diversity and to use modern breeding and genomic selection techniques. However, no such genome assembly has been previously reported. There are 2 species of domestic water buffalo, the river (2n = 50) and the swamp (2n = 48) buffalo. Here we describe a draft quality reference sequence for the river buffalo created from Illumina GA and Roche 454 short read sequences using the MaSuRCA assembler. The assembled sequence is 2.83 Gb, consisting of 366 983 scaffolds with a scaffold N50 of 1.41 Mb and contig N50 of 21 398 bp. Annotation of the genome was supported by transcriptome data from 30 tissues and identified 21 711 predicted protein coding genes. Searches for complete mammalian BUSCO gene groups found 98.6% of curated single copy orthologs present among predicted genes, which suggests a high level of completeness of the genome. The annotated sequence is available from NCBI at accession GCA_000471725.1. PMID:29048578
Whole-Genome Sequence Analysis of Bombella intestini LMG 28161T, a Novel Acetic Acid Bacterium Isolated from the Crop of a Red-Tailed Bumble Bee, Bombus lapidarius.

PubMed

Li, Leilei; Illeghems, Koen; Van Kerrebroeck, Simon; Borremans, Wim; Cleenwerck, Ilse; Smagghe, Guy; De Vuyst, Luc; Vandamme, Peter

2016-01-01

The whole-genome sequence of Bombella intestini LMG 28161T, an endosymbiotic acetic acid bacterium (AAB) occurring in bumble bees, was determined to investigate the molecular mechanisms underlying its metabolic capabilities. The draft genome sequence of B. intestini LMG 28161T was 2.02 Mb. Metabolic carbohydrate pathways were in agreement with the metabolite analyses of fermentation experiments and revealed its oxidative capacity towards sucrose, D-glucose, D-fructose and D-mannitol, but not ethanol and glycerol. The results of the fermentation experiments also demonstrated that the lack of effective aeration in small-scale carbohydrate consumption experiments may be responsible for the lack of reproducibility of such results in taxonomic studies of AAB. Finally, compared to the genome sequences of its nearest phylogenetic neighbor and of three other insect associated AAB strains, the B. intestini LMG 28161T genome lost 69 orthologs and included 89 unique genes. Although many of the latter were hypothetical they also included several type IV secretion system proteins, amino acid transporter/permeases and membrane proteins which might play a role in the interaction with the bumble bee host.
Dominant ectosymbiotic bacteria of cellulolytic protists in the termite gut also have the potential to digest lignocellulose.

PubMed

Yuki, Masahiro; Kuwahara, Hirokazu; Shintani, Masaki; Izawa, Kazuki; Sato, Tomoyuki; Starns, David; Hongoh, Yuichi; Ohkuma, Moriya

2015-12-01

Wood-feeding lower termites harbour symbiotic gut protists that support the termite nutritionally by degrading recalcitrant lignocellulose. These protists themselves host specific endo- and ectosymbiotic bacteria, functions of which remain largely unknown. Here, we present draft genomes of a dominant, uncultured ectosymbiont belonging to the order Bacteroidales, 'Candidatus Symbiothrix dinenymphae', which colonizes the cell surface of the cellulolytic gut protists Dinenympha spp. We analysed four single-cell genomes of Ca. S. dinenymphae, the highest genome completeness was estimated to be 81.6-82.3% with a predicted genome size of 4.28-4.31 Mb. The genome retains genes encoding large parts of the amino acid, cofactor and nucleotide biosynthetic pathways. In addition, the genome contains genes encoding various glycoside hydrolases such as endoglucanases and hemicellulases. The genome indicates that Ca. S. dinenymphae ferments lignocellulose-derived monosaccharides to acetate, a major carbon and energy source of the host termite. We suggest that the ectosymbiont digests lignocellulose and provides nutrients to the host termites, and hypothesize that the hydrolytic activity might also function as a pretreatment for the host protist to effectively decompose the crystalline cellulose components. © 2015 Society for Applied Microbiology and John Wiley & Sons Ltd.
Identification and Characterization of Microsatellite Markers Derived from the Whole Genome Analysis of Taenia solium

PubMed Central

Pajuelo, Mónica J.; Eguiluz, María; Dahlstrom, Eric; Requena, David; Guzmán, Frank; Ramirez, Manuel; Sheen, Patricia; Frace, Michael; Sammons, Scott; Cama, Vitaliano; Anzick, Sarah; Bruno, Dan; Mahanty, Siddhartha; Wilkins, Patricia; Nash, Theodore; Gonzalez, Armando; García, Héctor H.; Gilman, Robert H.; Porcella, Steve; Zimic, Mirko

2015-01-01

Background Infections with Taenia solium are the most common cause of adult acquired seizures worldwide, and are the leading cause of epilepsy in developing countries. A better understanding of the genetic diversity of T. solium will improve parasite diagnostics and transmission pathways in endemic areas thereby facilitating the design of future control measures and interventions. Microsatellite markers are useful genome features, which enable strain typing and identification in complex pathogen genomes. Here we describe microsatellite identification and characterization in T. solium, providing information that will assist in global efforts to control this important pathogen. Methods For genome sequencing, T. solium cysts and proglottids were collected from Huancayo and Puno in Peru, respectively. Using next generation sequencing (NGS) and de novo assembly, we assembled two draft genomes and one hybrid genome. Microsatellite sequences were identified and 36 of them were selected for further analysis. Twenty T. solium isolates were collected from Tumbes in the northern region, and twenty from Puno in the southern region of Peru. The size-polymorphism of the selected microsatellites was determined with multi-capillary electrophoresis. We analyzed the association between microsatellite polymorphism and the geographic origin of the samples. Results The predicted size of the hybrid (proglottid genome combined with cyst genome) T. solium genome was 111 MB with a GC content of 42.54%. A total of 7,979 contigs (>1,000 nt) were obtained. We identified 9,129 microsatellites in the Puno-proglottid genome and 9,936 in the Huancayo-cyst genome, with 5 or more repeats, ranging from mono- to hexa-nucleotide. Seven microsatellites were polymorphic and 29 were monomorphic within the analyzed isolates. T. solium tapeworms were classified into two genetic groups that correlated with the North/South geographic origin of the parasites. Conclusions/Significance The availability of draft genomes for T. solium represents a significant step towards the understanding the biology of the parasite. We report here a set of T. solium polymorphic microsatellite markers that appear promising for genetic epidemiology studies. PMID:26697878
The Jujube Genome Provides Insights into Genome Evolution and the Domestication of Sweetness/Acidity Taste in Fruit Trees

PubMed Central

Wan, KangKang; Zhang, Zhong; Pang, Xiaoming; Yin, Xiao; Bai, Yang; Sun, Xiaoqing; Gao, Lizhi; Li, Ruiqiang; Zhang, Jinbo

2016-01-01

Jujube (Ziziphus jujuba Mill.) belongs to the Rhamnaceae family and is a popular fruit tree species with immense economic and nutritional value. Here, we report a draft genome of the dry jujube cultivar ‘Junzao’ and the genome resequencing of 31 geographically diverse accessions of cultivated and wild jujubes (Ziziphus jujuba var. spinosa). Comparative analysis revealed that the genome of ‘Dongzao’, a fresh jujube, was ~86.5 Mb larger than that of the ‘Junzao’, partially due to the recent insertions of transposable elements in the ‘Dongzao’ genome. We constructed eight proto-chromosomes of the common ancestor of Rhamnaceae and Rosaceae, two sister families in the order Rosales, and elucidated the evolutionary processes that have shaped the genome structures of modern jujubes. Population structure analysis revealed the complex genetic background of jujubes resulting from extensive hybridizations between jujube and its wild relatives. Notably, several key genes that control fruit organic acid metabolism and sugar content were identified in the selective sweep regions. We also identified S-locus genes controlling gametophytic self-incompatibility and investigated haplotype patterns of the S locus in the jujube genomes, which would provide a guideline for parent selection for jujube crossbreeding. This study provides valuable genomic resources for jujube improvement, and offers insights into jujube genome evolution and its population structure and domestication. PMID:28005948
The Jujube Genome Provides Insights into Genome Evolution and the Domestication of Sweetness/Acidity Taste in Fruit Trees.

PubMed

Huang, Jian; Zhang, Chunmei; Zhao, Xing; Fei, Zhangjun; Wan, KangKang; Zhang, Zhong; Pang, Xiaoming; Yin, Xiao; Bai, Yang; Sun, Xiaoqing; Gao, Lizhi; Li, Ruiqiang; Zhang, Jinbo; Li, Xingang

2016-12-01

Jujube (Ziziphus jujuba Mill.) belongs to the Rhamnaceae family and is a popular fruit tree species with immense economic and nutritional value. Here, we report a draft genome of the dry jujube cultivar 'Junzao' and the genome resequencing of 31 geographically diverse accessions of cultivated and wild jujubes (Ziziphus jujuba var. spinosa). Comparative analysis revealed that the genome of 'Dongzao', a fresh jujube, was ~86.5 Mb larger than that of the 'Junzao', partially due to the recent insertions of transposable elements in the 'Dongzao' genome. We constructed eight proto-chromosomes of the common ancestor of Rhamnaceae and Rosaceae, two sister families in the order Rosales, and elucidated the evolutionary processes that have shaped the genome structures of modern jujubes. Population structure analysis revealed the complex genetic background of jujubes resulting from extensive hybridizations between jujube and its wild relatives. Notably, several key genes that control fruit organic acid metabolism and sugar content were identified in the selective sweep regions. We also identified S-locus genes controlling gametophytic self-incompatibility and investigated haplotype patterns of the S locus in the jujube genomes, which would provide a guideline for parent selection for jujube crossbreeding. This study provides valuable genomic resources for jujube improvement, and offers insights into jujube genome evolution and its population structure and domestication.
SCRaMbLE generates designed combinatorial stochastic diversity in synthetic chromosomes.

PubMed

Shen, Yue; Stracquadanio, Giovanni; Wang, Yun; Yang, Kun; Mitchell, Leslie A; Xue, Yaxin; Cai, Yizhi; Chen, Tai; Dymond, Jessica S; Kang, Kang; Gong, Jianhui; Zeng, Xiaofan; Zhang, Yongfen; Li, Yingrui; Feng, Qiang; Xu, Xun; Wang, Jun; Wang, Jian; Yang, Huanming; Boeke, Jef D; Bader, Joel S

2016-01-01

Synthetic chromosome rearrangement and modification by loxP-mediated evolution (SCRaMbLE) generates combinatorial genomic diversity through rearrangements at designed recombinase sites. We applied SCRaMbLE to yeast synthetic chromosome arm synIXR (43 recombinase sites) and then used a computational pipeline to infer or unscramble the sequence of recombinations that created the observed genomes. Deep sequencing of 64 synIXR SCRaMbLE strains revealed 156 deletions, 89 inversions, 94 duplications, and 55 additional complex rearrangements; several duplications are consistent with a double rolling circle mechanism. Every SCRaMbLE strain was unique, validating the capability of SCRaMbLE to explore a diverse space of genomes. Rearrangements occurred exclusively at designed loxPsym sites, with no significant evidence for ectopic rearrangements or mutations involving synthetic regions, the 99% nonsynthetic nuclear genome, or the mitochondrial genome. Deletion frequencies identified genes required for viability or fast growth. Replacement of 3' UTR by non-UTR sequence had surprisingly little effect on fitness. SCRaMbLE generates genome diversity in designated regions, reveals fitness constraints, and should scale to simultaneous evolution of multiple synthetic chromosomes. © 2016 Shen et al.; Published by Cold Spring Harbor Laboratory Press.
The reconstruction of 2,631 draft metagenome-assembled genomes from the global oceans.

PubMed

Tully, Benjamin J; Graham, Elaina D; Heidelberg, John F

2018-01-16

Microorganisms play a crucial role in mediating global biogeochemical cycles in the marine environment. By reconstructing the genomes of environmental organisms through metagenomics, researchers are able to study the metabolic potential of Bacteria and Archaea that are resistant to isolation in the laboratory. Utilizing the large metagenomic dataset generated from 234 samples collected during the Tara Oceans circumnavigation expedition, we were able to assemble 102 billion paired-end reads into 562 million contigs, which in turn were co-assembled and consolidated in to 7.2 million contigs ≥2 kb in length. Approximately 1 million of these contigs were binned to reconstruct draft genomes. In total, 2,631 draft genomes with an estimated completion of ≥50% were generated (1,491 draft genomes >70% complete; 603 genomes >90% complete). A majority of the draft genomes were manually assigned phylogeny based on sets of concatenated phylogenetic marker genes and/or 16S rRNA gene sequences. The draft genomes are now publically available for the research community at-large.
Whole genome sequencing and analysis of plant growth promoting bacteria isolated from the rhizosphere of plantation crops coconut, cocoa and arecanut.

PubMed

Gupta, Alka; Gopal, Murali; Thomas, George V; Manikandan, Vinu; Gajewski, John; Thomas, George; Seshagiri, Somasekar; Schuster, Stephan C; Rajesh, Preeti; Gupta, Ravi

2014-01-01

Coconut, cocoa and arecanut are commercial plantation crops that play a vital role in the Indian economy while sustaining the livelihood of more than 10 million Indians. According to 2012 Food and Agricultural organization's report, India is the third largest producer of coconut and it dominates the production of arecanut worldwide. In this study, three Plant Growth Promoting Rhizobacteria (PGPR) from coconut (CPCRI-1), cocoa (CPCRI-2) and arecanut (CPCRI-3) characterized for the PGP activities have been sequenced. The draft genome sizes were 4.7 Mb (56% GC), 5.9 Mb (63.6% GC) and 5.1 Mb (54.8% GB) for CPCRI-1, CPCRI-2, CPCRI-3, respectively. These genomes encoded 4056 (CPCRI-1), 4637 (CPCRI-2) and 4286 (CPCRI-3) protein-coding genes. Phylogenetic analysis revealed that both CPCRI-1 and CPCRI-3 belonged to Enterobacteriaceae family, while, CPCRI-2 was a Pseudomonadaceae family member. Functional annotation of the genes predicted that all three bacteria encoded genes needed for mineral phosphate solubilization, siderophores, acetoin, butanediol, 1-aminocyclopropane-1-carboxylate (ACC) deaminase, chitinase, phenazine, 4-hydroxybenzoate, trehalose and quorum sensing molecules supportive of the plant growth promoting traits observed in the course of their isolation and characterization. Additionally, in all the three CPCRI PGPRs, we identified genes involved in synthesis of hydrogen sulfide (H2S), which recently has been proposed to aid plant growth. The PGPRs also carried genes for central carbohydrate metabolism indicating that the bacteria can efficiently utilize the root exudates and other organic materials as energy source. Genes for production of peroxidases, catalases and superoxide dismutases that confer resistance to oxidative stresses in plants were identified. Besides these, genes for heat shock tolerance, cold shock tolerance and glycine-betaine production that enable bacteria to survive abiotic stress were also identified.
Use of low-coverage, large-insert, short-read data for rapid and accurate generation of enhanced-quality draft Pseudomonas genome sequences.

PubMed

O'Brien, Heath E; Gong, Yunchen; Fung, Pauline; Wang, Pauline W; Guttman, David S

2011-01-01

Next-generation genomic technology has both greatly accelerated the pace of genome research as well as increased our reliance on draft genome sequences. While groups such as the Genomics Standards Consortium have made strong efforts to promote genome standards there is a still a general lack of uniformity among published draft genomes, leading to challenges for downstream comparative analyses. This lack of uniformity is a particular problem when using standard draft genomes that frequently have large numbers of low-quality sequencing tracts. Here we present a proposal for an "enhanced-quality draft" genome that identifies at least 95% of the coding sequences, thereby effectively providing a full accounting of the genic component of the genome. Enhanced-quality draft genomes are easily attainable through a combination of small- and large-insert next-generation, paired-end sequencing. We illustrate the generation of an enhanced-quality draft genome by re-sequencing the plant pathogenic bacterium Pseudomonas syringae pv. phaseolicola 1448A (Pph 1448A), which has a published, closed genome sequence of 5.93 Mbp. We use a combination of Illumina paired-end and mate-pair sequencing, and surprisingly find that de novo assemblies with 100x paired-end coverage and mate-pair sequencing with as low as low as 2-5x coverage are substantially better than assemblies based on higher coverage. The rapid and low-cost generation of large numbers of enhanced-quality draft genome sequences will be of particular value for microbial diagnostics and biosecurity, which rely on precise discrimination of potentially dangerous clones from closely related benign strains.
Three draft genomes of Vibrio coralliilyticus strains isolated from bivalve hatcheries

USDA-ARS?s Scientific Manuscript database

Reported here are the draft genomes of three Vibrio coralliilyticus isolates RE87, AIC-7, and 080116A. Each strain was isolated in association with diseased oyster larvae in commercial aquaculture systems. These draft genomes will be useful for further studies in understanding the genomic features...
Phylogenomic and MALDI-TOF MS Analysis of Streptococcus sinensis HKU4T Reveals a Distinct Phylogenetic Clade in the Genus Streptococcus

PubMed Central

Tse, Herman; Chen, Jonathan H.K.; Tang, Ying; Lau, Susanna K.P.; Woo, Patrick C.Y.

2014-01-01

Streptococcus sinensis is a recently discovered human pathogen isolated from blood cultures of patients with infective endocarditis. Its phylogenetic position, as well as those of its closely related species, remains inconclusive when single genes were used for phylogenetic analysis. For example, S. sinensis branched out from members of the anginosus, mitis, and sanguinis groups in the 16S ribosomal RNA gene phylogenetic tree, but it was clustered with members of the anginosus and sanguinis groups when groEL gene sequences used for analysis. In this study, we sequenced the draft genome of S. sinensis and used a polyphasic approach, including concatenated genes, whole genomes, and matrix-assisted laser desorption ionization-time of flight mass spectrometry to analyze the phylogeny of S. sinensis. The size of the S. sinensis draft genome is 2.06 Mb, with GC content of 42.2%. Phylogenetic analysis using 50 concatenated genes or whole genomes revealed that S. sinensis formed a distinct cluster with Streptococcus oligofermentans and Streptococcus cristatus, and these three streptococci were clustered with the “sanguinis group.” As for phylogenetic analysis using hierarchical cluster analysis of the mass spectra of streptococci, S. sinensis also formed a distinct cluster with S. oligofermentans and S. cristatus, but these three streptococci were clustered with the “mitis group.” On the basis of the findings, we propose a novel group, named “sinensis group,” to include S. sinensis, S. oligofermentans, and S. cristatus, in the Streptococcus genus. Our study also illustrates the power of phylogenomic analyses for resolving ambiguities in bacterial taxonomy. PMID:25331233
Phylogenomic and MALDI-TOF MS analysis of Streptococcus sinensis HKU4T reveals a distinct phylogenetic clade in the genus Streptococcus.

PubMed

Teng, Jade L L; Huang, Yi; Tse, Herman; Chen, Jonathan H K; Tang, Ying; Lau, Susanna K P; Woo, Patrick C Y

2014-10-20

Streptococcus sinensis is a recently discovered human pathogen isolated from blood cultures of patients with infective endocarditis. Its phylogenetic position, as well as those of its closely related species, remains inconclusive when single genes were used for phylogenetic analysis. For example, S. sinensis branched out from members of the anginosus, mitis, and sanguinis groups in the 16S ribosomal RNA gene phylogenetic tree, but it was clustered with members of the anginosus and sanguinis groups when groEL gene sequences used for analysis. In this study, we sequenced the draft genome of S. sinensis and used a polyphasic approach, including concatenated genes, whole genomes, and matrix-assisted laser desorption ionization-time of flight mass spectrometry to analyze the phylogeny of S. sinensis. The size of the S. sinensis draft genome is 2.06 Mb, with GC content of 42.2%. Phylogenetic analysis using 50 concatenated genes or whole genomes revealed that S. sinensis formed a distinct cluster with Streptococcus oligofermentans and Streptococcus cristatus, and these three streptococci were clustered with the "sanguinis group." As for phylogenetic analysis using hierarchical cluster analysis of the mass spectra of streptococci, S. sinensis also formed a distinct cluster with S. oligofermentans and S. cristatus, but these three streptococci were clustered with the "mitis group." On the basis of the findings, we propose a novel group, named "sinensis group," to include S. sinensis, S. oligofermentans, and S. cristatus, in the Streptococcus genus. Our study also illustrates the power of phylogenomic analyses for resolving ambiguities in bacterial taxonomy. © The Author(s) 2014. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
The Draft Genome and Transcriptome of Amaranthus hypochondriacus: A C4 Dicot Producing High-Lysine Edible Pseudo-Cereal

PubMed Central

Sunil, Meeta; Hariharan, Arun K.; Nayak, Soumya; Gupta, Saurabh; Nambisan, Suran R.; Gupta, Ravi P.; Panda, Binay; Choudhary, Bibha; Srinivasan, Subhashini

2014-01-01

Grain amaranths, edible C4 dicots, produce pseudo-cereals high in lysine. Lysine being one of the most limiting essential amino acids in cereals and C4 photosynthesis being one of the most sought-after phenotypes in protein-rich legume crops, the genome of one of the grain amaranths is likely to play a critical role in crop research. We have sequenced the genome and transcriptome of Amaranthus hypochondriacus, a diploid (2n = 32) belonging to the order Caryophyllales with an estimated genome size of 466 Mb. Of the 411 linkage single-nucleotide polymorphisms (SNPs) reported for grain amaranths, 355 SNPs (86%) are represented in the scaffolds and 74% of the 8.6 billion bases of the sequenced transcriptome map to the genomic scaffolds. The genome of A. hypochondriacus, codes for at least 24,829 proteins, shares the paleohexaploidy event with species under the superorders Rosids and Asterids, harbours 1 SNP in 1,000 bases, and contains 13.76% of repeat elements. Annotation of all the genes in the lysine biosynthetic pathway using comparative genomics and expression analysis offers insights into the high-lysine phenotype. As the first grain species under Caryophyllales and the first C4 dicot genome reported, the work presented here will be beneficial in improving crops and in expanding our understanding of angiosperm evolution. PMID:25071079
Whole-genome sequencing of the efficient industrial fuel-ethanol fermentative Saccharomyces cerevisiae strain CAT-1.

PubMed

Babrzadeh, Farbod; Jalili, Roxana; Wang, Chunlin; Shokralla, Shadi; Pierce, Sarah; Robinson-Mosher, Avi; Nyren, Pål; Shafer, Robert W; Basso, Luiz C; de Amorim, Henrique V; de Oliveira, Antonio J; Davis, Ronald W; Ronaghi, Mostafa; Gharizadeh, Baback; Stambuk, Boris U

2012-06-01

The Saccharomyces cerevisiae strains widely used for industrial fuel-ethanol production have been developed by selection, but their underlying beneficial genetic polymorphisms remain unknown. Here, we report the draft whole-genome sequence of the S. cerevisiae strain CAT-1, which is a dominant fuel-ethanol fermentative strain from the sugarcane industry in Brazil. Our results indicate that strain CAT-1 is a highly heterozygous diploid yeast strain, and the ~12-Mb genome of CAT-1, when compared with the reference S228c genome, contains ~36,000 homozygous and ~30,000 heterozygous single nucleotide polymorphisms, exhibiting an uneven distribution among chromosomes due to large genomic regions of loss of heterozygosity (LOH). In total, 58 % of the 6,652 predicted protein-coding genes of the CAT-1 genome constitute different alleles when compared with the genes present in the reference S288c genome. The CAT-1 genome contains a reduced number of transposable elements, as well as several gene deletions and duplications, especially at telomeric regions, some correlated with several of the physiological characteristics of this industrial fuel-ethanol strain. Phylogenetic analyses revealed that some genes were likely associated with traits important for bioethanol production. Identifying and characterizing the allelic variations controlling traits relevant to industrial fermentation should provide the basis for a forward genetics approach for developing better fermenting yeast strains.

The reconstruction of 2,631 draft metagenome-assembled genomes from the global oceans

PubMed Central

Tully, Benjamin J.; Graham, Elaina D.; Heidelberg, John F.

2018-01-01

Microorganisms play a crucial role in mediating global biogeochemical cycles in the marine environment. By reconstructing the genomes of environmental organisms through metagenomics, researchers are able to study the metabolic potential of Bacteria and Archaea that are resistant to isolation in the laboratory. Utilizing the large metagenomic dataset generated from 234 samples collected during the Tara Oceans circumnavigation expedition, we were able to assemble 102 billion paired-end reads into 562 million contigs, which in turn were co-assembled and consolidated in to 7.2 million contigs ≥2 kb in length. Approximately 1 million of these contigs were binned to reconstruct draft genomes. In total, 2,631 draft genomes with an estimated completion of ≥50% were generated (1,491 draft genomes >70% complete; 603 genomes >90% complete). A majority of the draft genomes were manually assigned phylogeny based on sets of concatenated phylogenetic marker genes and/or 16S rRNA gene sequences. The draft genomes are now publically available for the research community at-large. PMID:29337314
Genome Sequence of the Pea Aphid Acyrthosiphon pisum

PubMed Central

2010-01-01

Aphids are important agricultural pests and also biological models for studies of insect-plant interactions, symbiosis, virus vectoring, and the developmental causes of extreme phenotypic plasticity. Here we present the 464 Mb draft genome assembly of the pea aphid Acyrthosiphon pisum. This first published whole genome sequence of a basal hemimetabolous insect provides an outgroup to the multiple published genomes of holometabolous insects. Pea aphids are host-plant specialists, they can reproduce both sexually and asexually, and they have coevolved with an obligate bacterial symbiont. Here we highlight findings from whole genome analysis that may be related to these unusual biological features. These findings include discovery of extensive gene duplication in more than 2000 gene families as well as loss of evolutionarily conserved genes. Gene family expansions relative to other published genomes include genes involved in chromatin modification, miRNA synthesis, and sugar transport. Gene losses include genes central to the IMD immune pathway, selenoprotein utilization, purine salvage, and the entire urea cycle. The pea aphid genome reveals that only a limited number of genes have been acquired from bacteria; thus the reduced gene count of Buchnera does not reflect gene transfer to the host genome. The inventory of metabolic genes in the pea aphid genome suggests that there is extensive metabolite exchange between the aphid and Buchnera, including sharing of amino acid biosynthesis between the aphid and Buchnera. The pea aphid genome provides a foundation for post-genomic studies of fundamental biological questions and applied agricultural problems. PMID:20186266
Benefits of Genomic Insights and CRISPR-Cas Signatures to Monitor Potential Pathogens across Drinking Water Production and Distribution Systems

PubMed Central

Zhang, Ya; Kitajima, Masaaki; Whittle, Andrew J.; Liu, Wen-Tso

2017-01-01

The occurrence of pathogenic bacteria in drinking water distribution systems (DWDSs) is a major health concern, and our current understanding is mostly related to pathogenic species such as Legionella pneumophila and Mycobacterium avium but not to bacterial species closely related to them. In this study, genomic-based approaches were used to characterize pathogen-related species in relation to their abundance, diversity, potential pathogenicity, genetic exchange, and distribution across an urban drinking water system. Nine draft genomes recovered from 10 metagenomes were identified as Legionella (4 draft genomes), Mycobacterium (3 draft genomes), Parachlamydia (1 draft genome), and Leptospira (1 draft genome). The pathogenicity potential of these genomes was examined by the presence/absence of virulence machinery, including genes belonging to Type III, IV, and VII secretion systems and their effectors. Several virulence factors known to pathogenic species were detected with these retrieved draft genomes except the Leptospira-related genome. Identical clustered regularly interspaced short palindromic repeats-CRISPR-associated proteins (CRISPR-Cas) genetic signatures were observed in two draft genomes recovered at different stages of the studied system, suggesting that the spacers in CRISPR-Cas could potentially be used as a biomarker in the monitoring of Legionella related strains at an evolutionary scale of several years across different drinking water production and distribution systems. Overall, metagenomics approach was an effective and complementary tool of culturing techniques to gain insights into the pathogenic characteristics and the CRISPR-Cas signatures of pathogen-related species in DWDSs. PMID:29097994
Genome scaffolding and annotation for the pathogen vector Ixodes ricinus by ultra-long single molecule sequencing.

PubMed

Cramaro, Wibke J; Hunewald, Oliver E; Bell-Sakyi, Lesley; Muller, Claude P

2017-02-08

Global warming and other ecological changes have facilitated the expansion of Ixodes ricinus tick populations. Ixodes ricinus is the most important carrier of vector-borne pathogens in Europe, transmitting viruses, protozoa and bacteria, in particular Borrelia burgdorferi (sensu lato), the causative agent of Lyme borreliosis, the most prevalent vector-borne disease in humans in the Northern hemisphere. To faster control this disease vector, a better understanding of the I. ricinus tick is necessary. To facilitate such studies, we recently published the first reference genome of this highly prevalent pathogen vector. Here, we further extend these studies by scaffolding and annotating the first reference genome by using ultra-long sequencing reads from third generation single molecule sequencing. In addition, we present the first genome size estimation for I. ricinus ticks and the embryo-derived cell line IRE/CTVM19. 235,953 contigs were integrated into 204,904 scaffolds, extending the currently known genome lengths by more than 30% from 393 to 516 Mb and the N50 contig value by 87% from 1643 bp to a N50 scaffold value of 3067 bp. In addition, 25,263 sequences were annotated by comparison to the tick's North American relative Ixodes scapularis. After (conserved) hypothetical proteins, zinc finger proteins, secreted proteins and P450 coding proteins were the most prevalent protein categories annotated. Interestingly, more than 50% of the amino acid sequences matching the homology threshold had 95-100% identity to the corresponding I. scapularis gene models. The sequence information was complemented by the first genome size estimation for this species. Flow cytometry-based genome size analysis revealed a haploid genome size of 2.65Gb for I. ricinus ticks and 3.80 Gb for the cell line. We present a first draft sequence map of the I. ricinus genome based on a PacBio-Illumina assembly. The I. ricinus genome was shown to be 26% (500 Mb) larger than the genome of its American relative I. scapularis. Based on the genome size of 2.65 Gb we estimated that we covered about 67% of the non-repetitive sequences. Genome annotation will facilitate screening for specific molecular pathways in I. ricinus cells and provides an overview of characteristics and functions.
The draft genome of Mycobacterium aurum, a potential model organism for investigating drugs against Mycobacterium tuberculosis and Mycobacterium leprae.

PubMed

Phelan, Jody; Maitra, Arundhati; McNerney, Ruth; Nair, Mridul; Gupta, Antima; Coll, Francesc; Pain, Arnab; Bhakta, Sanjib; Clark, Taane G

2015-09-01

Mycobacterium aurum (M. aurum) is an environmental mycobacteria that has previously been used in studies of anti-mycobacterial drugs due to its fast growth rate and low pathogenicity. The M. aurum genome has been sequenced and assembled into 46 contigs, with a total length of 6.02Mb containing 5684 annotated protein-coding genes. A phylogenetic analysis using whole genome alignments positioned M. aurum close to Mycobacterium vaccae and Mycobacterium vanbaalenii, within a clade related to fast-growing mycobacteria. Large-scale genomic rearrangements were identified by comparing the M. aurum genome to those of Mycobacterium tuberculosis and Mycobacterium leprae. M. aurum orthologous genes implicated in resistance to anti-tuberculosis drugs in M. tuberculosis were observed. The sequence identity at the DNA level varied from 68.6% for pncA (pyrazinamide drug-related) to 96.2% for rrs (streptomycin, capreomycin). We observed two homologous genes encoding the catalase-peroxidase enzyme (katG) that is associated with resistance to isoniazid. Similarly, two embB homologues were identified in the M. aurum genome. In addition to describing for the first time the genome of M. aurum, this work provides a resource to aid the use of M. aurum in studies to develop improved drugs for the pathogenic mycobacteria M. tuberculosis and M. leprae. Copyright © 2015 Asian-African Society for Mycobacteriology. Published by Elsevier Ltd. All rights reserved.
Genome Sequence and Transcriptome Analyses of Chrysochromulina tobin: Metabolic Tools for Enhanced Algal Fitness in the Prominent Order Prymnesiales (Haptophyceae)

PubMed Central

Hovde, Blake T.; Deodato, Chloe R.; Hunsperger, Heather M.; Ryken, Scott A.; Yost, Will; Jha, Ramesh K.; Patterson, Johnathan; Monnat, Raymond J.; Barlow, Steven B.; Starkenburg, Shawn R.; Cattolico, Rose Ann

2015-01-01

Haptophytes are recognized as seminal players in aquatic ecosystem function. These algae are important in global carbon sequestration, form destructive harmful blooms, and given their rich fatty acid content, serve as a highly nutritive food source to a broad range of eco-cohorts. Haptophyte dominance in both fresh and marine waters is supported by the mixotrophic nature of many taxa. Despite their importance the nuclear genome sequence of only one haptophyte, Emiliania huxleyi (Isochrysidales), is available. Here we report the draft genome sequence of Chrysochromulina tobin (Prymnesiales), and transcriptome data collected at seven time points over a 24-hour light/dark cycle. The nuclear genome of C. tobin is small (59 Mb), compact (∼40% of the genome is protein coding) and encodes approximately 16,777 genes. Genes important to fatty acid synthesis, modification, and catabolism show distinct patterns of expression when monitored over the circadian photoperiod. The C. tobin genome harbors the first hybrid polyketide synthase/non-ribosomal peptide synthase gene complex reported for an algal species, and encodes potential anti-microbial peptides and proteins involved in multidrug and toxic compound extrusion. A new haptophyte xanthorhodopsin was also identified, together with two “red” RuBisCO activases that are shared across many algal lineages. The Chrysochromulina tobin genome sequence provides new information on the evolutionary history, ecology and economic importance of haptophytes. PMID:26397803
Improved hybrid de novo genome assembly of domesticated apple (Malus x domestica).

PubMed

Li, Xuewei; Kui, Ling; Zhang, Jing; Xie, Yinpeng; Wang, Liping; Yan, Yan; Wang, Na; Xu, Jidi; Li, Cuiying; Wang, Wen; van Nocker, Steve; Dong, Yang; Ma, Fengwang; Guan, Qingmei

2016-08-08

Domesticated apple (Malus × domestica Borkh) is a popular temperate fruit with high nutrient levels and diverse flavors. In 2012, global apple production accounted for at least one tenth of all harvested fruits. A high-quality apple genome assembly is crucial for the selection and breeding of new cultivars. Currently, a single reference genome is available for apple, assembled from 16.9 × genome coverage short reads via Sanger and 454 sequencing technologies. Although a useful resource, this assembly covers only ~89 % of the non-repetitive portion of the genome, and has a relatively short (16.7 kb) contig N50 length. These downsides make it difficult to apply this reference in transcriptive or whole-genome re-sequencing analyses. Here we present an improved hybrid de novo genomic assembly of apple (Golden Delicious), which was obtained from 76 Gb (~102 × genome coverage) Illumina HiSeq data and 21.7 Gb (~29 × genome coverage) PacBio data. The final draft genome is approximately 632.4 Mb, representing ~ 90 % of the estimated genome. The contig N50 size is 111,619 bp, representing a 7 fold improvement. Further annotation analyses predicted 53,922 protein-coding genes and 2,765 non-coding RNA genes. The new apple genome assembly will serve as a valuable resource for investigating complex apple traits at the genomic level. It is not only suitable for genome editing and gene cloning, but also for RNA-seq and whole-genome re-sequencing studies.
Genome assembly and transcriptome resource for river buffalo, Bubalus bubalis (2n = 50).

PubMed

Williams, John L; Iamartino, Daniela; Pruitt, Kim D; Sonstegard, Tad; Smith, Timothy P L; Low, Wai Yee; Biagini, Tommaso; Bomba, Lorenzo; Capomaccio, Stefano; Castiglioni, Bianca; Coletta, Angelo; Corrado, Federica; Ferré, Fabrizio; Iannuzzi, Leopoldo; Lawley, Cynthia; Macciotta, Nicolò; McClure, Matthew; Mancini, Giordano; Matassino, Donato; Mazza, Raffaele; Milanesi, Marco; Moioli, Bianca; Morandi, Nicola; Ramunno, Luigi; Peretti, Vincenzo; Pilla, Fabio; Ramelli, Paola; Schroeder, Steven; Strozzi, Francesco; Thibaud-Nissen, Francoise; Zicarelli, Luigi; Ajmone-Marsan, Paolo; Valentini, Alessio; Chillemi, Giovanni; Zimin, Aleksey

2017-10-01

Water buffalo is a globally important species for agriculture and local economies. A de novo assembled, well-annotated reference sequence for the water buffalo is an important prerequisite for studying the biology of this species, and is necessary to manage genetic diversity and to use modern breeding and genomic selection techniques. However, no such genome assembly has been previously reported. There are 2 species of domestic water buffalo, the river (2 n = 50) and the swamp (2 n = 48) buffalo. Here we describe a draft quality reference sequence for the river buffalo created from Illumina GA and Roche 454 short read sequences using the MaSuRCA assembler. The assembled sequence is 2.83 Gb, consisting of 366 983 scaffolds with a scaffold N50 of 1.41 Mb and contig N50 of 21 398 bp. Annotation of the genome was supported by transcriptome data from 30 tissues and identified 21 711 predicted protein coding genes. Searches for complete mammalian BUSCO gene groups found 98.6% of curated single copy orthologs present among predicted genes, which suggests a high level of completeness of the genome. The annotated sequence is available from NCBI at accession GCA_000471725.1. © The Author 2017. Published by Oxford University Press.
Genome analysis of Hibiscus syriacus provides insights of polyploidization and indeterminate flowering in woody plants

PubMed Central

Kim, Yong-Min; Kim, Seungill; Koo, Namjin; Shin, Ah-Young; Yeom, Seon-In; Seo, Eunyoung; Park, Seong-Jin; Kang, Won-Hee; Kim, Myung-Shin; Park, Jieun; Jang, Insu; Kim, Pan-Gyu; Byeon, Iksu; Kim, Min-Seo; Choi, JinHyuk; Ko, Gunhwan; Hwang, JiHye; Yang, Tae-Jin; Choi, Sang-Bong; Lee, Je Min; Lim, Ki-Byung; Lee, Jungho; Choi, Ik-Young; Park, Beom-Seok; Kwon, Suk-Yoon; Choi, Doil

2017-01-01

Abstract Hibiscus syriacus (L.) (rose of Sharon) is one of the most widespread garden shrubs in the world. We report a draft of the H. syriacus genome comprised of a 1.75 Gb assembly that covers 92% of the genome with only 1.7% (33 Mb) gap sequences. Predicted gene modeling detected 87,603 genes, mostly supported by deep RNA sequencing data. To define gene family distribution among relatives of H. syriacus, orthologous gene sets containing 164,660 genes in 21,472 clusters were identified by OrthoMCL analysis of five plant species, including H. syriacus, Arabidopsis thaliana, Gossypium raimondii, Theobroma cacao and Amborella trichopoda. We inferred their evolutionary relationships based on divergence times among Malvaceae plant genes and found that gene families involved in flowering regulation and disease resistance were more highly divergent and expanded in H. syriacus than in its close relatives, G. raimondii (DD) and T. cacao. Clustered gene families and gene collinearity analysis revealed that two recent rounds of whole-genome duplication were followed by diploidization of the H. syriacus genome after speciation. Copy number variation and phylogenetic divergence indicates that WGDs and subsequent diploidization led to unequal duplication and deletion of flowering-related genes in H. syriacus and may affect its unique floral morphology. PMID:28011721
Draft sequencing and comparative genomics of Xylella fastidiosa strains reveal novel biological insights.

PubMed

Bhattacharyya, Anamitra; Stilwagen, Stephanie; Reznik, Gary; Feil, Helene; Feil, William S; Anderson, Iain; Bernal, Axel; D'Souza, Mark; Ivanova, Natalia; Kapatral, Vinayak; Larsen, Niels; Los, Tamara; Lykidis, Athanasios; Selkov, Eugene; Walunas, Theresa L; Purcell, Alexander; Edwards, Rob A; Hawkins, Trevor; Haselkorn, Robert; Overbeek, Ross; Kyrpides, Nikos C; Predki, Paul F

2002-10-01

Draft sequencing is a rapid and efficient method for determining the near-complete sequence of microbial genomes. Here we report a comparative analysis of one complete and two draft genome sequences of the phytopathogenic bacterium, Xylella fastidiosa, which causes serious disease in plants, including citrus, almond, and oleander. We present highlights of an in silico analysis based on a comparison of reconstructions of core biological subsystems. Cellular pathway reconstructions have been used to identify a small number of genes, which are likely to reside within the draft genomes but are not captured in the draft assembly. These represented only a small fraction of all genes and were predominantly large and small ribosomal subunit protein components. By using this approach, some of the inherent limitations of draft sequence can be significantly reduced. Despite the incomplete nature of the draft genomes, it is possible to identify several phage-related genes, which appear to be absent from the draft genomes and not the result of insufficient sequence sampling. This region may therefore identify potential host-specific functions. Based on this first functional reconstruction of a phytopathogenic microbe, we spotlight an unusual respiration machinery as a potential target for biological control. We also predicted and developed a new defined growth medium for Xylella.
CAR: contig assembly of prokaryotic draft genomes using rearrangements.

PubMed

Lu, Chin Lung; Chen, Kun-Tze; Huang, Shih-Yuan; Chiu, Hsien-Tai

2014-11-28

Next generation sequencing technology has allowed efficient production of draft genomes for many organisms of interest. However, most draft genomes are just collections of independent contigs, whose relative positions and orientations along the genome being sequenced are unknown. Although several tools have been developed to order and orient the contigs of draft genomes, more accurate tools are still needed. In this study, we present a novel reference-based contig assembly (or scaffolding) tool, named as CAR, that can efficiently and more accurately order and orient the contigs of a prokaryotic draft genome based on a reference genome of a related organism. Given a set of contigs in multi-FASTA format and a reference genome in FASTA format, CAR can output a list of scaffolds, each of which is a set of ordered and oriented contigs. For validation, we have tested CAR on a real dataset composed of several prokaryotic genomes and also compared its performance with several other reference-based contig assembly tools. Consequently, our experimental results have shown that CAR indeed performs better than all these other reference-based contig assembly tools in terms of sensitivity, precision and genome coverage. CAR serves as an efficient tool that can more accurately order and orient the contigs of a prokaryotic draft genome based on a reference genome. The web server of CAR is freely available at http://genome.cs.nthu.edu.tw/CAR/ and its stand-alone program can also be downloaded from the same website.
Draft Genome Sequences of Seven Thermophilic Spore-Forming Bacteria Isolated from Foods That Produce Highly Heat-Resistant Spores, Comprising Geobacillus spp., Caldibacillus debilis, and Anoxybacillus flavithermus

PubMed Central

Berendsen, Erwin M.; Wells-Bennik, Marjon H. J.; Krawczyk, Antonina O.; de Jong, Anne; van Heel, Auke; Holsappel, Siger; Eijlander, Robyn T.

2016-01-01

Here, we report the draft genomes of five strains of Geobacillus spp., one Caldibacillus debilis strain, and one draft genome of Anoxybacillus flavithermus, all thermophilic spore-forming Gram-positive bacteria. PMID:27151781
Draft genome of the gayal, Bos frontalis

PubMed Central

Wang, Ming-Shan; Zeng, Yan; Wang, Xiao; Nie, Wen-Hui; Wang, Jin-Huan; Su, Wei-Ting; Xiong, Zi-Jun; Wang, Sheng; Qu, Kai-Xing; Yan, Shou-Qing; Yang, Min-Min; Wang, Wen; Dong, Yang; Zhang, Ya-Ping

2017-01-01

Abstract Gayal (Bos frontalis), also known as mithan or mithun, is a large endangered semi-domesticated bovine that has a limited geographical distribution in the hill-forests of China, Northeast India, Bangladesh, Myanmar, and Bhutan. Many questions about the gayal such as its origin, population history, and genetic basis of local adaptation remain largely unresolved. De novo sequencing and assembly of the whole gayal genome provides an opportunity to address these issues. We report a high-depth sequencing, de novo assembly, and annotation of a female Chinese gayal genome. Based on the Illumina genomic sequencing platform, we have generated 350.38 Gb of raw data from 16 different insert-size libraries. A total of 276.86 Gb of clean data is retained after quality control. The assembled genome is about 2.85 Gb with scaffold and contig N50 sizes of 2.74 Mb and 14.41 kb, respectively. Repetitive elements account for 48.13% of the genome. Gene annotation has yielded 26 667 protein-coding genes, of which 97.18% have been functionally annotated. BUSCO assessment shows that our assembly captures 93% (3183 of 4104) of the core eukaryotic genes and 83.1% of vertebrate universal single-copy orthologs. We provide the first comprehensive de novo genome of the gayal. This genetic resource is integral for investigating the origin of the gayal and performing comparative genomic studies to improve understanding of the speciation and divergence of bovine species. The assembled genome could be used as reference in future population genetic studies of gayal. PMID:29048483
First High-Quality Draft Genome Sequence of Pasteurella multocida Sequence Type 128 Isolated from Infected Bone.

PubMed

Kavousi, Niloofar; Eng, Wilhelm Wei Han; Lee, Yin Peng; Tan, Lian Huat; Thuraisingham, Ravindran; Yule, Catherine M; Gan, Han Ming

2016-03-03

We report here the first high-quality draft genome sequence of Pasteurella multocida sequence type 128, which was isolated from the infected finger bone of an adult female who was bitten by a domestic dog. The draft genome will be a valuable addition to the scarce genomic resources available for P. multocida. Copyright © 2016 Kavousi et al.
Draft genome sequence of Streptomyces sp. strain F1, a potential source for glycoside hydrolases isolated from Brazilian soil.

PubMed

Melo, Ricardo Rodrigues de; Persinoti, Gabriela Felix; Paixão, Douglas Antonio Alvaredo; Squina, Fábio Márcio; Ruller, Roberto; Sato, Helia Harumi

Here, we show the draft genome sequence of Streptomyces sp. F1, a strain isolated from soil with great potential for secretion of hydrolytic enzymes used to deconstruct cellulosic biomass. The draft genome assembly of Streptomyces sp. strain F1 has 69 contigs with a total genome size of 8,142,296bp and G+C 72.65%. Preliminary genome analysis identified 175 proteins as Carbohydrate-Active Enzymes, being 85 glycoside hydrolases organized in 33 distinct families. This draft genome information provides new insights on the key genes encoding hydrolytic enzymes involved in biomass deconstruction employed by soil bacteria. Copyright © 2017 Sociedade Brasileira de Microbiologia. Published by Elsevier Editora Ltda. All rights reserved.
Multi-Omics Driven Assembly and Annotation of the Sandalwood (Santalum album) Genome.

PubMed

Mahesh, Hirehally Basavarajegowda; Subba, Pratigya; Advani, Jayshree; Shirke, Meghana Deepak; Loganathan, Ramya Malarini; Chandana, Shankara Lingu; Shilpa, Siddappa; Chatterjee, Oishi; Pinto, Sneha Maria; Prasad, Thottethodi Subrahmanya Keshava; Gowda, Malali

2018-04-01

Indian sandalwood ( Santalum album ) is an important tropical evergreen tree known for its fragrant heartwood-derived essential oil and its valuable carving wood. Here, we applied an integrated genomic, transcriptomic, and proteomic approach to assemble and annotate the Indian sandalwood genome. Our genome sequencing resulted in the establishment of a draft map of the smallest genome for any woody tree species to date (221 Mb). The genome annotation predicted 38,119 protein-coding genes and 27.42% repetitive DNA elements. In-depth proteome analysis revealed the identities of 72,325 unique peptides, which confirmed 10,076 of the predicted genes. The addition of transcriptomic and proteogenomic approaches resulted in the identification of 53 novel proteins and 34 gene-correction events that were missed by genomic approaches. Proteogenomic analysis also helped in reassigning 1,348 potential noncoding RNAs as bona fide protein-coding messenger RNAs. Gene expression patterns at the RNA and protein levels indicated that peptide sequencing was useful in capturing proteins encoded by nuclear and organellar genomes alike. Mass spectrometry-based proteomic evidence provided an unbiased approach toward the identification of proteins encoded by organellar genomes. Such proteins are often missed in transcriptome data sets due to the enrichment of only messenger RNAs that contain poly(A) tails. Overall, the use of integrated omic approaches enhanced the quality of the assembly and annotation of this nonmodel plant genome. The availability of genomic, transcriptomic, and proteomic data will enhance genomics-assisted breeding, germplasm characterization, and conservation of sandalwood trees. © 2018 American Society of Plant Biologists. All Rights Reserved.
Extensive Error in the Number of Genes Inferred from Draft Genome Assemblies

PubMed Central

Denton, James F.; Lugo-Martinez, Jose; Tucker, Abraham E.; Schrider, Daniel R.; Warren, Wesley C.; Hahn, Matthew W.

2014-01-01

Current sequencing methods produce large amounts of data, but genome assemblies based on these data are often woefully incomplete. These incomplete and error-filled assemblies result in many annotation errors, especially in the number of genes present in a genome. In this paper we investigate the magnitude of the problem, both in terms of total gene number and the number of copies of genes in specific families. To do this, we compare multiple draft assemblies against higher-quality versions of the same genomes, using several new assemblies of the chicken genome based on both traditional and next-generation sequencing technologies, as well as published draft assemblies of chimpanzee. We find that upwards of 40% of all gene families are inferred to have the wrong number of genes in draft assemblies, and that these incorrect assemblies both add and subtract genes. Using simulated genome assemblies of Drosophila melanogaster, we find that the major cause of increased gene numbers in draft genomes is the fragmentation of genes onto multiple individual contigs. Finally, we demonstrate the usefulness of RNA-Seq in improving the gene annotation of draft assemblies, largely by connecting genes that have been fragmented in the assembly process. PMID:25474019
Extensive error in the number of genes inferred from draft genome assemblies.

PubMed

Denton, James F; Lugo-Martinez, Jose; Tucker, Abraham E; Schrider, Daniel R; Warren, Wesley C; Hahn, Matthew W

2014-12-01

Current sequencing methods produce large amounts of data, but genome assemblies based on these data are often woefully incomplete. These incomplete and error-filled assemblies result in many annotation errors, especially in the number of genes present in a genome. In this paper we investigate the magnitude of the problem, both in terms of total gene number and the number of copies of genes in specific families. To do this, we compare multiple draft assemblies against higher-quality versions of the same genomes, using several new assemblies of the chicken genome based on both traditional and next-generation sequencing technologies, as well as published draft assemblies of chimpanzee. We find that upwards of 40% of all gene families are inferred to have the wrong number of genes in draft assemblies, and that these incorrect assemblies both add and subtract genes. Using simulated genome assemblies of Drosophila melanogaster, we find that the major cause of increased gene numbers in draft genomes is the fragmentation of genes onto multiple individual contigs. Finally, we demonstrate the usefulness of RNA-Seq in improving the gene annotation of draft assemblies, largely by connecting genes that have been fragmented in the assembly process.
Genomic insights into the uncultured genus 'Candidatus Magnetobacterium' in the phylum Nitrospirae.

PubMed

Lin, Wei; Deng, Aihua; Wang, Zhang; Li, Ying; Wen, Tingyi; Wu, Long-Fei; Wu, Martin; Pan, Yongxin

2014-12-01

Magnetotactic bacteria (MTB) of the genus 'Candidatus Magnetobacterium' in phylum Nitrospirae are of great interest because of the formation of hundreds of bullet-shaped magnetite magnetosomes in multiple bundles of chains per cell. These bacteria are worldwide distributed in aquatic environments and have important roles in the biogeochemical cycles of iron and sulfur. However, except for a few short genomic fragments, no genome data are available for this ecologically important genus, and little is known about their metabolic capacity owing to the lack of pure cultures. Here we report the first draft genome sequence of 3.42 Mb from an uncultivated strain tentatively named 'Ca. Magnetobacterium casensis' isolated from Lake Miyun, China. The genome sequence indicates an autotrophic lifestyle using the Wood-Ljungdahl pathway for CO2 fixation, which has not been described in any previously known MTB or Nitrospirae organisms. Pathways involved in the denitrification, sulfur oxidation and sulfate reduction have been predicted, indicating its considerable capacity for adaptation to variable geochemical conditions and roles in local biogeochemical cycles. Moreover, we have identified a complete magnetosome gene island containing mam, mad and a set of novel genes (named as man genes) putatively responsible for the formation of bullet-shaped magnetite magnetosomes and the arrangement of multiple magnetosome chains. This first comprehensive genomic analysis sheds light on the physiology, ecology and biomineralization of the poorly understood 'Ca. Magnetobacterium' genus.
Genome sequence of Aspergillus luchuensis NBRC 4314

PubMed Central

Yamada, Osamu; Machida, Masayuki; Hosoyama, Akira; Goto, Masatoshi; Takahashi, Toru; Futagami, Taiki; Yamagata, Youhei; Takeuchi, Michio; Kobayashi, Tetsuo; Koike, Hideaki; Abe, Keietsu; Asai, Kiyoshi; Arita, Masanori; Fujita, Nobuyuki; Fukuda, Kazuro; Higa, Ken-ichi; Horikawa, Hiroshi; Ishikawa, Takeaki; Jinno, Koji; Kato, Yumiko; Kirimura, Kohtaro; Mizutani, Osamu; Nakasone, Kaoru; Sano, Motoaki; Shiraishi, Yohei; Tsukahara, Masatoshi; Gomi, Katsuya

2016-01-01

Awamori is a traditional distilled beverage made from steamed Thai-Indica rice in Okinawa, Japan. For brewing the liquor, two microbes, local kuro (black) koji mold Aspergillus luchuensis and awamori yeast Saccharomyces cerevisiae are involved. In contrast, that yeasts are used for ethanol fermentation throughout the world, a characteristic of Japanese fermentation industries is the use of Aspergillus molds as a source of enzymes for the maceration and saccharification of raw materials. Here we report the draft genome of a kuro (black) koji mold, A. luchuensis NBRC 4314 (RIB 2604). The total length of nonredundant sequences was nearly 34.7 Mb, comprising approximately 2,300 contigs with 16 telomere-like sequences. In total, 11,691 genes were predicted to encode proteins. Most of the housekeeping genes, such as transcription factors and N-and O-glycosylation system, were conserved with respect to Aspergillus niger and Aspergillus oryzae. An alternative oxidase and acid-stable α-amylase regarding citric acid production and fermentation at a low pH as well as a unique glutamic peptidase were also found in the genome. Furthermore, key biosynthetic gene clusters of ochratoxin A and fumonisin B were absent when compared with A. niger genome, showing the safety of A. luchuensis for food and beverage production. This genome information will facilitate not only comparative genomics with industrial kuro-koji molds, but also molecular breeding of the molds in improvements of awamori fermentation. PMID:27651094

Genomic prediction and genome-wide association analysis of female longevity in a composite beef cattle breed.

PubMed

Hamidi Hay, E; Roberts, A

2017-04-01

Longevity is a highly important trait to the efficiency of beef cattle production. The objective of this study was to evaluate the genomic prediction of longevity and identify genomic regions associated with this trait. The data used in this study consisted of 547 Composite Gene Combination cows (1/2 Red Angus, 1/4 Charolais, 1/4 Tarentaise) born from 2002 to 2011 genotyped with Illumina BovineSNP50 BeadChip. Three models were used to assess genomic prediction: Bayes A, Bayes B and GBLUP using a genomic relationship matrix. To identify genomic regions associated with longevity 2 approaches were adopted: single marker genome wide association and Bayesian approach using GenSel software. The genomic prediction accuracy was low 0.28, 0.25, and 0.22 for Bayes A, Bayes B and GBLUP, respectively. The single-marker genome wide association study (GWAS)identified 5 loci with -value less than 0.05 after false discovery correction: UA-IFASA-7571 on chromosome 19 (58.03 Mb), ARS-BFGL-BAC-15059 on BTA 1 (28.8 Mb), ARS-BFGL-NGS-104159 on BTA3 (29.4 Mb), ARS-BFGL-NGS-32882 on BTA9 (104.07 Mb) and ARS-BFGL-NGS-32883 on BTA25 (33.77 Mb). The Bayesian GWAS yielded 4 genomic regions overlapping with the single marker GWAS results. The region with the highest percentage of genomic variance (3.73%) was detected on chromosome 19. Both GWAS approaches adopted in this study showed evidence for association with various chromosomal locations.
Comparative genomics of Lactobacillus

PubMed Central

Kant, Ravi; Blom, Jochen; Palva, Airi; Siezen, Roland J.; de Vos, Willem M.

2011-01-01

Summary The genus Lactobacillus includes a diverse group of bacteria consisting of many species that are associated with fermentations of plants, meat or milk. In addition, various lactobacilli are natural inhabitants of the intestinal tract of humans and other animals. Finally, several Lactobacillus strains are marketed as probiotics as their consumption can confer a health benefit to host. Presently, 154 Lactobacillus species are known and a growing fraction of these are subject to draft genome sequencing. However, complete genome sequences are needed to provide a platform for detailed genomic comparisons. Therefore, we selected a total of 20 genomes of various Lactobacillus strains for which complete genomic sequences have been reported. These genomes had sizes varying from 1.8 to 3.3 Mb and other characteristic features, such as G+C content that ranged from 33% to 51%. The Lactobacillus pan genome was found to consist of approximately 14 000 protein‐encoding genes while all 20 genomes shared a total of 383 sets of orthologous genes that defined the Lactobacillus core genome (LCG). Based on advanced phylogeny of the proteins encoded by this LCG, we grouped the 20 strains into three main groups and defined core group genes present in all genomes of a single group, signature group genes shared in all genomes of one group but absent in all other Lactobacillus genomes, and Group‐specific ORFans present in core group genes of one group and absent in all other complete genomes. The latter are of specific value in defining the different groups of genomes. The study provides a platform for present individual comparisons as well as future analysis of new Lactobacillus genomes. PMID:21375712
A CRISPR/molecular beacon hybrid system for live-cell genomic imaging.

PubMed

Wu, Xiaotian; Mao, Shiqi; Yang, Yantao; Rushdi, Muaz N; Krueger, Christopher J; Chen, Antony K

2018-04-30

The clustered regularly interspersed short palindromic repeat (CRISPR) gene-editing system has been repurposed for live-cell genomic imaging, but existing approaches rely on fluorescent protein reporters, making sensitive and continuous imaging difficult. Here, we present a fluorophore-based live-cell genomic imaging system that consists of a nuclease-deactivated mutant of the Cas9 protein (dCas9), a molecular beacon (MB), and an engineered single-guide RNA (sgRNA) harboring a unique MB target sequence (sgRNA-MTS), termed CRISPR/MB. Specifically, dCas9 and sgRNA-MTS are first co-expressed to target a specific locus in cells, followed by delivery of MBs that can then hybridize to MTS to illuminate the target locus. We demonstrated the feasibility of this approach for quantifying genomic loci, for monitoring chromatin dynamics, and for dual-color imaging when using two orthogonal MB/MTS pairs. With flexibility in selecting different combinations of fluorophore/quencher pairs and MB/MTS sequences, our CRISPR/MB hybrid system could be a promising platform for investigating chromatin activities.
Draft versus finished sequence data for DNA and protein diagnostic signature development

PubMed Central

Gardner, Shea N.; Lam, Marisa W.; Smith, Jason R.; Torres, Clinton L.; Slezak, Tom R.

2005-01-01

Sequencing pathogen genomes is costly, demanding careful allocation of limited sequencing resources. We built a computational Sequencing Analysis Pipeline (SAP) to guide decisions regarding the amount of genomic sequencing necessary to develop high-quality diagnostic DNA and protein signatures. SAP uses simulations to estimate the number of target genomes and close phylogenetic relatives (near neighbors or NNs) to sequence. We use SAP to assess whether draft data are sufficient or finished sequencing is required using Marburg and variola virus sequences. Simulations indicate that intermediate to high-quality draft with error rates of 10−3–10−5 (∼8× coverage) of target organisms is suitable for DNA signature prediction. Low-quality draft with error rates of ∼1% (3× to 6× coverage) of target isolates is inadequate for DNA signature prediction, although low-quality draft of NNs is sufficient, as long as the target genomes are of high quality. For protein signature prediction, sequencing errors in target genomes substantially reduce the detection of amino acid sequence conservation, even if the draft is of high quality. In summary, high-quality draft of target and low-quality draft of NNs appears to be a cost-effective investment for DNA signature prediction, but may lead to underestimation of predicted protein signatures. PMID:16243783
Genome-Wide Linkage and Association Analysis Identifies Major Gene Loci for Guttural Pouch Tympany in Arabian and German Warmblood Horses

PubMed Central

Metzger, Julia; Ohnesorge, Bernhard; Distl, Ottmar

2012-01-01

Equine guttural pouch tympany (GPT) is a hereditary condition affecting foals in their first months of life. Complex segregation analyses in Arabian and German warmblood horses showed the involvement of a major gene as very likely. Genome-wide linkage and association analyses including a high density marker set of single nucleotide polymorphisms (SNPs) were performed to map the genomic region harbouring the potential major gene for GPT. A total of 85 Arabian and 373 German warmblood horses were genotyped on the Illumina equine SNP50 beadchip. Non-parametric multipoint linkage analyses showed genome-wide significance on horse chromosomes (ECA) 3 for German warmblood at 16–26 Mb and 34–55 Mb and for Arabian on ECA15 at 64–65 Mb. Genome-wide association analyses confirmed the linked regions for both breeds. In Arabian, genome-wide association was detected at 64 Mb within the region with the highest linkage peak on ECA15. For German warmblood, signals for genome-wide association were close to the peak region of linkage at 52 Mb on ECA3. The odds ratio for the SNP with the highest genome-wide association was 0.12 for the Arabian. In conclusion, the refinement of the regions with the Illumina equine SNP50 beadchip is an important step to unravel the responsible mutations for GPT. PMID:22848553
The human genome contracts again.

PubMed

Pavlichin, Dmitri S; Weissman, Tsachy; Yona, Golan

2013-09-01

The number of human genomes that have been sequenced completely for different individuals has increased rapidly in recent years. Storing and transferring complete genomes between computers for the purpose of applying various applications and analysis tools will soon become a major hurdle, hindering the analysis phase. Therefore, there is a growing need to compress these data efficiently. Here, we describe a technique to compress human genomes based on entropy coding, using a reference genome and known Single Nucleotide Polymorphisms (SNPs). Furthermore, we explore several intrinsic features of genomes and information in other genomic databases to further improve the compression attained. Using these methods, we compress James Watson's genome to 2.5 megabytes (MB), improving on recent work by 37%. Similar compression is obtained for most genomes available from the 1000 Genomes Project. Our biologically inspired techniques promise even greater gains for genomes of lower organisms and for human genomes as more genomic data become available. Code is available at sourceforge.net/projects/genomezip/
Draft Genome Sequence of a Picorna-Like Virus Associated with Gill Tissue in Clinically Normal Brook Trout, Salvelinus fontinalis.

PubMed

Iwanowicz, Luke R; Iwanowicz, Deborah D; Adams, Cynthia R; Galbraith, Heather; Aunins, Aaron; Cornman, Robert S

2017-10-12

Here, we report a draft genome sequence of a picorna-like virus associated with brook trout, Salvelinus fontinalis , gill tissue. The draft genome comprises 8,681 nucleotides, excluding the poly(A) tract, and contains two open reading frames. It is most similar to picorna-like viruses that infect invertebrates.
Draft Genome Sequence of Tolypothrix boutellei Strain VB521301

PubMed Central

Chandrababunaidu, Mathu Malar; Singh, Deeksha; Sen, Diya; Bhan, Sushma; Das, Subhadeep; Gupta, Akash

2015-01-01

We report here the draft genome sequence of the filamentous nitrogen-fixing cyanobacterium Tolypothrix boutellei strain VB521301. The organism is lipid rich and hydrophobic and produces polyunsaturated fatty acids which can be harnessed for industrial purpose. The draft genome sequence assembled into 11,572,263 bp with 70 scaffolds and 7,777 protein coding genes. PMID:25700407
Draft genome sequence of a picorna-like virus associated with gill tissue in clinically normal brook trout, Salvelinus fontinalis

USGS Publications Warehouse

Iwanowicz, Luke R.; Iwanowicz, Deborah; Adams, Cynthia; Galbraith, Heather S.; Aunins, Aaron W.; Cornman, Robert S.

2017-01-01

Here, we report a draft genome sequence of a picorna-like virus associated with brook trout, Salvelinus fontinalis, gill tissue. The draft genome comprises 8,681 nucleotides, excluding the poly(A) tract, and contains two open reading frames. It is most similar to picorna-like viruses that infect invertebrates.
Draft Genome Sequence of Solibacillus kalamii, Isolated from an Air Filter Aboard the International Space Station.

PubMed

Seuylemezian, Arman; Singh, Nitin K; Vaishampayan, Parag; Venkateswaran, Kasthuri

2017-08-31

We report here the draft genome of Solibacillus kalamii ISSFR-015, isolated from a high-energy particulate arrestance filter aboard the International Space Station. The draft genome sequence of this strain contains 3,809,180 bp with an estimated G+C content of 38.61%. Copyright © 2017 Seuylemezian et al.
Genome sequence of Phytophthora ramorum: implications for management

Treesearch

Brett Tyler; Sucheta Tripathy; Nik Grunwald; Kurt Lamour; Kelly Ivors; Matteo Garbelotto; Daniel Rokhsar; Nik Putnam; Igor Grigoriev; Jeffrey Boore

2006-01-01

A draft genome sequence has been determined for Phytophthora ramorum, together with a draft sequence of the soybean pathogen Phytophthora sojae. The P. ramorum genome was sequenced to a depth of 7-fold coverage, while the P. sojae genome was sequenced to a depth of 9-fold coverage. The genome...
Genome Sequence and Transcriptome Analyses of Chrysochromulina tobin: Metabolic Tools for Enhanced Algal Fitness in the Prominent Order Prymnesiales (Haptophyceae)

DOE PAGES

Hovde, Blake T.; Deodato, Chloe R.; Hunsperger, Heather M.; ...

2015-09-23

Haptophytes are recognized as seminal players in aquatic ecosystem function. These algae are important in global carbon sequestration, form destructive harmful blooms, and given their rich fatty acid content, serve as a highly nutritive food source to a broad range of eco-cohorts. Haptophyte dominance in both fresh and marine waters is supported by the mixotrophic nature of many taxa. Despite their importance the nuclear genome sequence of only one haptophyte, Emiliania huxleyi (Isochrysidales), is available. Here we report the draft genome sequence of Chrysochromulina tobin (Prymnesiales), and transcriptome data collected at seven time points over a 24-hour light/dark cycle. Themore » nuclear genome of C. tobin is small (59 Mb), compact (~40% of the genome is protein coding) and encodes approximately 16,777 genes. Genes important to fatty acid synthesis, modification, and catabolism show distinct patterns of expression when monitored over the circadian photoperiod. The C. tobin genome harbors the first hybrid polyketide synthase/non-ribosomal peptide synthase gene complex reported for an algal species, and encodes potential anti-microbial peptides and proteins involved in multidrug and toxic compound extrusion. A new haptophyte xanthorhodopsin was also identified, together with two “red” RuBisCO activases that are shared across many algal lineages. In conclusion, the Chrysochromulina tobin genome sequence provides new information on the evolutionary history, ecology and economic importance of haptophytes.« less
Genome Sequence and Transcriptome Analyses of Chrysochromulina tobin: Metabolic Tools for Enhanced Algal Fitness in the Prominent Order Prymnesiales (Haptophyceae)

DOE Office of Scientific and Technical Information (OSTI.GOV)

Hovde, Blake T.; Deodato, Chloe R.; Hunsperger, Heather M.

Haptophytes are recognized as seminal players in aquatic ecosystem function. These algae are important in global carbon sequestration, form destructive harmful blooms, and given their rich fatty acid content, serve as a highly nutritive food source to a broad range of eco-cohorts. Haptophyte dominance in both fresh and marine waters is supported by the mixotrophic nature of many taxa. Despite their importance the nuclear genome sequence of only one haptophyte, Emiliania huxleyi (Isochrysidales), is available. Here we report the draft genome sequence of Chrysochromulina tobin (Prymnesiales), and transcriptome data collected at seven time points over a 24-hour light/dark cycle. Themore » nuclear genome of C. tobin is small (59 Mb), compact (~40% of the genome is protein coding) and encodes approximately 16,777 genes. Genes important to fatty acid synthesis, modification, and catabolism show distinct patterns of expression when monitored over the circadian photoperiod. The C. tobin genome harbors the first hybrid polyketide synthase/non-ribosomal peptide synthase gene complex reported for an algal species, and encodes potential anti-microbial peptides and proteins involved in multidrug and toxic compound extrusion. A new haptophyte xanthorhodopsin was also identified, together with two “red” RuBisCO activases that are shared across many algal lineages. In conclusion, the Chrysochromulina tobin genome sequence provides new information on the evolutionary history, ecology and economic importance of haptophytes.« less
Identification of genomic copy number variations associated with specific clinical features of head and neck cancer.

PubMed

Zagradišnik, Boris; Krgović, Danijela; Herodež, Špela Stangler; Zagorac, Andreja; Ćižmarević, Bogdan; Vokač, Nadja Kokalj

2018-01-01

Copy number variations (CNSs) of large genomic regions are an important mechanism implicated in the development of head and neck cancer, however, for most changes their exact role is not well understood. The aim of this study was to find possible associations between gains/losses of genomic regions and clinically distinct subgroups of head and neck cancer patients. Array comparative genomic hybridization (aCGH) analysis was performed on DNA samples in 64 patients with cancer in oral cavity, oropharynx or hypopharynx. Overlapping genomic regions created from gains and losses were used for statistical analysis. Following regions were overrepresented: in tumors with stage I or II a gain of 2.98 Mb on 6p21.2-p11 and a gain of 7.4 Mb on 8q11.1-q11.23; in tumors with grade I histology a gain of 1.1 Mb on 8q24.13, a loss of a large part of p arm of chromosome 3, a loss of a 1.24 Mb on 6q14.3, and a loss of terminal 32 Mb region of 8p23.3; in cases with affected lymph nodes a gain of 0.75 Mb on 3q24, and a gain of 0.9 Mb on 3q26.32-q26.33; in cases with unaffected lymph nodes a gain of 1.1 Mb on 8q23.3, in patients not treated with surgery a gain of 12.2 Mb on 7q21.3-q22.3 and a gain of 0.33 Mb on 20q11.22. Our study identified several genomic regions of interest which appear to be associated with various clinically distinct subgroups of head and neck cancer. They represent a potentially important source of biomarkers useful for the clinical management of head and neck cancer. In particular, the PIK3CA and AGTR1 genes could be singled out to predict the lymph node involvement.
No evidence for extensive horizontal gene transfer in the genome of the tardigrade Hypsibius dujardini

PubMed Central

Koutsovoulos, Georgios; Laetsch, Dominik R.; Stevens, Lewis; Daub, Jennifer; Conlon, Claire; Maroon, Habib; Thomas, Fran; Aboobaker, Aziz A.

2016-01-01

Tardigrades are meiofaunal ecdysozoans that are key to understanding the origins of Arthropoda. Many species of Tardigrada can survive extreme conditions through cryptobiosis. In a recent paper [Boothby TC, et al. (2015) Proc Natl Acad Sci USA 112(52):15976–15981], the authors concluded that the tardigrade Hypsibius dujardini had an unprecedented proportion (17%) of genes originating through functional horizontal gene transfer (fHGT) and speculated that fHGT was likely formative in the evolution of cryptobiosis. We independently sequenced the genome of H. dujardini. As expected from whole-organism DNA sampling, our raw data contained reads from nontarget genomes. Filtering using metagenomics approaches generated a draft H. dujardini genome assembly of 135 Mb with superior assembly metrics to the previously published assembly. Additional microbial contamination likely remains. We found no support for extensive fHGT. Among 23,021 gene predictions we identified 0.2% strong candidates for fHGT from bacteria and 0.2% strong candidates for fHGT from nonmetazoan eukaryotes. Cross-comparison of assemblies showed that the overwhelming majority of HGT candidates in the Boothby et al. genome derived from contaminants. We conclude that fHGT into H. dujardini accounts for at most 1–2% of genes and that the proposal that one-sixth of tardigrade genes originate from functional HGT events is an artifact of undetected contamination. PMID:27035985
No evidence for extensive horizontal gene transfer in the genome of the tardigrade Hypsibius dujardini.

PubMed

Koutsovoulos, Georgios; Kumar, Sujai; Laetsch, Dominik R; Stevens, Lewis; Daub, Jennifer; Conlon, Claire; Maroon, Habib; Thomas, Fran; Aboobaker, Aziz A; Blaxter, Mark

2016-05-03

Tardigrades are meiofaunal ecdysozoans that are key to understanding the origins of Arthropoda. Many species of Tardigrada can survive extreme conditions through cryptobiosis. In a recent paper [Boothby TC, et al. (2015) Proc Natl Acad Sci USA 112(52):15976-15981], the authors concluded that the tardigrade Hypsibius dujardini had an unprecedented proportion (17%) of genes originating through functional horizontal gene transfer (fHGT) and speculated that fHGT was likely formative in the evolution of cryptobiosis. We independently sequenced the genome of H. dujardini As expected from whole-organism DNA sampling, our raw data contained reads from nontarget genomes. Filtering using metagenomics approaches generated a draft H. dujardini genome assembly of 135 Mb with superior assembly metrics to the previously published assembly. Additional microbial contamination likely remains. We found no support for extensive fHGT. Among 23,021 gene predictions we identified 0.2% strong candidates for fHGT from bacteria and 0.2% strong candidates for fHGT from nonmetazoan eukaryotes. Cross-comparison of assemblies showed that the overwhelming majority of HGT candidates in the Boothby et al. genome derived from contaminants. We conclude that fHGT into H. dujardini accounts for at most 1-2% of genes and that the proposal that one-sixth of tardigrade genes originate from functional HGT events is an artifact of undetected contamination.
Genome analysis of Hibiscus syriacus provides insights of polyploidization and indeterminate flowering in woody plants.

PubMed

Kim, Yong-Min; Kim, Seungill; Koo, Namjin; Shin, Ah-Young; Yeom, Seon-In; Seo, Eunyoung; Park, Seong-Jin; Kang, Won-Hee; Kim, Myung-Shin; Park, Jieun; Jang, Insu; Kim, Pan-Gyu; Byeon, Iksu; Kim, Min-Seo; Choi, JinHyuk; Ko, Gunhwan; Hwang, JiHye; Yang, Tae-Jin; Choi, Sang-Bong; Lee, Je Min; Lim, Ki-Byung; Lee, Jungho; Choi, Ik-Young; Park, Beom-Seok; Kwon, Suk-Yoon; Choi, Doil; Kim, Ryan W

2017-02-01

Hibiscus syriacus (L.) (rose of Sharon) is one of the most widespread garden shrubs in the world. We report a draft of the H. syriacus genome comprised of a 1.75 Gb assembly that covers 92% of the genome with only 1.7% (33 Mb) gap sequences. Predicted gene modeling detected 87,603 genes, mostly supported by deep RNA sequencing data. To define gene family distribution among relatives of H. syriacus, orthologous gene sets containing 164,660 genes in 21,472 clusters were identified by OrthoMCL analysis of five plant species, including H. syriacus, Arabidopsis thaliana, Gossypium raimondii, Theobroma cacao and Amborella trichopoda. We inferred their evolutionary relationships based on divergence times among Malvaceae plant genes and found that gene families involved in flowering regulation and disease resistance were more highly divergent and expanded in H. syriacus than in its close relatives, G. raimondii (DD) and T. cacao. Clustered gene families and gene collinearity analysis revealed that two recent rounds of whole-genome duplication were followed by diploidization of the H. syriacus genome after speciation. Copy number variation and phylogenetic divergence indicates that WGDs and subsequent diploidization led to unequal duplication and deletion of flowering-related genes in H. syriacus and may affect its unique floral morphology. © The Author 2016. Published by Oxford University Press on behalf of Kazusa DNA Research Institute.
Resequencing of the common marmoset genome improves genome assemblies and gene-coding sequence analysis.

PubMed

Sato, Kengo; Kuroki, Yoko; Kumita, Wakako; Fujiyama, Asao; Toyoda, Atsushi; Kawai, Jun; Iriki, Atsushi; Sasaki, Erika; Okano, Hideyuki; Sakakibara, Yasubumi

2015-11-20

The first draft of the common marmoset (Callithrix jacchus) genome was published by the Marmoset Genome Sequencing and Analysis Consortium. The draft was based on whole-genome shotgun sequencing, and the current assembly version is Callithrix_jacches-3.2.1, but there still exist 187,214 undetermined gap regions and supercontigs and relatively short contigs that are unmapped to chromosomes in the draft genome. We performed resequencing and assembly of the genome of common marmoset by deep sequencing with high-throughput sequencing technology. Several different sequence runs using Illumina sequencing platforms were executed, and 181 Gbp of high-quality bases including mate-pairs with long insert lengths of 3, 8, 20, and 40 Kbp were obtained, that is, approximately 60× coverage. The resequencing significantly improved the MGSAC draft genome sequence. The N50 of the contigs, which is a statistical measure used to evaluate assembly quality, doubled. As a result, 51% of the contigs (total length: 299 Mbp) that were unmapped to chromosomes in the MGSAC draft were merged with chromosomal contigs, and the improved genome sequence helped to detect 5,288 new genes that are homologous to human cDNAs and the gaps in 5,187 transcripts of the Ensembl gene annotations were completely filled.
Draft Genome Sequence of a Picorna-Like Virus Associated with Gill Tissue in Clinically Normal Brook Trout, Salvelinus fontinalis

PubMed Central

2017-01-01

ABSTRACT Here, we report a draft genome sequence of a picorna-like virus associated with brook trout, Salvelinus fontinalis, gill tissue. The draft genome comprises 8,681 nucleotides, excluding the poly(A) tract, and contains two open reading frames. It is most similar to picorna-like viruses that infect invertebrates. PMID:29025930
Draft Genome Sequence of Tolypothrix boutellei Strain VB521301.

PubMed

Chandrababunaidu, Mathu Malar; Singh, Deeksha; Sen, Diya; Bhan, Sushma; Das, Subhadeep; Gupta, Akash; Adhikary, Siba Prasad; Tripathy, Sucheta

2015-02-19

We report here the draft genome sequence of the filamentous nitrogen-fixing cyanobacterium Tolypothrix boutellei strain VB521301. The organism is lipid rich and hydrophobic and produces polyunsaturated fatty acids which can be harnessed for industrial purpose. The draft genome sequence assembled into 11,572,263 bp with 70 scaffolds and 7,777 protein coding genes. Copyright © 2015 Chandrababunaidu et al.

Draft Genome Sequences of 20 Salmonella enterica subsp. enterica Serovar Typhimurium Strains Isolated from Swine in Santa Catarina, Brazil.

PubMed

Seribelli, Amanda Aparecida; Frazão, Miliane Rodrigues; Gonzales, Júlia Cunha; Cao, Guojie; Leon, Maria Sanchez; Kich, Jalusa Deon; Allard, Marc William; Falcão, Juliana Pfrimer

2018-04-19

Salmonellosis is a disease with a high incidence worldwide, and Salmonella enterica subsp. enterica serovar Typhimurium is one of the most clinically important serovars. We report here the draft genome sequences of 20 S. Typhimurium strains isolated from swine in Santa Catarina, Brazil. These draft genomes will improve our understanding of S. Typhimurium in Brazil.
Draft Genome Sequence of Ideonella sp. Strain A 288, Isolated from an Iron-Precipitating Biofilm

PubMed Central

Künzel, Sven; Szewzyk, Ulrich

2017-01-01

ABSTRACT Here, we report the draft genome sequence of the betaproteobacterium Ideonella sp. strain A_228. This isolate, obtained from a bog iron ore-containing floodplain area in Germany, provides valuable information about the genetic diversity of neutrophilic iron-depositing bacteria. The Illumina NextSeq technique was used to sequence the draft genome sequence of the strain. PMID:28818902
Draft Genome Sequence of Lactobacillus reuteri Strain CRL 1098, an Interesting Candidate for Functional Food Development.

PubMed

Torres, Andrea C; Suárez, Nadia E; Font, Graciela; Saavedra, Lucila; Taranto, María Pía

2016-08-25

We report here the draft genome sequence of Lactobacillus reuteri strain CRL 1098. This strain represents an interesting candidate for functional food development because of its proven probiotic properties. The draft genome sequence is composed of 1,969,471 bp assembled into 45 contigs and an average G+C content of 38.8%. Copyright © 2016 Torres et al.
Draft genome of agar-degrading marine bacterium Gilvimarinus agarilyticus JEA5.

PubMed

Lee, Youngdeuk; Lee, Su-Jin; Park, Gun-Hoo; Heo, Soo-Jin; Umasuthan, Navaneethaiyer; Kang, Do-Hyung; Oh, Chulhong

2015-06-01

Gilvimarinus agarilyticus JEA5, which effectively degrades agar, was isolated from the seawater of Jeju Island, Republic of Korea. Here, we report the draft genome sequence of G. agarilyticus JEA5 with a total genome size of 4,179,438bp from 2 scaffolds (21 contigs) with 53.15% G+C content. Various polysaccharidases including 11 predicted agarases were observed from the draft genome of G. agarilyticus JEA5. Copyright © 2015 Elsevier B.V. All rights reserved.
Draft Genome Sequence of Ezakiella peruensis Strain M6.X2, a Human Gut Gram-Positive Anaerobic Coccus.

PubMed

Diop, Awa; Diop, Khoudia; Tomei, Enora; Raoult, Didier; Fenollar, Florence; Fournier, Pierre-Edouard

2018-03-01

We report here the draft genome sequence of Ezakiella peruensis strain M6.X2 T The draft genome is 1,672,788 bp long and harbors 1,589 predicted protein-encoding genes, including 26 antibiotic resistance genes with 1 gene encoding vancomycin resistance. The genome also exhibits 1 clustered regularly interspaced short palindromic repeat region and 333 genes acquired by horizontal gene transfer. Copyright © 2018 Diop et al.
Genome Sequencing of the Phytoseiid Predatory Mite Metaseiulus occidentalis Reveals Completely Atomized Hox Genes and Superdynamic Intron Evolution

PubMed Central

Hoy, Marjorie A.; Waterhouse, Robert M.; Wu, Ke; Estep, Alden S.; Ioannidis, Panagiotis; Palmer, William J.; Pomerantz, Aaron F.; Simão, Felipe A.; Thomas, Jainy; Jiggins, Francis M.; Murphy, Terence D.; Pritham, Ellen J.; Robertson, Hugh M.; Zdobnov, Evgeny M.; Gibbs, Richard A.; Richards, Stephen

2016-01-01

Metaseiulus occidentalis is an eyeless phytoseiid predatory mite employed for the biological control of agricultural pests including spider mites. Despite appearances, these predator and prey mites are separated by some 400 Myr of evolution and radically different lifestyles. We present a 152-Mb draft assembly of the M. occidentalis genome: Larger than that of its favored prey, Tetranychus urticae, but considerably smaller than those of many other chelicerates, enabling an extremely contiguous and complete assembly to be built—the best arachnid to date. Aided by transcriptome data, genome annotation cataloged 18,338 protein-coding genes and identified large numbers of Helitron transposable elements. Comparisons with other arthropods revealed a particularly dynamic and turbulent genomic evolutionary history. Its genes exhibit elevated molecular evolution, with strikingly high numbers of intron gains and losses, in stark contrast to the deer tick Ixodes scapularis. Uniquely among examined arthropods, this predatory mite’s Hox genes are completely atomized, dispersed across the genome, and it encodes five copies of the normally single-copy RNA processing Dicer-2 gene. Examining gene families linked to characteristic biological traits of this tiny predator provides initial insights into processes of sex determination, development, immune defense, and how it detects, disables, and digests its prey. As the first reference genome for the Phytoseiidae, and for any species with the rare sex determination system of parahaploidy, the genome of the western orchard predatory mite improves genomic sampling of chelicerates and provides invaluable new resources for functional genomic analyses of this family of agriculturally important mites. PMID:26951779
De Novo Genome and Transcriptome Assembly of the Canadian Beaver (Castor canadensis).

PubMed

Lok, Si; Paton, Tara A; Wang, Zhuozhi; Kaur, Gaganjot; Walker, Susan; Yuen, Ryan K C; Sung, Wilson W L; Whitney, Joseph; Buchanan, Janet A; Trost, Brett; Singh, Naina; Apresto, Beverly; Chen, Nan; Coole, Matthew; Dawson, Travis J; Ho, Karen; Hu, Zhizhou; Pullenayegum, Sanjeev; Samler, Kozue; Shipstone, Arun; Tsoi, Fiona; Wang, Ting; Pereira, Sergio L; Rostami, Pirooz; Ryan, Carol Ann; Tong, Amy Hin Yan; Ng, Karen; Sundaravadanam, Yogi; Simpson, Jared T; Lim, Burton K; Engstrom, Mark D; Dutton, Christopher J; Kerr, Kevin C R; Franke, Maria; Rapley, William; Wintle, Richard F; Scherer, Stephen W

2017-02-09

The Canadian beaver ( Castor canadensis ) is the largest indigenous rodent in North America. We report a draft annotated assembly of the beaver genome, the first for a large rodent and the first mammalian genome assembled directly from uncorrected and moderate coverage (< 30 ×) long reads generated by single-molecule sequencing. The genome size is 2.7 Gb estimated by k-mer analysis. We assembled the beaver genome using the new Canu assembler optimized for noisy reads. The resulting assembly was refined using Pilon supported by short reads (80 ×) and checked for accuracy by congruency against an independent short read assembly. We scaffolded the assembly using the exon-gene models derived from 9805 full-length open reading frames (FL-ORFs) constructed from the beaver leukocyte and muscle transcriptomes. The final assembly comprised 22,515 contigs with an N50 of 278,680 bp and an N50-scaffold of 317,558 bp. Maximum contig and scaffold lengths were 3.3 and 4.2 Mb, respectively, with a combined scaffold length representing 92% of the estimated genome size. The completeness and accuracy of the scaffold assembly was demonstrated by the precise exon placement for 91.1% of the 9805 assembled FL-ORFs and 83.1% of the BUSCO (Benchmarking Universal Single-Copy Orthologs) gene set used to assess the quality of genome assemblies. Well-represented were genes involved in dentition and enamel deposition, defining characteristics of rodents with which the beaver is well-endowed. The study provides insights for genome assembly and an important genomics resource for Castoridae and rodent evolutionary biology. Copyright © 2017 Lok et al.
De Novo Genome and Transcriptome Assembly of the Canadian Beaver (Castor canadensis)

PubMed Central

Lok, Si; Paton, Tara A.; Wang, Zhuozhi; Kaur, Gaganjot; Walker, Susan; Yuen, Ryan K. C.; Sung, Wilson W. L.; Whitney, Joseph; Buchanan, Janet A.; Trost, Brett; Singh, Naina; Apresto, Beverly; Chen, Nan; Coole, Matthew; Dawson, Travis J.; Ho, Karen; Hu, Zhizhou; Pullenayegum, Sanjeev; Samler, Kozue; Shipstone, Arun; Tsoi, Fiona; Wang, Ting; Pereira, Sergio L.; Rostami, Pirooz; Ryan, Carol Ann; Tong, Amy Hin Yan; Ng, Karen; Sundaravadanam, Yogi; Simpson, Jared T.; Lim, Burton K.; Engstrom, Mark D.; Dutton, Christopher J.; Kerr, Kevin C. R.; Franke, Maria; Rapley, William; Wintle, Richard F.; Scherer, Stephen W.

2017-01-01

The Canadian beaver (Castor canadensis) is the largest indigenous rodent in North America. We report a draft annotated assembly of the beaver genome, the first for a large rodent and the first mammalian genome assembled directly from uncorrected and moderate coverage (< 30 ×) long reads generated by single-molecule sequencing. The genome size is 2.7 Gb estimated by k-mer analysis. We assembled the beaver genome using the new Canu assembler optimized for noisy reads. The resulting assembly was refined using Pilon supported by short reads (80 ×) and checked for accuracy by congruency against an independent short read assembly. We scaffolded the assembly using the exon–gene models derived from 9805 full-length open reading frames (FL-ORFs) constructed from the beaver leukocyte and muscle transcriptomes. The final assembly comprised 22,515 contigs with an N50 of 278,680 bp and an N50-scaffold of 317,558 bp. Maximum contig and scaffold lengths were 3.3 and 4.2 Mb, respectively, with a combined scaffold length representing 92% of the estimated genome size. The completeness and accuracy of the scaffold assembly was demonstrated by the precise exon placement for 91.1% of the 9805 assembled FL-ORFs and 83.1% of the BUSCO (Benchmarking Universal Single-Copy Orthologs) gene set used to assess the quality of genome assemblies. Well-represented were genes involved in dentition and enamel deposition, defining characteristics of rodents with which the beaver is well-endowed. The study provides insights for genome assembly and an important genomics resource for Castoridae and rodent evolutionary biology. PMID:28087693
Comparative Genomics of Field Isolates of Mycobacterium bovis and M. caprae Provides Evidence for Possible Correlates with Bacterial Viability and Virulence.

PubMed

de la Fuente, José; Díez-Delgado, Iratxe; Contreras, Marinela; Vicente, Joaquín; Cabezas-Cruz, Alejandro; Tobes, Raquel; Manrique, Marina; López, Vladimir; Romero, Beatriz; Bezos, Javier; Dominguez, Lucas; Sevilla, Iker A; Garrido, Joseba M; Juste, Ramón; Madico, Guillermo; Jones-López, Edward; Gortazar, Christian

2015-11-01

Mycobacteria of the Mycobacterium tuberculosis complex (MTBC) greatly affect humans and animals worldwide. The life cycle of mycobacteria is complex and the mechanisms resulting in pathogen infection and survival in host cells are not fully understood. Recently, comparative genomics analyses have provided new insights into the evolution and adaptation of the MTBC to survive inside the host. However, most of this information has been obtained using M. tuberculosis but not other members of the MTBC such as M. bovis and M. caprae. In this study, the genome of three M. bovis (MB1, MB3, MB4) and one M. caprae (MB2) field isolates with different lesion score, prevalence and host distribution phenotypes were sequenced. Genome sequence information was used for whole-genome and protein-targeted comparative genomics analysis with the aim of finding correlates with phenotypic variation with potential implications for tuberculosis (TB) disease risk assessment and control. At the whole-genome level the results of the first comparative genomics study of field isolates of M. bovis including M. caprae showed that as previously reported for M. tuberculosis, sequential chromosomal nucleotide substitutions were the main driver of the M. bovis genome evolution. The phylogenetic analysis provided a strong support for the M. bovis/M. caprae clade, but supported M. caprae as a separate species. The comparison of the MB1 and MB4 isolates revealed differences in genome sequence, including gene families that are important for bacterial infection and transmission, thus highlighting differences with functional implications between isolates otherwise classified with the same spoligotype. Strategic protein-targeted analysis using the ESX or type VII secretion system, proteins linking stress response with lipid metabolism, host T cell epitopes of mycobacteria, antigens and peptidoglycan assembly protein identified new genetic markers and candidate vaccine antigens that warrant further study to develop tools to evaluate risks for TB disease caused by M. bovis/M.caprae and for TB control in humans and animals.
The draft genome sequence of Mangrovibacter sp. strain MP23, an endophyte isolated from the roots of Phragmites karka.

PubMed

Behera, Pratiksha; Vaishampayan, Parag; Singh, Nitin K; Mishra, Samir R; Raina, Vishakha; Suar, Mrutyunjay; Pattnaik, Ajit K; Rastogi, Gurdeep

2016-09-01

Till date, only one draft genome has been reported within the genus Mangrovibacter. Here, we report the second draft genome shotgun sequence of a Mangrovibacter sp. strain MP23 that was isolated from the roots of Phargmites karka (P. karka), an invasive weed growing in the Chilika Lagoon, Odisha, India. Strain MP23 is a facultative anaerobic, nitrogen-fixing endophytic bacteria that grows optimally at 37 °C, 7.0 pH, and 1% NaCl concentration. The draft genome sequence of strain MP23 contains 4,947,475 bp with an estimated G + C content of 49.9% and total 4392 protein coding genes. The genome sequence has provided information on putative genes that code for proteins involved in oxidative stress, uptake of nutrients, and nitrogen fixation that might offer niche specific ecological fitness and explain the invasive success of P. karka in Chilika Lagoon. The draft genome sequence and annotation have been deposited at DDBJ/EMBL/GenBank under the accession number LYRP00000000.
A Rickettsia Genome Overrun by Mobile Genetic Elements Provides Insight into the Acquisition of Genes Characteristic of an Obligate Intracellular Lifestyle

PubMed Central

Joardar, Vinita; Williams, Kelly P.; Driscoll, Timothy; Hostetler, Jessica B.; Nordberg, Eric; Shukla, Maulik; Walenz, Brian; Hill, Catherine A.; Nene, Vishvanath M.; Azad, Abdu F.; Sobral, Bruno W.; Caler, Elisabet

2012-01-01

We present the draft genome for the Rickettsia endosymbiont of Ixodes scapularis (REIS), a symbiont of the deer tick vector of Lyme disease in North America. Among Rickettsia species (Alphaproteobacteria: Rickettsiales), REIS has the largest genome sequenced to date (>2 Mb) and contains 2,309 genes across the chromosome and four plasmids (pREIS1 to pREIS4). The most remarkable finding within the REIS genome is the extraordinary proliferation of mobile genetic elements (MGEs), which contributes to a limited synteny with other Rickettsia genomes. In particular, an integrative conjugative element named RAGE (for Rickettsiales amplified genetic element), previously identified in scrub typhus rickettsiae (Orientia tsutsugamushi) genomes, is present on both the REIS chromosome and plasmids. Unlike the pseudogene-laden RAGEs of O. tsutsugamushi, REIS encodes nine conserved RAGEs that include F-like type IV secretion systems similar to that of the tra genes encoded in the Rickettsia bellii and R. massiliae genomes. An unparalleled abundance of encoded transposases (>650) relative to genome size, together with the RAGEs and other MGEs, comprise ∼35% of the total genome, making REIS one of the most plastic and repetitive bacterial genomes sequenced to date. We present evidence that conserved rickettsial genes associated with an intracellular lifestyle were acquired via MGEs, especially the RAGE, through a continuum of genomic invasions. Robust phylogeny estimation suggests REIS is ancestral to the virulent spotted fever group of rickettsiae. As REIS is not known to invade vertebrate cells and has no known pathogenic effects on I. scapularis, its genome sequence provides insight on the origin of mechanisms of rickettsial pathogenicity. PMID:22056929
Comprehensive definition of genome features in Spirodela polyrhiza by high-depth physical mapping and short-read DNA sequencing strategies.

PubMed

Michael, Todd P; Bryant, Douglas; Gutierrez, Ryan; Borisjuk, Nikolai; Chu, Philomena; Zhang, Hanzhong; Xia, Jing; Zhou, Junfei; Peng, Hai; El Baidouri, Moaine; Ten Hallers, Boudewijn; Hastie, Alex R; Liang, Tiffany; Acosta, Kenneth; Gilbert, Sarah; McEntee, Connor; Jackson, Scott A; Mockler, Todd C; Zhang, Weixiong; Lam, Eric

2017-02-01

Spirodela polyrhiza is a fast-growing aquatic monocot with highly reduced morphology, genome size and number of protein-coding genes. Considering these biological features of Spirodela and its basal position in the monocot lineage, understanding its genome architecture could shed light on plant adaptation and genome evolution. Like many draft genomes, however, the 158-Mb Spirodela genome sequence has not been resolved to chromosomes, and important genome characteristics have not been defined. Here we deployed rapid genome-wide physical maps combined with high-coverage short-read sequencing to resolve the 20 chromosomes of Spirodela and to empirically delineate its genome features. Our data revealed a dramatic reduction in the number of the rDNA repeat units in Spirodela to fewer than 100, which is even fewer than that reported for yeast. Consistent with its unique phylogenetic position, small RNA sequencing revealed 29 Spirodela-specific microRNA, with only two being shared with Elaeis guineensis (oil palm) and Musa balbisiana (banana). Combining DNA methylation data and small RNA sequencing enabled the accurate prediction of 20.5% long terminal repeats (LTRs) that doubled the previous estimate, and revealed a high Solo:Intact LTR ratio of 8.2. Interestingly, we found that Spirodela has the lowest global DNA methylation levels (9%) of any plant species tested. Taken together our results reveal a genome that has undergone reduction, likely through eliminating non-essential protein coding genes, rDNA and LTRs. In addition to delineating the genome features of this unique plant, the methodologies described and large-scale genome resources from this work will enable future evolutionary and functional studies of this basal monocot family. © 2016 The Authors The Plant Journal © 2016 John Wiley & Sons Ltd.
Complete genome sequence of Campylobacter concisus ATCC 33237T and draft genome sequences for an additional eight well-characterized C. concisus strains

USDA-ARS?s Scientific Manuscript database

This report includes the complete genome of the Campylobacter concisus type strain ATCC 33237T and the draft genomes of eight additional well characterized C. concisus genomes. C. concisus has been shown to be a genetically heterogeneous species and these nine genomes provide valuable information re...
Draft genome sequence of Thermoanaerobacterium sp. strain PSU-2 isolated from thermophilic hydrogen producing reactor.

PubMed

O-Thong, Sompong; Khongkliang, Peerawat; Mamimin, Chonticha; Singkhala, Apinya; Prasertsan, Poonsuk; Birkeland, Nils-Kåre

2017-06-01

Thermoanaerobacterium sp. strain PSU-2 was isolated from thermophilic hydrogen producing reactor and subjected to draft genome sequencing on 454 pyrosequencing and annotated on RAST. The draft genome sequence of strain PSU-2 contains 2,552,497 bases with an estimated G + C content of 35.2%, 2555 CDS, 8 rRNAs and 57 tRNAs. The strain had a number of genes responsible for carbohydrates metabolic, amino acids and derivatives, and protein metabolism of 17.7%, 14.39% and 9.81%, respectively. Strain PSU-2 also had gene responsible for hydrogen biosynthesis as well as the genes related to Ni-Fe hydrogenase. Comparative genomic analysis indicates strain PSU-2 shares about 94% genome sequence similarity with Thermoanaerobacterium xylanolyticum LX-11. The nucleotide sequence of this draft genome was deposited into DDBJ/ENA/GenBank under the accession MSQD00000000.
Draft Genome Sequences of Clostridium tyrobutyricum Strains FAM22552 and FAM22553, Isolated from Swiss Semihard Red-Smear Cheese

PubMed Central

Wüthrich, Daniel; Bruggmann, Rémy; Berthoud, Hélène; Arias-Roth, Emmanuelle

2015-01-01

Clostridium tyrobutyricum is the main microorganism responsible for late blowing defect in cheeses. Here, we present the draft genome sequences of two C. tyrobutyricum strains isolated from a Swiss semihard red-smear cheese. The two draft genomes comprise 3.05 and 3.08 Mbp and contain 3,030 and 3,089 putative coding sequences, respectively. PMID:25767226
Draft Genome Sequences of blaKPC-Containing Enterobacter aerogenes, Citrobacter freundii, and Citrobacter koseri Strains

PubMed Central

Hazen, Tracy H.; Mettus, Roberta T.; McElheny, Christi L.; Bowler, Sarah L.

2018-01-01

ABSTRACT We report here the draft genome sequences of four blaKPC-containing bacteria identified as Klebsiella aerogenes, Citrobacter freundii, and Citrobacter koseri. Additionally, we report the draft genome sequence of a K. aerogenes strain that did not contain a blaKPC gene but was isolated from the patient who had the blaKPC-2-containing K. aerogenes strain. PMID:29472325
Draft Genome Sequences of blaKPC-Containing Enterobacter aerogenes, Citrobacter freundii, and Citrobacter koseri Strains.

PubMed

Hazen, Tracy H; Mettus, Roberta T; McElheny, Christi L; Bowler, Sarah L; Doi, Yohei; Rasko, David A

2018-02-22

We report here the draft genome sequences of four bla KPC -containing bacteria identified as Klebsiella aerogenes , Citrobacter freundii , and Citrobacter koseri Additionally, we report the draft genome sequence of a K. aerogenes strain that did not contain a bla KPC gene but was isolated from the patient who had the bla KPC-2 -containing K. aerogenes strain. Copyright © 2018 Hazen et al.
Draft Genome Sequence of Agrobacterium sp. Strain UHFBA-218, Isolated from Rhizosphere Soil of Crown Gall-Infected Cherry Rootstock Colt

PubMed Central

Dua, Ankita; Sangwan, Naseer; Kaur, Jasvinder; Saxena, Anjali; Kohli, Puneet; Gupta, A. K.

2013-01-01

We report here the draft genome sequence of the alphaproteobacterium Agrobacterium sp. strain UHFBA-218, which was isolated from rhizosphere soil of crown gall-infected cherry rootstock Colt. The draft genome of strain UHFBA-218 consists of 112 contigs (5,425,303 bp) and 5,063 coding sequences with a G+C content of 59.8%. PMID:23723402
Draft genome sequence of Lactococcus garvieae str. PAQ102015-99, an outbreak strain isolated from a commercial trout farm in the Northwestern United States.

USDA-ARS?s Scientific Manuscript database

We announce the draft genome assembly of Lactococcus garvieae str. PAQ102015-99, a recently isolated strain from an outbreak of lactococcosis at a commercial trout farm in the Northwestern US. The draft genome comprises 14 contigs totaling 2,068,357 bp with an N50 of 496,618 bp and average G+C conte...
The tomato genome sequence provides insight into fleshy fruit evolution

USDA-ARS?s Scientific Manuscript database

The genome of the inbred tomato cultivar ‘Heinz 1706’ was sequenced and assembled using a combination of Sanger and “next generation” technologies. The predicted genome size is ~900 Mb, consistent with prior estimates, of which 760 Mb were assembled in 91 scaffolds aligned to the 12 tomato chromosom...

The Genome of Ganderma lucidum Provide Insights into Triterpense Biosynthesis and Wood Degradation

PubMed Central

Huang, Zhuo; Zhang, Hong-Mei; Liu, Wei; Liu, Le; Ma, Junping; Xia, Zhilan; Chen, Yuxin; Chen, Yuewen; Wang, Depeng; Ni, Peixiang; Guo, An-Yuan; Xiong, Xingyao

2012-01-01

Background Ganoderma lucidum (Reishi or Ling Zhi) is one of the most famous Traditional Chinese Medicines and has been widely used in the treatment of various human diseases in Asia countries. It is also a fungus with strong wood degradation ability with potential in bioenergy production. However, genes, pathways and mechanisms of these functions are still unknown. Methodology/Principal Findings The genome of G. lucidum was sequenced and assembled into a 39.9 megabases (Mb) draft genome, which encoded 12,080 protein-coding genes and ∼83% of them were similar to public sequences. We performed comprehensive annotation for G. lucidum genes and made comparisons with genes in other fungi genomes. Genes in the biosynthesis of the main G. lucidum active ingredients, ganoderic acids (GAs), were characterized. Among the GAs synthases, we identified a fusion gene, the N and C terminal of which are homologous to two different enzymes. Moreover, the fusion gene was only found in basidiomycetes. As a white rot fungus with wood degradation ability, abundant carbohydrate-active enzymes and ligninolytic enzymes were identified in the G. lucidum genome and were compared with other fungi. Conclusions/Significance The genome sequence and well annotation of G. lucidum will provide new insights in function analyses including its medicinal mechanism. The characterization of genes in the triterpene biosynthesis and wood degradation will facilitate bio-engineering research in the production of its active ingredients and bioenergy. PMID:22567134
CIDR

Science.gov Websites

Genotyping General Information Genome Wide Association Custom FFPE Sample Options Methylation Linkage Enrichment Options 51 Mb 51 Mb plus 6.8 - 24Mb custom option 54 Mb Clinical Exome 71 Mb (includes UTRs) Next Generation Sequencing Platform Illumina HiSeq sequencers Options for Formalin-Fixed Paraffin-Embedded (FFPE
Medulloblastoma | Office of Cancer Genomics

Cancer.gov

The Medulloblastoma Project was developed to apply newly emerging genomic methods towards the discovery of novel genetic alterations in medulloblastoma (MB). MB is the most common malignant brain tumor in children, accounting for approximately 20% of all pediatric brain tumors.
Draft Genome Sequence of Roseovarius sp. A-2, an Iodide-Oxidizing Bacterium Isolated from Natural Gas Brine Water, Chiba, Japan.

PubMed

Yuliana, Tri; Nakajima, Nobuyoshi; Yamamura, Shigeki; Tomita, Masaru; Suzuki, Haruo; Amachi, Seigo

2017-01-01

Roseovarius sp. A-2 is a heterotrophic iodide (I - )-oxidizing bacterium isolated from iodide-rich natural gas brine water in Chiba, Japan. This strain oxidizes iodide to molecular iodine (I 2 ) by means of an extracellular multicopper oxidase. Here we report the draft genome sequence of strain A-2. The draft genome contained 46 tRNA genes, 1 copy of a 16S-23S-5S rRNA operon, and 4,514 protein coding DNA sequences, of which 1,207 (27%) were hypothetical proteins. The genome contained a gene encoding IoxA, a multicopper oxidase previously found to catalyze the oxidation of iodide in Iodidimonas sp. Q-1. This draft genome provides detailed insights into the metabolism and potential application of Roseovarius sp. A-2.
Draft genome sequence of two Shingopyxis sp. strains H107 and H115 isolated from a chloraminated drinking water distriburion system simulator

EPA Pesticide Factsheets

Draft genome sequence of two Shingopyxis sp. strains H107 and H115 isolated from a chloraminated drinking water distriburion system simulatorThis dataset is associated with the following publication:Gomez-Alvarez, V., S. Pfaller , and R. Revetta. Draft Genome of Two Sphingopyxis sp. Strains, Dominant Members of the Bacterial Community Associated with a Drinking Water Distribution System Simulator. Genome Announcements. American Society for Microbiology, Washington, DC, USA, 4(2): e00183-16, (2016).
Draft Genome Sequences of Clostridium tyrobutyricum Strains FAM22552 and FAM22553, Isolated from Swiss Semihard Red-Smear Cheese.

PubMed

Storari, Michelangelo; Wüthrich, Daniel; Bruggmann, Rémy; Berthoud, Hélène; Arias-Roth, Emmanuelle

2015-03-12

Clostridium tyrobutyricum is the main microorganism responsible for late blowing defect in cheeses. Here, we present the draft genome sequences of two C. tyrobutyricum strains isolated from a Swiss semihard red-smear cheese. The two draft genomes comprise 3.05 and 3.08 Mbp and contain 3,030 and 3,089 putative coding sequences, respectively. Copyright © 2015 Storari et al.
Draft Genome Sequence of the First New Delhi Metallo-β-Lactamase (NDM-1)-Producing Escherichia coli Strain Isolated in Peru.

PubMed

Tamariz, Jesus; Llanos, Carlos; Seas, Carlos; Montenegro, Paola; Lagos, Jose; Fernandes, Miriam R; Cerdeira, Louise; Lincopan, Nilton

2018-03-29

We present here the draft genome sequence of the first New Delhi metallo-β-lactamase (NDM-1)-producing Escherichia coli strain, belonging to sequence type 155 (ST155), isolated in Peru. Assembly of this draft genome resulted in 5,061,184 bp, revealing a clinically significant resistome for β-lactams, aminoglycosides, tetracyclines, phenicols, sulfonamides, trimethoprim, and fluoroquinolones. Copyright © 2018 Tamariz et al.
The draft genome sequence and annotation of the desert woodrat Neotoma lepida.

PubMed

Campbell, Michael; Oakeson, Kelly F; Yandell, Mark; Halpert, James R; Dearing, Denise

2016-09-01

We present the de novo draft genome sequence for a vertebrate mammalian herbivore, the desert woodrat (Neotoma lepida). This species is of ecological and evolutionary interest with respect to ingestion, microbial detoxification and hepatic metabolism of toxic plant secondary compounds from the highly toxic creosote bush (Larrea tridentata) and the juniper shrub (Juniperus monosperma). The draft genome sequence and annotation have been deposited at GenBank under the accession LZPO01000000.
Draft genome sequence of Staphylococcus aureus KT/312045, an ST1-MSSA PVL positive isolated from pus sample in East Coast Malaysia.

PubMed

Suhaili, Zarizal; Lean, Soo-Sum; Mohamad, Noor Muzamil; Rachman, Abdul R Abdul; Desa, Mohd Nasir Mohd; Yeo, Chew Chieng

2016-09-01

Most of the efforts in elucidating the molecular relatedness and epidemiology of Staphylococcus aureus in Malaysia have been largely focused on methicillin-resistant S. aureus (MRSA). Therefore, here we report the draft genome sequence of the methicillin-susceptible Staphylococcus aureus (MSSA) with sequence type 1 (ST1), spa type t127 with Panton-Valentine Leukocidin (pvl) pathogenic determinant isolated from pus sample designated as KT/314250 strain. The size of the draft genome is 2.86 Mbp with 32.7% of G + C content consisting 2673 coding sequences. The draft genome sequence has been deposited in DDBJ/EMBL/GenBank under the accession number AOCP00000000.
Survey of genome sequences in a wild sweet potato, Ipomoea trifida (H. B. K.) G. Don

PubMed Central

Hirakawa, Hideki; Okada, Yoshihiro; Tabuchi, Hiroaki; Shirasawa, Kenta; Watanabe, Akiko; Tsuruoka, Hisano; Minami, Chiharu; Nakayama, Shinobu; Sasamoto, Shigemi; Kohara, Mitsuyo; Kishida, Yoshie; Fujishiro, Tsunakazu; Kato, Midori; Nanri, Keiko; Komaki, Akiko; Yoshinaga, Masaru; Takahata, Yasuhiro; Tanaka, Masaru; Tabata, Satoshi; Isobe, Sachiko N.

2015-01-01

Ipomoea trifida (H. B. K.) G. Don. is the most likely diploid ancestor of the hexaploid sweet potato, I. batatas (L.) Lam. To assist in analysis of the sweet potato genome, de novo whole-genome sequencing was performed with two lines of I. trifida, namely the selfed line Mx23Hm and the highly heterozygous line 0431-1, using the Illumina HiSeq platform. We classified the sequences thus obtained as either ‘core candidates’ (common to the two lines) or ‘line specific’. The total lengths of the assembled sequences of Mx23Hm (ITR_r1.0) was 513 Mb, while that of 0431-1 (ITRk_r1.0) was 712 Mb. Of the assembled sequences, 240 Mb (Mx23Hm) and 353 Mb (0431-1) were classified into core candidate sequences. A total of 62,407 (62.4 Mb) and 109,449 (87.2 Mb) putative genes were identified, respectively, in the genomes of Mx23Hm and 0431-1, of which 11,823 were derived from core sequences of Mx23Hm, while 28,831 were from the core candidate sequence of 0431-1. There were a total of 1,464,173 single-nucleotide polymorphisms and 16,682 copy number variations (CNVs) in the two assembled genomic sequences (under the condition of log2 ratio of >1 and CNV size >1,000 bases). The results presented here are expected to contribute to the progress of genomic and genetic studies of I. trifida, as well as studies of the sweet potato and the genus Ipomoea in general. PMID:25805887
Draft Nuclear Genome, Complete Chloroplast Genome, and Complete Mitochondrial Genome for the Biofuel/Bioproduct Feedstock Species Scenedesmus obliquus Strain DOE0152z

DOE Office of Scientific and Technical Information (OSTI.GOV)

Starkenburg, S. R.; Polle, J. E. W.; Hovde, B.

ABSTRACT The green alga Scenedesmus obliquus is an emerging platform species for the industrial production of biofuels. Here, we report the draft assembly and annotation for the nuclear, plastid, and mitochondrial genomes of S. obliquus strain DOE0152z.
Draft Nuclear Genome, Complete Chloroplast Genome, and Complete Mitochondrial Genome for the Biofuel/Bioproduct Feedstock Species Scenedesmus obliquus Strain DOE0152z

DOE PAGES

Starkenburg, S. R.; Polle, J. E. W.; Hovde, B.; ...

2017-08-10

ABSTRACT The green alga Scenedesmus obliquus is an emerging platform species for the industrial production of biofuels. Here, we report the draft assembly and annotation for the nuclear, plastid, and mitochondrial genomes of S. obliquus strain DOE0152z.
Draft genome sequence of Enterococcus faecium strain LMG 8148.

PubMed

Michiels, Joran E; Van den Bergh, Bram; Fauvart, Maarten; Michiels, Jan

2016-01-01

Enterococcus faecium, traditionally considered a harmless gut commensal, is emerging as an important nosocomial pathogen showing increasing rates of multidrug resistance. We report the draft genome sequence of E. faecium strain LMG 8148, isolated in 1968 from a human in Gothenburg, Sweden. The draft genome has a total length of 2,697,490 bp, a GC-content of 38.3 %, and 2,402 predicted protein-coding sequences. The isolation of this strain predates the emergence of E. faecium as a nosocomial pathogen. Consequently, its genome can be useful in comparative genomic studies investigating the evolution of E. faecium as a pathogen.
Genome Sequence, Assembly and Characterization of Two Metschnikowia fructicola Strains Used as Biocontrol Agents of Postharvest Diseases

PubMed Central

Piombo, Edoardo; Sela, Noa; Wisniewski, Michael; Hoffmann, Maria; Gullino, Maria L.; Allard, Marc W.; Levin, Elena; Spadaro, Davide; Droby, Samir

2018-01-01

The yeast Metschnikowia fructicola was reported as an efficient biological control agent of postharvest diseases of fruits and vegetables, and it is the bases of the commercial formulated product “Shemer.” Several mechanisms of action by which M. fructicola inhibits postharvest pathogens were suggested including iron-binding compounds, induction of defense signaling genes, production of fungal cell wall degrading enzymes and relatively high amounts of superoxide anions. We assembled the whole genome sequence of two strains of M. fructicola using PacBio and Illumina shotgun sequencing technologies. Using the PacBio, a high-quality draft genome consisting of 93 contigs, with an estimated genome size of approximately 26 Mb, was obtained. Comparative analysis of M. fructicola proteins with the other three available closely related genomes revealed a shared core of homologous proteins coded by 5,776 genes. Comparing the genomes of the two M. fructicola strains using a SNP calling approach resulted in the identification of 564,302 homologous SNPs with 2,004 predicted high impact mutations. The size of the genome is exceptionally high when compared with those of available closely related organisms, and the high rate of homology among M. fructicola genes points toward a recent whole-genome duplication event as the cause of this large genome. Based on the assembled genome, sequences were annotated with a gene description and gene ontology (GO term) and clustered in functional groups. Analysis of CAZymes family genes revealed 1,145 putative genes, and transcriptomic analysis of CAZyme expression levels in M. fructicola during its interaction with either grapefruit peel tissue or Penicillium digitatum revealed a high level of CAZyme gene expression when the yeast was placed in wounded fruit tissue. PMID:29666611
Genome Sequence of the Historical Clinical Isolate Burkholderia pseudomallei PHLS 6

DOE Office of Scientific and Technical Information (OSTI.GOV)

D’haeseleer, Patrik; Johnson, Shannon L.; Davenport, Karen W.

We present the draft genome sequence ofBurkholderia pseudomalleiPHLS 6, a virulent clinical strain isolated from a melioidosis patient in Bangladesh in 1960. This draft genome consists of 39 contigs and is 7,322,181 bp long.
Genome Sequence of the Historical Clinical Isolate Burkholderia pseudomallei PHLS 6

DOE PAGES

D’haeseleer, Patrik; Johnson, Shannon L.; Davenport, Karen W.; ...

2016-06-30

We present the draft genome sequence ofBurkholderia pseudomalleiPHLS 6, a virulent clinical strain isolated from a melioidosis patient in Bangladesh in 1960. This draft genome consists of 39 contigs and is 7,322,181 bp long.
Draft Genome Sequence of Thermus sp. Strain RL, Isolated from a Hot Water Spring Located atop the Himalayan Ranges at Manikaran, India

PubMed Central

Dwivedi, Vatsala; Sangwan, Naseer; Nigam, Aeshna; Garg, Nidhi; Niharika, Neha; Khurana, Paramjit; Khurana, Jitendra P.

2012-01-01

Thermus sp. strain RL was isolated from a hot water spring (90°C to 98°C) at Manikaran, Himachal Pradesh, India. Here we report the draft genome sequence (20,36,600 bp) of this strain. The draft genome sequence consists of 17 contigs and 1,986 protein-coding sequences and has an average G+C content of 68.77%. PMID:22689228
Treatment of cells with alkaline borate buffer extends the capability of interphase FISH mapping.

PubMed

Yokota, H; van den Engh, G; Mostert, M; Trask, B J

1995-01-20

Interphase fluorescence in situ hybridization (FISH) has been shown to be a means to map DNA sequences relative to each other in the 100 kb to 1-2 Mb genomic-separation range. At distances below 0.1 Mb, probe sites are infrequently resolved in interphase chromatin. In the 0.1- to 1-Mb range, interphase chromatin can be modeled as a freely flexible chain. The mean square interphase distance between two probes is proportional to the genomic separation between the probes on the linear DNA molecule. Above 1-2 Mb, the relationship between interphase distance and genomic separation changes abruptly and appears to level off. We have used alkaline-borate treatment to expand the capability of interphase FISH mapping. We show here that alkaline-borate treatment increases nuclear diameter, the interphase distance between probes on homologous chromosomes, and the distance between probes on the same chromosome. We also show that the mean square distance between hybridization sites in borate-treated nuclei is proportional to genomic separation up to 4 Mb. Thus, alkaline-borate treatment enhances the capability of interphase FISH mapping by increasing the absolute distance between probes and extending the range of the simple relationship between interphase distance and genomic separation.
Treatment of cells with alkaline borate buffer extends the capability of interphase FISH mapping

DOE Office of Scientific and Technical Information (OSTI.GOV)

Yokota, H.; Van Den Engh, G.; Mostert, M.

1995-01-20

Interphase fluorescence in situ hybridization (FISH) has been shown to be a means to map DNA sequences relative to each other in the 100 kb to 1-2 Mb genomic-separation range. At distances below 0.1 Mb, probe sites are infrequently resolved in interphase chromatin. In the 0.1- to 1-Mb range, interphase chromatin can be modeled as a freely flexible chain. The mean square interphase distance between two probes is proportional to the genomic separation between the probes on the linear DNA molecule. Above 1-2 Mb, the relationship between interphase distance and genomic separation changes abruptly and appears to level off. Wemore » have used alkaline-borate treatment to expand the capability of interphase FISH mapping. We show here that alkaline-borate treatment increases nuclear diameter, the interphase distance between probes on homologous chromosomes, and the distance between probes on the same chromosome. We also show that the mean square distance between hybridization sites in borate-treated nuclei is proportional to genomic separation up to 4 Mb. Thus, alkaline-borate treatment enhances the capability of interphase FISH mapping by increasing the absolute distance between probes and extending the range of the simple relationship between interphase distance and genomic separation. 31 refs., 5 figs.« less
Estimation of the genome sizes of the chigger mites Leptotrombidium pallidum and Leptotrombidium scutellare based on quantitative PCR and k-mer analysis

PubMed Central

2014-01-01

Background Leptotrombidium pallidum and Leptotrombidium scutellare are the major vector mites for Orientia tsutsugamushi, the causative agent of scrub typhus. Before these organisms can be subjected to whole-genome sequencing, it is necessary to estimate their genome sizes to obtain basic information for establishing the strategies that should be used for genome sequencing and assembly. Method The genome sizes of L. pallidum and L. scutellare were estimated by a method based on quantitative real-time PCR. In addition, a k-mer analysis of the whole-genome sequences obtained through Illumina sequencing was conducted to verify the mutual compatibility and reliability of the results. Results The genome sizes estimated using qPCR were 191 ± 7 Mb for L. pallidum and 262 ± 13 Mb for L. scutellare. The k-mer analysis-based genome lengths were estimated to be 175 Mb for L. pallidum and 286 Mb for L. scutellare. The estimates from these two independent methods were mutually complementary and within a similar range to those of other Acariform mites. Conclusions The estimation method based on qPCR appears to be a useful alternative when the standard methods, such as flow cytometry, are impractical. The relatively small estimated genome sizes should facilitate whole-genome analysis, which could contribute to our understanding of Arachnida genome evolution and provide key information for scrub typhus prevention and mite vector competence. PMID:24947244

Genome reconstructions indicate the partitioning of ecological functions inside a phytoplankton bloom in the Amundsen Sea, Antarctica

PubMed Central

Delmont, Tom O.; Eren, A. Murat; Vineis, Joseph H.; Post, Anton F.

2015-01-01

Antarctica polynyas support intense phytoplankton blooms, impacting their environment by a substantial depletion of inorganic carbon and nutrients. These blooms are dominated by the colony-forming haptophyte Phaeocystis antarctica and they are accompanied by a distinct bacterial population. Yet, the ecological role these bacteria may play in P. antarctica blooms awaits elucidation of their functional gene pool and of the geochemical activities they support. Here, we report on a metagenome (~160 million reads) analysis of the microbial community associated with a P. antarctica bloom event in the Amundsen Sea polynya (West Antarctica). Genomes of the most abundant Bacteroidetes and Proteobacteria populations have been reconstructed and a network analysis indicates a strong functional partitioning of these bacterial taxa. Three of them (SAR92, and members of the Oceanospirillaceae and Cryomorphaceae) are found in close association with P. antarctica colonies. Distinct features of their carbohydrate, nitrogen, sulfur and iron metabolisms may serve to support mutualistic relationships with P. antarctica. The SAR92 genome indicates a specialization in the degradation of fatty acids and dimethylsulfoniopropionate (compounds released by P. antarctica) into dimethyl sulfide, an aerosol precursor. The Oceanospirillaceae genome carries genes that may enhance algal physiology (cobalamin synthesis). Finally, the Cryomorphaceae genome is enriched in genes that function in cell or colony invasion. A novel pico-eukaryote, Micromonas related genome (19.6 Mb, ~94% completion) was also recovered. It contains the gene for an anti-freeze protein, which is lacking in Micromonas at lower latitudes. These draft genomes are representative for abundant microbial taxa across the Southern Ocean surface. PMID:26579075
Comparative Genomics of Field Isolates of Mycobacterium bovis and M. caprae Provides Evidence for Possible Correlates with Bacterial Viability and Virulence

PubMed Central

de la Fuente, José; Díez-Delgado, Iratxe; Contreras, Marinela; Vicente, Joaquín; Cabezas-Cruz, Alejandro; Tobes, Raquel; Manrique, Marina; López, Vladimir; Romero, Beatriz; Bezos, Javier; Dominguez, Lucas; Sevilla, Iker A.; Garrido, Joseba M.; Juste, Ramón; Madico, Guillermo; Jones-López, Edward; Gortazar, Christian

2015-01-01

Mycobacteria of the Mycobacterium tuberculosis complex (MTBC) greatly affect humans and animals worldwide. The life cycle of mycobacteria is complex and the mechanisms resulting in pathogen infection and survival in host cells are not fully understood. Recently, comparative genomics analyses have provided new insights into the evolution and adaptation of the MTBC to survive inside the host. However, most of this information has been obtained using M. tuberculosis but not other members of the MTBC such as M. bovis and M. caprae. In this study, the genome of three M. bovis (MB1, MB3, MB4) and one M. caprae (MB2) field isolates with different lesion score, prevalence and host distribution phenotypes were sequenced. Genome sequence information was used for whole-genome and protein-targeted comparative genomics analysis with the aim of finding correlates with phenotypic variation with potential implications for tuberculosis (TB) disease risk assessment and control. At the whole-genome level the results of the first comparative genomics study of field isolates of M. bovis including M. caprae showed that as previously reported for M. tuberculosis, sequential chromosomal nucleotide substitutions were the main driver of the M. bovis genome evolution. The phylogenetic analysis provided a strong support for the M. bovis/M. caprae clade, but supported M. caprae as a separate species. The comparison of the MB1 and MB4 isolates revealed differences in genome sequence, including gene families that are important for bacterial infection and transmission, thus highlighting differences with functional implications between isolates otherwise classified with the same spoligotype. Strategic protein-targeted analysis using the ESX or type VII secretion system, proteins linking stress response with lipid metabolism, host T cell epitopes of mycobacteria, antigens and peptidoglycan assembly protein identified new genetic markers and candidate vaccine antigens that warrant further study to develop tools to evaluate risks for TB disease caused by M. bovis/M.caprae and for TB control in humans and animals. PMID:26583774
Draft genome of the most devastating insect pest of coffee worldwide: the coffee berry borer, Hypothenemus hampei

USDA-ARS?s Scientific Manuscript database

The coffee berry borer, Hypothenemus hampei, is the most economically important insect pest of coffee worldwide, causing millions of dollars in yearly losses to coffee growers. We present the third genomic analysis for a Coleopteran species, a draft genome of female coffee berry borers. The genome s...
A draft genome sequence of “Candidatus Liberibacter asiaticus” from California, USA

USDA-ARS?s Scientific Manuscript database

The draft genome sequence of “Candidatus Liberibacter asiaticus” strain HHCA, collected from a lemon tree in California, USA, is reported. The HHCA strain has a genome size of 1,118,244 bp, with G+C content of 36.6%. The HHCA genome encodes 1,191 predicted open reading frames and 51 RNA genes....
High-quality permanent draft genome sequence of Bradyrhizobium sp. Th.b2, a microsymbiont of Amphicarpaea bracteata collected in Johnson City, New York

DOE PAGES

Tian, Rui; Parker, Matthew; Seshadri, Rekha; ...

2015-05-16

Bradyrhizobium sp. Th.b2 is an aerobic, motile, Gram-negative, non-spore-forming rod that was isolated from an effective nitrogen-fixing root nodule of Amphicarpaea bracteata collected in Johnson City, New York. Here we describe the features of Bradyrhizobium sp. Th.b2, together with high-quality permanent draft genome sequence information and annotation. The 10,118,060 high-quality draft genome is arranged in 266 scaffolds of 274 contigs, contains 9,809 protein-coding genes and 108 RNA-only encoding genes. In conclusion, this rhizobial genome was sequenced as part of the DOE Joint Genome Institute 2010 Genomic Encyclopedia for Bacteria and Archaea-Root Nodule Bacteria (GEBA-RNB) project.
High-quality permanent draft genome sequence of Bradyrhizobium sp. Th.b2, a microsymbiont of Amphicarpaea bracteata collected in Johnson City, New York

DOE Office of Scientific and Technical Information (OSTI.GOV)

Tian, Rui; Parker, Matthew; Seshadri, Rekha

Bradyrhizobium sp. Th.b2 is an aerobic, motile, Gram-negative, non-spore-forming rod that was isolated from an effective nitrogen-fixing root nodule of Amphicarpaea bracteata collected in Johnson City, New York. Here we describe the features of Bradyrhizobium sp. Th.b2, together with high-quality permanent draft genome sequence information and annotation. The 10,118,060 high-quality draft genome is arranged in 266 scaffolds of 274 contigs, contains 9,809 protein-coding genes and 108 RNA-only encoding genes. In conclusion, this rhizobial genome was sequenced as part of the DOE Joint Genome Institute 2010 Genomic Encyclopedia for Bacteria and Archaea-Root Nodule Bacteria (GEBA-RNB) project.
Draft genome sequence of a multidrug-resistant Aeromonas hydrophila ST508 strain carrying rmtD and blaCTX-M-131 isolated from a bloodstream infection.

PubMed

Moura, Quézia; Fernandes, Miriam R; Cerdeira, Louise; Santos, Ana Carolina M; de Souza, Tiago A; Ienne, Susan; Pignatari, Antonio Carlos C; Gales, Ana C; Silva, Rosa M; Lincopan, Nilton

2017-09-01

Here we report the draft genome sequence of a multidrug-resistant (MDR) Aeromonas hydrophila strain belonging to sequence type 508 (ST508) isolated from a human bloodstream infection. Assembly and annotation of this draft genome resulted in 5028498bp and revealed the presence of 16S rRNA methylase rmtD and bla CTX-M-131 genes encoding high-level resistance to aminoglycosides and cephalosporins, respectively, as well as multiple virulence genes. This draft genome can provide significant information for understanding mechanisms on the establishment and treatment of infections caused by this pathogen. Copyright © 2017 International Society for Chemotherapy of Infection and Cancer. Published by Elsevier Ltd. All rights reserved.
Draft genome of bagasse-degrading bacteria Bacillus aryabhattai GZ03 from deep sea water.

PubMed

Wen, Jian; Ren, Chong; Huang, Nan; Liu, Yang; Zeng, Runying

2015-02-01

Bacillus aryabhattai GZ03 was isolated from deep sea water of the South China Sea, which can produce glucose and fructose by degrading bagasse at 25 °C. Here we report the draft genome sequence of Bacillus aryabhattai GZ03. The data obtained revealed 37 contigs with genome size of 5,105,129 bp and G+C content of 38.09%. The draft genome of B. aryabhattai GZ03 may provide insights into the mechanism of microbial carbohydrate and lignocellulosic material degradation. Copyright © 2014 Elsevier B.V. All rights reserved.
A Complex Structural Variation on Chromosome 27 Leads to the Ectopic Expression of HOXB8 and the Muffs and Beard Phenotype in Chickens

PubMed Central

Wang, Yanqiang; Luo, Chenglong; Liu, Ranran; Qu, Hao; Shu, Dingming; Wen, Jie; Crooijmans, Richard P. M. A.; Zhao, Yiqiang; Hu, Xiaoxiang; Li, Ning

2016-01-01

Muffs and beard (Mb) is a phenotype in chickens where groups of elongated feathers gather from both sides of the face (muffs) and below the beak (beard). It is an autosomal, incomplete dominant phenotype encoded by the Muffs and beard (Mb) locus. Here we use genome-wide association (GWA) analysis, linkage analysis, Identity-by-Descent (IBD) mapping, array-CGH, genome re-sequencing and expression analysis to show that the Mb allele causing the Mb phenotype is a derived allele where a complex structural variation (SV) on GGA27 leads to an altered expression of the gene HOXB8. This Mb allele was shown to be completely associated with the Mb phenotype in nine other independent Mb chicken breeds. The Mb allele differs from the wild-type mb allele by three duplications, one in tandem and two that are translocated to that of the tandem repeat around 1.70 Mb on GGA27. The duplications contain total seven annotated genes and their expression was tested during distinct stages of Mb morphogenesis. A continuous high ectopic expression of HOXB8 was found in the facial skin of Mb chickens, strongly suggesting that HOXB8 directs this regional feather-development. In conclusion, our results provide an interesting example of how genomic structural rearrangements alter the regulation of genes leading to novel phenotypes. Further, it again illustrates the value of utilizing derived phenotypes in domestic animals to dissect the genetic basis of developmental traits, herein providing novel insights into the likely role of HOXB8 in feather development and differentiation. PMID:27253709
The genome of black raspberry (Rubus occidentalis).

PubMed

VanBuren, Robert; Bryant, Doug; Bushakra, Jill M; Vining, Kelly J; Edger, Patrick P; Rowley, Erik R; Priest, Henry D; Michael, Todd P; Lyons, Eric; Filichkin, Sergei A; Dossett, Michael; Finn, Chad E; Bassil, Nahla V; Mockler, Todd C

2016-09-01

Black raspberry (Rubus occidentalis) is an important specialty fruit crop in the US Pacific Northwest that can hybridize with the globally commercialized red raspberry (R. idaeus). Here we report a 243 Mb draft genome of black raspberry that will serve as a useful reference for the Rosaceae and Rubus fruit crops (raspberry, blackberry, and their hybrids). The black raspberry genome is largely collinear to the diploid woodland strawberry (Fragaria vesca) with a conserved karyotype and few notable structural rearrangements. Centromeric satellite repeats are widely dispersed across the black raspberry genome, in contrast to the tight association with the centromere observed in most plants. Among the 28 005 predicted protein-coding genes, we identified 290 very recent small-scale gene duplicates enriched for sugar metabolism, fruit development, and anthocyanin related genes which may be related to key agronomic traits during black raspberry domestication. This contrasts patterns of recent duplications in the wild woodland strawberry F. vesca, which show no patterns of enrichment, suggesting gene duplications contributed to domestication traits. Expression profiles from a fruit ripening series and roots exposed to Verticillium dahliae shed insight into fruit development and disease response, respectively. The resources presented here will expedite the development of improved black and red raspberry, blackberry and other Rubus cultivars. © 2016 The Authors The Plant Journal © 2016 John Wiley & Sons Ltd.
Draft Genome Sequencing and Comparative Analysis of Aspergillus sojae NBRC4239

PubMed Central

Sato, Atsushi; Oshima, Kenshiro; Noguchi, Hideki; Ogawa, Masahiro; Takahashi, Tadashi; Oguma, Tetsuya; Koyama, Yasuji; Itoh, Takehiko; Hattori, Masahira; Hanya, Yoshiki

2011-01-01

We conducted genome sequencing of the filamentous fungus Aspergillus sojae NBRC4239 isolated from the koji used to prepare Japanese soy sauce. We used the 454 pyrosequencing technology and investigated the genome with respect to enzymes and secondary metabolites in comparison with other Aspergilli sequenced. Assembly of 454 reads generated a non-redundant sequence of 39.5-Mb possessing 13 033 putative genes and 65 scaffolds composed of 557 contigs. Of the 2847 open reading frames with Pfam domain scores of >150 found in A. sojae NBRC4239, 81.7% had a high degree of similarity with the genes of A. oryzae. Comparative analysis identified serine carboxypeptidase and aspartic protease genes unique to A. sojae NBRC4239. While A. oryzae possessed three copies of α-amyalse gene, A. sojae NBRC4239 possessed only a single copy. Comparison of 56 gene clusters for secondary metabolites between A. sojae NBRC4239 and A. oryzae revealed that 24 clusters were conserved, whereas 32 clusters differed between them that included a deletion of 18 508 bp containing mfs1, mao1, dmaT, and pks-nrps for the cyclopiazonic acid (CPA) biosynthesis, explaining the no productivity of CPA in A. sojae. The A. sojae NBRC4239 genome data will be useful to characterize functional features of the koji moulds used in Japanese industries. PMID:21659486
Novel genes related to nodulation, secretion systems, and surface structures revealed by a genome draft of Rhizobium tropici strain PRF 81.

PubMed

Pinto, Fabiana G S; Chueire, Ligia M O; Vasconcelos, Ana Tereza R; Nicolás, Marisa F; Almeida, Luiz G P; Souza, Rangel C; Menna, Pâmela; Barcellos, Fernando G; Megías, Manuel; Hungria, Mariangela

2009-05-01

Rhizobium tropici is representative of the diversity of tropical rhizobia, besides comprising strains very effective in fixing N(2) in symbiosis with the common bean (Phaseolus vulgaris L.). The genome of a Brazilian commercial inoculant R. tropici strain (PRF 81, =SEMIA 4088), estimated at 7.85 Mb, was analyzed through a total of 9,026 shotgun reads, assembled in 1,668 phrap contigs, and covering approximately 30% of the genome. Annotation identified 2,135 coding DNA sequences (CDS), and only 57.2% have possible functions. The genome comprises a mosaic of genes, with CDS showing the highest similarities with 134 microorganisms, none of which represents more than 19% of the CDS with putative known functions. The high saprophytic capacity of PRF 81 may reside in a variety of genes related to transport, biodegradation of xenobiotics, defense, and secretion proteins, many of which were reported for the first time in the present study. Novelty was also found in nodulation (nodG, a double nodIJ system, nodT, nolF, nolG) and capsular polysaccharide genes, showing stronger similarities with Sinorhizobium (=Ensifer) than with the main symbionts of the common bean -- R. etli and R. leguminosarum -- suggesting that the original host of R. tropici might be another tropical legume or emphasizing the highly promiscuous nature of this rhizobial species.
The bottle gourd genome provides insights into Cucurbitaceae evolution and facilitates mapping of a Papaya ringspot virus resistance locus

USDA-ARS?s Scientific Manuscript database

Bottle gourd (Lagenaria siceraria) is an important vegetable crop as well as a rootstock for other cucurbit crops. In this study, we report a high-quality 313.4-Mb genome sequence of a bottle gourd inbred line, USVL1VR-Ls, with a scaffold N50 of 8.7 Mb and the longest of 19.0 Mb. About 98.3% of the ...
Permanent draft genome sequence of Desulfurococcus mobilis type strain DSM 2161, a thermoacidophilic sulfur-reducing crenarchaeon isolated from acidic hot springs of Hveravellir, Iceland.

PubMed

Susanti, Dwi; Johnson, Eric F; Lapidus, Alla; Han, James; Reddy, T B K; Pilay, Manoj; Ivanova, Natalia N; Markowitz, Victor M; Woyke, Tanja; Kyrpides, Nikos C; Mukhopadhyay, Biswarup

2016-01-01

This report presents the permanent draft genome sequence of Desulfurococcus mobilis type strain DSM 2161, an obligate anaerobic hyperthermophilic crenarchaeon that was isolated from acidic hot springs in Hveravellir, Iceland. D. mobilis utilizes peptides as carbon and energy sources and reduces elemental sulfur to H2S. A metabolic construction derived from the draft genome identified putative pathways for peptide degradation and sulfur respiration in this archaeon. Existence of several hydrogenase genes in the genome supported previous findings that H2 is produced during the growth of D. mobilis in the absence of sulfur. Interestingly, genes encoding glucose transport and utilization systems also exist in the D. mobilis genome though this archaeon does not utilize carbohydrate for growth. The draft genome of D. mobilis provides an additional mean for comparative genomic analysis of desulfurococci. In addition, our analysis on the Average Nucleotide Identity between D. mobilis and Desulfurococcus mucosus suggested that these two desulfurococci are two different strains of the same species.
Permanent draft genome sequence of Desulfurococcus mobilis type strain DSM 2161, a thermoacidophilic sulfur-reducing crenarchaeon isolated from acidic hot springs of Hveravellir, Iceland

DOE Office of Scientific and Technical Information (OSTI.GOV)

Susanti, Dwi; Johnson, Eric F.; Lapidus, Alla

Our report presents the permanent draft genome sequence of Desulfurococcus mobilis type strain DSM 2161, an obligate anaerobic hyperthermophilic crenarchaeon that was isolated from acidic hot springs in Hveravellir, Iceland. D. mobilis utilizes peptides as carbon and energy sources and reduces elemental sulfur to H 2S. A metabolic construction derived from the draft genome identified putative pathways for peptide degradation and sulfur respiration in this archaeon. Existence of several hydrogenase genes in the genome supported previous findings that H 2 is produced during the growth of D. mobilis in the absence of sulfur. Interestingly, genes encoding glucose transport and utilizationmore » systems also exist in the D. mobilis genome though this archaeon does not utilize carbohydrate for growth. The draft genome of D. mobilis provides an additional mean for comparative genomic analysis of desulfurococci. In addition, our analysis on the Average Nucleotide Identity between D. mobilis and Desulfurococcus mucosus suggested that these two desulfurococci are two different strains of the same species.« less
Permanent draft genome sequence of Desulfurococcus mobilis type strain DSM 2161, a thermoacidophilic sulfur-reducing crenarchaeon isolated from acidic hot springs of Hveravellir, Iceland

DOE PAGES

Susanti, Dwi; Johnson, Eric F.; Lapidus, Alla; ...

2016-01-13

Our report presents the permanent draft genome sequence of Desulfurococcus mobilis type strain DSM 2161, an obligate anaerobic hyperthermophilic crenarchaeon that was isolated from acidic hot springs in Hveravellir, Iceland. D. mobilis utilizes peptides as carbon and energy sources and reduces elemental sulfur to H 2S. A metabolic construction derived from the draft genome identified putative pathways for peptide degradation and sulfur respiration in this archaeon. Existence of several hydrogenase genes in the genome supported previous findings that H 2 is produced during the growth of D. mobilis in the absence of sulfur. Interestingly, genes encoding glucose transport and utilizationmore » systems also exist in the D. mobilis genome though this archaeon does not utilize carbohydrate for growth. The draft genome of D. mobilis provides an additional mean for comparative genomic analysis of desulfurococci. In addition, our analysis on the Average Nucleotide Identity between D. mobilis and Desulfurococcus mucosus suggested that these two desulfurococci are two different strains of the same species.« less
Draft genome sequences of Actinomyces timonensis strain 7400942T and its prophage.

PubMed

Gorlas, Aurore; Gimenez, Grégory; Raoult, Didier; Roux, Véronique

2012-12-01

A draft genome sequence of Actinomyces timonensis, an anaerobic bacterium isolated from a human clinical osteoarticular sample, is described here. CRISPR-associated proteins, insertion sequence, and toxin-antitoxin loci were found on the genome. A new virus or provirus, AT-1, was characterized.
Draft genome sequence of rice orange leaf phytoplasma from Guangdong, China

USDA-ARS?s Scientific Manuscript database

The genome of rice orange leaf phytoplasma strain LD1 from Luoding City, Guangdong, P. R. China, was sequenced. The draft LD1genome is 599,264 bp with GC content of 28.2%, 647 predicted open reading frames and 33 RNA genes....
Draft Genome Sequences of Acinetobacter and Bacillus Strains Isolated from Spacecraft-Associated Surfaces

PubMed Central

Seuylemezian, Arman; Vaishampayan, Parag; Cooper, Kerry

2018-01-01

ABSTRACT We report here the draft genome sequences of four strains isolated from spacecraft-associated surfaces exhibiting increased resistance to stressors such as UV radiation and exposure to H2O2. The draft genomes of strains 1P01SCT, FO-92T, 50v1, and 2P01AA had sizes of 5,500,894 bp, 4,699,376 bp, 3,174,402 bp, and 4,328,804 bp, respectively. PMID:29439046
Draft genome sequence of Mycobacterium tuberculosis strain B9741 of Beijing B0/W lineage from HIV positive patient from Siberia.

PubMed

Shur, K V; Zaychikova, M V; Mikheecheva, N E; Klimina, K M; Bekker, O B; Zhdanova, S N; Ogarkov, O B; Danilenko, V N

2016-12-01

We report a draft genome sequence of Mycobacterium tuberculosis strain B9741 belonging to Beijing B0/W lineage isolated from a HIV patient from Siberia, Russia. This clinical isolate showed MDR phenotype and resistance to isoniazid, rifampin, streptomycin and pyrazinamide. We analyzed SNPs associated with virulence and resistance. The draft genome sequence and annotation have been deposited at GenBank under the accession NZ_LVJJ00000000.

Draft Genome Sequence of Candida pseudohaemulonii Isolated from the Blood of a Neutropenic Patient.

PubMed

Mohd Tap, Ratna; Kamarudin, Nur Amalina; Ginsapu, Stephanie Jane; Ahmed Bakri, Ahmed Rafezzan; Ahmad, Norazah; Amran, Fairuz; Sipiczki, Matthias

2018-04-05

Candida pseudohaemulonii is phylogenetically close to the C. haemulonii complex and exhibits resistance to amphotericin B and azole agents. We report here the draft genome sequence of C. pseudohaemulonii UZ153_17 isolated from the blood culture of a neutropenic patient. The draft genome is 3,532,003,666 bp in length, with 579,838 reads, 130 contigs, and a G+C content of 47.15%. Copyright © 2018 Mohd Tap et al.
Methylobacterium genome sequences: a reference blueprint to investigate microbial metabolism of C1 compounds from natural and industrial sources.

PubMed

Vuilleumier, Stéphane; Chistoserdova, Ludmila; Lee, Ming-Chun; Bringel, Françoise; Lajus, Aurélie; Zhou, Yang; Gourion, Benjamin; Barbe, Valérie; Chang, Jean; Cruveiller, Stéphane; Dossat, Carole; Gillett, Will; Gruffaz, Christelle; Haugen, Eric; Hourcade, Edith; Levy, Ruth; Mangenot, Sophie; Muller, Emilie; Nadalig, Thierry; Pagni, Marco; Penny, Christian; Peyraud, Rémi; Robinson, David G; Roche, David; Rouy, Zoé; Saenampechek, Channakhone; Salvignol, Grégory; Vallenet, David; Wu, Zaining; Marx, Christopher J; Vorholt, Julia A; Olson, Maynard V; Kaul, Rajinder; Weissenbach, Jean; Médigue, Claudine; Lidstrom, Mary E

2009-01-01

Methylotrophy describes the ability of organisms to grow on reduced organic compounds without carbon-carbon bonds. The genomes of two pink-pigmented facultative methylotrophic bacteria of the Alpha-proteobacterial genus Methylobacterium, the reference species Methylobacterium extorquens strain AM1 and the dichloromethane-degrading strain DM4, were compared. The 6.88 Mb genome of strain AM1 comprises a 5.51 Mb chromosome, a 1.26 Mb megaplasmid and three plasmids, while the 6.12 Mb genome of strain DM4 features a 5.94 Mb chromosome and two plasmids. The chromosomes are highly syntenic and share a large majority of genes, while plasmids are mostly strain-specific, with the exception of a 130 kb region of the strain AM1 megaplasmid which is syntenic to a chromosomal region of strain DM4. Both genomes contain large sets of insertion elements, many of them strain-specific, suggesting an important potential for genomic plasticity. Most of the genomic determinants associated with methylotrophy are nearly identical, with two exceptions that illustrate the metabolic and genomic versatility of Methylobacterium. A 126 kb dichloromethane utilization (dcm) gene cluster is essential for the ability of strain DM4 to use DCM as the sole carbon and energy source for growth and is unique to strain DM4. The methylamine utilization (mau) gene cluster is only found in strain AM1, indicating that strain DM4 employs an alternative system for growth with methylamine. The dcm and mau clusters represent two of the chromosomal genomic islands (AM1: 28; DM4: 17) that were defined. The mau cluster is flanked by mobile elements, but the dcm cluster disrupts a gene annotated as chelatase and for which we propose the name "island integration determinant" (iid). These two genome sequences provide a platform for intra- and interspecies genomic comparisons in the genus Methylobacterium, and for investigations of the adaptive mechanisms which allow bacterial lineages to acquire methylotrophic lifestyles.
Draft Genome Sequence of Aspergillus oryzae ATCC 12892

DOE Office of Scientific and Technical Information (OSTI.GOV)

Deng, Shuang; Pomraning, Kyle R.; Bohutskyi, Pavlo

The draft genome sequence ofAspergillus oryzaeATCC 12892 is presented here.A. oryzaeproduces 3-nitropropionic acid, which has been investigated with regard to understanding the biosynthesis of nitroorganic compounds.
Draft Genome Sequence of Mycobacterium asiaticum Strain DSM 44297.

PubMed

Croce, Olivier; Robert, Catherine; Raoult, Didier; Drancourt, Michel

2014-04-17

We report the draft genome sequence of Mycobacterium asiaticum strain DSM 44297, a tropical mycobacterium seldom responsible for human infection. The genome of M. asiaticum has a size of 5,935,986 bp, with a 66.03% G+C content, encoding 5,591 proteins and 81 RNAs.
Reconstruction of a Nearly Complete Pseudomonas Draft Genome Sequence from a Coalbed Methane-Produced Water Metagenome

DOE PAGES

Ross, Daniel E.; Gulliver, Djuna

2016-10-06

The draft genome sequence ofPseudomonas stutzeristrain K35 was separated from a metagenome derived from a produced water microbial community of a coalbed methane well. The genome encodes a complete nitrogen fixation pathway and the upper and lower naphthalene degradation pathways.
Reconstruction of a Nearly Complete Pseudomonas Draft Genome Sequence from a Coalbed Methane-Produced Water Metagenome

DOE Office of Scientific and Technical Information (OSTI.GOV)

Ross, Daniel E.; Gulliver, Djuna

The draft genome sequence ofPseudomonas stutzeristrain K35 was separated from a metagenome derived from a produced water microbial community of a coalbed methane well. The genome encodes a complete nitrogen fixation pathway and the upper and lower naphthalene degradation pathways.
Draft Genome Sequence of Streptococcus orisasini SH06, Isolated from a Healthy Thoroughbred Gastrointestinal Tract.

PubMed

Takagi, Misako; Nakano, Akiyo; Toh, Hidehiro; Oshima, Kenshiro; Arakawa, Kensuke; Nakajima, Fumihiko; Tashiro, Kosuke; Kikusui, Tekefumi; Yanagida, Fujitoshi; Morita, Hidetoshi

2016-01-14

Streptococcus orisasini SH06 was isolated from a healthy thoroughbred gastrointestinal tract. Here, we report the draft genome sequence of this organism. This paper is the first published report of the genomic sequence of S. orisasini. Copyright © 2016 Takagi et al.
Draft Genome Sequence of Lactobacillus panis DSM 6035T, First Isolated from Sourdough

PubMed Central

Zhu, Yixin; Fang, Daiqiong; Shi, Ding; Li, Ang; Lv, Longxian; Yan, Ren; Yao, Jian; Hua, Dasong; Hu, Xinjun; Guo, Feifei; Wu, Wenrui; Guo, Jing; Chen, Yanfei; Jiang, Xiawei; Chen, Xiaoxiao

2015-01-01

We report a draft genome sequence of Lactobacillus panis DSM 6035T, isolated from sourdough. The genome of this strain is 2,082,789 bp long, with 47.9% G+C content. A total of 2,047 protein-coding genes were predicted. PMID:26205855
A Targeted Capture Linkage Map Anchors the Genome of the Schistosomiasis Vector Snail, Biomphalaria glabrata.

PubMed

Tennessen, Jacob A; Bollmann, Stephanie R; Blouin, Michael S

2017-07-05

The aquatic planorbid snail Biomphalaria glabrata is one of the most intensively-studied mollusks due to its role in the transmission of schistosomiasis. Its 916 Mb genome has recently been sequenced and annotated, but it remains poorly assembled. Here, we used targeted capture markers to map over 10,000 B. glabrata scaffolds in a linkage cross of 94 F1 offspring, generating 24 linkage groups (LGs). We added additional scaffolds to these LGs based on linkage disequilibrium (LD) analysis of targeted capture and whole-genome sequences of 96 unrelated snails. Our final linkage map consists of 18,613 scaffolds comprising 515 Mb, representing 56% of the genome and 75% of genic and nonrepetitive regions. There are 18 large (> 10 Mb) LGs, likely representing the expected 18 haploid chromosomes, and > 50% of the genome has been assigned to LGs of at least 17 Mb. Comparisons with other gastropod genomes reveal patterns of synteny and chromosomal rearrangements. Linkage relationships of key immune-relevant genes may help clarify snail-schistosome interactions. By focusing on linkage among genic and nonrepetitive regions, we have generated a useful resource for associating snail phenotypes with causal genes, even in the absence of a complete genome assembly. A similar approach could potentially improve numerous poorly-assembled genomes in other taxa. This map will facilitate future work on this host of a serious human parasite. Copyright © 2017 Tennessen et al.
Draft genome sequence of Xylella fastidiosa pear leaf scorch strain in Taiwan

USDA-ARS?s Scientific Manuscript database

The draft genome sequence of Xylella fastidiosa pear leaf scorch strain (PLS229) isolated from pear cultivar Hengshan (Pyrus pyrifolia) in Taiwan is reported. The bacterium has a genome size of 2,733,013 bp with a G+C content of 53.1%. The PLS229 strain genome was annotated to have 3,259 open readin...
ARKS: chromosome-scale scaffolding of human genome drafts with linked read kmers.

PubMed

Coombe, Lauren; Zhang, Jessica; Vandervalk, Benjamin P; Chu, Justin; Jackman, Shaun D; Birol, Inanc; Warren, René L

2018-06-20

The long-range sequencing information captured by linked reads, such as those available from 10× Genomics (10xG), helps resolve genome sequence repeats, and yields accurate and contiguous draft genome assemblies. We introduce ARKS, an alignment-free linked read genome scaffolding methodology that uses linked reads to organize genome assemblies further into contiguous drafts. Our approach departs from other read alignment-dependent linked read scaffolders, including our own (ARCS), and uses a kmer-based mapping approach. The kmer mapping strategy has several advantages over read alignment methods, including better usability and faster processing, as it precludes the need for input sequence formatting and draft sequence assembly indexing. The reliance on kmers instead of read alignments for pairing sequences relaxes the workflow requirements, and drastically reduces the run time. Here, we show how linked reads, when used in conjunction with Hi-C data for scaffolding, improve a draft human genome assembly of PacBio long-read data five-fold (baseline vs. ARKS NG50 = 4.6 vs. 23.1 Mbp, respectively). We also demonstrate how the method provides further improvements of a megabase-scale Supernova human genome assembly (NG50 = 14.74 Mbp vs. 25.94 Mbp before and after ARKS), which itself exclusively uses linked read data for assembly, with an execution speed six to nine times faster than competitive linked read scaffolders (~ 10.5 h compared to 75.7 h, on average). Following ARKS scaffolding of a human genome 10xG Supernova assembly (of cell line NA12878), fewer than 9 scaffolds cover each chromosome, except the largest (chromosome 1, n = 13). ARKS uses a kmer mapping strategy instead of linked read alignments to record and associate the barcode information needed to order and orient draft assembly sequences. The simplified workflow, when compared to that of our initial implementation, ARCS, markedly improves run time performances on experimental human genome datasets. Furthermore, the novel distance estimator in ARKS utilizes barcoding information from linked reads to estimate gap sizes. It accomplishes this by modeling the relationship between known distances of a region within contigs and calculating associated Jaccard indices. ARKS has the potential to provide correct, chromosome-scale genome assemblies, promptly. We expect ARKS to have broad utility in helping refine draft genomes.
Gleaning evolutionary insights from the genome sequence of a probiotic yeast Saccharomyces boulardii

PubMed Central

2013-01-01

Background The yeast Saccharomyces boulardii is used worldwide as a probiotic to alleviate the effects of several gastrointestinal diseases and control antibiotics-associated diarrhea. While many studies report the probiotic effects of S. boulardii, no genome information for this yeast is currently available in the public domain. Results We report the 11.4 Mbp draft genome of this probiotic yeast. The draft genome was obtained by assembling Roche 454 FLX + shotgun data into 194 contigs with an N50 of 251 Kbp. We compare our draft genome with all other Saccharomyces cerevisiae genomes. Conclusions Our analysis confirms the close similarity of S. boulardii to S. cerevisiae strains and provides a framework to understand the probiotic effects of this yeast, which exhibits unique physiological and metabolic properties. PMID:24148866
High-quality permanent draft genome sequence of Bradyrhizobium sp. Tv2a.2, a microsymbiont of Tachigali versicolor discovered in Barro Colorado Island of Panama

DOE PAGES

Tian, Rui; Parker, Matthew; Seshadri, Rekha; ...

2015-05-17

Bradyrhizobiumsp. Tv2a.2 is an aerobic, motile, Gram-negative, non-spore-forming rod that was isolated from an effective nitrogen-fixing root nodule of Tachigali versicolor collected in Barro Colorado Island of Panama. Here we describe the features of Bradyrhizobiumsp. Tv2a.2, together with high-quality permanent draft genome sequence information and annotation. The 8,496,279 bp high-quality draft genome is arranged in 87 scaffolds of 87 contigs, contains 8,109 protein-coding genes and 72 RNA-only encoding genes. In conclusion, this rhizobial genome was sequenced as part of the DOE Joint Genome Institute 2010 Genomic Encyclopedia for Bacteria and Archaea-Root Nodule Bacteria (GEBA-RNB) project.
Gleaning evolutionary insights from the genome sequence of a probiotic yeast Saccharomyces boulardii.

PubMed

Khatri, Indu; Akhtar, Akil; Kaur, Kamaldeep; Tomar, Rajul; Prasad, Gandham Satyanarayana; Ramya, Thirumalai Nallan Chakravarthy; Subramanian, Srikrishna

2013-10-22

The yeast Saccharomyces boulardii is used worldwide as a probiotic to alleviate the effects of several gastrointestinal diseases and control antibiotics-associated diarrhea. While many studies report the probiotic effects of S. boulardii, no genome information for this yeast is currently available in the public domain. We report the 11.4 Mbp draft genome of this probiotic yeast. The draft genome was obtained by assembling Roche 454 FLX + shotgun data into 194 contigs with an N50 of 251 Kbp. We compare our draft genome with all other Saccharomyces cerevisiae genomes. Our analysis confirms the close similarity of S. boulardii to S. cerevisiae strains and provides a framework to understand the probiotic effects of this yeast, which exhibits unique physiological and metabolic properties.
Phase III Archives | NOAA Gulf Spill Restoration

Science.gov Websites

III Early Restoration Plan and Draft Early Restoration PEIS Executive Summary (pdf, 3.4 MB) Project Summary Table (pdf, 80 KB) Public Repositories (pdf, 113 KB) Press Release (pdf, 501 KB) Press Release
MBGD update 2015: microbial genome database for flexible ortholog analysis utilizing a diverse set of genomic data.

PubMed

Uchiyama, Ikuo; Mihara, Motohiro; Nishide, Hiroyo; Chiba, Hirokazu

2015-01-01

The microbial genome database for comparative analysis (MBGD) (available at http://mbgd.genome.ad.jp/) is a comprehensive ortholog database for flexible comparative analysis of microbial genomes, where the users are allowed to create an ortholog table among any specified set of organisms. Because of the rapid increase in microbial genome data owing to the next-generation sequencing technology, it becomes increasingly challenging to maintain high-quality orthology relationships while allowing the users to incorporate the latest genomic data available into an analysis. Because many of the recently accumulating genomic data are draft genome sequences for which some complete genome sequences of the same or closely related species are available, MBGD now stores draft genome data and allows the users to incorporate them into a user-specific ortholog database using the MyMBGD functionality. In this function, draft genome data are incorporated into an existing ortholog table created only from the complete genome data in an incremental manner to prevent low-quality draft data from affecting clustering results. In addition, to provide high-quality orthology relationships, the standard ortholog table containing all the representative genomes, which is first created by the rapid classification program DomClust, is now refined using DomRefine, a recently developed program for improving domain-level clustering using multiple sequence alignment information. © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.
Draft Sequencing of the Heterozygous Diploid Genome of Satsuma (Citrus unshiu Marc.) Using a Hybrid Assembly Approach

PubMed Central

Shimizu, Tokurou; Tanizawa, Yasuhiro; Mochizuki, Takako; Nagasaki, Hideki; Yoshioka, Terutaka; Toyoda, Atsushi; Fujiyama, Asao; Kaminuma, Eli; Nakamura, Yasukazu

2017-01-01

Satsuma (Citrus unshiu Marc.) is one of the most abundantly produced mandarin varieties of citrus, known for its seedless fruit production and as a breeding parent of citrus. De novo assembly of the heterozygous diploid genome of Satsuma (“Miyagawa Wase”) was conducted by a hybrid assembly approach using short-read sequences, three mate-pair libraries, and a long-read sequence of PacBio by the PLATANUS assembler. The assembled sequence, with a total size of 359.7 Mb at the N50 length of 386,404 bp, consisted of 20,876 scaffolds. Pseudomolecules of Satsuma constructed by aligning the scaffolds to three genetic maps showed genome-wide synteny to the genomes of Clementine, pummelo, and sweet orange. Gene prediction by modeling with MAKER-P proposed 29,024 genes and 37,970 mRNA; additionally, gene prediction analysis found candidates for novel genes in several biosynthesis pathways for gibberellin and violaxanthin catabolism. BUSCO scores for the assembled scaffold and predicted transcripts, and another analysis by BAC end sequence mapping indicated the assembled genome consistency was close to those of the haploid Clementine, pummel, and sweet orange genomes. The number of repeat elements and long terminal repeat retrotransposon were comparable to those of the seven citrus genomes; this suggested no significant failure in the assembly at the repeat region. A resequencing application using the assembled sequence confirmed that both kunenbo-A and Satsuma are offsprings of Kishu, and Satsuma is a back-crossed offspring of Kishu. These results illustrated the performance of the hybrid assembly approach and its ability to construct an accurate heterozygous diploid genome. PMID:29259619
Draft Sequencing of the Heterozygous Diploid Genome of Satsuma (Citrus unshiu Marc.) Using a Hybrid Assembly Approach.

PubMed

Shimizu, Tokurou; Tanizawa, Yasuhiro; Mochizuki, Takako; Nagasaki, Hideki; Yoshioka, Terutaka; Toyoda, Atsushi; Fujiyama, Asao; Kaminuma, Eli; Nakamura, Yasukazu

2017-01-01

Satsuma ( Citrus unshiu Marc.) is one of the most abundantly produced mandarin varieties of citrus, known for its seedless fruit production and as a breeding parent of citrus. De novo assembly of the heterozygous diploid genome of Satsuma ("Miyagawa Wase") was conducted by a hybrid assembly approach using short-read sequences, three mate-pair libraries, and a long-read sequence of PacBio by the PLATANUS assembler. The assembled sequence, with a total size of 359.7 Mb at the N 50 length of 386,404 bp, consisted of 20,876 scaffolds. Pseudomolecules of Satsuma constructed by aligning the scaffolds to three genetic maps showed genome-wide synteny to the genomes of Clementine, pummelo, and sweet orange. Gene prediction by modeling with MAKER-P proposed 29,024 genes and 37,970 mRNA; additionally, gene prediction analysis found candidates for novel genes in several biosynthesis pathways for gibberellin and violaxanthin catabolism. BUSCO scores for the assembled scaffold and predicted transcripts, and another analysis by BAC end sequence mapping indicated the assembled genome consistency was close to those of the haploid Clementine, pummel, and sweet orange genomes. The number of repeat elements and long terminal repeat retrotransposon were comparable to those of the seven citrus genomes; this suggested no significant failure in the assembly at the repeat region. A resequencing application using the assembled sequence confirmed that both kunenbo-A and Satsuma are offsprings of Kishu, and Satsuma is a back-crossed offspring of Kishu. These results illustrated the performance of the hybrid assembly approach and its ability to construct an accurate heterozygous diploid genome.
Genome, Functional Gene Annotation, and Nuclear Transformation of the Heterokont Oleaginous Alga Nannochloropsis oceanica CCMP1779

DTIC Science & Technology

2012-11-15

the 28.7 Mb genome of N. oceanica CCMP1779. RNA sequencing data from nitrogen-replete and nitrogen- depleted growth conditions support a total of... sequence and its analysis, protocols for the transformation of N. oceanica CCMP1779 are provided. The availability of genomic and transcriptomic data for...biochemistry of this fascinating organism group. Here we present the assembly of the 28.7 Mb genome of N. oceanica CCMP1779. RNA sequencing data from
Draft genome of tule elk Cervus canadensis nannodes.

PubMed

Mizzi, Jessica E; Lounsberry, Zachary T; Brown, C Titus; Sacks, Benjamin N

2017-01-01

This paper presents the first draft genome of the tule elk ( Cervus elaphus nannodes ), a subspecies native to California that underwent an extreme genetic bottleneck in the late 1800s. The genome was generated from Illumina HiSeq 3000 whole genome sequencing of four individuals, resulting in the assembly of 2.395 billion base pairs (Gbp) over 602,862 contigs over 500 bp and N50 = 6,885 bp. This genome provides a resource to facilitate future genomic research on elk and other cervids.

Draft genome sequence of the coccolithovirus Emiliania huxleyi virus 202.

PubMed

Nissimov, Jozef I; Worthy, Charlotte A; Rooks, Paul; Napier, Johnathan A; Kimmance, Susan A; Henn, Matthew R; Ogata, Hiroyuki; Allen, Michael J

2012-02-01

Emiliania huxleyi virus 202 (EhV-202) is a member of the Coccolithoviridae, a group of viruses that infect the marine coccolithophorid Emiliania huxleyi. EhV-202 has a 160- to 180-nm-diameter icosahedral structure and a genome of approximately 407 kbp, consisting of 485 coding sequences (CDSs). Here we describe the genomic features of EhV-202, together with a draft genome sequence and its annotation, highlighting the homology and heterogeneity of this genome in comparison with the EhV-86 reference genome.
Draft genome sequence of the Coccolithovirus Emiliania huxleyi virus 203.

PubMed

Nissimov, Jozef I; Worthy, Charlotte A; Rooks, Paul; Napier, Johnathan A; Kimmance, Susan A; Henn, Matthew R; Ogata, Hiroyuki; Allen, Michael J

2011-12-01

The Coccolithoviridae are a recently discovered group of viruses that infect the marine coccolithophorid Emiliania huxleyi. Emiliania huxleyi virus 203 (EhV-203) has a 160- to 180-nm-diameter icosahedral structure and a genome of approximately 400 kbp, consisting of 464 coding sequences (CDSs). Here we describe the genomic features of EhV-203 together with a draft genome sequence and its annotation, highlighting the homology and heterogeneity of this genome in comparison with the EhV-86 reference genome.
Draft Genome of the Marine Gammaproteobacterium Halomonas titanicae

PubMed Central

Sánchez-Porro, Cristina; de la Haba, Rafael R.; Cruz-Hernández, Norge; González, Juan M.; Reyes-Guirao, Cristina; Navarro-Sampedro, Laura; Carballo, Modesto

2013-01-01

Halomonas titanicae strain BH1 is a heterotrophic, aerobic marine bacterium which was isolated from rusticles of the RMS Titanic wreck. Here we report the draft genome sequence of this halophilic gammaproteobacterium. PMID:23516210
Elucidating the Small Regulatory RNA Repertoire of the Sea Anemone Anemonia viridis Based on Whole Genome and Small RNA Sequencing

PubMed Central

Patel, Hardip; Forêt, Sylvain; Karlsen, Bård Ove; Jørgensen, Tor Erik; Hall-Spencer, Jason M

2018-01-01

Abstract Cnidarians harbor a variety of small regulatory RNAs that include microRNAs (miRNAs) and PIWI-interacting RNAs (piRNAs), but detailed information is limited. Here, we report the identification and expression of novel miRNAs and putative piRNAs, as well as their genomic loci, in the symbiotic sea anemone Anemonia viridis. We generated a draft assembly of the A. viridis genome with putative size of 313 Mb that appeared to be composed of about 36% repeats, including known transposable elements. We detected approximately equal fractions of DNA transposons and retrotransposons. Deep sequencing of small RNA libraries constructed from A. viridis adults sampled at a natural CO2 gradient off Vulcano Island, Italy, identified 70 distinct miRNAs. Eight were homologous to previously reported miRNAs in cnidarians, whereas 62 appeared novel. Nine miRNAs were recognized as differentially expressed along the natural seawater pH gradient. We found a highly abundant and diverse population of piRNAs, with a substantial fraction showing ping–pong signatures. We identified nearly 22% putative piRNAs potentially targeting transposable elements within the A. viridis genome. The A. viridis genome appeared similar in size to that of other hexacorals with a very high divergence of transposable elements resembling that of the sea anemone genus Exaiptasia. The genome encodes and expresses a high number of small regulatory RNAs, which include novel miRNAs and piRNAs. Differentially expressed small RNAs along the seawater pH gradient indicated regulatory gene responses to environmental stressors. PMID:29385567
Draft genome sequence of Dactylonectria macrodydima, a plant pathogenic fungus in the Nectriaceae

USDA-ARS?s Scientific Manuscript database

Dactylonectria macrodidyma is part of the Nectriaceae, a family containing important plant pathogens. This species possesses the ability to induce disease on grapevine, avocado and olive. Here, we report the first draft genome of D. macrodidyma isolate JAC15-08. The assembled genome was 58 Mbp and c...
Draft Genome Sequence of Mycobacterium triplex DSM 44626.

PubMed

Sassi, Mohamed; Croce, Olivier; Robert, Catherine; Raoult, Didier; Drancourt, Michel

2014-05-29

We announce the draft genome sequence of Mycobacterium triplex strain DSM 44626, a nontuberculosis species responsible for opportunistic infections. The genome described here is composed of 6,382,840 bp, with a G+C content of 66.57%, and contains 5,988 protein-coding genes and 81 RNA genes. Copyright © 2014 Sassi et al.
Draft Genome Sequence of Two Sphingopyxis sp. Strains, Dominant Members of the Bacterial Community Associated with a Drinking Water Distribution System Simulator

EPA Science Inventory

We report the draft genome of two Sphingopyxis spp. strains isolated from a chloraminated drinking water distribution system simulator. Both strains are ubiquitous residents and early colonizers of water distribution systems. Genomic annotation identified a class 1 integron (in...
Draft Genome Sequences of 37 Salmonella enterica Strains Isolated from Poultry Sources in Nigeria

PubMed Central

Useh, Nicodemus M.; Ngbede, Emmanuel O.; Akange, Nguavese; Thomas, Milton; Foley, Andrew; Keena, Mitchel Chan; Nelson, Eric; Christopher-Hennings, Jane; Tomita, Masaru

2016-01-01

Here, we report the availability of draft genomes of several Salmonella serotypes, isolated from poultry sources from Nigeria. These genomes will help to further understand the biological diversity of S. enterica and will serve as references in microbial trace-back studies to improve food safety. PMID:27151793
Draft Genome Sequence of Lactobacillus farciminis NBRC 111452, Isolated from Kôso, a Japanese Sugar-Vegetable Fermented Beverage

PubMed Central

Oshima, Kenshiro; Suda, Wataru; Hattori, Masahira; Takahashi, Tomoya

2016-01-01

Here, we report the draft genome sequence of the Lactobacillus farciminis strain NBRC 111452, isolated from kôso, a Japanese sugar-vegetable fermented beverage. This genome information is of potential use in studies of Lactobacillus farciminis as a probiotic. PMID:26769925
Draft Genome Sequence of Lactobacillus johnsonii Strain 16, Isolated from Mice.

PubMed

Buhnik-Rosenblau, Keren; Danin-Poleg, Yael; Elgavish, Sharona; Kashi, Yechezkel

2015-10-08

Here, we report the genome sequence of Lactobacillus johnsonii, a member of the gut lactobacilli. This draft genome of L. johnsonii strain 16 isolated from C57BL/6J mice enables the identification of bacterial genes responsible for host-specific gut persistence. Copyright © 2015 Buhnik-Rosenblau et al.
Chromosomal-Level Assembly of the Asian Seabass Genome Using Long Sequence Reads and Multi-layered Scaffolding

PubMed Central

Vij, Shubha; Kuhl, Heiner; Kuznetsova, Inna S.; Komissarov, Aleksey; Yurchenko, Andrey A.; Van Heusden, Peter; Singh, Siddharth; Thevasagayam, Natascha M.; Prakki, Sai Rama Sridatta; Purushothaman, Kathiresan; Saju, Jolly M.; Jiang, Junhui; Mbandi, Stanley Kimbung; Jonas, Mario; Hin Yan Tong, Amy; Mwangi, Sarah; Lau, Doreen; Ngoh, Si Yan; Liew, Woei Chang; Shen, Xueyan; Hon, Lawrence S.; Drake, James P.; Boitano, Matthew; Hall, Richard; Chin, Chen-Shan; Lachumanan, Ramkumar; Korlach, Jonas; Trifonov, Vladimir; Kabilov, Marsel; Tupikin, Alexey; Green, Darrell; Moxon, Simon; Garvin, Tyler; Sedlazeck, Fritz J.; Vurture, Gregory W.; Gopalapillai, Gopikrishna; Kumar Katneni, Vinaya; Noble, Tansyn H.; Scaria, Vinod; Sivasubbu, Sridhar; Jerry, Dean R.; O'Brien, Stephen J.; Schatz, Michael C.; Dalmay, Tamás; Turner, Stephen W.; Lok, Si; Christoffels, Alan; Orbán, László

2016-01-01

We report here the ~670 Mb genome assembly of the Asian seabass (Lates calcarifer), a tropical marine teleost. We used long-read sequencing augmented by transcriptomics, optical and genetic mapping along with shared synteny from closely related fish species to derive a chromosome-level assembly with a contig N50 size over 1 Mb and scaffold N50 size over 25 Mb that span ~90% of the genome. The population structure of L. calcarifer species complex was analyzed by re-sequencing 61 individuals representing various regions across the species’ native range. SNP analyses identified high levels of genetic diversity and confirmed earlier indications of a population stratification comprising three clades with signs of admixture apparent in the South-East Asian population. The quality of the Asian seabass genome assembly far exceeds that of any other fish species, and will serve as a new standard for fish genomics. PMID:27082250
Genome Structure of the Legume, Lotus japonicus

PubMed Central

Sato, Shusei; Nakamura, Yasukazu; Kaneko, Takakazu; Asamizu, Erika; Kato, Tomohiko; Nakao, Mitsuteru; Sasamoto, Shigemi; Watanabe, Akiko; Ono, Akiko; Kawashima, Kumiko; Fujishiro, Tsunakazu; Katoh, Midori; Kohara, Mitsuyo; Kishida, Yoshie; Minami, Chiharu; Nakayama, Shinobu; Nakazaki, Naomi; Shimizu, Yoshimi; Shinpo, Sayaka; Takahashi, Chika; Wada, Tsuyuko; Yamada, Manabu; Ohmido, Nobuko; Hayashi, Makoto; Fukui, Kiichi; Baba, Tomoya; Nakamichi, Tomoko; Mori, Hirotada; Tabata, Satoshi

2008-01-01

The legume Lotus japonicus has been widely used as a model system to investigate the genetic background of legume-specific phenomena such as symbiotic nitrogen fixation. Here, we report structural features of the L. japonicus genome. The 315.1-Mb sequences determined in this and previous studies correspond to 67% of the genome (472 Mb), and are likely to cover 91.3% of the gene space. Linkage mapping anchored 130-Mb sequences onto the six linkage groups. A total of 10 951 complete and 19 848 partial structures of protein-encoding genes were assigned to the genome. Comparative analysis of these genes revealed the expansion of several functional domains and gene families that are characteristic of L. japonicus. Synteny analysis detected traces of whole-genome duplication and the presence of synteny blocks with other plant genomes to various degrees. This study provides the first opportunity to look into the complex and unique genetic system of legumes. PMID:18511435
Draft Genome Sequence of Sphingobium fuliginis OMI, a Bacterium That Degrades Alkylphenols and Bisphenols

PubMed Central

Ogata, Yuka; Yahara, Tatsuya; Yokoyama, Takashi; Ishizawa, Hidehiro; Takada, Kazuki; Inoue, Daisuke; Sei, Kazunari

2017-01-01

ABSTRACT Sphingobium fuliginis OMI is a bacterium that can degrade a variety of recalcitrant alkylphenols and bisphenols. This study reports the draft genome sequence of S. fuliginis OMI. PMID:29167253
Draft Genome Sequence of Bioactive-Compound-Producing Cyanobacterium Tolypothrix campylonemoides Strain VB511288

PubMed Central

Das, Subhadeep; Singh, Deeksha; Madduluri, Madhavi; Chandrababunaidu, Mathu Malar; Gupta, Akash

2015-01-01

We report here the draft genome sequence of Tolypothrix campylonemoides VB511288, isolated from building facades in Santiniketan, India. The members of this genus produce several compounds of commercial importance. The draft assembly is 10,627,177 bases in 135 scaffolds, and it contains 7,886 protein-coding genes, 994 pseudogenes, 18 rRNA genes, and 76 tRNA genes. PMID:25838485
Clinical and Genetic Implications of Mutation Burden in Squamous Cell Carcinoma of the Lung.

PubMed

Okamoto, Tatsuro; Takada, Kazuki; Sato, Seijiro; Toyokawa, Gouji; Tagawa, Tetsuzo; Shoji, Fumihiro; Nakanishi, Ryota; Oki, Eiji; Koike, Terumoto; Nagahashi, Masayuki; Ichikawa, Hiroshi; Shimada, Yoshifumi; Watanabe, Satoshi; Kikuchi, Toshiaki; Akazawa, Kouhei; Lyle, Stephen; Takabe, Kazuaki; Okuda, Shujiro; Sugio, Kenji; Wakai, Toshifumi; Tsuchida, Masanori; Maehara, Yoshihiko

2018-06-01

Lung squamous cell carcinoma (LSCC) is a major histological subtype of lung cancer. In this study, we investigated genomic alterations in LSCC and evaluated the clinical implications of mutation burden (MB) in LSCC. Genomic alterations were determined in Japanese patients with LSCC (N = 67) using next-generation sequencing of 415 known cancer genes. MB was defined as the number of non-synonymous mutations per 1 Mbp. Programmed death-ligand 1 (PD-L1) protein expression in cancer cells was evaluated by immunohistochemical analysis. TP53 gene mutations were the most common alteration (n = 51/67, 76.1%), followed by gene alterations in cyclin-dependent kinase inhibitor 2B (CDKN2B; 35.8%), CDKN2A (31.3%), phosphatase and tensin homolog (30.0%), and sex-determining region Y-box 2 (SOX2, 28.3%). Histological differentiation was significantly poorer in tumors with high MB (greater than or equal to the median MB) compared with that in tumors with low MB (less than the median MB; p = 0.0446). The high MB group had more tumors located in the upper or middle lobe than tumors located in the lower lobe (p = 0.0019). Moreover, cancers in the upper or middle lobes had significantly higher MB than cancers in the lower lobes (p = 0.0005), and tended to show higher PD-L1 protein expression (p = 0.0573). SOX2 and tyrosine kinase non-receptor 2 amplifications were associated with high MB (p = 0.0065 and p = 0.0010, respectively). The MB level differed according to the tumor location in LSCC, suggesting that the location of cancer development may influence the genomic background of the tumor.
Apophysomyces variabilis: draft genome sequence and comparison of predictive virulence determinants with other medically important Mucorales.

PubMed

Prakash, Hariprasath; Rudramurthy, Shivaprakash Mandya; Gandham, Prasad S; Ghosh, Anup Kumar; Kumar, Milner M; Badapanda, Chandan; Chakrabarti, Arunaloke

2017-09-18

Apophysomyces species are prevalent in tropical countries and A. variabilis is the second most frequent agent causing mucormycosis in India. Among Apophysomyces species, A. elegans, A. trapeziformis and A. variabilis are commonly incriminated in human infections. The genome sequences of A. elegans and A. trapeziformis are available in public database, but not A. variabilis. We, therefore, performed the whole genome sequence of A. variabilis to explore its genomic structure and possible genes determining the virulence of the organism. The whole genome of A. variabilis NCCPF 102052 was sequenced and the genomic structure of A. variabilis was compared with already available genome structures of A. elegans, A. trapeziformis and other medically important Mucorales. The total size of genome assembly of A. variabilis was 39.38 Mb with 12,764 protein-coding genes. The transposable elements (TEs) were low in Apophysomyces genome and the retrotransposon Ty3-gypsy was the common TE. Phylogenetically, Apophysomyces species were grouped closely with Phycomyces blakesleeanus. OrthoMCL analysis revealed 3025 orthologues proteins, which were common in those three pathogenic Apophysomyces species. Expansion of multiple gene families/duplication was observed in Apophysomyces genomes. Approximately 6% of Apophysomyces genes were predicted to be associated with virulence on PHIbase analysis. The virulence determinants included the protein families of CotH proteins (invasins), proteases, iron utilisation pathways, siderophores and signal transduction pathways. Serine proteases were the major group of proteases found in all Apophysomyces genomes. The carbohydrate active enzymes (CAZymes) constitute the majority of the secretory proteins. The present study is the maiden attempt to sequence and analyze the genomic structure of A. variabilis. Together with available genome sequence of A. elegans and A. trapeziformis, the study helped to indicate the possible virulence determinants of pathogenic Apophysomyces species. The presence of unique CAZymes in cell wall might be exploited in future for antifungal drug development.
Draft Genome Sequence of Pseudomonas sp. Strain B1, Isolated from a Contaminated Sediment

DOE Office of Scientific and Technical Information (OSTI.GOV)

Pathak, Ashish; Jaswal, Rajneesh; Stothard, Paul

ABSTRACT The draft genome sequence of Pseudomonas sp. strain B1, isolated from a contaminated soil, is reported. The genome comprises 6,706,934 bases, 6,059 coding sequences, and 70 RNAs and has a G+C content of 60.3%. A suite of biodegradative genes, many located on genomic islands, were identified from strain B1, further enhancing our understanding of the versatile pseudomonads.
Draft Genome Sequence of Pseudomonas sp. EpS/L25, Isolated from the Medicinal Plant Echinacea purpurea and Able To Synthesize Antimicrobial Compounds.

PubMed

Presta, Luana; Bosi, Emanuele; Fondi, Marco; Maida, Isabel; Perrin, Elena; Miceli, Elisangela; Maggini, Valentina; Bogani, Patrizia; Firenzuoli, Fabio; Di Pilato, Vincenzo; Rossolini, Gian Maria; Mengoni, Alessio; Fani, Renato

2016-05-05

We announce here the draft genome sequence of Pseudomonas sp. strain EpS/L25, isolated from the stem/leaves of the medicinal plant Echinacea purpurea This genome will allow for comparative genomics in order to identify genes associated with the production of bioactive compounds and antibiotic resistance. Copyright © 2016 Presta et al.
Draft Genome Sequence of Pseudomonas sp. Strain B1, Isolated from a Contaminated Sediment

DOE PAGES

Pathak, Ashish; Jaswal, Rajneesh; Stothard, Paul; ...

2018-06-21

ABSTRACT The draft genome sequence of Pseudomonas sp. strain B1, isolated from a contaminated soil, is reported. The genome comprises 6,706,934 bases, 6,059 coding sequences, and 70 RNAs and has a G+C content of 60.3%. A suite of biodegradative genes, many located on genomic islands, were identified from strain B1, further enhancing our understanding of the versatile pseudomonads.
Draft Genome Sequence of Leptolyngbya sp. KIOST-1, a Filamentous Cyanobacterium with Biotechnological Potential for Alimentary Purposes

PubMed Central

Kim, Ji Hyung

2016-01-01

Here, we report the draft genome of cyanobacterium Leptolyngbya sp. KIOST-1 isolated from a microalgal culture pond in South Korea. The genome consists of 13 contigs containing 6,320,172 bp, and a total of 5,327 coding sequences were predicted. This genomic information will allow further exploitation of its biotechnological potential for alimentary purposes. PMID:27635005

Draft Genome Sequence of Limnobacter sp. Strain CACIAM 66H1, a Heterotrophic Bacterium Associated with Cyanobacteria

PubMed Central

da Silva, Fábio Daniel Florêncio; Lima, Alex Ranieri Jerônimo; Moraes, Pablo Henrique Gonçalves; Siqueira, Andrei Santos; Dall’Agnol, Leonardo Teixeira; Baraúna, Anna Rafaella Ferreira; Martins, Luisa Carício; Oliveira, Karol Guimarães; de Lima, Clayton Pereira Silva; Nunes, Márcio Roberto Teixeira; Vianez-Júnior, João Lídio Silva Gonçalves

2016-01-01

Ecological interactions between cyanobacteria and heterotrophic prokaryotes are poorly known. To improve the genomic studies of heterotrophic bacterium-cyanobacterium associations, the draft genome sequence (3.2 Mbp) of Limnobacter sp. strain CACIAM 66H1, found in a nonaxenic culture of Synechococcus sp. (cyanobacteria), is presented here. PMID:27198027
Draft genome sequence of the New Jersey aster yellows strain of ‘Candidatus Phytoplasma asteris’

USDA-ARS?s Scientific Manuscript database

The NJAY (New Jersey aster yellows) strain of ‘Candidatus Phytoplasma asteris’ is a significant plant pathogen responsible for causing severe lettuce yellows in the U.S. state of New Jersey. A draft genome sequence was prepared for this organism and used for genome- and gene-based comparative phylog...
Draft Genome Sequences for Five Strains of Trabulsiella odontotermitis, Isolated from Heterotermes sp. Termite Gut

PubMed Central

Olvera-García, Myrna; Fontes-Perez, Héctor; Chávez-Martínez, America; Ruiz Barrera, Oscar; Rodríguez-Almeida, Felipe A.

2015-01-01

Trabulsiella odontotermitis represents a novel species in the genus Trabulsiella with no complete genome reported yet. Here, we describe the draft genome sequences of five isolates from termites present in the north of Mexico, which have an interesting pool of genes related to cellulose degradation with biotechnological application. PMID:26543120
Draft Genome Sequence of a Bacillus Bacterium from the Atacama Desert Wetlands Metagenome

PubMed Central

Vilo, Claudia; Galetovic, Alexandra; Araya, Jorge E.; Dong, Qunfeng

2015-01-01

We report here the draft genome sequence of a Bacillus bacterium isolated from the microflora of Nostoc colonies grown at the Andean wetlands in northern Chile. We consider this genome sequence to be a molecular tool for exploring microbial relationships and adaptation strategies to the prevailing extreme conditions at the Atacama Desert. PMID:26294639
Draft genome sequence of “Candidatus Liberibacter asiaticus” from Diaphorina citri in Guangdong, China

USDA-ARS?s Scientific Manuscript database

The draft genome sequence of “Candidatus Liberibacter asiaticus” strain YCPsy from an Asian citrus psyllid (Diaphorina citri) in Guangdong of China is reported. The YCPsy strain has a genome size of 1,233,647 bp, 36.5% G+C content, 1,171 open reading frames (ORFs), and 53 RNAs....
Draft Genome Sequence of Pseudomonas sp. BDAL1 Reconstructed from a Bakken Shale Hydraulic Fracturing-Produced Water Storage Tank Metagenome

DOE Office of Scientific and Technical Information (OSTI.GOV)

Lipus, Daniel; Ross, Daniel; Bibby, Kyle

We report the 5,425,832 bp draft genome ofPseudomonassp. strain BDAL1, recovered from a Bakken shale hydraulic fracturing-produced water tank metagenome. Genome annotation revealed several key biofilm formation genes and osmotic stress response mechanisms necessary for survival in hydraulic fracturing-produced water.
Draft Genome Sequences of 37 Salmonella enterica Strains Isolated from Poultry Sources in Nigeria.

PubMed

Useh, Nicodemus M; Ngbede, Emmanuel O; Akange, Nguavese; Thomas, Milton; Foley, Andrew; Keena, Mitchel Chan; Nelson, Eric; Christopher-Hennings, Jane; Tomita, Masaru; Suzuki, Haruo; Scaria, Joy

2016-05-05

Here, we report the availability of draft genomes of several Salmonella serotypes, isolated from poultry sources from Nigeria. These genomes will help to further understand the biological diversity of S. enterica and will serve as references in microbial trace-back studies to improve food safety. Copyright © 2016 Useh et al.
Draft genome sequence of Pyrodictium occultum PL19 T, a marine hyperthermophilic species of Archaea that grows optimally at 105°C

DOE PAGES

Utturkar, Sagar M.; Huber, Harald; Leptihn, Sebastian; ...

2016-02-25

We report here the draft genome sequence of Pyrodictium occultum PL19 T, a marine hyperthermophilic archaeon. In addition, the genome provides insights into molecular and cellular adaptation mechanisms to life in extreme environments and the evolution of early organisms on Earth.
Draft Genome Sequence of “Cohnella kolymensis” B-2846

PubMed Central

Kudryashova, Ekaterina B.; Ariskina, Elena V.

2016-01-01

A draft genome sequence of “Cohnella kolymensis” strain B-2846 was derived using IonTorrent sequencing technology. The size of the assembly and G+C content were in agreement with those of other species of this genus. Characterization of the genome of a novel species of Cohnella will assist in bacterial systematics. PMID:26769947
Draft Genome Sequences of Clostridium Strains Native to Colombia with the Potential To Produce Solvents

PubMed Central

Rosas-Morales, Juan Pablo; Perez-Mancilla, Ximena; López-Kleine, Liliana

2015-01-01

Genomes from four Clostridium sp. strains considered to be mesophilic anaerobic bacteria, isolated from crop soil in Colombia, with a strong potential to produce alcohols like 1,3-propanediol, were analyzed. We present the draft genome of these strains, which will be useful for developing genetic engineering strategies. PMID:25999575
Draft Genome Sequence of Lactobacillus farciminis NBRC 111452, Isolated from Kôso, a Japanese Sugar-Vegetable Fermented Beverage.

PubMed

Chiou, Tai-Ying; Oshima, Kenshiro; Suda, Wataru; Hattori, Masahira; Takahashi, Tomoya

2016-01-14

Here, we report the draft genome sequence of the Lactobacillus farciminis strain NBRC 111452, isolated from kôso, a Japanese sugar-vegetable fermented beverage. This genome information is of potential use in studies of Lactobacillus farciminis as a probiotic. Copyright © 2016 Chiou et al.
Draft Genome Sequence of Pseudomonas sp. BDAL1 Reconstructed from a Bakken Shale Hydraulic Fracturing-Produced Water Storage Tank Metagenome

DOE PAGES

Lipus, Daniel; Ross, Daniel; Bibby, Kyle; ...

2017-03-16

We report the 5,425,832 bp draft genome ofPseudomonassp. strain BDAL1, recovered from a Bakken shale hydraulic fracturing-produced water tank metagenome. Genome annotation revealed several key biofilm formation genes and osmotic stress response mechanisms necessary for survival in hydraulic fracturing-produced water.
Draft Genome Sequence of Fish Pathogen Aeromonas bestiarum GA97-22.

PubMed

Kumru, Salih; Tekedar, Hasan C; Griffin, Matt J; Waldbieser, Geoffrey C; Liles, Mark R; Sonstegard, Tad; Schroeder, Steven G; Lawrence, Mark L; Karsi, Attila

2018-06-14

Aeromonas bestiarum is a Gram-negative mesophilic motile bacterium causing acute hemorrhagic septicemia or chronic skin ulcers in fish. Here, we report the draft genome sequence of A. bestiarum strain GA97-22, which was isolated from rainbow trout in 1997. This genome sequence will improve our understanding of the complex taxonomy of motile aeromonads.
Draft Genome Sequence of Clostridium pasteurianum NRRL B-598, a Potential Butanol or Hydrogen Producer.

PubMed

Kolek, Jan; Sedlár, Karel; Provazník, Ivo; Patáková, Petra

2014-03-20

We present a draft genome sequence of Clostridium pasteurianum NRRL B-598. This strain ferments saccharides by two-stage acetone-butanol (AB) fermentation, is oxygen tolerant, and has high hydrogen yields.
Draft Genome Sequence of Aeromonas caviae Strain 429865 INP, Isolated from a Mexican Patient

PubMed Central

Padilla, Juan Carlos A.; Bustos, Patricia; Sánchez-Varela, Alejandro; Palma-Martinez, Ingrid; Arzate-Barbosa, Patricia; García-Pérez, Carlos A.; López-López, María de Jesús; González, Víctor

2015-01-01

Aeromonas caviae is an emerging human pathogen. Here, we report the draft genome sequence of Aeromonas caviae strain 429865 INP which shows the presence of various putative virulence-related genes. PMID:26494682
Draft Genome of Rhodococcus rhodochrous TRN7, Isolated from the Coast of Trindade Island, Brazil

PubMed Central

Rodrigues, Edmo M.; Pylro, Victor S.; Dobbler, Priscila T.; Victoria, Filipe

2016-01-01

Here, we present a draft genome and annotation of Rhodococcus rhodochrous TRN7, isolated from Trindade Island, Brazil, which will provide genetic data to benefit the understanding of its metabolism. PMID:26941155
Draft Genome Sequence of Lactobacillus plantarum Strain IPLA 88

PubMed Central

Ladero, Victor; Alvarez-Sieiro, Patricia; Redruello, Begoña; del Rio, Beatriz; Linares, Daniel M.; Martin, M. Cruz; Fernández, María

2013-01-01

Here, we report a 3.2-Mbp draft assembly for the genome of Lactobacillus plantarum IPLA 88. The sequence of this sourdough isolate provides insight into the adaptation of this versatile species to different environments. PMID:23887921
Revealing the inventory of type III effectors in Pantoea agglomerans gall-forming pathovars using draft genome sequences and a machine-learning approach.

PubMed

Nissan, Gal; Gershovits, Michael; Morozov, Michael; Chalupowicz, Laura; Sessa, Guido; Manulis-Sasson, Shulamit; Barash, Isaac; Pupko, Tal

2018-02-01

Pantoea agglomerans, a widespread epiphytic bacterium, has evolved into a hypersensitive response and pathogenicity (hrp)-dependent and host-specific gall-forming pathogen by the acquisition of a pathogenicity plasmid containing a type III secretion system (T3SS) and its effectors (T3Es). Pantoea agglomerans pv. betae (Pab) elicits galls on beet (Beta vulgaris) and gypsophila (Gypsophila paniculata), whereas P. agglomerans pv. gypsophilae (Pag) incites galls on gypsophila and a hypersensitive response (HR) on beet. Draft genome sequences were generated and employed in combination with a machine-learning approach and a translocation assay into beet roots to identify the pools of T3Es in the two pathovars. The genomes of the sequenced Pab4188 and Pag824-1 strains have a similar size (∼5 MB) and GC content (∼55%). Mutational analysis revealed that, in Pab4188, eight T3Es (HsvB, HsvG, PseB, DspA/E, HopAY1, HopX2, HopAF1 and HrpK) contribute to pathogenicity on beet and gypsophila. In Pag824-1, nine T3Es (HsvG, HsvB, PthG, DspA/E, HopAY1, HopD1, HopX2, HopAF1 and HrpK) contribute to pathogenicity on gypsophila, whereas the PthG effector triggers HR on beet. HsvB, HsvG, PthG and PseB appear to endow pathovar specificities to Pab and Pag, and no homologous T3Es were identified for these proteins in other phytopathogenic bacteria. Conversely, the remaining T3Es contribute to the virulence of both pathovars, and homologous T3Es were found in other phytopathogenic bacteria. Remarkably, HsvG and HsvB, which act as host-specific transcription factors, displayed the largest contribution to disease development. © 2016 BSPP AND JOHN WILEY & SONS LTD.
Assembly of the draft genome of buckwheat and its applications in identifying agronomically useful genes

PubMed Central

Yasui, Yasuo; Hirakawa, Hideki; Ueno, Mariko; Matsui, Katsuhiro; Katsube-Tanaka, Tomoyuki; Yang, Soo Jung; Aii, Jotaro; Sato, Shingo; Mori, Masashi

2016-01-01

Buckwheat (Fagopyrum esculentum Moench; 2n = 2x = 16) is a nutritionally dense annual crop widely grown in temperate zones. To accelerate molecular breeding programmes of this important crop, we generated a draft assembly of the buckwheat genome using short reads obtained by next-generation sequencing (NGS), and constructed the Buckwheat Genome DataBase. After assembling short reads, we determined 387,594 scaffolds as the draft genome sequence (FES_r1.0). The total length of FES_r1.0 was 1,177,687,305 bp, and the N50 of the scaffolds was 25,109 bp. Gene prediction analysis revealed 286,768 coding sequences (CDSs; FES_r1.0_cds) including those related to transposable elements. The total length of FES_r1.0_cds was 212,917,911 bp, and the N50 was 1,101 bp. Of these, the functions of 35,816 CDSs excluding those for transposable elements were annotated by BLAST analysis. To demonstrate the utility of the database, we conducted several test analyses using BLAST and keyword searches. Furthermore, we used the draft genome as a reference sequence for NGS-based markers, and successfully identified novel candidate genes controlling heteromorphic self-incompatibility of buckwheat. The database and draft genome sequence provide a valuable resource that can be used in efforts to develop buckwheat cultivars with superior agronomic traits. PMID:27037832
Methylobacterium Genome Sequences: A Reference Blueprint to Investigate Microbial Metabolism of C1 Compounds from Natural and Industrial Sources

PubMed Central

Lee, Ming-Chun; Bringel, Françoise; Lajus, Aurélie; Zhou, Yang; Gourion, Benjamin; Barbe, Valérie; Chang, Jean; Cruveiller, Stéphane; Dossat, Carole; Gillett, Will; Gruffaz, Christelle; Haugen, Eric; Hourcade, Edith; Levy, Ruth; Mangenot, Sophie; Muller, Emilie; Nadalig, Thierry; Pagni, Marco; Penny, Christian; Peyraud, Rémi; Robinson, David G.; Roche, David; Rouy, Zoé; Saenampechek, Channakhone; Salvignol, Grégory; Vallenet, David; Wu, Zaining; Marx, Christopher J.; Vorholt, Julia A.; Olson, Maynard V.; Kaul, Rajinder; Weissenbach, Jean; Médigue, Claudine; Lidstrom, Mary E.

2009-01-01

Background Methylotrophy describes the ability of organisms to grow on reduced organic compounds without carbon-carbon bonds. The genomes of two pink-pigmented facultative methylotrophic bacteria of the Alpha-proteobacterial genus Methylobacterium, the reference species Methylobacterium extorquens strain AM1 and the dichloromethane-degrading strain DM4, were compared. Methodology/Principal Findings The 6.88 Mb genome of strain AM1 comprises a 5.51 Mb chromosome, a 1.26 Mb megaplasmid and three plasmids, while the 6.12 Mb genome of strain DM4 features a 5.94 Mb chromosome and two plasmids. The chromosomes are highly syntenic and share a large majority of genes, while plasmids are mostly strain-specific, with the exception of a 130 kb region of the strain AM1 megaplasmid which is syntenic to a chromosomal region of strain DM4. Both genomes contain large sets of insertion elements, many of them strain-specific, suggesting an important potential for genomic plasticity. Most of the genomic determinants associated with methylotrophy are nearly identical, with two exceptions that illustrate the metabolic and genomic versatility of Methylobacterium. A 126 kb dichloromethane utilization (dcm) gene cluster is essential for the ability of strain DM4 to use DCM as the sole carbon and energy source for growth and is unique to strain DM4. The methylamine utilization (mau) gene cluster is only found in strain AM1, indicating that strain DM4 employs an alternative system for growth with methylamine. The dcm and mau clusters represent two of the chromosomal genomic islands (AM1: 28; DM4: 17) that were defined. The mau cluster is flanked by mobile elements, but the dcm cluster disrupts a gene annotated as chelatase and for which we propose the name “island integration determinant” (iid). Conclusion/Significance These two genome sequences provide a platform for intra- and interspecies genomic comparisons in the genus Methylobacterium, and for investigations of the adaptive mechanisms which allow bacterial lineages to acquire methylotrophic lifestyles. PMID:19440302

Dissection of the Octoploid Strawberry Genome by Deep Sequencing of the Genomes of Fragaria Species

PubMed Central

Hirakawa, Hideki; Shirasawa, Kenta; Kosugi, Shunichi; Tashiro, Kosuke; Nakayama, Shinobu; Yamada, Manabu; Kohara, Mistuyo; Watanabe, Akiko; Kishida, Yoshie; Fujishiro, Tsunakazu; Tsuruoka, Hisano; Minami, Chiharu; Sasamoto, Shigemi; Kato, Midori; Nanri, Keiko; Komaki, Akiko; Yanagi, Tomohiro; Guoxin, Qin; Maeda, Fumi; Ishikawa, Masami; Kuhara, Satoru; Sato, Shusei; Tabata, Satoshi; Isobe, Sachiko N.

2014-01-01

Cultivated strawberry (Fragaria x ananassa) is octoploid and shows allogamous behaviour. The present study aims at dissecting this octoploid genome through comparison with its wild relatives, F. iinumae, F. nipponica, F. nubicola, and F. orientalis by de novo whole-genome sequencing on an Illumina and Roche 454 platforms. The total length of the assembled Illumina genome sequences obtained was 698 Mb for F. x ananassa, and ∼200 Mb each for the four wild species. Subsequently, a virtual reference genome termed FANhybrid_r1.2 was constructed by integrating the sequences of the four homoeologous subgenomes of F. x ananassa, from which heterozygous regions in the Roche 454 and Illumina genome sequences were eliminated. The total length of FANhybrid_r1.2 thus created was 173.2 Mb with the N50 length of 5137 bp. The Illumina-assembled genome sequences of F. x ananassa and the four wild species were then mapped onto the reference genome, along with the previously published F. vesca genome sequence to establish the subgenomic structure of F. x ananassa. The strategy adopted in this study has turned out to be successful in dissecting the genome of octoploid F. x ananassa and appears promising when applied to the analysis of other polyploid plant species. PMID:24282021
Draft Genome Sequence of Bioactive-Compound-Producing Cyanobacterium Tolypothrix campylonemoides Strain VB511288.

PubMed

Das, Subhadeep; Singh, Deeksha; Madduluri, Madhavi; Chandrababunaidu, Mathu Malar; Gupta, Akash; Adhikary, Siba Prasad; Tripathy, Sucheta

2015-04-02

We report here the draft genome sequence of Tolypothrix campylonemoides VB511288, isolated from building facades in Santiniketan, India. The members of this genus produce several compounds of commercial importance. The draft assembly is 10,627,177 bases in 135 scaffolds, and it contains 7,886 protein-coding genes, 994 pseudogenes, 18 rRNA genes, and 76 tRNA genes. Copyright © 2015 Das et al.
Genome of a Low-Salinity Ammonia-Oxidizing Archaeon Determined by Single-Cell and Metagenomic Analysis

PubMed Central

Potanina, Anastasia; Francis, Christopher A.; Quake, Stephen R.

2011-01-01

Ammonia-oxidizing archaea (AOA) are thought to be among the most abundant microorganisms on Earth and may significantly impact the global nitrogen and carbon cycles. We sequenced the genome of AOA in an enrichment culture from low-salinity sediments in San Francisco Bay using single-cell and metagenomic genome sequence data. Five single cells were isolated inside an integrated microfluidic device using laser tweezers, the cells' genomic DNA was amplified by multiple displacement amplification (MDA) in 50 nL volumes and then sequenced by high-throughput DNA pyrosequencing. This microscopy-based approach to single-cell genomics minimizes contamination and allows correlation of high-resolution cell images with genomic sequences. Statistical properties of coverage across the five single cells, in combination with the contrasting properties of the metagenomic dataset allowed the assembly of a high-quality draft genome. The genome of this AOA, which we designate Candidatus Nitrosoarchaeum limnia SFB1, is ∼1.77 Mb with >2100 genes and a G+C content of 32%. Across the entire genome, the average nucleotide identity to Nitrosopumilus maritimus, the only AOA in pure culture, is ∼70%, suggesting this AOA represents a new genus of Crenarchaeota. Phylogenetically, the 16S rRNA and ammonia monooxygenase subunit A (amoA) genes of this AOA are most closely related to sequences reported from a wide variety of freshwater ecosystems. Like N. maritimus, the low-salinity AOA genome appears to have an ammonia oxidation pathway distinct from ammonia oxidizing bacteria (AOB). In contrast to other described AOA, these low-salinity AOA appear to be motile, based on the presence of numerous motility- and chemotaxis-associated genes in the genome. This genome data will be used to inform targeted physiological and metabolic studies of this novel group of AOA, which may ultimately advance our understanding of AOA metabolism and their impacts on the global carbon and nitrogen cycles. PMID:21364937
Genome of a low-salinity ammonia-oxidizing archaeon determined by single-cell and metagenomic analysis.

PubMed

Blainey, Paul C; Mosier, Annika C; Potanina, Anastasia; Francis, Christopher A; Quake, Stephen R

2011-02-22

Ammonia-oxidizing archaea (AOA) are thought to be among the most abundant microorganisms on Earth and may significantly impact the global nitrogen and carbon cycles. We sequenced the genome of AOA in an enrichment culture from low-salinity sediments in San Francisco Bay using single-cell and metagenomic genome sequence data. Five single cells were isolated inside an integrated microfluidic device using laser tweezers, the cells' genomic DNA was amplified by multiple displacement amplification (MDA) in 50 nL volumes and then sequenced by high-throughput DNA pyrosequencing. This microscopy-based approach to single-cell genomics minimizes contamination and allows correlation of high-resolution cell images with genomic sequences. Statistical properties of coverage across the five single cells, in combination with the contrasting properties of the metagenomic dataset allowed the assembly of a high-quality draft genome. The genome of this AOA, which we designate Candidatus Nitrosoarchaeum limnia SFB1, is ∼1.77 Mb with >2100 genes and a G+C content of 32%. Across the entire genome, the average nucleotide identity to Nitrosopumilus maritimus, the only AOA in pure culture, is ∼70%, suggesting this AOA represents a new genus of Crenarchaeota. Phylogenetically, the 16S rRNA and ammonia monooxygenase subunit A (amoA) genes of this AOA are most closely related to sequences reported from a wide variety of freshwater ecosystems. Like N. maritimus, the low-salinity AOA genome appears to have an ammonia oxidation pathway distinct from ammonia oxidizing bacteria (AOB). In contrast to other described AOA, these low-salinity AOA appear to be motile, based on the presence of numerous motility- and chemotaxis-associated genes in the genome. This genome data will be used to inform targeted physiological and metabolic studies of this novel group of AOA, which may ultimately advance our understanding of AOA metabolism and their impacts on the global carbon and nitrogen cycles.
Draft Genome Sequence of Gordonia sp. Strain UCD-TK1 (Phylum Actinobacteria)

PubMed Central

Koenigsaecker, Tynisha M.; Coil, David A.

2016-01-01

Here, we present the draft genome of Gordonia sp. strain UCD-TK1. The assembly contains 5,470,576 bp in 98 contigs. This strain was isolated from a disinfected ambulatory surgery center. PMID:27738036
Draft Genome Sequence of Bacillus altitudinis YNP4-TSU, Isolated from Yellowstone National Park

PubMed Central

OHair, Joshua A.; Li, Hui; Thapa, Santosh; Scholz, Matthew

2017-01-01

ABSTRACT Undisturbed hot springs inside Yellowstone National Park remain a dynamic biome for novel cellulolytic thermophiles. We report here the draft genome sequence of one of these isolates, Bacillus altitudinis YNP4-TSU. PMID:28705979
Draft genome sequence of Xylella fastidiosa subsp. fastidiosa strain Stag’s Leap

USDA-ARS?s Scientific Manuscript database

Xylella fastidiosa subsp. fastidiosa causes Pierce’s disease of grapevine. Presented here is the draft genome sequence of the Stag’s Leap strain, previously used in pathogenicity/virulence assays to evaluate grapevine germplasm bearing Pierce’s disease....
Draft Genome Sequence of Sphingobium fuliginis OMI, a Bacterium That Degrades Alkylphenols and Bisphenols.

PubMed

Kuroda, Masashi; Ogata, Yuka; Yahara, Tatsuya; Yokoyama, Takashi; Ishizawa, Hidehiro; Takada, Kazuki; Inoue, Daisuke; Sei, Kazunari; Ike, Michihiko

2017-11-22

Sphingobium fuliginis OMI is a bacterium that can degrade a variety of recalcitrant alkylphenols and bisphenols. This study reports the draft genome sequence of S. fuliginis OMI. Copyright © 2017 Kuroda et al.
Draft Genome Sequence of Janthinobacterium sp. Strain ROICE36, a Putative Secondary Metabolite-Synthesizing Bacterium Isolated from Antarctic Snow

PubMed Central

Chiriac, Cecilia; Baricz, Andreea

2018-01-01

ABSTRACT The draft genome assembly of Janthinobacterium sp. strain ROICE36 has 207 contigs, with a total genome size of 5,977,006 bp and a G+C content of 62%. Preliminary genome analysis identified 5,363 protein-coding genes and a total of 7 secondary metabolic gene clusters (encoding bacteriocins, nonribosomal peptide-synthetase [NRPS], terpene, hserlactone, and other ketide synthases). PMID:29650588
Draft Genome Sequence of Leptolyngbya sp. KIOST-1, a Filamentous Cyanobacterium with Biotechnological Potential for Alimentary Purposes.

PubMed

Kim, Ji Hyung; Kang, Do-Hyung

2016-09-15

Here, we report the draft genome of cyanobacterium Leptolyngbya sp. KIOST-1 isolated from a microalgal culture pond in South Korea. The genome consists of 13 contigs containing 6,320,172 bp, and a total of 5,327 coding sequences were predicted. This genomic information will allow further exploitation of its biotechnological potential for alimentary purposes. Copyright © 2016 Kim and Kang.
The draft genome of sweet orange (Citrus sinensis).

PubMed

Xu, Qiang; Chen, Ling-Ling; Ruan, Xiaoan; Chen, Dijun; Zhu, Andan; Chen, Chunli; Bertrand, Denis; Jiao, Wen-Biao; Hao, Bao-Hai; Lyon, Matthew P; Chen, Jiongjiong; Gao, Song; Xing, Feng; Lan, Hong; Chang, Ji-Wei; Ge, Xianhong; Lei, Yang; Hu, Qun; Miao, Yin; Wang, Lun; Xiao, Shixin; Biswas, Manosh Kumar; Zeng, Wenfang; Guo, Fei; Cao, Hongbo; Yang, Xiaoming; Xu, Xi-Wen; Cheng, Yun-Jiang; Xu, Juan; Liu, Ji-Hong; Luo, Oscar Junhong; Tang, Zhonghui; Guo, Wen-Wu; Kuang, Hanhui; Zhang, Hong-Yu; Roose, Mikeal L; Nagarajan, Niranjan; Deng, Xiu-Xin; Ruan, Yijun

2013-01-01

Oranges are an important nutritional source for human health and have immense economic value. Here we present a comprehensive analysis of the draft genome of sweet orange (Citrus sinensis). The assembled sequence covers 87.3% of the estimated orange genome, which is relatively compact, as 20% is composed of repetitive elements. We predicted 29,445 protein-coding genes, half of which are in the heterozygous state. With additional sequencing of two more citrus species and comparative analyses of seven citrus genomes, we present evidence to suggest that sweet orange originated from a backcross hybrid between pummelo and mandarin. Focused analysis on genes involved in vitamin C metabolism showed that GalUR, encoding the rate-limiting enzyme of the galacturonate pathway, is significantly upregulated in orange fruit, and the recent expansion of this gene family may provide a genomic basis. This draft genome represents a valuable resource for understanding and improving many important citrus traits in the future.
Draft Genome Sequence of Limnobacter sp. Strain CACIAM 66H1, a Heterotrophic Bacterium Associated with Cyanobacteria.

PubMed

da Silva, Fábio Daniel Florêncio; Lima, Alex Ranieri Jerônimo; Moraes, Pablo Henrique Gonçalves; Siqueira, Andrei Santos; Dall'Agnol, Leonardo Teixeira; Baraúna, Anna Rafaella Ferreira; Martins, Luisa Carício; Oliveira, Karol Guimarães; de Lima, Clayton Pereira Silva; Nunes, Márcio Roberto Teixeira; Vianez-Júnior, João Lídio Silva Gonçalves; Gonçalves, Evonnildo Costa

2016-05-19

Ecological interactions between cyanobacteria and heterotrophic prokaryotes are poorly known. To improve the genomic studies of heterotrophic bacterium-cyanobacterium associations, the draft genome sequence (3.2 Mbp) of Limnobacter sp. strain CACIAM 66H1, found in a nonaxenic culture of Synechococcus sp. (cyanobacteria), is presented here. Copyright © 2016 da Silva et al.
Draft Genome Sequence of Acinetobacter calcoaceticus Strain P23, a Plant Growth-Promoting Bacterium of Duckweed

PubMed Central

Hosoyama, Akira; Yamazoe, Atsushi; Morikawa, Masaaki

2015-01-01

Acinetobacter calcoaceticus strain P23 is a plant growth-promoting bacterium, which was isolated from the surface of duckweed. We report here the draft genome sequence of strain P23. The genome data will serve as a valuable reference for understanding the molecular mechanism of plant growth promotion in aquatic plants. PMID:25720680
Draft Genome Sequence of Microbacterium sp. Strain UCD-TDU (Phylum Actinobacteria)

PubMed Central

Bendiks, Zachary A.; Lang, Jenna M.; Darling, Aaron E.; Coil, David A.

2013-01-01

Here, we present the draft genome sequence of Microbacterium sp. strain UCD-TDU, a member of the phylum Actinobacteria. The assembly contains 3,746,321 bp (in 8 scaffolds). This strain was isolated from a residential toilet as part of an undergraduate student research project to sequence reference genomes of microbes from the built environment. PMID:23516225
A draft whole genome sequence of “Candidatus Liberibacter asiaticus” strain TX2351 from Asian citrus psyllids in Texas, USA

USDA-ARS?s Scientific Manuscript database

The draft genome sequence of “Candidatus Liberibacter asiaticus” strain TX2351 collected from ACP in South Texas has been determined. The TX2351 genome is 1,252,043 bp in size with a 36.5% G+C content, encoding 1,184 predicted open reading frames and 51 RNA genes....
Draft Genome Sequence of Lactobacillus paracasei DmW181, a Bacterium Isolated from Wild Drosophila.

PubMed

Hammer, Austin J; Walters, Amber; Carroll, Courtney; Newell, Peter D; Chaston, John M

2017-07-06

The draft genome sequence of Lactobacillus paracasei DmW181, an anaerobic bacterium isolate from wild Drosophila flies, is reported here. Strain DmW181 possesses genes for sialic acid and mannose metabolism. The assembled genome is 3,201,429 bp, with 3,454 predicted genes. Copyright © 2017 Hammer et al.
Draft Genome Sequence of Methanohalophilus mahii Strain DAL1 Reconstructed from a Hydraulic Fracturing-Produced Water Metagenome

PubMed Central

Lipus, Daniel; Vikram, Amit

2016-01-01

We report here the 1,882,100-bp draft genome sequence of Methanohalophilus mahii strain DAL1, recovered from Marcellus Shale hydraulic fracturing-produced water using metagenomic contig binning. Genome annotation revealed several key methanogenesis genes and provides valuable information on archaeal activity associated with hydraulic fracturing-produced water environments. PMID:27587817
Draft Genome Sequence of Pseudomonas sp. BDAL1 Reconstructed from a Bakken Shale Hydraulic Fracturing-Produced Water Storage Tank Metagenome

PubMed Central

Lipus, Daniel; Ross, Daniel

2017-01-01

ABSTRACT We report the 5,425,832 bp draft genome of Pseudomonas sp. strain BDAL1, recovered from a Bakken shale hydraulic fracturing-produced water tank metagenome. Genome annotation revealed several key biofilm formation genes and osmotic stress response mechanisms necessary for survival in hydraulic fracturing-produced water. PMID:28302780
Draft Genome Sequence of Brevibacterium linens AE038-8, an Extremely Arsenic-Resistant Bacterium

DOE Office of Scientific and Technical Information (OSTI.GOV)

Maizel, Daniela; Utturkar, Sagar M.; Brown, Steven D.

To understand the arsenic biogeocycles in the groundwaters at Tucumán, Argentina, we isolated Brevibacterium linens sp. strain AE38-8, obtained from arsenic-contaminated well water. This strain is extremely resistant to arsenicals and has arsenic resistance (ars) genes in its genome. Here, we report the draft genome sequence of B. linens AE38-8.
Draft Genome Sequence of Brevibacterium linens AE038-8, an Extremely Arsenic-Resistant Bacterium

DOE PAGES

Maizel, Daniela; Utturkar, Sagar M.; Brown, Steven D.; ...

2015-04-16

To understand the arsenic biogeocycles in the groundwaters at Tucumán, Argentina, we isolated Brevibacterium linens sp. strain AE38-8, obtained from arsenic-contaminated well water. This strain is extremely resistant to arsenicals and has arsenic resistance (ars) genes in its genome. Here, we report the draft genome sequence of B. linens AE38-8.

Permanent Improved High-Quality Draft Genome Sequence of Nocardia casuarinae Strain BMG51109, an Endophyte of Actinorhizal Root Nodules of Casuarina glauca

DOE Office of Scientific and Technical Information (OSTI.GOV)

Ghodhbane-Gtari, Faten; Beauchemin, Nicholas; Louati, Moussa

Here, we report the first genome sequence of a Nocardia plant endophyte, N. casuarinae strain BMG51109, isolated from Casuarina glauca root nodules. The improved high-quality draft genome sequence contains 8,787,999 bp with a 68.90% GC content and 7,307 predicted protein-coding genes.
Draft Genome Sequence of a Violacein-Producing Iodobacter sp. from the Hudson Valley Watershed

PubMed Central

Doing, Georgia

2018-01-01

ABSTRACT Iodobacter species are among a number of freshwater Gram-negative violacein-producing bacteria. Janthinobacterium lividum and Chromobacterium violaceum have had their whole genomes sequenced and annotated. This is the first report of a draft whole-genome sequence of a violacein-producing Iodobacter strain that was isolated from the Hudson Valley watershed. PMID:29301892
Draft Genome Sequence and Description of Janthinobacterium sp. Strain CG3, a Psychrotolerant Antarctic Supraglacial Stream Bacterium

PubMed Central

Smith, Heidi; Akiyama, Tatsuya; Franklin, Michael; Woyke, Tanja; Teshima, Hazuki; Davenport, Karen; Daligault, Hajnalka; Erkkila, Tracy; Goodwin, Lynne; Gu, Wei; Xu, Yan; Chain, Patrick

2013-01-01

Here we present the draft genome sequence of Janthinobacterium sp. strain CG3, a psychrotolerant non-violacein-producing bacterium that was isolated from the Cotton Glacier supraglacial stream. The genome sequence of this organism will provide insight as to the mechanisms necessary for bacteria to survive in UV-stressed icy environments. PMID:24265494
Draft Genome Sequence of a Violacein-Producing Iodobacter sp. from the Hudson Valley Watershed.

PubMed

Doing, Georgia; Perron, Gabriel G; Jude, Brooke A

2018-01-04

Iodobacter species are among a number of freshwater Gram-negative violacein-producing bacteria. Janthinobacterium lividum and Chromobacterium violaceum have had their whole genomes sequenced and annotated. This is the first report of a draft whole-genome sequence of a violacein-producing Iodobacter strain that was isolated from the Hudson Valley watershed. Copyright © 2018 Doing et al.
Draft genome sequence of Sulfurospirillum sp. strain MES, reconstructed from the metagenome of a microbial electrosynthesis system

DOE PAGES

Ross, Daniel E.; Marshall, Christopher W.; May, Harold D.; ...

2015-01-15

A draft genome of Sulfurospirillum sp. strain MES was isolated through taxonomic binning of a metagenome sequenced from a microbial electrosynthesis system (MES) actively producing acetate and hydrogen. The genome contains the nosZDFLY genes, which are involved in nitrous oxide reduction, suggesting the potential role of this strain in denitrification.
Draft Genome Sequences of Two Mycobacterium bovis Strains Isolated from Beef Cattle in Paraguay

PubMed Central

Sanabria, Lidia; Lagrave, Lorena; Nishibe, Christiane; Ribas, Augusto C. A.; Zumárraga, Martín J.; Araújo, Flábio R.

2017-01-01

ABSTRACT This work reports the draft genome sequences of the Mycobacterium bovis strains M1009 and M1010, isolated from the lymph nodes of two infected cows on a beef farm in Paraguay. Comparative genomics between these strains and other regional strains may provide more insights regarding M. bovis epidemiology in South America. PMID:28705977
Permanent Improved High-Quality Draft Genome Sequence of Nocardia casuarinae Strain BMG51109, an Endophyte of Actinorhizal Root Nodules of Casuarina glauca

DOE PAGES

Ghodhbane-Gtari, Faten; Beauchemin, Nicholas; Louati, Moussa; ...

2016-08-04

Here, we report the first genome sequence of a Nocardia plant endophyte, N. casuarinae strain BMG51109, isolated from Casuarina glauca root nodules. The improved high-quality draft genome sequence contains 8,787,999 bp with a 68.90% GC content and 7,307 predicted protein-coding genes.
Draft Genome Sequence of Methylobacterium radiotolerans Strain MAMP 4754, a Bacterial Endophyte Isolated from Combretum erythrophyllum in South Africa

PubMed Central

Photolo, Mampolelo M.; Mavumengwana, Vuyo; Serepa-Dlamini, Mahloro H.

2017-01-01

ABSTRACT We announce here the draft genome sequence of Methylobacterium radiotolerans strain MAMP 4754, isolated from the roots of the medicinal plant Combretum erythrophyllum. M. radiotolerans has a genome size of 7,389,282 bp with 7,166 genes and a G+C content of 70.5%. PMID:28982992
SSPACE-LongRead: scaffolding bacterial draft genomes using long read sequence information

PubMed Central

2014-01-01

Background The recent introduction of the Pacific Biosciences RS single molecule sequencing technology has opened new doors to scaffolding genome assemblies in a cost-effective manner. The long read sequence information is promised to enhance the quality of incomplete and inaccurate draft assemblies constructed from Next Generation Sequencing (NGS) data. Results Here we propose a novel hybrid assembly methodology that aims to scaffold pre-assembled contigs in an iterative manner using PacBio RS long read information as a backbone. On a test set comprising six bacterial draft genomes, assembled using either a single Illumina MiSeq or Roche 454 library, we show that even a 50× coverage of uncorrected PacBio RS long reads is sufficient to drastically reduce the number of contigs. Comparisons to the AHA scaffolder indicate our strategy is better capable of producing (nearly) complete bacterial genomes. Conclusions The current work describes our SSPACE-LongRead software which is designed to upgrade incomplete draft genomes using single molecule sequences. We conclude that the recent advances of the PacBio sequencing technology and chemistry, in combination with the limited computational resources required to run our program, allow to scaffold genomes in a fast and reliable manner. PMID:24950923
Improved genomic resources and new bioinformatic workflow for the carcinogenic parasite Clonorchis sinensis: Biotechnological implications.

PubMed

Wang, Daxi; Korhonen, Pasi K; Gasser, Robin B; Young, Neil D

Clonorchis sinensis (family Opisthorchiidae) is an important foodborne parasite that has a major socioeconomic impact on ~35 million people predominantly in China, Vietnam, Korea and the Russian Far East. In humans, infection with C. sinensis causes clonorchiasis, a complex hepatobiliary disease that can induce cholangiocarcinoma (CCA), a malignant cancer of the bile ducts. Central to understanding the epidemiology of this disease is knowledge of genetic variation within and among populations of this parasite. Although most published molecular studies seem to suggest that C. sinensis represents a single species, evidence of karyotypic variation within C. sinensis and cryptic species within a related opisthorchiid fluke (Opisthorchis viverrini) emphasise the importance of studying and comparing the genes and genomes of geographically distinct isolates of C. sinensis. Recently, we sequenced, assembled and characterised a draft nuclear genome of a C. sinensis isolate from Korea and compared it with a published draft genome of a Chinese isolate of this species using a bioinformatic workflow established for comparing draft genome assemblies and their gene annotations. We identified that 50.6% and 51.3% of the Korean and Chinese C. sinensis genomic scaffolds were syntenic, respectively. Within aligned syntenic blocks, the genomes had a high level of nucleotide identity (99.1%) and encoded 15 variable proteins likely to be involved in diverse biological processes. Here, we review current technical challenges of using draft genome assemblies to undertake comparative genomic analyses to quantify genetic variation between isolates of the same species. Using a workflow that overcomes these challenges, we report on a high-quality draft genome for C. sinensis from Korea and comparative genomic analyses, as a basis for future investigations of the genetic structures of C. sinensis populations, and discuss the biotechnological implications of these explorations. Copyright © 2018 Elsevier Inc. All rights reserved.
Draft Genome Sequence of Magnesium-Dissolving Lactococcus garvieae A1, Isolated from Soil

PubMed Central

Altın, Gonca; Şahin, Fikrettin

2017-01-01

ABSTRACT The probiotic bacterium Lactococcus garvieae A1, isolated from soil, is interesting for biomining applications. Here, we report the draft genome sequence and annotation of this strain, with a focus on metal transporter enzymes. PMID:28546485
Draft Genome Sequence of Enterococcus hirae Strain INF E1 Isolated from Cultured Milk.

PubMed

Porcellato, Davide; Ostlie, Hilde M; Skeie, Siv B

2014-07-17

Here, we present the draft genome of Enterococcus hirae INF E1, found as a contaminant in cultured milk and studied for its ability to metabolize milk fat globule membrane glycoconjugates. Copyright © 2014 Porcellato et al.
Draft Genome Sequence of Herbaspirillum lusitanum P6-12, an Endophyte Isolated from Root Nodules of Phaseolus vulgaris

PubMed Central

Weiss, Vinícius Almir; Faoro, Helisson; Tadra-Sfeir, Michelle Zibbetti; Raittz, Roberto Tadeu; de Souza, Emanuel Maltempi; Monteiro, Rose Adele; Cardoso, Rodrigo Luis Alves; Wassem, Roseli; Chubatsu, Leda Satie; Huergo, Luciano Fernandes; Müller-Santos, Marcelo; Steffens, Maria Berenice Reynaud; Rigo, Liu Un; Pedrosa, Fábio de Oliveira

2012-01-01

Herbaspirillum lusitanum strain P6-12 (DSM 17154) is, so far, the only species of Herbaspirillum isolated from plant root nodules. Here we report a draft genome sequence of this organism. PMID:22815451
Draft Genome of Rhodococcus rhodochrous TRN7, Isolated from the Coast of Trindade Island, Brazil.

PubMed

Rodrigues, Edmo M; Pylro, Victor S; Dobbler, Priscila T; Victoria, Filipe; Roesch, Luiz F W; Tótola, Marcos R

2016-03-03

Here, we present a draft genome and annotation of Rhodococcus rhodochrous TRN7, isolated from Trindade Island, Brazil, which will provide genetic data to benefit the understanding of its metabolism. Copyright © 2016 Rodrigues et al.
MAIZEGDB.ORG, the Maize Genetics Cooperation and the 2500 MB B73 Genome-Generated Tsunami

USDA-ARS?s Scientific Manuscript database

Advances in sequencing technology have made it possible to sequence the 2500 MB B73 maize genome, both cheaply and in a relatively short time. Nearly simultaneously, other sequencing-based data are on the leading edge of a data tsunami: sequenced differences (currently >300,000 SNP for >1000 inbre...
An Approach to Using Toxicogenomic Data in US EPA Human ...

EPA Pesticide Factsheets

This draft report is a description of an approach to evaluate genomic data for use in risk assessment and a case study to illustrate the approach. The dibutyl phthalate (DBP) case study example focuses on male reproductive developmental effects and the qualitative application of the available genomic data. The case study presented in this draft document is a separate activity from any of the ongoing IRIS human health assessments for the phthalates. This draft report is a description of an approach to evaluate genomic data for use in risk assessment and a case study to illustrate the approach. The dibutyl phthalate (DBP) case study example focuses on male reproductive developmental effects and the qualitative application of the available genomic data.
Whole genome comparison between table and wine grapes reveals a comprehensive catalog of structural variants

PubMed Central

2014-01-01

Background Grapevine (Vitis vinifera L.) is the most important Mediterranean fruit crop, used to produce both wine and spirits as well as table grape and raisins. Wine and table grape cultivars represent two divergent germplasm pools with different origins and domestication history, as well as differential characteristics for berry size, cluster architecture and berry chemical profile, among others. ‘Sultanina’ plays a pivotal role in modern table grape breeding providing the main source of seedlessness. This cultivar is also one of the most planted for fresh consumption and raisins production. Given its importance, we sequenced it and implemented a novel strategy for the de novo assembly of its highly heterozygous genome. Results Our approach produced a draft genome of 466 Mb, recovering 82% of the genes present in the grapevine reference genome; in addition, we identified 240 novel genes. A large number of structural variants and SNPs were identified. Among them, 45 (21 SNPs and 24 INDELs) were experimentally confirmed in ‘Sultanina’ and six SNPs in other 23 table grape varieties. Transposable elements corresponded to ca. 80% of the repetitive sequences involved in structural variants and more than 2,000 genes were affected in their structure by these variants. Some of these genes are likely involved in embryo development, suggesting that they may contribute to seedlessness, a key trait for table grapes. Conclusions This work produced the first structural variants and SNPs catalog for grapevine, constituting a novel and very powerful tool for genomic studies in this key fruit crop, particularly useful to support marker assisted breeding in table grapes. PMID:24397443
Draft genome of the reindeer (Rangifer tarandus).

PubMed

Li, Zhipeng; Lin, Zeshan; Ba, Hengxing; Chen, Lei; Yang, Yongzhi; Wang, Kun; Qiu, Qiang; Wang, Wen; Li, Guangyu

2017-12-01

The reindeer (Rangifer tarandus) is the only fully domesticated species in the Cervidae family, and it is the only cervid with a circumpolar distribution. Unlike all other cervids, female reindeer, as well as males, regularly grow cranial appendages (antlers, the defining characteristics of cervids). Moreover, reindeer milk contains more protein and less lactose than bovids' milk. A high-quality reference genome of this species will assist efforts to elucidate these and other important features in the reindeer. We obtained 615 Gb (Gigabase) of usable sequences by filtering the low-quality reads of the raw data generated from the Illumina Hiseq 4000 platform, and a 2.64-Gb final assembly, representing 95.7% of the estimated genome (2.76 Gb according to k-mer analysis), including 92.6% of expected genes according to BUSCO analysis. The contig N50 and scaffold N50 sizes were 89.7 kilo base (kb) and 0.94 mega base (Mb), respectively. We annotated 21 555 protein-coding genes and 1.07 Gb of repetitive sequences by de novo and homology-based prediction. Homology-based searches detected 159 rRNA, 547 miRNA, 1339 snRNA, and 863 tRNA sequences in the genome of R. tarandus. The divergence time between R. tarandus and ancestors of Bos taurus and Capra hircus is estimated to be about 29.5 million years ago. Our results provide the first high-quality reference genome for the reindeer and a valuable resource for studying the evolution, domestication, and other unusual characteristics of the reindeer. © The Authors 2017. Published by Oxford University Press.
A Single Molecule Scaffold for the Maize Genome

PubMed Central

Zhou, Shiguo; Wei, Fusheng; Nguyen, John; Bechner, Mike; Potamousis, Konstantinos; Goldstein, Steve; Pape, Louise; Mehan, Michael R.; Churas, Chris; Pasternak, Shiran; Forrest, Dan K.; Wise, Roger; Ware, Doreen; Wing, Rod A.; Waterman, Michael S.; Livny, Miron; Schwartz, David C.

2009-01-01

About 85% of the maize genome consists of highly repetitive sequences that are interspersed by low-copy, gene-coding sequences. The maize community has dealt with this genomic complexity by the construction of an integrated genetic and physical map (iMap), but this resource alone was not sufficient for ensuring the quality of the current sequence build. For this purpose, we constructed a genome-wide, high-resolution optical map of the maize inbred line B73 genome containing >91,000 restriction sites (averaging 1 site/∼23 kb) accrued from mapping genomic DNA molecules. Our optical map comprises 66 contigs, averaging 31.88 Mb in size and spanning 91.5% (2,103.93 Mb/∼2,300 Mb) of the maize genome. A new algorithm was created that considered both optical map and unfinished BAC sequence data for placing 60/66 (2,032.42 Mb) optical map contigs onto the maize iMap. The alignment of optical maps against numerous data sources yielded comprehensive results that proved revealing and productive. For example, gaps were uncovered and characterized within the iMap, the FPC (fingerprinted contigs) map, and the chromosome-wide pseudomolecules. Such alignments also suggested amended placements of FPC contigs on the maize genetic map and proactively guided the assembly of chromosome-wide pseudomolecules, especially within complex genomic regions. Lastly, we think that the full integration of B73 optical maps with the maize iMap would greatly facilitate maize sequence finishing efforts that would make it a valuable reference for comparative studies among cereals, or other maize inbred lines and cultivars. PMID:19936062
Draft genome sequence of marine alphaproteobacterial strain HIMB11, the first cultivated representative of a unique lineage within the Roseobacter clade possessing an unusually small genome

PubMed Central

Durham, Bryndan P.; Grote, Jana; Whittaker, Kerry A.; Bender, Sara J.; Luo, Haiwei; Grim, Sharon L.; Brown, Julia M.; Casey, John R.; Dron, Antony; Florez-Leiva, Lennin; Krupke, Andreas; Luria, Catherine M.; Mine, Aric H.; Nigro, Olivia D.; Pather, Santhiska; Talarmin, Agathe; Wear, Emma K.; Weber, Thomas S.; Wilson, Jesse M.; Church, Matthew J.; DeLong, Edward F.; Karl, David M.; Steward, Grieg F.; Eppley, John M.; Kyrpides, Nikos C.; Schuster, Stephan; Rappé, Michael S.

2014-01-01

Strain HIMB11 is a planktonic marine bacterium isolated from coastal seawater in Kaneohe Bay, Oahu, Hawaii belonging to the ubiquitous and versatile Roseobacter clade of the alphaproteobacterial family Rhodobacteraceae. Here we describe the preliminary characteristics of strain HIMB11, including annotation of the draft genome sequence and comparative genomic analysis with other members of the Roseobacter lineage. The 3,098,747 bp draft genome is arranged in 34 contigs and contains 3,183 protein-coding genes and 54 RNA genes. Phylogenomic and 16S rRNA gene analyses indicate that HIMB11 represents a unique sublineage within the Roseobacter clade. Comparison with other publicly available genome sequences from members of the Roseobacter lineage reveals that strain HIMB11 has the genomic potential to utilize a wide variety of energy sources (e.g. organic matter, reduced inorganic sulfur, light, carbon monoxide), while possessing a reduced number of substrate transporters. PMID:25197450

Draft genome sequence of marine alphaproteobacterial strain HIMB11, the first cultivated representative of a unique lineage within the Roseobacter clade possessing an unusually small genome.

PubMed

Durham, Bryndan P; Grote, Jana; Whittaker, Kerry A; Bender, Sara J; Luo, Haiwei; Grim, Sharon L; Brown, Julia M; Casey, John R; Dron, Antony; Florez-Leiva, Lennin; Krupke, Andreas; Luria, Catherine M; Mine, Aric H; Nigro, Olivia D; Pather, Santhiska; Talarmin, Agathe; Wear, Emma K; Weber, Thomas S; Wilson, Jesse M; Church, Matthew J; DeLong, Edward F; Karl, David M; Steward, Grieg F; Eppley, John M; Kyrpides, Nikos C; Schuster, Stephan; Rappé, Michael S

2014-06-15

Strain HIMB11 is a planktonic marine bacterium isolated from coastal seawater in Kaneohe Bay, Oahu, Hawaii belonging to the ubiquitous and versatile Roseobacter clade of the alphaproteobacterial family Rhodobacteraceae. Here we describe the preliminary characteristics of strain HIMB11, including annotation of the draft genome sequence and comparative genomic analysis with other members of the Roseobacter lineage. The 3,098,747 bp draft genome is arranged in 34 contigs and contains 3,183 protein-coding genes and 54 RNA genes. Phylogenomic and 16S rRNA gene analyses indicate that HIMB11 represents a unique sublineage within the Roseobacter clade. Comparison with other publicly available genome sequences from members of the Roseobacter lineage reveals that strain HIMB11 has the genomic potential to utilize a wide variety of energy sources (e.g. organic matter, reduced inorganic sulfur, light, carbon monoxide), while possessing a reduced number of substrate transporters.
Draft Genome Sequence of Leuconostoc mesenteroides 406 Isolated from the Traditional Fermented Mare Milk Airag in Tuv Aimag, Mongolia

PubMed Central

Toh, Hidehiro; Oshima, Kenshiro; Nakano, Akiyo; Hano, Chihiro; Yoshida, Saki; Nguyen, Tien Thi Thuy; Wulijideligen; Tashiro, Kosuke; Arakawa, Kensuke; Miyamoto, Taku

2016-01-01

Leuconostoc mesenteroides 406 was isolated from the traditional fermented mare milk airag in Tuv Aimag, Mongolia. This strain produces an antilisterial bacteriocin. Here, we report the draft genome sequence of this organism. PMID:27013047
The draft genome of a diploid cotton Gossypium raimondii

USDA-ARS?s Scientific Manuscript database

We have sequenced and assembled the draft genome of Gossypium raimondii, whose progenitor is considered the contributor of the D-subgenome to the economically important natural textile fiber producer, G. hirsutum. Next-generation Illumina pair-end (PE) sequencing strategies were employed to obtain ...
Draft Genome Sequence of Komagataeibacter rhaeticus Strain AF1, a High Producer of Cellulose, Isolated from Kombucha Tea.

PubMed

Dos Santos, Renato Augusto Corrêa; Berretta, Andresa A; Barud, Hernane da Silva; Ribeiro, Sidney José Lima; González-García, Laura Natalia; Zucchi, Tiago Domingues; Goldman, Gustavo H; Riaño-Pachón, Diego M

2014-07-24

Here, we present the draft genome sequence of Komagatabaeicter rhaeticus strain AF1, which was isolated from Kombucha tea and is capable of producing high levels of cellulose. Copyright © 2014 dos Santos et al.
Draft Genome Sequence of Catellicoccus marimammalium, a Novel Species Commonly Found in Gull Feces

EPA Science Inventory

Catellicoccus marimammalium is a relatively uncharacterized Gram-positive, facultative anaerobe with potential utility as an indicator of waterfowl fecal contamination. Here we report an annotated draft genome sequence that suggests this organism may be a symbiotic gut microbe.
Draft genome sequence of the phenazine-producing Pseudomonas fluorescens strain 2-79

USDA-ARS?s Scientific Manuscript database

Pseudomonas fluorescens strain 2-79, a natural isolate of the rhizosphere of wheat (Triticum aestivum L.), possesses antagonistic potential toward several fungal pathogens. We report the draft genome sequence of strain 2-79, which comprises 5,674 protein-coding sequences....
Toward Universal Forward Genetics: Using a Draft Genome Sequence of the Nematode Oscheius tipulae To Identify Mutations Affecting Vulva Development

PubMed Central

Besnard, Fabrice; Koutsovoulos, Georgios; Dieudonné, Sana; Blaxter, Mark; Félix, Marie-Anne

2017-01-01

Mapping-by-sequencing has become a standard method to map and identify phenotype-causing mutations in model species. Here, we show that a fragmented draft assembly is sufficient to perform mapping-by-sequencing in nonmodel species. We generated a draft assembly and annotation of the genome of the free-living nematode Oscheius tipulae, a distant relative of the model Caenorhabditis elegans. We used this draft to identify the likely causative mutations at the O. tipulae cov-3 locus, which affect vulval development. The cov-3 locus encodes the O. tipulae ortholog of C. elegans mig-13, and we further show that Cel-mig-13 mutants also have an unsuspected vulval-development phenotype. In a virtuous circle, we were able to use the linkage information collected during mutant mapping to improve the genome assembly. These results showcase the promise of genome-enabled forward genetics in nonmodel species. PMID:28630114
Toward Universal Forward Genetics: Using a Draft Genome Sequence of the Nematode Oscheius tipulae To Identify Mutations Affecting Vulva Development.

PubMed

Besnard, Fabrice; Koutsovoulos, Georgios; Dieudonné, Sana; Blaxter, Mark; Félix, Marie-Anne

2017-08-01

Mapping-by-sequencing has become a standard method to map and identify phenotype-causing mutations in model species. Here, we show that a fragmented draft assembly is sufficient to perform mapping-by-sequencing in nonmodel species. We generated a draft assembly and annotation of the genome of the free-living nematode Oscheius tipulae , a distant relative of the model Caenorhabditis elegans We used this draft to identify the likely causative mutations at the O. tipulae cov -3 locus, which affect vulval development. The cov-3 locus encodes the O. tipulae ortholog of C. elegans mig-13 , and we further show that Cel-mig-13 mutants also have an unsuspected vulval-development phenotype. In a virtuous circle, we were able to use the linkage information collected during mutant mapping to improve the genome assembly. These results showcase the promise of genome-enabled forward genetics in nonmodel species. Copyright © 2017 by the Genetics Society of America.
Assembly of the draft genome of buckwheat and its applications in identifying agronomically useful genes.

PubMed

Yasui, Yasuo; Hirakawa, Hideki; Ueno, Mariko; Matsui, Katsuhiro; Katsube-Tanaka, Tomoyuki; Yang, Soo Jung; Aii, Jotaro; Sato, Shingo; Mori, Masashi

2016-06-01

Buckwheat (Fagopyrum esculentum Moench; 2n = 2x = 16) is a nutritionally dense annual crop widely grown in temperate zones. To accelerate molecular breeding programmes of this important crop, we generated a draft assembly of the buckwheat genome using short reads obtained by next-generation sequencing (NGS), and constructed the Buckwheat Genome DataBase. After assembling short reads, we determined 387,594 scaffolds as the draft genome sequence (FES_r1.0). The total length of FES_r1.0 was 1,177,687,305 bp, and the N50 of the scaffolds was 25,109 bp. Gene prediction analysis revealed 286,768 coding sequences (CDSs; FES_r1.0_cds) including those related to transposable elements. The total length of FES_r1.0_cds was 212,917,911 bp, and the N50 was 1,101 bp. Of these, the functions of 35,816 CDSs excluding those for transposable elements were annotated by BLAST analysis. To demonstrate the utility of the database, we conducted several test analyses using BLAST and keyword searches. Furthermore, we used the draft genome as a reference sequence for NGS-based markers, and successfully identified novel candidate genes controlling heteromorphic self-incompatibility of buckwheat. The database and draft genome sequence provide a valuable resource that can be used in efforts to develop buckwheat cultivars with superior agronomic traits. © The Author 2016. Published by Oxford University Press on behalf of Kazusa DNA Research Institute.
Draft Genome Sequence of Thermus scotoductus Strain K1, Isolated from a Geothermal Spring in Karvachar, Nagorno Karabakh

PubMed Central

Saghatelyan, Ani; Poghosyan, Lianna

2015-01-01

The 2,379,636-bp draft genome sequence of Thermus scotoductus strain K1, isolated from geothermal spring outlet located in the Karvachar region in Nagorno Karabakh is presented. Strain K1 shares about 80% genome sequence similarity with T. scotoductus strain SA-01, recovered from a deep gold mine in South Africa. PMID:26564055
Draft genome sequence of marine Streptomyces sp. strain W007, which produces angucyclinone antibiotics with a benz[a]anthracene skeleton.

PubMed

Qin, Song; Zhang, Hongyu; Li, Fuchao; Zhu, Benwei; Zheng, Huajun

2012-03-01

A series of angucyclinone antibiotics have been isolated from marine Streptomyces sp. strain W007 and identified. Here, a draft genome sequence of Streptomyces sp. W007 is presented. The genome contains an intact biosynthetic gene cluster for angucyclinone antibiotics, which provides insight into the combinatorial biosynthesis of angucyclinone antibiotics produced by marine streptomycetes.
Draft Genome Sequence of a Dictyoglomus sp. from an Enrichment Culture of a New Zealand Geothermal Spring

DOE PAGES

Reysenbach, Anna-Louise; Donaho, John; Kelley, John; ...

2018-03-15

A draft genome of a novelDictyoglomussp., NZ13-RE01, was obtained from a New Zealand hot spring enrichment culture. The 1,927,012-bp genome is similar in both size and G+C content to otherDictyoglomusspp. Like its relatives,Dictyoglomussp. NZ13-RE01 encodes many genes involved in complex carbohydrate metabolism.
Draft Genome Sequence of Pedobacter sp. Strain NL19, a Producer of Potent Antibacterial Compounds

PubMed Central

2015-01-01

Here, we report the draft genome sequence of Pedobacter sp. strain NL19. The genome has 5.99 Mbp and a G+C content of 39.0%. NL19 was isolated from sludge from an abandoned uranium mine in the north of Portugal, and it produces potent antibacterials against Gram-positive and Gram-negative bacteria. PMID:25814603
Draft Genome Sequence of Pedobacter agri PB92T, Which Belongs to the Family Sphingobacteriaceae

PubMed Central

Lee, Myunglip; Roh, Seong Woon; Lee, Hae-Won; Yim, Kyung June; Kim, Kil-Nam; Bae, Jin-Woo; Choi, Kwang-Sik; Jeon, You-Jin; Jung, Won-Kyo; Kang, Heewan

2012-01-01

Strain PB92T of Pedobacter agri, which belongs to the family Sphingobacteriaceae, was isolated from soil in the Republic of Korea. The draft genome of strain PB92T contains 5,141,552 bp, with a G+C content of 38.0%. This is the third genome sequencing project of the type strains among the Pedobacter species. PMID:22740666
Draft Genome Sequences of Five Enterococcus Species Isolated from the Gut of Patients with Suspected Clostridium difficile Infection

PubMed Central

Castro-Nallar, Eduardo; Valenzuela, Sandro L.; Baquedano, Sebastián; Sánchez, Carolina; Fernández, Fabiola

2017-01-01

ABSTRACT We present draft genome sequences of five Enterococcus species from patients suspected of Clostridium difficile infection. Genome completeness was confirmed by presence of bacterial orthologs (97%). Gene searches using Hidden-Markov models revealed that the isolates harbor between seven and 11 genes involved in antibiotic resistance to tetracyclines, beta-lactams, and vancomycin. PMID:28522725
Draft Genome Sequence of Marinobacter sp. Strain ANT_B65, Isolated from Antarctic Marine Sponge.

PubMed

de França, Paula; Camilo, Esther; Fantinatti-Garboginni, Fabiana

2018-01-04

Marinobacter sp. strain ANT_B65 was isolated from sponge collected in King George Island, Antarctica. The draft genome of 4,173,840 bp encodes 3,743 protein-coding open reading frames. The genome will provide insights into the strain's potential use in the production of natural products. Copyright © 2018 de França et al.
Draft genome sequence of Xylella fastidiosa supsp. multiplex strain Griffin-1 from Quercus rubra in Georgia

USDA-ARS?s Scientific Manuscript database

The draft genome sequence of Xylella fastidiosa subsp. multiplex Strain Griffin-1 isolated from a red oak tree (Quercus rubra) in Georgia, U.S.A. is reported. The bacterium has a genome size of 2,387,314 bp with 51.7% G+C content and comprises 2,903 predicted open reading frames (ORFs), and 50 RNA g...
Draft Genome Sequences of New Genomospecies "Candidatus Pectobacterium maceratum" Strains, Which Cause Soft Rot in Plants.

PubMed

Shirshikov, Fedor V; Korzhenkov, Aleksei A; Miroshnikov, Kirill K; Kabanova, Anastasia P; Barannik, Alla P; Ignatov, Alexander N; Miroshnikov, Konstantin A

2018-04-12

Investigation of collections of phytopathogenic bacteria has revealed some strains distinct from known Pectobacterium spp. We report here the draft genome sequences of five such strains, isolated during the period of 1947 to 2012. Based on comparative genomics, we propose a new candidate genomospecies of the genus Pectobacterium , " Candidatus Pectobacterium maceratum." Copyright © 2018 Shirshikov et al.
Draft Genome Sequences of New Genomospecies “Candidatus Pectobacterium maceratum” Strains, Which Cause Soft Rot in Plants

PubMed Central

2018-01-01

ABSTRACT Investigation of collections of phytopathogenic bacteria has revealed some strains distinct from known Pectobacterium spp. We report here the draft genome sequences of five such strains, isolated during the period of 1947 to 2012. Based on comparative genomics, we propose a new candidate genomospecies of the genus Pectobacterium, “Candidatus Pectobacterium maceratum.” PMID:29650577
Draft Genome Sequence of a Dictyoglomus sp. from an Enrichment Culture of a New Zealand Geothermal Spring

DOE Office of Scientific and Technical Information (OSTI.GOV)

Reysenbach, Anna-Louise; Donaho, John; Kelley, John

A draft genome of a novelDictyoglomussp., NZ13-RE01, was obtained from a New Zealand hot spring enrichment culture. The 1,927,012-bp genome is similar in both size and G+C content to otherDictyoglomusspp. Like its relatives,Dictyoglomussp. NZ13-RE01 encodes many genes involved in complex carbohydrate metabolism.

Some links on this page may take you to non-federal websites. Their policies may differ from this site.