Extensive Error in the Number of Genes Inferred from Draft Genome Assemblies
Denton, James F.; Lugo-Martinez, Jose; Tucker, Abraham E.; Schrider, Daniel R.; Warren, Wesley C.; Hahn, Matthew W.
2014-01-01
Current sequencing methods produce large amounts of data, but genome assemblies based on these data are often woefully incomplete. These incomplete and error-filled assemblies result in many annotation errors, especially in the number of genes present in a genome. In this paper we investigate the magnitude of the problem, both in terms of total gene number and the number of copies of genes in specific families. To do this, we compare multiple draft assemblies against higher-quality versions of the same genomes, using several new assemblies of the chicken genome based on both traditional and next-generation sequencing technologies, as well as published draft assemblies of chimpanzee. We find that upwards of 40% of all gene families are inferred to have the wrong number of genes in draft assemblies, and that these incorrect assemblies both add and subtract genes. Using simulated genome assemblies of Drosophila melanogaster, we find that the major cause of increased gene numbers in draft genomes is the fragmentation of genes onto multiple individual contigs. Finally, we demonstrate the usefulness of RNA-Seq in improving the gene annotation of draft assemblies, largely by connecting genes that have been fragmented in the assembly process. PMID:25474019
Extensive error in the number of genes inferred from draft genome assemblies.
Denton, James F; Lugo-Martinez, Jose; Tucker, Abraham E; Schrider, Daniel R; Warren, Wesley C; Hahn, Matthew W
2014-12-01
Current sequencing methods produce large amounts of data, but genome assemblies based on these data are often woefully incomplete. These incomplete and error-filled assemblies result in many annotation errors, especially in the number of genes present in a genome. In this paper we investigate the magnitude of the problem, both in terms of total gene number and the number of copies of genes in specific families. To do this, we compare multiple draft assemblies against higher-quality versions of the same genomes, using several new assemblies of the chicken genome based on both traditional and next-generation sequencing technologies, as well as published draft assemblies of chimpanzee. We find that upwards of 40% of all gene families are inferred to have the wrong number of genes in draft assemblies, and that these incorrect assemblies both add and subtract genes. Using simulated genome assemblies of Drosophila melanogaster, we find that the major cause of increased gene numbers in draft genomes is the fragmentation of genes onto multiple individual contigs. Finally, we demonstrate the usefulness of RNA-Seq in improving the gene annotation of draft assemblies, largely by connecting genes that have been fragmented in the assembly process.
Improving draft genome contiguity with reference-derived in silico mate-pair libraries.
Grau, José Horacio; Hackl, Thomas; Koepfli, Klaus-Peter; Hofreiter, Michael
2018-05-01
Contiguous genome assemblies are a highly valued biological resource because of the higher number of completely annotated genes and genomic elements that are usable compared to fragmented draft genomes. Nonetheless, contiguity is difficult to obtain if only low coverage data and/or only distantly related reference genome assemblies are available. In order to improve genome contiguity, we have developed Cross-Species Scaffolding-a new pipeline that imports long-range distance information directly into the de novo assembly process by constructing mate-pair libraries in silico. We show how genome assembly metrics and gene prediction dramatically improve with our pipeline by assembling two primate genomes solely based on ∼30x coverage of shotgun sequencing data.
Sato, Kengo; Kuroki, Yoko; Kumita, Wakako; Fujiyama, Asao; Toyoda, Atsushi; Kawai, Jun; Iriki, Atsushi; Sasaki, Erika; Okano, Hideyuki; Sakakibara, Yasubumi
2015-11-20
The first draft of the common marmoset (Callithrix jacchus) genome was published by the Marmoset Genome Sequencing and Analysis Consortium. The draft was based on whole-genome shotgun sequencing, and the current assembly version is Callithrix_jacches-3.2.1, but there still exist 187,214 undetermined gap regions and supercontigs and relatively short contigs that are unmapped to chromosomes in the draft genome. We performed resequencing and assembly of the genome of common marmoset by deep sequencing with high-throughput sequencing technology. Several different sequence runs using Illumina sequencing platforms were executed, and 181 Gbp of high-quality bases including mate-pairs with long insert lengths of 3, 8, 20, and 40 Kbp were obtained, that is, approximately 60× coverage. The resequencing significantly improved the MGSAC draft genome sequence. The N50 of the contigs, which is a statistical measure used to evaluate assembly quality, doubled. As a result, 51% of the contigs (total length: 299 Mbp) that were unmapped to chromosomes in the MGSAC draft were merged with chromosomal contigs, and the improved genome sequence helped to detect 5,288 new genes that are homologous to human cDNAs and the gaps in 5,187 transcripts of the Ensembl gene annotations were completely filled.
Single molecule sequencing-guided scaffolding and correction of draft assemblies.
Zhu, Shenglong; Chen, Danny Z; Emrich, Scott J
2017-12-06
Although single molecule sequencing is still improving, the lengths of the generated sequences are inevitably an advantage in genome assembly. Prior work that utilizes long reads to conduct genome assembly has mostly focused on correcting sequencing errors and improving contiguity of de novo assemblies. We propose a disassembling-reassembling approach for both correcting structural errors in the draft assembly and scaffolding a target assembly based on error-corrected single molecule sequences. To achieve this goal, we formulate a maximum alternating path cover problem. We prove that this problem is NP-hard, and solve it by a 2-approximation algorithm. Our experimental results show that our approach can improve the structural correctness of target assemblies in the cost of some contiguity, even with smaller amounts of long reads. In addition, our reassembling process can also serve as a competitive scaffolder relative to well-established assembly benchmarks.
ARKS: chromosome-scale scaffolding of human genome drafts with linked read kmers.
Coombe, Lauren; Zhang, Jessica; Vandervalk, Benjamin P; Chu, Justin; Jackman, Shaun D; Birol, Inanc; Warren, René L
2018-06-20
The long-range sequencing information captured by linked reads, such as those available from 10× Genomics (10xG), helps resolve genome sequence repeats, and yields accurate and contiguous draft genome assemblies. We introduce ARKS, an alignment-free linked read genome scaffolding methodology that uses linked reads to organize genome assemblies further into contiguous drafts. Our approach departs from other read alignment-dependent linked read scaffolders, including our own (ARCS), and uses a kmer-based mapping approach. The kmer mapping strategy has several advantages over read alignment methods, including better usability and faster processing, as it precludes the need for input sequence formatting and draft sequence assembly indexing. The reliance on kmers instead of read alignments for pairing sequences relaxes the workflow requirements, and drastically reduces the run time. Here, we show how linked reads, when used in conjunction with Hi-C data for scaffolding, improve a draft human genome assembly of PacBio long-read data five-fold (baseline vs. ARKS NG50 = 4.6 vs. 23.1 Mbp, respectively). We also demonstrate how the method provides further improvements of a megabase-scale Supernova human genome assembly (NG50 = 14.74 Mbp vs. 25.94 Mbp before and after ARKS), which itself exclusively uses linked read data for assembly, with an execution speed six to nine times faster than competitive linked read scaffolders (~ 10.5 h compared to 75.7 h, on average). Following ARKS scaffolding of a human genome 10xG Supernova assembly (of cell line NA12878), fewer than 9 scaffolds cover each chromosome, except the largest (chromosome 1, n = 13). ARKS uses a kmer mapping strategy instead of linked read alignments to record and associate the barcode information needed to order and orient draft assembly sequences. The simplified workflow, when compared to that of our initial implementation, ARCS, markedly improves run time performances on experimental human genome datasets. Furthermore, the novel distance estimator in ARKS utilizes barcoding information from linked reads to estimate gap sizes. It accomplishes this by modeling the relationship between known distances of a region within contigs and calculating associated Jaccard indices. ARKS has the potential to provide correct, chromosome-scale genome assemblies, promptly. We expect ARKS to have broad utility in helping refine draft genomes.
GFinisher: a new strategy to refine and finish bacterial genome assemblies
NASA Astrophysics Data System (ADS)
Guizelini, Dieval; Raittz, Roberto T.; Cruz, Leonardo M.; Souza, Emanuel M.; Steffens, Maria B. R.; Pedrosa, Fabio O.
2016-10-01
Despite the development in DNA sequencing technology, improving the number and the length of reads, the process of reconstruction of complete genome sequences, the so called genome assembly, is still complex. Only 13% of the prokaryotic genome sequencing projects have been completed. Draft genome sequences deposited in public databases are fragmented in contigs and may lack the full gene complement. The aim of the present work is to identify assembly errors and improve the assembly process of bacterial genomes. The biological patterns observed in genomic sequences and the application of a priori information can allow the identification of misassembled regions, and the reorganization and improvement of the overall de novo genome assembly. GFinisher starts generating a Fuzzy GC skew graphs for each contig in an assembly and follows breaking down the contigs in critical points in order to reassemble and close them using jFGap. This has been successfully applied to dataset from 96 genome assemblies, decreasing the number of contigs by up to 86%. GFinisher can easily optimize assemblies of prokaryotic draft genomes and can be used to improve the assembly programs based on nucleotide sequence patterns in the genome. The software and source code are available at http://gfinisher.sourceforge.net/.
GFinisher: a new strategy to refine and finish bacterial genome assemblies.
Guizelini, Dieval; Raittz, Roberto T; Cruz, Leonardo M; Souza, Emanuel M; Steffens, Maria B R; Pedrosa, Fabio O
2016-10-10
Despite the development in DNA sequencing technology, improving the number and the length of reads, the process of reconstruction of complete genome sequences, the so called genome assembly, is still complex. Only 13% of the prokaryotic genome sequencing projects have been completed. Draft genome sequences deposited in public databases are fragmented in contigs and may lack the full gene complement. The aim of the present work is to identify assembly errors and improve the assembly process of bacterial genomes. The biological patterns observed in genomic sequences and the application of a priori information can allow the identification of misassembled regions, and the reorganization and improvement of the overall de novo genome assembly. GFinisher starts generating a Fuzzy GC skew graphs for each contig in an assembly and follows breaking down the contigs in critical points in order to reassemble and close them using jFGap. This has been successfully applied to dataset from 96 genome assemblies, decreasing the number of contigs by up to 86%. GFinisher can easily optimize assemblies of prokaryotic draft genomes and can be used to improve the assembly programs based on nucleotide sequence patterns in the genome. The software and source code are available at http://gfinisher.sourceforge.net/.
Moll, Karen M; Zhou, Peng; Ramaraj, Thiruvarangan; Fajardo, Diego; Devitt, Nicholas P; Sadowsky, Michael J; Stupar, Robert M; Tiffin, Peter; Miller, Jason R; Young, Nevin D; Silverstein, Kevin A T; Mudge, Joann
2017-08-04
Third generation sequencing technologies, with sequencing reads in the tens- of kilo-bases, facilitate genome assembly by spanning ambiguous regions and improving continuity. This has been critical for plant genomes, which are difficult to assemble due to high repeat content, gene family expansions, segmental and tandem duplications, and polyploidy. Recently, high-throughput mapping and scaffolding strategies have further improved continuity. Together, these long-range technologies enable quality draft assemblies of complex genomes in a cost-effective and timely manner. Here, we present high quality genome assemblies of the model legume plant, Medicago truncatula (R108) using PacBio, Dovetail Chicago (hereafter, Dovetail) and BioNano technologies. To test these technologies for plant genome assembly, we generated five assemblies using all possible combinations and ordering of these three technologies in the R108 assembly. While the BioNano and Dovetail joins overlapped, they also showed complementary gains in continuity and join numbers. Both technologies spanned repetitive regions that PacBio alone was unable to bridge. Combining technologies, particularly Dovetail followed by BioNano, resulted in notable improvements compared to Dovetail or BioNano alone. A combination of PacBio, Dovetail, and BioNano was used to generate a high quality draft assembly of R108, a M. truncatula accession widely used in studies of functional genomics. As a test for the usefulness of the resulting genome sequence, the new R108 assembly was used to pinpoint breakpoints and characterize flanking sequence of a previously identified translocation between chromosomes 4 and 8, identifying more than 22.7 Mb of novel sequence not present in the earlier A17 reference assembly. Adding Dovetail followed by BioNano data yielded complementary improvements in continuity over the original PacBio assembly. This strategy proved efficient and cost-effective for developing a quality draft assembly compared to traditional reference assemblies.
Positional bias in variant calls against draft reference assemblies.
Briskine, Roman V; Shimizu, Kentaro K
2017-03-28
Whole genome resequencing projects may implement variant calling using draft reference genomes assembled de novo from short-read libraries. Despite lower quality of such assemblies, they allowed researchers to extend a wide range of population genetic and genome-wide association analyses to non-model species. As the variant calling pipelines are complex and involve many software packages, it is important to understand inherent biases and limitations at each step of the analysis. In this article, we report a positional bias present in variant calling performed against draft reference assemblies constructed from de Bruijn or string overlap graphs. We assessed how frequently variants appeared at each position counted from ends of a contig or scaffold sequence, and discovered unexpectedly high number of variants at the positions related to the length of either k-mers or reads used for the assembly. We detected the bias in both publicly available draft assemblies from Assemblathon 2 competition as well as in the assemblies we generated from our simulated short-read data. Simulations confirmed that the bias causing variants are predominantly false positives induced by reads from spatially distant repeated sequences. The bias is particularly strong in contig assemblies. Scaffolding does not eliminate the bias but tends to mitigate it because of the changes in variants' relative positions and alterations in read alignments. The bias can be effectively reduced by filtering out the variants that reside in repetitive elements. Draft genome sequences generated by several popular assemblers appear to be susceptible to the positional bias potentially affecting many resequencing projects in non-model species. The bias is inherent to the assembly algorithms and arises from their particular handling of repeated sequences. It is recommended to reduce the bias by filtering especially if higher-quality genome assembly cannot be achieved. Our findings can help other researchers to improve the quality of their variant data sets and reduce artefactual findings in downstream analyses.
Besnard, Fabrice; Koutsovoulos, Georgios; Dieudonné, Sana; Blaxter, Mark; Félix, Marie-Anne
2017-01-01
Mapping-by-sequencing has become a standard method to map and identify phenotype-causing mutations in model species. Here, we show that a fragmented draft assembly is sufficient to perform mapping-by-sequencing in nonmodel species. We generated a draft assembly and annotation of the genome of the free-living nematode Oscheius tipulae, a distant relative of the model Caenorhabditis elegans. We used this draft to identify the likely causative mutations at the O. tipulae cov-3 locus, which affect vulval development. The cov-3 locus encodes the O. tipulae ortholog of C. elegans mig-13, and we further show that Cel-mig-13 mutants also have an unsuspected vulval-development phenotype. In a virtuous circle, we were able to use the linkage information collected during mutant mapping to improve the genome assembly. These results showcase the promise of genome-enabled forward genetics in nonmodel species. PMID:28630114
Besnard, Fabrice; Koutsovoulos, Georgios; Dieudonné, Sana; Blaxter, Mark; Félix, Marie-Anne
2017-08-01
Mapping-by-sequencing has become a standard method to map and identify phenotype-causing mutations in model species. Here, we show that a fragmented draft assembly is sufficient to perform mapping-by-sequencing in nonmodel species. We generated a draft assembly and annotation of the genome of the free-living nematode Oscheius tipulae , a distant relative of the model Caenorhabditis elegans We used this draft to identify the likely causative mutations at the O. tipulae cov -3 locus, which affect vulval development. The cov-3 locus encodes the O. tipulae ortholog of C. elegans mig-13 , and we further show that Cel-mig-13 mutants also have an unsuspected vulval-development phenotype. In a virtuous circle, we were able to use the linkage information collected during mutant mapping to improve the genome assembly. These results showcase the promise of genome-enabled forward genetics in nonmodel species. Copyright © 2017 by the Genetics Society of America.
De novo genome assembly of the red silk cotton tree (Bombax ceiba).
Gao, Yong; Wang, Haibo; Liu, Chao; Chu, Honglong; Dai, Dongqin; Song, Shengnan; Yu, Long; Han, Lihong; Fu, Yi; Tian, Bin; Tang, Lizhou
2018-05-01
Bombax ceiba L. (the red silk cotton tree) is a large deciduous tree that is distributed in tropical and sub-tropical Asia as well as northern Australia. It has great economic and ecological importance, with several applications in industry and traditional medicine in many Asian countries. To facilitate further utilization of this plant resource, we present here the draft genome sequence for B. ceiba. We assembled a relatively intact genome of B. ceiba by using PacBio single-molecule sequencing and BioNano optical mapping technologies. The final draft genome is approximately 895 Mb long, with contig and scaffold N50 sizes of 1.0 Mb and 2.06 Mb, respectively. The high-quality draft genome assembly of B. ceiba will be a valuable resource enabling further genetic improvement and more effective use of this tree species.
A post-assembly genome-improvement toolkit (PAGIT) to obtain annotated genomes from contigs.
Swain, Martin T; Tsai, Isheng J; Assefa, Samual A; Newbold, Chris; Berriman, Matthew; Otto, Thomas D
2012-06-07
Genome projects now produce draft assemblies within weeks owing to advanced high-throughput sequencing technologies. For milestone projects such as Escherichia coli or Homo sapiens, teams of scientists were employed to manually curate and finish these genomes to a high standard. Nowadays, this is not feasible for most projects, and the quality of genomes is generally of a much lower standard. This protocol describes software (PAGIT) that is used to improve the quality of draft genomes. It offers flexible functionality to close gaps in scaffolds, correct base errors in the consensus sequence and exploit reference genomes (if available) in order to improve scaffolding and generating annotations. The protocol is most accessible for bacterial and small eukaryotic genomes (up to 300 Mb), such as pathogenic bacteria, malaria and parasitic worms. Applying PAGIT to an E. coli assembly takes ∼24 h: it doubles the average contig size and annotates over 4,300 gene models.
Utturkar, Sagar M.; Bayer, Edward A.; Borovok, Ilya; ...
2016-09-29
Here, we and others have shown the utility of long sequence reads to improve genome assembly quality. In this study, we generated PacBio DNA sequence data to improve the assemblies of draft genomes for Clostridium thermocellum AD2, Clostridium thermocellum LQRI, and Pelosinus fermentans R7.
Utturkar, Sagar M.; Klingeman, Dawn Marie; Land, Miriam L.; ...
2014-06-14
Our motivation with this work was to assess the potential of different types of sequence data combined with de novo and hybrid assembly approaches to improve existing draft genome sequences. Our results show Illumina, 454 and PacBio sequencing technologies were used to generate de novo and hybrid genome assemblies for four different bacteria, which were assessed for quality using summary statistics (e.g. number of contigs, N50) and in silico evaluation tools. Differences in predictions of multiple copies of rDNA operons for each respective bacterium were evaluated by PCR and Sanger sequencing, and then the validated results were applied as anmore » additional criterion to rank assemblies. In general, assemblies using longer PacBio reads were better able to resolve repetitive regions. In this study, the combination of Illumina and PacBio sequence data assembled through the ALLPATHS-LG algorithm gave the best summary statistics and most accurate rDNA operon number predictions. This study will aid others looking to improve existing draft genome assemblies. As to availability and implementation–all assembly tools except CLC Genomics Workbench are freely available under GNU General Public License.« less
Dramatic improvement in genome assembly achieved using doubled-haploid genomes.
Zhang, Hong; Tan, Engkong; Suzuki, Yutaka; Hirose, Yusuke; Kinoshita, Shigeharu; Okano, Hideyuki; Kudoh, Jun; Shimizu, Atsushi; Saito, Kazuyoshi; Watabe, Shugo; Asakawa, Shuichi
2014-10-27
Improvement in de novo assembly of large genomes is still to be desired. Here, we improved draft genome sequence quality by employing doubled-haploid individuals. We sequenced wildtype and doubled-haploid Takifugu rubripes genomes, under the same conditions, using the Illumina platform and assembled contigs with SOAPdenovo2. We observed 5.4-fold and 2.6-fold improvement in the sizes of the N50 contig and scaffold of doubled-haploid individuals, respectively, compared to the wildtype, indicating that the use of a doubled-haploid genome aids in accurate genome analysis.
Genome assemblies for 11 Yersinia pestis strains isolated in the Caucasus region
Zhgenti, Ekaterine; Johnson, Shannon L.; Davenport, Karen W.; ...
2015-09-17
Yersinia pestis, the causative agent of plague, is endemic to the Caucasus region but few reference strain genome sequences from that region are available. We present the improved draft or finished assembled genomes from 11 strains isolated in the nation of Georgia and surrounding countries.
2018-01-01
ABSTRACT Karnal bunt of wheat is an internationally quarantined fungal pathogen disease caused by Tilletia indica and affects the international commercial seed trade of wheat. We announce here the first improved draft genome assembly of a monoteliosporic culture of the Tilletia indica fungus, consisting of 787 scaffolds with an approximate total genome size of 31.83 Mbp, which is more accurate and near to complete than the previous version. PMID:29773612
Kumar, Anil; Mishra, Pallavi; Maurya, Ranjeet; Mishra, A K; Gupta, Vijai K; Ramteke, Pramod W; Marla, Soma S
2018-05-17
Karnal bunt of wheat is an internationally quarantined fungal pathogen disease caused by Tilletia indica and affects the international commercial seed trade of wheat. We announce here the first improved draft genome assembly of a monoteliosporic culture of the Tilletia indica fungus, consisting of 787 scaffolds with an approximate total genome size of 31.83 Mbp, which is more accurate and near to complete than the previous version. Copyright © 2018 Kumar et al.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Lapidus, Alla L.
From the date its role in heredity was discovered, DNA has been generating interest among scientists from different fields of knowledge: physicists have studied the three dimensional structure of the DNA molecule, biologists tried to decode the secrets of life hidden within these long molecules, and technologists invent and improve methods of DNA analysis. The analysis of the nucleotide sequence of DNA occupies a special place among the methods developed. Thanks to the variety of sequencing technologies available, the process of decoding the sequence of genomic DNA (or whole genome sequencing) has become robust and inexpensive. Meanwhile the assembly ofmore » whole genome sequences remains a challenging task. In addition to the need to assemble millions of DNA fragments of different length (from 35 bp (Solexa) to 800 bp (Sanger)), great interest in analysis of microbial communities (metagenomes) of different complexities raises new problems and pushes some new requirements for sequence assembly tools to the forefront. The genome assembly process can be divided into two steps: draft assembly and assembly improvement (finishing). Despite the fact that automatically performed assembly (or draft assembly) is capable of covering up to 98% of the genome, in most cases, it still contains incorrectly assembled reads. The error rate of the consensus sequence produced at this stage is about 1/2000 bp. A finished genome represents the genome assembly of much higher accuracy (with no gaps or incorrectly assembled areas) and quality ({approx}1 error/10,000 bp), validated through a number of computer and laboratory experiments.« less
USDA-ARS?s Scientific Manuscript database
Although draft genomes are available for most agronomically important plant species, the majority are incomplete, highly fragmented, and often riddled with assembly and scaffolding errors. These assembly issues hinder advances in tool development for functional genomics and systems biology. Here we ...
Assembly of 913 microbial genomes from metagenomic sequencing of the cow rumen.
Stewart, Robert D; Auffret, Marc D; Warr, Amanda; Wiser, Andrew H; Press, Maximilian O; Langford, Kyle W; Liachko, Ivan; Snelling, Timothy J; Dewhurst, Richard J; Walker, Alan W; Roehe, Rainer; Watson, Mick
2018-02-28
The cow rumen is adapted for the breakdown of plant material into energy and nutrients, a task largely performed by enzymes encoded by the rumen microbiome. Here we present 913 draft bacterial and archaeal genomes assembled from over 800 Gb of rumen metagenomic sequence data derived from 43 Scottish cattle, using both metagenomic binning and Hi-C-based proximity-guided assembly. Most of these genomes represent previously unsequenced strains and species. The draft genomes contain over 69,000 proteins predicted to be involved in carbohydrate metabolism, over 90% of which do not have a good match in public databases. Inclusion of the 913 genomes presented here improves metagenomic read classification by sevenfold against our own data, and by fivefold against other publicly available rumen datasets. Thus, our dataset substantially improves the coverage of rumen microbial genomes in the public databases and represents a valuable resource for biomass-degrading enzyme discovery and studies of the rumen microbiome.
Genome Improvement at JGI-HAGSC
DOE Office of Scientific and Technical Information (OSTI.GOV)
Grimwood, Jane; Schmutz, Jeremy J.; Myers, Richard M.
Since the completion of the sequencing of the human genome, the Joint Genome Institute (JGI) has rapidly expanded its scientific goals in several DOE mission-relevant areas. At the JGI-HAGSC, we have kept pace with this rapid expansion of projects with our focus on assessing, assembling, improving and finishing eukaryotic whole genome shotgun (WGS) projects for which the shotgun sequence is generated at the Production Genomic Facility (JGI-PGF). We follow this by combining the draft WGS with genomic resources generated at JGI-HAGSC or in collaborator laboratories (including BAC end sequences, genetic maps and FLcDNA sequences) to produce an improved draft sequence.more » For eukaryotic genomes important to the DOE mission, we then add further information from directed experiments to produce reference genomic sequences that are publicly available for any scientific researcher. Also, we have continued our program for producing BAC-based finished sequence, both for adding information to JGI genome projects and for small BAC-based sequencing projects proposed through any of the JGI sequencing programs. We have now built our computational expertise in WGS assembly and analysis and have moved eukaryotic genome assembly from the JGI-PGF to JGI-HAGSC. We have concentrated our assembly development work on large plant genomes and complex fungal and algal genomes.« less
Draft genome of a Xanthomonas perforans strain associated with pith necrosis.
Torelli, Emanuela; Aiello, Dalia; Polizzi, Giancarlo; Firrao, Giuseppe; Cirvilleri, Gabriella
2015-02-01
Xanthomonas perforans causes bacterial spot of tomato and pepper. A genome draft of an unusual isolate (strain 4P1S2), differing in that it was associated with stem pith necrosis, was assembled from Illumina MiSeq sequencing data using the draft of X. perforans strain 91-118 as a reference. The resulting draft (accession number JRWW00000000) largely overlapped with the reference draft. In addition, the reads not mapping on the reference assembly were selected and used for a further assembly, that revealed a large putative plasmid. The analysis of the predicted proteins showed only few gene features that could be potentially implicated in the switch of a phytopathological behavior. © FEMS 2015. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Gonzalez-Esquer, C. Raul; Twary, Scott N.; Hovde, Blake T.
Picochlorum soloecismus is a halotolerant, fast-growing, and moderate-lipid-producing microalga that is being evaluated as a renewable feedstock for biofuel production. Herein, we report on an improved high-quality draft assembly and annotation for the nuclear, chloroplast, and mitochondrial genomes of P. soloecismus DOE 101.
The reconstruction of 2,631 draft metagenome-assembled genomes from the global oceans.
Tully, Benjamin J; Graham, Elaina D; Heidelberg, John F
2018-01-16
Microorganisms play a crucial role in mediating global biogeochemical cycles in the marine environment. By reconstructing the genomes of environmental organisms through metagenomics, researchers are able to study the metabolic potential of Bacteria and Archaea that are resistant to isolation in the laboratory. Utilizing the large metagenomic dataset generated from 234 samples collected during the Tara Oceans circumnavigation expedition, we were able to assemble 102 billion paired-end reads into 562 million contigs, which in turn were co-assembled and consolidated in to 7.2 million contigs ≥2 kb in length. Approximately 1 million of these contigs were binned to reconstruct draft genomes. In total, 2,631 draft genomes with an estimated completion of ≥50% were generated (1,491 draft genomes >70% complete; 603 genomes >90% complete). A majority of the draft genomes were manually assigned phylogeny based on sets of concatenated phylogenetic marker genes and/or 16S rRNA gene sequences. The draft genomes are now publically available for the research community at-large.
Thompson, Sarah M.; Kalamorz, Falk; David, Charles; Addison, Shea M.; Smith, Grant R.
2018-01-01
ABSTRACT Here, we report the draft genome sequence of “Candidatus Liberibacter europaeus” ASNZ1, assembled from broom psyllids (Arytainilla spartiophila) from New Zealand. The assembly comprises 15 contigs, with a total length of 1.33 Mb and a G+C content of 33.5%. PMID:29773636
An improved assembly of the loblolly pine mega-genome using long-read single-molecule sequencing.
Zimin, Aleksey V; Stevens, Kristian A; Crepeau, Marc W; Puiu, Daniela; Wegrzyn, Jill L; Yorke, James A; Langley, Charles H; Neale, David B; Salzberg, Steven L
2017-01-01
The 22-gigabase genome of loblolly pine (Pinus taeda) is one of the largest ever sequenced. The draft assembly published in 2014 was built entirely from short Illumina reads, with lengths ranging from 100 to 250 base pairs (bp). The assembly was quite fragmented, containing over 11 million contigs whose weighted average (N50) size was 8206 bp. To improve this result, we generated approximately 12-fold coverage in long reads using the Single Molecule Real Time sequencing technology developed at Pacific Biosciences. We assembled the long and short reads together using the MaSuRCA mega-reads assembly algorithm, which produced a substantially better assembly, P. taeda version 2.0. The new assembly has an N50 contig size of 25 361, more than three times as large as achieved in the original assembly, and an N50 scaffold size of 107 821, 61% larger than the previous assembly. © The Author 2017. Published by Oxford University Press.
Zimin, Aleksey V; Stevens, Kristian A; Crepeau, Marc W; Puiu, Daniela; Wegrzyn, Jill L; Yorke, James A; Langley, Charles H; Neale, David B; Salzberg, Steven L
2017-10-01
The 22-gigabase genome of loblolly pine (Pinus taeda) is one of the largest ever sequenced. The draft assembly published in 2014 was built entirely from short Illumina reads, with lengths ranging from 100 to 250 base pairs (bp). The assembly was quite fragmented, containing over 11 million contigs whose weighted average (N50) size was 8206 bp. To improve this result, we generated approximately 12-fold coverage in long reads using the Single Molecule Real Time sequencing technology developed at Pacific Biosciences. We assembled the long and short reads together using the MaSuRCA mega-reads assembly algorithm, which produced a substantially better assembly, P. taeda version 2.0. The new assembly has an N50 contig size of 25 361, more than three times as large as achieved in the original assembly, and an N50 scaffold size of 107 821, 61% larger than the previous assembly. © The Authors 2017. Published by Oxford University Press.
Yoshida, Catherine E; Kruczkiewicz, Peter; Laing, Chad R; Lingohr, Erika J; Gannon, Victor P J; Nash, John H E; Taboada, Eduardo N
2016-01-01
For nearly 100 years serotyping has been the gold standard for the identification of Salmonella serovars. Despite the increasing adoption of DNA-based subtyping approaches, serotype information remains a cornerstone in food safety and public health activities aimed at reducing the burden of salmonellosis. At the same time, recent advances in whole-genome sequencing (WGS) promise to revolutionize our ability to perform advanced pathogen characterization in support of improved source attribution and outbreak analysis. We present the Salmonella In Silico Typing Resource (SISTR), a bioinformatics platform for rapidly performing simultaneous in silico analyses for several leading subtyping methods on draft Salmonella genome assemblies. In addition to performing serovar prediction by genoserotyping, this resource integrates sequence-based typing analyses for: Multi-Locus Sequence Typing (MLST), ribosomal MLST (rMLST), and core genome MLST (cgMLST). We show how phylogenetic context from cgMLST analysis can supplement the genoserotyping analysis and increase the accuracy of in silico serovar prediction to over 94.6% on a dataset comprised of 4,188 finished genomes and WGS draft assemblies. In addition to allowing analysis of user-uploaded whole-genome assemblies, the SISTR platform incorporates a database comprising over 4,000 publicly available genomes, allowing users to place their isolates in a broader phylogenetic and epidemiological context. The resource incorporates several metadata driven visualizations to examine the phylogenetic, geospatial and temporal distribution of genome-sequenced isolates. As sequencing of Salmonella isolates at public health laboratories around the world becomes increasingly common, rapid in silico analysis of minimally processed draft genome assemblies provides a powerful approach for molecular epidemiology in support of public health investigations. Moreover, this type of integrated analysis using multiple sequence-based methods of sub-typing allows for continuity with historical serotyping data as we transition towards the increasing adoption of genomic analyses in epidemiology. The SISTR platform is freely available on the web at https://lfz.corefacility.ca/sistr-app/.
The reconstruction of 2,631 draft metagenome-assembled genomes from the global oceans
Tully, Benjamin J.; Graham, Elaina D.; Heidelberg, John F.
2018-01-01
Microorganisms play a crucial role in mediating global biogeochemical cycles in the marine environment. By reconstructing the genomes of environmental organisms through metagenomics, researchers are able to study the metabolic potential of Bacteria and Archaea that are resistant to isolation in the laboratory. Utilizing the large metagenomic dataset generated from 234 samples collected during the Tara Oceans circumnavigation expedition, we were able to assemble 102 billion paired-end reads into 562 million contigs, which in turn were co-assembled and consolidated in to 7.2 million contigs ≥2 kb in length. Approximately 1 million of these contigs were binned to reconstruct draft genomes. In total, 2,631 draft genomes with an estimated completion of ≥50% were generated (1,491 draft genomes >70% complete; 603 genomes >90% complete). A majority of the draft genomes were manually assigned phylogeny based on sets of concatenated phylogenetic marker genes and/or 16S rRNA gene sequences. The draft genomes are now publically available for the research community at-large. PMID:29337314
Finishing bacterial genome assemblies with Mix.
Soueidan, Hayssam; Maurier, Florence; Groppi, Alexis; Sirand-Pugnet, Pascal; Tardy, Florence; Citti, Christine; Dupuy, Virginie; Nikolski, Macha
2013-01-01
Among challenges that hamper reaping the benefits of genome assembly are both unfinished assemblies and the ensuing experimental costs. First, numerous software solutions for genome de novo assembly are available, each having its advantages and drawbacks, without clear guidelines as to how to choose among them. Second, these solutions produce draft assemblies that often require a resource intensive finishing phase. In this paper we address these two aspects by developing Mix , a tool that mixes two or more draft assemblies, without relying on a reference genome and having the goal to reduce contig fragmentation and thus speed-up genome finishing. The proposed algorithm builds an extension graph where vertices represent extremities of contigs and edges represent existing alignments between these extremities. These alignment edges are used for contig extension. The resulting output assembly corresponds to a set of paths in the extension graph that maximizes the cumulative contig length. We evaluate the performance of Mix on bacterial NGS data from the GAGE-B study and apply it to newly sequenced Mycoplasma genomes. Resulting final assemblies demonstrate a significant improvement in the overall assembly quality. In particular, Mix is consistent by providing better overall quality results even when the choice is guided solely by standard assembly statistics, as is the case for de novo projects. Mix is implemented in Python and is available at https://github.com/cbib/MIX, novel data for our Mycoplasma study is available at http://services.cbib.u-bordeaux2.fr/mix/.
Augmenting Chinese hamster genome assembly by identifying regions of high confidence.
Vishwanathan, Nandita; Bandyopadhyay, Arpan A; Fu, Hsu-Yuan; Sharma, Mohit; Johnson, Kathryn C; Mudge, Joann; Ramaraj, Thiruvarangan; Onsongo, Getiria; Silverstein, Kevin A T; Jacob, Nitya M; Le, Huong; Karypis, George; Hu, Wei-Shou
2016-09-01
Chinese hamster Ovary (CHO) cell lines are the dominant industrial workhorses for therapeutic recombinant protein production. The availability of genome sequence of Chinese hamster and CHO cells will spur further genome and RNA sequencing of producing cell lines. However, the mammalian genomes assembled using shot-gun sequencing data still contain regions of uncertain quality due to assembly errors. Identifying high confidence regions in the assembled genome will facilitate its use for cell engineering and genome engineering. We assembled two independent drafts of Chinese hamster genome by de novo assembly from shotgun sequencing reads and by re-scaffolding and gap-filling the draft genome from NCBI for improved scaffold lengths and gap fractions. We then used the two independent assemblies to identify high confidence regions using two different approaches. First, the two independent assemblies were compared at the sequence level to identify their consensus regions as "high confidence regions" which accounts for at least 78 % of the assembled genome. Further, a genome wide comparison of the Chinese hamster scaffolds with mouse chromosomes revealed scaffolds with large blocks of collinearity, which were also compiled as high-quality scaffolds. Genome scale collinearity was complemented with EST based synteny which also revealed conserved gene order compared to mouse. As cell line sequencing becomes more commonly practiced, the approaches reported here are useful for assessing the quality of assembly and potentially facilitate the engineering of cell lines. Copyright © 2016 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Rivera, Yazmin; Zeller, Kurt; Srivastava, Subodh K; Sutherland, Jeremy; Galvez, Marco E; Nakhla, Mark K; Poniatowska, Anna; Schnabel, Guido; Sundin, George W; Abad, Gloria
2018-05-03
Fungi in the genus Monilinia are known to cause devastating brown rot disease of stone and pome fruits. Here, we report the draft genome assemblies of four important phytopathogenic species: Monilinia fructicola, Monilinia fructigena, Monilinia polystroma, and Monilinia laxa. The draft genome assemblies were 39 Mb (M. fructigena), 42 Mb (M. laxa), 43 Mb (M. fructicola), and 45 Mb (M. polystroma) with as few as 550 contigs (M. laxa). These are the first draft genome resources publicly available for M. laxa, M. fructigena, and M. polystroma.
Seuylemezian, Arman; Cooper, Kerry; Schubert, Wayne
2018-01-01
ABSTRACT Spore-forming microorganisms are of concern for forward contamination because they can survive harsh interplanetary travel. Here, we report the draft genome sequences of 12 spore-forming strains isolated from the Manned Spacecraft Operations Building (MSOB) and the Vehicle Assembly Building (VAB) in Cape Canaveral, FL, where the Viking spacecraft were assembled. PMID:29567731
ERIC Educational Resources Information Center
Spear, Richard
2008-01-01
The Welsh Assembly Government is nothing if not consultative. The most recent draft policy out for public debate aims to improve the way adult community learning (ACL) is planned and delivered--in order to provide demonstrable benefits for learners. This latest consultation sits under "Skills that Work for Wales," the Assembly…
Zhong, Xingyu; Tian, Yuqing; Niu, Guoqing; Tan, Huarong
2013-07-01
A draft genome sequence of Streptomyces ansochromogenes 7100 was generated using 454 sequencing technology. In combination with local BLAST searches and gap filling techniques, a comprehensive antiSMASH-based method was adopted to assemble the secondary metabolite biosynthetic gene clusters in the draft genome of S. ansochromogenes. A total of at least 35 putative gene clusters were identified and assembled. Transcriptional analysis showed that 20 of the 35 gene clusters were expressed in either or all of the three different media tested, whereas the other 15 gene clusters were silent in all three different media. This study provides a comprehensive method to identify and assemble secondary metabolite biosynthetic gene clusters in draft genomes of Streptomyces, and will significantly promote functional studies of these secondary metabolite biosynthetic gene clusters.
Seuylemezian, Arman; Cooper, Kerry; Schubert, Wayne; Vaishampayan, Parag
2018-03-22
Spore-forming microorganisms are of concern for forward contamination because they can survive harsh interplanetary travel. Here, we report the draft genome sequences of 12 spore-forming strains isolated from the Manned Spacecraft Operations Building (MSOB) and the Vehicle Assembly Building (VAB) in Cape Canaveral, FL, where the Viking spacecraft were assembled. Copyright © 2018 Seuylemezian et al.
Sequencing and De novo Draft Assemblies of the Fathead Minnow (Pimphales promelas)Reference Genome
This study was undertaken to develop genome-scale resources for the fathead minnow (Pimphales promelas) an important model organism widely used in both aquatic ecotoxicology research and in regulatory toxicity testing. We report on the first sequencing and two draft assemblies fo...
SSPACE-LongRead: scaffolding bacterial draft genomes using long read sequence information
2014-01-01
Background The recent introduction of the Pacific Biosciences RS single molecule sequencing technology has opened new doors to scaffolding genome assemblies in a cost-effective manner. The long read sequence information is promised to enhance the quality of incomplete and inaccurate draft assemblies constructed from Next Generation Sequencing (NGS) data. Results Here we propose a novel hybrid assembly methodology that aims to scaffold pre-assembled contigs in an iterative manner using PacBio RS long read information as a backbone. On a test set comprising six bacterial draft genomes, assembled using either a single Illumina MiSeq or Roche 454 library, we show that even a 50× coverage of uncorrected PacBio RS long reads is sufficient to drastically reduce the number of contigs. Comparisons to the AHA scaffolder indicate our strategy is better capable of producing (nearly) complete bacterial genomes. Conclusions The current work describes our SSPACE-LongRead software which is designed to upgrade incomplete draft genomes using single molecule sequences. We conclude that the recent advances of the PacBio sequencing technology and chemistry, in combination with the limited computational resources required to run our program, allow to scaffold genomes in a fast and reliable manner. PMID:24950923
Use of a draft genome of coffee (Coffea arabica) to identify SNPs associated with caffeine content.
Tran, Hue T M; Ramaraj, Thiruvarangan; Furtado, Agnelo; Lee, Leonard Slade; Henry, Robert J
2018-03-07
Arabica coffee (Coffea arabica) has a small gene pool limiting genetic improvement. Selection for caffeine content within this gene pool would be assisted by identification of the genes controlling this important trait. Sequencing of DNA bulks from 18 genotypes with extreme high- or low-caffeine content from a population of 232 genotypes was used to identify linked polymorphisms. To obtain a reference genome, a whole genome assembly of arabica coffee (variety K7) was achieved by sequencing using short read (Illumina) and long-read (PacBio) technology. Assembly was performed using a range of assembly tools resulting in 76 409 scaffolds with a scaffold N50 of 54 544 bp and a total scaffold length of 1448 Mb. Validation of the genome assembly using different tools showed high completeness of the genome. More than 99% of transcriptome sequences mapped to the C. arabica draft genome, and 89% of BUSCOs were present. The assembled genome annotated using AUGUSTUS yielded 99 829 gene models. Using the draft arabica genome as reference in mapping and variant calling allowed the detection of 1444 nonsynonymous single nucleotide polymorphisms (SNPs) associated with caffeine content. Based on Kyoto Encyclopaedia of Genes and Genomes pathway-based analysis, 65 caffeine-associated SNPs were discovered, among which 11 SNPs were associated with genes encoding enzymes involved in the conversion of substrates, which participate in the caffeine biosynthesis pathways. This analysis demonstrated the complex genetic control of this key trait in coffee. © 2018 The Authors. Plant Biotechnology Journal published by Society for Experimental Biology and The Association of Applied Biologists and John Wiley & Sons Ltd.
CAR: contig assembly of prokaryotic draft genomes using rearrangements.
Lu, Chin Lung; Chen, Kun-Tze; Huang, Shih-Yuan; Chiu, Hsien-Tai
2014-11-28
Next generation sequencing technology has allowed efficient production of draft genomes for many organisms of interest. However, most draft genomes are just collections of independent contigs, whose relative positions and orientations along the genome being sequenced are unknown. Although several tools have been developed to order and orient the contigs of draft genomes, more accurate tools are still needed. In this study, we present a novel reference-based contig assembly (or scaffolding) tool, named as CAR, that can efficiently and more accurately order and orient the contigs of a prokaryotic draft genome based on a reference genome of a related organism. Given a set of contigs in multi-FASTA format and a reference genome in FASTA format, CAR can output a list of scaffolds, each of which is a set of ordered and oriented contigs. For validation, we have tested CAR on a real dataset composed of several prokaryotic genomes and also compared its performance with several other reference-based contig assembly tools. Consequently, our experimental results have shown that CAR indeed performs better than all these other reference-based contig assembly tools in terms of sensitivity, precision and genome coverage. CAR serves as an efficient tool that can more accurately order and orient the contigs of a prokaryotic draft genome based on a reference genome. The web server of CAR is freely available at http://genome.cs.nthu.edu.tw/CAR/ and its stand-alone program can also be downloaded from the same website.
Comparing memory-efficient genome assemblers on stand-alone and cloud infrastructures.
Kleftogiannis, Dimitrios; Kalnis, Panos; Bajic, Vladimir B
2013-01-01
A fundamental problem in bioinformatics is genome assembly. Next-generation sequencing (NGS) technologies produce large volumes of fragmented genome reads, which require large amounts of memory to assemble the complete genome efficiently. With recent improvements in DNA sequencing technologies, it is expected that the memory footprint required for the assembly process will increase dramatically and will emerge as a limiting factor in processing widely available NGS-generated reads. In this report, we compare current memory-efficient techniques for genome assembly with respect to quality, memory consumption and execution time. Our experiments prove that it is possible to generate draft assemblies of reasonable quality on conventional multi-purpose computers with very limited available memory by choosing suitable assembly methods. Our study reveals the minimum memory requirements for different assembly programs even when data volume exceeds memory capacity by orders of magnitude. By combining existing methodologies, we propose two general assembly strategies that can improve short-read assembly approaches and result in reduction of the memory footprint. Finally, we discuss the possibility of utilizing cloud infrastructures for genome assembly and we comment on some findings regarding suitable computational resources for assembly.
Wang, Daxi; Korhonen, Pasi K; Gasser, Robin B; Young, Neil D
Clonorchis sinensis (family Opisthorchiidae) is an important foodborne parasite that has a major socioeconomic impact on ~35 million people predominantly in China, Vietnam, Korea and the Russian Far East. In humans, infection with C. sinensis causes clonorchiasis, a complex hepatobiliary disease that can induce cholangiocarcinoma (CCA), a malignant cancer of the bile ducts. Central to understanding the epidemiology of this disease is knowledge of genetic variation within and among populations of this parasite. Although most published molecular studies seem to suggest that C. sinensis represents a single species, evidence of karyotypic variation within C. sinensis and cryptic species within a related opisthorchiid fluke (Opisthorchis viverrini) emphasise the importance of studying and comparing the genes and genomes of geographically distinct isolates of C. sinensis. Recently, we sequenced, assembled and characterised a draft nuclear genome of a C. sinensis isolate from Korea and compared it with a published draft genome of a Chinese isolate of this species using a bioinformatic workflow established for comparing draft genome assemblies and their gene annotations. We identified that 50.6% and 51.3% of the Korean and Chinese C. sinensis genomic scaffolds were syntenic, respectively. Within aligned syntenic blocks, the genomes had a high level of nucleotide identity (99.1%) and encoded 15 variable proteins likely to be involved in diverse biological processes. Here, we review current technical challenges of using draft genome assemblies to undertake comparative genomic analyses to quantify genetic variation between isolates of the same species. Using a workflow that overcomes these challenges, we report on a high-quality draft genome for C. sinensis from Korea and comparative genomic analyses, as a basis for future investigations of the genetic structures of C. sinensis populations, and discuss the biotechnological implications of these explorations. Copyright © 2018 Elsevier Inc. All rights reserved.
Haemonchus contortus: Genome Structure, Organization and Comparative Genomics.
Laing, R; Martinelli, A; Tracey, A; Holroyd, N; Gilleard, J S; Cotton, J A
2016-01-01
One of the first genome sequencing projects for a parasitic nematode was that for Haemonchus contortus. The open access data from the Wellcome Trust Sanger Institute provided a valuable early resource for the research community, particularly for the identification of specific genes and genetic markers. Later, a second sequencing project was initiated by the University of Melbourne, and the two draft genome sequences for H. contortus were published back-to-back in 2013. There is a pressing need for long-range genomic information for genetic mapping, population genetics and functional genomic studies, so we are continuing to improve the Wellcome Trust Sanger Institute assembly to provide a finished reference genome for H. contortus. This review describes this process, compares the H. contortus genome assemblies with draft genomes from other members of the strongylid group and discusses future directions for parasite genomics using the H. contortus model. Copyright © 2016 Elsevier Ltd. All rights reserved.
Draft Genome Assembly of a Wolbachia Endosymbiont of Plutella australiana
Ward, Christopher M.
2017-01-01
ABSTRACT Wolbachia spp. are endosymbiotic bacteria that infect around 50% of arthropods and cause a broad range of effects, including manipulating host reproduction. Here, we present the annotated draft genome assembly of Wolbachia strain wAus, which infects Plutella australiana, a cryptic ally of the major Brassica pest Plutella xylostella (diamondback moth). PMID:29074653
Draft Genome Assembly of a Wolbachia Endosymbiont of Plutella australiana.
Ward, Christopher M; Baxter, Simon W
2017-10-26
Wolbachia spp. are endosymbiotic bacteria that infect around 50% of arthropods and cause a broad range of effects, including manipulating host reproduction. Here, we present the annotated draft genome assembly of Wolbachia strain wAus, which infects Plutella australiana , a cryptic ally of the major Brassica pest Plutella xylostella (diamondback moth). Copyright © 2017 Ward and Baxter.
De novo assembly of a draft genome for Cucumis hystrix, the closest relative of cucumber
USDA-ARS?s Scientific Manuscript database
Cucumis hystrix (2n = 2x = 24, HH) is the only known species that is cross-compatible with cucumber and has a great potential for cucumber improvement, To facilitate introgression of C. hystrix chromatins into cucumber genetic background through development of introgression library, we sequenced two...
Gonzalez-Esquer, C. Raul; Twary, Scott N.; Hovde, Blake T.; ...
2018-01-25
Picochlorum soloecismus is a halotolerant, fast-growing, and moderate-lipid-producing microalga that is being evaluated as a renewable feedstock for biofuel production. Herein, we report on an improved high-quality draft assembly and annotation for the nuclear, chloroplast, and mitochondrial genomes of P. soloecismus DOE 101.
USDA-ARS?s Scientific Manuscript database
The Asian citrus psyllid (Diaphorina citri Kuwayama) is the insect vector of the bacterium Candidatus Liberibacter asiaticus (CLas), the causal agent for the citrus greening or Huanglongbing disease which threatens citrus industry worldwide. This vector is the primary target of approaches to stop th...
Working, Welding and Structural Drafting, Drafting--Intermediate: 9255.03.
ERIC Educational Resources Information Center
Dade County Public Schools, Miami, FL.
The course introduces the student to working welding drawings, both detail and assembly, as related to all fields of drafting and structural drafting, and provides him with the opportunity to work with various types of tools and equipment. Prior to entry in this course, the vocational student must display mastery of the skills indicated in…
Gupta, Sonal; Nawaz, Kashif; Parween, Sabiha; Roy, Riti; Sahu, Kamlesh; Kumar Pole, Anil; Khandal, Hitaishi; Srivastava, Rishi; Kumar Parida, Swarup; Chattopadhyay, Debasis
2017-02-01
Cicer reticulatum L. is the wild progenitor of the fourth most important legume crop chickpea (C. arietinum L.). We assembled short-read sequences into 416 Mb draft genome of C. reticulatum and anchored 78% (327 Mb) of this assembly to eight linkage groups. Genome annotation predicted 25,680 protein-coding genes covering more than 90% of predicted gene space. The genome assembly shared a substantial synteny and conservation of gene orders with the genome of the model legume Medicago truncatula. Resistance gene homologs of wild and domesticated chickpeas showed high sequence homology and conserved synteny. Comparison of gene sequences and nucleotide diversity using 66 wild and domesticated chickpea accessions suggested that the desi type chickpea was genetically closer to the wild species than the kabuli type. Comparative analyses predicted gene flow between the wild and the cultivated species during domestication. Molecular diversity and population genetic structure determination using 15,096 genome-wide single nucleotide polymorphisms revealed an admixed domestication pattern among cultivated (desi and kabuli) and wild chickpea accessions belonging to three population groups reflecting significant influence of parentage or geographical origin for their cultivar-specific population classification. The assembly and the polymorphic sequence resources presented here would facilitate the study of chickpea domestication and targeted use of wild Cicer germplasms for agronomic trait improvement in chickpea. © The Author 2016. Published by Oxford University Press on behalf of Kazusa DNA Research Institute.
Thompson, Peter C; Zarlenga, Dante S; Liu, Ming-Yuan; Rosenthal, Benjamin M
2017-09-01
Genome assemblies can form the basis of comparative analyses fostering insight into the evolutionary genetics of a parasite's pathogenicity, host-pathogen interactions, environmental constraints and invasion biology; however, the length and complexity of many parasite genomes has hampered the development of well-resolved assemblies. In order to improve Trichinella genome assemblies, the genome of the sylvatic encapsulated species Trichinella murrelli was sequenced using third-generation, long-read technology and, using syntenic comparisons, scaffolded to a reference genome assembly of Trichinella spiralis, markedly improving both. A high-quality draft assembly for T. murrelli was achieved that totalled 63·2 Mbp, half of which was condensed into 26 contigs each longer than 571 000 bp. When compared with previous assemblies for parasites in the genus, ours required 10-fold fewer contigs, which were five times longer, on average. Better assembly across repetitive regions also enabled resolution of 8 Mbp of previously indeterminate sequence. Furthermore, syntenic comparisons identified widespread scaffold misassemblies in the T. spiralis reference genome. The two new assemblies, organized for the first time into three chromosomal scaffolds, will be valuable resources for future studies linking phenotypic traits within each species to their underlying genetic bases.
1988-01-01
This document contains portions of the text of a 1988 UN Resolution on measures to improve the situation and ensure the human rights and dignity of all migrant workers. In this resolution, the General Assembly reaffirms international instruments protecting human rights but articulates a further need to improve the protection of human rights for migrant workers and their families. The General Assembly then noted the two most recent reports of the Working Group on the Drafting of an International Convention on the Protection of the Rights of All Migrant Workers and Their Families and took measures to enable the Working Group to complete its task.
USDA-ARS?s Scientific Manuscript database
Clostridium perfringens strain LLY_N11 is a commensal bacterial isolate from a healthy chicken that produced a necrotic enteritis in experimental studies. Here we present the assembly and annotation of its genome, which may provide further insights into improved understanding of the molecular mechan...
Fathead minnow genome sequencing and assembly
The dataset provides the URLs for accessing the genome sequence data and two draft assemblies as well as fathead minnow genotyping data associated with estimating the heterozygosity of the in-bred line.This dataset is associated with the following publication:Burns, F., L. Cogburn, G. Ankley , D. Villeneuve , E. Waits , Y. Chang, V. Llaca, S. Deschamps, R. Jackson, and R. Hoke. Sequencing and De novo Draft Assemblies of the Fathead Minnow (Pimphales promelas)Reference Genome. ENVIRONMENTAL TOXICOLOGY AND CHEMISTRY. Society of Environmental Toxicology and Chemistry, Pensacola, FL, USA, 35(1): 212-217, (2016).
Two low coverage bird genomes and a comparison of reference-guided versus de novo genome assemblies.
Card, Daren C; Schield, Drew R; Reyes-Velasco, Jacobo; Fujita, Matthew K; Andrew, Audra L; Oyler-McCance, Sara J; Fike, Jennifer A; Tomback, Diana F; Ruggiero, Robert P; Castoe, Todd A
2014-01-01
As a greater number and diversity of high-quality vertebrate reference genomes become available, it is increasingly feasible to use these references to guide new draft assemblies for related species. Reference-guided assembly approaches may substantially increase the contiguity and completeness of a new genome using only low levels of genome coverage that might otherwise be insufficient for de novo genome assembly. We used low-coverage (∼3.5-5.5x) Illumina paired-end sequencing to assemble draft genomes of two bird species (the Gunnison Sage-Grouse, Centrocercus minimus, and the Clark's Nutcracker, Nucifraga columbiana). We used these data to estimate de novo genome assemblies and reference-guided assemblies, and compared the information content and completeness of these assemblies by comparing CEGMA gene set representation, repeat element content, simple sequence repeat content, and GC isochore structure among assemblies. Our results demonstrate that even lower-coverage genome sequencing projects are capable of producing informative and useful genomic resources, particularly through the use of reference-guided assemblies.
Two low coverage bird genomes and a comparison of reference-guided versus de novo genome assemblies
Card, Daren C.; Schield, Drew R.; Reyes-Velasco, Jacobo; Fujita, Matthre K.; Andrew, Audra L.; Oyler-McCance, Sara J.; Fike, Jennifer A.; Tomback, Diana F.; Ruggiero, Robert P.; Castoe, Todd A.
2014-01-01
As a greater number and diversity of high-quality vertebrate reference genomes become available, it is increasingly feasible to use these references to guide new draft assemblies for related species. Reference-guided assembly approaches may substantially increase the contiguity and completeness of a new genome using only low levels of genome coverage that might otherwise be insufficient for de novo genome assembly. We used low-coverage (~3.5–5.5x) Illumina paired-end sequencing to assemble draft genomes of two bird species (the Gunnison Sage-Grouse, Centrocercus minimus, and the Clark's Nutcracker, Nucifraga columbiana). We used these data to estimate de novo genome assemblies and reference-guided assemblies, and compared the information content and completeness of these assemblies by comparing CEGMA gene set representation, repeat element content, simple sequence repeat content, and GC isochore structure among assemblies. Our results demonstrate that even lower-coverage genome sequencing projects are capable of producing informative and useful genomic resources, particularly through the use of reference-guided assemblies.
Lineage-Specific Biology Revealed by a Finished Genome Assembly of the Mouse
Hillier, LaDeana W.; Zody, Michael C.; Goldstein, Steve; She, Xinwe; Bult, Carol J.; Agarwala, Richa; Cherry, Joshua L.; DiCuccio, Michael; Hlavina, Wratko; Kapustin, Yuri; Meric, Peter; Maglott, Donna; Birtle, Zoë; Marques, Ana C.; Graves, Tina; Zhou, Shiguo; Teague, Brian; Potamousis, Konstantinos; Churas, Christopher; Place, Michael; Herschleb, Jill; Runnheim, Ron; Forrest, Daniel; Amos-Landgraf, James; Schwartz, David C.; Cheng, Ze; Lindblad-Toh, Kerstin; Eichler, Evan E.; Ponting, Chris P.
2009-01-01
The mouse (Mus musculus) is the premier animal model for understanding human disease and development. Here we show that a comprehensive understanding of mouse biology is only possible with the availability of a finished, high-quality genome assembly. The finished clone-based assembly of the mouse strain C57BL/6J reported here has over 175,000 fewer gaps and over 139 Mb more of novel sequence, compared with the earlier MGSCv3 draft genome assembly. In a comprehensive analysis of this revised genome sequence, we are now able to define 20,210 protein-coding genes, over a thousand more than predicted in the human genome (19,042 genes). In addition, we identified 439 long, non–protein-coding RNAs with evidence for transcribed orthologs in human. We analyzed the complex and repetitive landscape of 267 Mb of sequence that was missing or misassembled in the previously published assembly, and we provide insights into the reasons for its resistance to sequencing and assembly by whole-genome shotgun approaches. Duplicated regions within newly assembled sequence tend to be of more recent ancestry than duplicates in the published draft, correcting our initial understanding of recent evolution on the mouse lineage. These duplicates appear to be largely composed of sequence regions containing transposable elements and duplicated protein-coding genes; of these, some may be fixed in the mouse population, but at least 40% of segmentally duplicated sequences are copy number variable even among laboratory mouse strains. Mouse lineage-specific regions contain 3,767 genes drawn mainly from rapidly-changing gene families associated with reproductive functions. The finished mouse genome assembly, therefore, greatly improves our understanding of rodent-specific biology and allows the delineation of ancestral biological functions that are shared with human from derived functions that are not. PMID:19468303
De novo assembly of the Aedes aegypti genome using Hi-C yields chromosome-length scaffolds.
Dudchenko, Olga; Batra, Sanjit S; Omer, Arina D; Nyquist, Sarah K; Hoeger, Marie; Durand, Neva C; Shamim, Muhammad S; Machol, Ido; Lander, Eric S; Aiden, Aviva Presser; Aiden, Erez Lieberman
2017-04-07
The Zika outbreak, spread by the Aedes aegypti mosquito, highlights the need to create high-quality assemblies of large genomes in a rapid and cost-effective way. Here we combine Hi-C data with existing draft assemblies to generate chromosome-length scaffolds. We validate this method by assembling a human genome, de novo, from short reads alone (67× coverage). We then combine our method with draft sequences to create genome assemblies of the mosquito disease vectors Ae aegypti and Culex quinquefasciatus , each consisting of three scaffolds corresponding to the three chromosomes in each species. These assemblies indicate that almost all genomic rearrangements among these species occur within, rather than between, chromosome arms. The genome assembly procedure we describe is fast, inexpensive, and accurate, and can be applied to many species. Copyright © 2017, American Association for the Advancement of Science.
The draft genome of sweet orange (Citrus sinensis).
Xu, Qiang; Chen, Ling-Ling; Ruan, Xiaoan; Chen, Dijun; Zhu, Andan; Chen, Chunli; Bertrand, Denis; Jiao, Wen-Biao; Hao, Bao-Hai; Lyon, Matthew P; Chen, Jiongjiong; Gao, Song; Xing, Feng; Lan, Hong; Chang, Ji-Wei; Ge, Xianhong; Lei, Yang; Hu, Qun; Miao, Yin; Wang, Lun; Xiao, Shixin; Biswas, Manosh Kumar; Zeng, Wenfang; Guo, Fei; Cao, Hongbo; Yang, Xiaoming; Xu, Xi-Wen; Cheng, Yun-Jiang; Xu, Juan; Liu, Ji-Hong; Luo, Oscar Junhong; Tang, Zhonghui; Guo, Wen-Wu; Kuang, Hanhui; Zhang, Hong-Yu; Roose, Mikeal L; Nagarajan, Niranjan; Deng, Xiu-Xin; Ruan, Yijun
2013-01-01
Oranges are an important nutritional source for human health and have immense economic value. Here we present a comprehensive analysis of the draft genome of sweet orange (Citrus sinensis). The assembled sequence covers 87.3% of the estimated orange genome, which is relatively compact, as 20% is composed of repetitive elements. We predicted 29,445 protein-coding genes, half of which are in the heterozygous state. With additional sequencing of two more citrus species and comparative analyses of seven citrus genomes, we present evidence to suggest that sweet orange originated from a backcross hybrid between pummelo and mandarin. Focused analysis on genes involved in vitamin C metabolism showed that GalUR, encoding the rate-limiting enzyme of the galacturonate pathway, is significantly upregulated in orange fruit, and the recent expansion of this gene family may provide a genomic basis. This draft genome represents a valuable resource for understanding and improving many important citrus traits in the future.
Guida, Brandon S; Garcia-Pichel, Ferran
2016-01-28
Mastigocoleus testarum strain BC008 is a model organism used to study marine photoautotrophic carbonate dissolution. It is a multicellular, filamentous, diazotrophic, euendolithic cyanobacterium ubiquitously found in marine benthic environments. We present an accurate draft genome assembly of 172 contigs spanning 12,700,239 bp with 9,131 annotated genes with an average G+C% of 37.3. Copyright © 2016 Guida and Garcia-Pichel.
ERIC Educational Resources Information Center
Franken, Ken; And Others
A multidisciplinary research team was assembled to review existing computer-aided drafting (CAD) systems for the purpose of enabling staff in the Design Drafting Department at Linn Technical College (Missouri) to select the best system out of the many CAD systems in existence. During the initial stage of the evaluation project, researchers…
Two Low Coverage Bird Genomes and a Comparison of Reference-Guided versus De Novo Genome Assemblies
Card, Daren C.; Schield, Drew R.; Reyes-Velasco, Jacobo; Fujita, Matthew K.; Andrew, Audra L.; Oyler-McCance, Sara J.; Fike, Jennifer A.; Tomback, Diana F.; Ruggiero, Robert P.; Castoe, Todd A.
2014-01-01
As a greater number and diversity of high-quality vertebrate reference genomes become available, it is increasingly feasible to use these references to guide new draft assemblies for related species. Reference-guided assembly approaches may substantially increase the contiguity and completeness of a new genome using only low levels of genome coverage that might otherwise be insufficient for de novo genome assembly. We used low-coverage (∼3.5–5.5x) Illumina paired-end sequencing to assemble draft genomes of two bird species (the Gunnison Sage-Grouse, Centrocercus minimus, and the Clark's Nutcracker, Nucifraga columbiana). We used these data to estimate de novo genome assemblies and reference-guided assemblies, and compared the information content and completeness of these assemblies by comparing CEGMA gene set representation, repeat element content, simple sequence repeat content, and GC isochore structure among assemblies. Our results demonstrate that even lower-coverage genome sequencing projects are capable of producing informative and useful genomic resources, particularly through the use of reference-guided assemblies. PMID:25192061
Draft genome sequence of Lactobacillus mali KCTC 3596.
Kim, Dong-Wook; Choi, Sang-Haeng; Kang, Aram; Nam, Seong-Hyeuk; Kim, Dae-Soo; Kim, Ryong Nam; Kim, Aeri; Park, Hong-Seog
2011-09-01
We announce the draft genome sequence of the type strain Lactobacillus mali KCTC 3596 (2,652,969 bp, with a G+C content of 36.0%), which is one of the most prevalent lactic acid bacteria present during the manufacturing process of apple juice. The genome consists of 122 large contigs (>100 bp). All of the contigs were assembled by Newbler Assembler 2.3 (454 Life Science). Copyright © 2011, American Society for Microbiology. All Rights Reserved.
Yasui, Yasuo; Hirakawa, Hideki; Ueno, Mariko; Matsui, Katsuhiro; Katsube-Tanaka, Tomoyuki; Yang, Soo Jung; Aii, Jotaro; Sato, Shingo; Mori, Masashi
2016-01-01
Buckwheat (Fagopyrum esculentum Moench; 2n = 2x = 16) is a nutritionally dense annual crop widely grown in temperate zones. To accelerate molecular breeding programmes of this important crop, we generated a draft assembly of the buckwheat genome using short reads obtained by next-generation sequencing (NGS), and constructed the Buckwheat Genome DataBase. After assembling short reads, we determined 387,594 scaffolds as the draft genome sequence (FES_r1.0). The total length of FES_r1.0 was 1,177,687,305 bp, and the N50 of the scaffolds was 25,109 bp. Gene prediction analysis revealed 286,768 coding sequences (CDSs; FES_r1.0_cds) including those related to transposable elements. The total length of FES_r1.0_cds was 212,917,911 bp, and the N50 was 1,101 bp. Of these, the functions of 35,816 CDSs excluding those for transposable elements were annotated by BLAST analysis. To demonstrate the utility of the database, we conducted several test analyses using BLAST and keyword searches. Furthermore, we used the draft genome as a reference sequence for NGS-based markers, and successfully identified novel candidate genes controlling heteromorphic self-incompatibility of buckwheat. The database and draft genome sequence provide a valuable resource that can be used in efforts to develop buckwheat cultivars with superior agronomic traits. PMID:27037832
Draft Genome Sequence of Tolypothrix boutellei Strain VB521301
Chandrababunaidu, Mathu Malar; Singh, Deeksha; Sen, Diya; Bhan, Sushma; Das, Subhadeep; Gupta, Akash
2015-01-01
We report here the draft genome sequence of the filamentous nitrogen-fixing cyanobacterium Tolypothrix boutellei strain VB521301. The organism is lipid rich and hydrophobic and produces polyunsaturated fatty acids which can be harnessed for industrial purpose. The draft genome sequence assembled into 11,572,263 bp with 70 scaffolds and 7,777 protein coding genes. PMID:25700407
Wang, Anqi; Wang, Zhanyu; Li, Zheng; Li, Lei M
2018-06-15
It is highly desirable to assemble genomes of high continuity and consistency at low cost. The current bottleneck of draft genome continuity using the second generation sequencing (SGS) reads is primarily caused by uncertainty among repetitive sequences. Even though the single-molecule real-time sequencing technology is very promising to overcome the uncertainty issue, its relatively high cost and error rate add burden on budget or computation. Many long-read assemblers take the overlap-layout-consensus (OLC) paradigm, which is less sensitive to sequencing errors, heterozygosity and variability of coverage. However, current assemblers of SGS data do not sufficiently take advantage of the OLC approach. Aiming at minimizing uncertainty, the proposed method BAUM, breaks the whole genome into regions by adaptive unique mapping; then the local OLC is used to assemble each region in parallel. BAUM can (i) perform reference-assisted assembly based on the genome of a close species (ii) or improve the results of existing assemblies that are obtained based on short or long sequencing reads. The tests on two eukaryote genomes, a wild rice Oryza longistaminata and a parrot Melopsittacus undulatus, show that BAUM achieved substantial improvement on genome size and continuity. Besides, BAUM reconstructed a considerable amount of repetitive regions that failed to be assembled by existing short read assemblers. We also propose statistical approaches to control the uncertainty in different steps of BAUM. http://www.zhanyuwang.xin/wordpress/index.php/2017/07/21/baum. Supplementary data are available at Bioinformatics online.
Draft Genome Sequence of Lactobacillus plantarum Strain IPLA 88
Ladero, Victor; Alvarez-Sieiro, Patricia; Redruello, Begoña; del Rio, Beatriz; Linares, Daniel M.; Martin, M. Cruz; Fernández, María
2013-01-01
Here, we report a 3.2-Mbp draft assembly for the genome of Lactobacillus plantarum IPLA 88. The sequence of this sourdough isolate provides insight into the adaptation of this versatile species to different environments. PMID:23887921
Single-molecule sequencing of the desiccation-tolerant grass Oropetium thomaeum.
VanBuren, Robert; Bryant, Doug; Edger, Patrick P; Tang, Haibao; Burgess, Diane; Challabathula, Dinakar; Spittle, Kristi; Hall, Richard; Gu, Jenny; Lyons, Eric; Freeling, Michael; Bartels, Dorothea; Ten Hallers, Boudewijn; Hastie, Alex; Michael, Todd P; Mockler, Todd C
2015-11-26
Plant genomes, and eukaryotic genomes in general, are typically repetitive, polyploid and heterozygous, which complicates genome assembly. The short read lengths of early Sanger and current next-generation sequencing platforms hinder assembly through complex repeat regions, and many draft and reference genomes are fragmented, lacking skewed GC and repetitive intergenic sequences, which are gaining importance due to projects like the Encyclopedia of DNA Elements (ENCODE). Here we report the whole-genome sequencing and assembly of the desiccation-tolerant grass Oropetium thomaeum. Using only single-molecule real-time sequencing, which generates long (>16 kilobases) reads with random errors, we assembled 99% (244 megabases) of the Oropetium genome into 625 contigs with an N50 length of 2.4 megabases. Oropetium is an example of a 'near-complete' draft genome which includes gapless coverage over gene space as well as intergenic sequences such as centromeres, telomeres, transposable elements and rRNA clusters that are typically unassembled in draft genomes. Oropetium has 28,466 protein-coding genes and 43% repeat sequences, yet with 30% more compact euchromatic regions it is the smallest known grass genome. The Oropetium genome demonstrates the utility of single-molecule real-time sequencing for assembling high-quality plant and other eukaryotic genomes, and serves as a valuable resource for the plant comparative genomics community.
Das, Subhadeep; Singh, Deeksha; Madduluri, Madhavi; Chandrababunaidu, Mathu Malar; Gupta, Akash
2015-01-01
We report here the draft genome sequence of Tolypothrix campylonemoides VB511288, isolated from building facades in Santiniketan, India. The members of this genus produce several compounds of commercial importance. The draft assembly is 10,627,177 bases in 135 scaffolds, and it contains 7,886 protein-coding genes, 994 pseudogenes, 18 rRNA genes, and 76 tRNA genes. PMID:25838485
Draft Genome Sequence of Tolypothrix boutellei Strain VB521301.
Chandrababunaidu, Mathu Malar; Singh, Deeksha; Sen, Diya; Bhan, Sushma; Das, Subhadeep; Gupta, Akash; Adhikary, Siba Prasad; Tripathy, Sucheta
2015-02-19
We report here the draft genome sequence of the filamentous nitrogen-fixing cyanobacterium Tolypothrix boutellei strain VB521301. The organism is lipid rich and hydrophobic and produces polyunsaturated fatty acids which can be harnessed for industrial purpose. The draft genome sequence assembled into 11,572,263 bp with 70 scaffolds and 7,777 protein coding genes. Copyright © 2015 Chandrababunaidu et al.
Torres, Andrea C; Suárez, Nadia E; Font, Graciela; Saavedra, Lucila; Taranto, María Pía
2016-08-25
We report here the draft genome sequence of Lactobacillus reuteri strain CRL 1098. This strain represents an interesting candidate for functional food development because of its proven probiotic properties. The draft genome sequence is composed of 1,969,471 bp assembled into 45 contigs and an average G+C content of 38.8%. Copyright © 2016 Torres et al.
Draft Genome Sequence of Gordonia sp. Strain UCD-TK1 (Phylum Actinobacteria)
Koenigsaecker, Tynisha M.; Coil, David A.
2016-01-01
Here, we present the draft genome of Gordonia sp. strain UCD-TK1. The assembly contains 5,470,576 bp in 98 contigs. This strain was isolated from a disinfected ambulatory surgery center. PMID:27738036
Draft genome of the lined seahorse, Hippocampus erectus.
Lin, Qiang; Qiu, Ying; Gu, Ruobo; Xu, Meng; Li, Jia; Bian, Chao; Zhang, Huixian; Qin, Geng; Zhang, Yanhong; Luo, Wei; Chen, Jieming; You, Xinxin; Fan, Mingjun; Sun, Min; Xu, Pao; Venkatesh, Byrappa; Xu, Junming; Fu, Hongtuo; Shi, Qiong
2017-06-01
The lined seahorse, Hippocampus erectus , is an Atlantic species and mainly inhabits shallow sea beds or coral reefs. It has become very popular in China for its wide use in traditional Chinese medicine. In order to improve the aquaculture yield of this valuable fish species, we are trying to develop genomic resources for assistant selection in genetic breeding. Here, we provide whole genome sequencing, assembly, and gene annotation of the lined seahorse, which can enrich genome resource and further application for its molecular breeding. A total of 174.6 Gb (Gigabase) raw DNA sequences were generated by the Illumina Hiseq2500 platform. The final assembly of the lined seahorse genome is around 458 Mb, representing 94% of the estimated genome size (489 Mb by k-mer analysis). The contig N50 and scaffold N50 reached 14.57 kb and 1.97 Mb, respectively. Quality of the assembled genome was assessed by BUSCO with prediction of 85% of the known vertebrate genes and evaluated using the de novo assembled RNA-seq transcripts to prove a high mapping ratio (more than 99% transcripts could be mapped to the assembly). Using homology-based, de novo and transcriptome-based prediction methods, we predicted 20 788 protein-coding genes in the generated assembly, which is less than our previously reported gene number (23 458) of the tiger tail seahorse ( H. comes ). We report a draft genome of the lined seahorse. These generated genomic data are going to enrich genome resource of this economically important fish, and also provide insights into the genetic mechanisms of its iconic morphology and male pregnancy behavior. © The Authors 2017. Published by Oxford University Press.
Draft genome of the lined seahorse, Hippocampus erectus
Lin, Qiang; Qiu, Ying; Gu, Ruobo; Xu, Meng; Li, Jia; Bian, Chao; Zhang, Huixian; Qin, Geng; Zhang, Yanhong; Luo, Wei; Chen, Jieming; You, Xinxin; Fan, Mingjun; Sun, Min; Xu, Pao; Venkatesh, Byrappa
2017-01-01
Abstract Background: The lined seahorse, Hippocampus erectus, is an Atlantic species and mainly inhabits shallow sea beds or coral reefs. It has become very popular in China for its wide use in traditional Chinese medicine. In order to improve the aquaculture yield of this valuable fish species, we are trying to develop genomic resources for assistant selection in genetic breeding. Here, we provide whole genome sequencing, assembly, and gene annotation of the lined seahorse, which can enrich genome resource and further application for its molecular breeding. Findings: A total of 174.6 Gb (Gigabase) raw DNA sequences were generated by the Illumina Hiseq2500 platform. The final assembly of the lined seahorse genome is around 458 Mb, representing 94% of the estimated genome size (489 Mb by k-mer analysis). The contig N50 and scaffold N50 reached 14.57 kb and 1.97 Mb, respectively. Quality of the assembled genome was assessed by BUSCO with prediction of 85% of the known vertebrate genes and evaluated using the de novo assembled RNA-seq transcripts to prove a high mapping ratio (more than 99% transcripts could be mapped to the assembly). Using homology-based, de novo and transcriptome-based prediction methods, we predicted 20 788 protein-coding genes in the generated assembly, which is less than our previously reported gene number (23 458) of the tiger tail seahorse (H. comes). Conclusion: We report a draft genome of the lined seahorse. These generated genomic data are going to enrich genome resource of this economically important fish, and also provide insights into the genetic mechanisms of its iconic morphology and male pregnancy behavior. PMID:28444302
Mavromatis, Konstantinos; Land, Miriam L; Brettin, Thomas S; Quest, Daniel J; Copeland, Alex; Clum, Alicia; Goodwin, Lynne; Woyke, Tanja; Lapidus, Alla; Klenk, Hans Peter; Cottingham, Robert W; Kyrpides, Nikos C
2012-01-01
The emergence of next generation sequencing (NGS) has provided the means for rapid and high throughput sequencing and data generation at low cost, while concomitantly creating a new set of challenges. The number of available assembled microbial genomes continues to grow rapidly and their quality reflects the quality of the sequencing technology used, but also of the analysis software employed for assembly and annotation. In this work, we have explored the quality of the microbial draft genomes across various sequencing technologies. We have compared the draft and finished assemblies of 133 microbial genomes sequenced at the Department of Energy-Joint Genome Institute and finished at the Los Alamos National Laboratory using a variety of combinations of sequencing technologies, reflecting the transition of the institute from Sanger-based sequencing platforms to NGS platforms. The quality of the public assemblies and of the associated gene annotations was evaluated using various metrics. Results obtained with the different sequencing technologies, as well as their effects on downstream processes, were analyzed. Our results demonstrate that the Illumina HiSeq 2000 sequencing system, the primary sequencing technology currently used for de novo genome sequencing and assembly at JGI, has various advantages in terms of total sequence throughput and cost, but it also introduces challenges for the downstream analyses. In all cases assembly results although on average are of high quality, need to be viewed critically and consider sources of errors in them prior to analysis. These data follow the evolution of microbial sequencing and downstream processing at the JGI from draft genome sequences with large gaps corresponding to missing genes of significant biological role to assemblies with multiple small gaps (Illumina) and finally to assemblies that generate almost complete genomes (Illumina+PacBio).
The genome of Theobroma cacao.
Argout, Xavier; Salse, Jerome; Aury, Jean-Marc; Guiltinan, Mark J; Droc, Gaetan; Gouzy, Jerome; Allegre, Mathilde; Chaparro, Cristian; Legavre, Thierry; Maximova, Siela N; Abrouk, Michael; Murat, Florent; Fouet, Olivier; Poulain, Julie; Ruiz, Manuel; Roguet, Yolande; Rodier-Goud, Maguy; Barbosa-Neto, Jose Fernandes; Sabot, Francois; Kudrna, Dave; Ammiraju, Jetty Siva S; Schuster, Stephan C; Carlson, John E; Sallet, Erika; Schiex, Thomas; Dievart, Anne; Kramer, Melissa; Gelley, Laura; Shi, Zi; Bérard, Aurélie; Viot, Christopher; Boccara, Michel; Risterucci, Ange Marie; Guignon, Valentin; Sabau, Xavier; Axtell, Michael J; Ma, Zhaorong; Zhang, Yufan; Brown, Spencer; Bourge, Mickael; Golser, Wolfgang; Song, Xiang; Clement, Didier; Rivallan, Ronan; Tahi, Mathias; Akaza, Joseph Moroh; Pitollat, Bertrand; Gramacho, Karina; D'Hont, Angélique; Brunel, Dominique; Infante, Diogenes; Kebe, Ismael; Costet, Pierre; Wing, Rod; McCombie, W Richard; Guiderdoni, Emmanuel; Quetier, Francis; Panaud, Olivier; Wincker, Patrick; Bocs, Stephanie; Lanaud, Claire
2011-02-01
We sequenced and assembled the draft genome of Theobroma cacao, an economically important tropical-fruit tree crop that is the source of chocolate. This assembly corresponds to 76% of the estimated genome size and contains almost all previously described genes, with 82% of these genes anchored on the 10 T. cacao chromosomes. Analysis of this sequence information highlighted specific expansion of some gene families during evolution, for example, flavonoid-related genes. It also provides a major source of candidate genes for T. cacao improvement. Based on the inferred paleohistory of the T. cacao genome, we propose an evolutionary scenario whereby the ten T. cacao chromosomes were shaped from an ancestor through eleven chromosome fusions.
The Transcriptomics of Secondary Growth and Wood Formation in Conifers
Carvalho, Ana; Paiva, Jorge; Louzada, José; Lima-Brito, José
2013-01-01
In the last years, forestry scientists have adapted genomics and next-generation sequencing (NGS) technologies to the search for candidate genes related to the transcriptomics of secondary growth and wood formation in several tree species. Gymnosperms, in particular, the conifers, are ecologically and economically important, namely, for the production of wood and other forestry end products. Until very recently, no whole genome sequencing of a conifer genome was available. Due to the gradual improvement of the NGS technologies and inherent bioinformatics tools, two draft assemblies of the whole genomes sequence of Picea abies and Picea glauca arose in the current year. These draft genome assemblies will bring new insights about the structure, content, and evolution of the conifer genomes. Furthermore, new directions in the forestry, breeding and research of conifers will be discussed in the following. The identification of genes associated with the xylem transcriptome and the knowledge of their regulatory mechanisms will provide less time-consuming breeding cycles and a high accuracy for the selection of traits related to wood production and quality. PMID:24288610
Yasui, Yasuo; Hirakawa, Hideki; Ueno, Mariko; Matsui, Katsuhiro; Katsube-Tanaka, Tomoyuki; Yang, Soo Jung; Aii, Jotaro; Sato, Shingo; Mori, Masashi
2016-06-01
Buckwheat (Fagopyrum esculentum Moench; 2n = 2x = 16) is a nutritionally dense annual crop widely grown in temperate zones. To accelerate molecular breeding programmes of this important crop, we generated a draft assembly of the buckwheat genome using short reads obtained by next-generation sequencing (NGS), and constructed the Buckwheat Genome DataBase. After assembling short reads, we determined 387,594 scaffolds as the draft genome sequence (FES_r1.0). The total length of FES_r1.0 was 1,177,687,305 bp, and the N50 of the scaffolds was 25,109 bp. Gene prediction analysis revealed 286,768 coding sequences (CDSs; FES_r1.0_cds) including those related to transposable elements. The total length of FES_r1.0_cds was 212,917,911 bp, and the N50 was 1,101 bp. Of these, the functions of 35,816 CDSs excluding those for transposable elements were annotated by BLAST analysis. To demonstrate the utility of the database, we conducted several test analyses using BLAST and keyword searches. Furthermore, we used the draft genome as a reference sequence for NGS-based markers, and successfully identified novel candidate genes controlling heteromorphic self-incompatibility of buckwheat. The database and draft genome sequence provide a valuable resource that can be used in efforts to develop buckwheat cultivars with superior agronomic traits. © The Author 2016. Published by Oxford University Press on behalf of Kazusa DNA Research Institute.
Draft genome of the Peruvian scallop Argopecten purpuratus.
Li, Chao; Liu, Xiao; Liu, Bo; Ma, Bin; Liu, Fengqiao; Liu, Guilong; Shi, Qiong; Wang, Chunde
2018-04-01
The Peruvian scallop, Argopecten purpuratus, is mainly cultured in southern Chile and Peru was introduced into China in the last century. Unlike other Argopecten scallops, the Peruvian scallop normally has a long life span of up to 7 to 10 years. Therefore, researchers have been using it to develop hybrid vigor. Here, we performed whole genome sequencing, assembly, and gene annotation of the Peruvian scallop, with an important aim to develop genomic resources for genetic breeding in scallops. A total of 463.19-Gb raw DNA reads were sequenced. A draft genome assembly of 724.78 Mb was generated (accounting for 81.87% of the estimated genome size of 885.29 Mb), with a contig N50 size of 80.11 kb and a scaffold N50 size of 1.02 Mb. Repeat sequences were calculated to reach 33.74% of the whole genome, and 26,256 protein-coding genes and 3,057 noncoding RNAs were predicted from the assembly. We generated a high-quality draft genome assembly of the Peruvian scallop, which will provide a solid resource for further genetic breeding and for the analysis of the evolutionary history of this economically important scallop.
Reducing assembly complexity of microbial genomes with single-molecule sequencing
USDA-ARS?s Scientific Manuscript database
Genome assembly algorithms cannot fully reconstruct microbial chromosomes from the DNA reads output by first or second-generation sequencing instruments. Therefore, most genomes are left unfinished due to the significant resources required to manually close gaps left in the draft assemblies. Single-...
Das, Subhadeep; Singh, Deeksha; Madduluri, Madhavi; Chandrababunaidu, Mathu Malar; Gupta, Akash; Adhikary, Siba Prasad; Tripathy, Sucheta
2015-04-02
We report here the draft genome sequence of Tolypothrix campylonemoides VB511288, isolated from building facades in Santiniketan, India. The members of this genus produce several compounds of commercial importance. The draft assembly is 10,627,177 bases in 135 scaffolds, and it contains 7,886 protein-coding genes, 994 pseudogenes, 18 rRNA genes, and 76 tRNA genes. Copyright © 2015 Das et al.
Sen, Diya; Chandrababunaidu, Mathu Malar; Singh, Deeksha; Sanghi, Neha; Ghorai, Arpita; Mishra, Gyan Prakash; Madduluri, Madhavi
2015-01-01
We report here the draft genome sequence of Scytonema millei VB511283, a cyanobacterium isolated from biofilms on the exterior of stone monuments in Santiniketan, eastern India. The draft genome is 11,627,246 bp long (11.63 Mb), with 118 scaffolds. About 9,011 protein-coding genes, 117 tRNAs, and 12 rRNAs are predicted from this assembly. PMID:25744984
USDA-ARS?s Scientific Manuscript database
We announce the draft genome assembly of Lactococcus garvieae str. PAQ102015-99, a recently isolated strain from an outbreak of lactococcosis at a commercial trout farm in the Northwestern US. The draft genome comprises 14 contigs totaling 2,068,357 bp with an N50 of 496,618 bp and average G+C conte...
Li, Gang; Hillier, LaDeana W; Grahn, Robert A; Zimin, Aleksey V; David, Victor A; Menotti-Raymond, Marilyn; Middleton, Rondo; Hannah, Steven; Hendrickson, Sher; Makunin, Alex; O'Brien, Stephen J; Minx, Pat; Wilson, Richard K; Lyons, Leslie A; Warren, Wesley C; Murphy, William J
2016-06-01
High-resolution genetic and physical maps are invaluable tools for building accurate genome assemblies, and interpreting results of genome-wide association studies (GWAS). Previous genetic and physical maps anchored good quality draft assemblies of the domestic cat genome, enabling the discovery of numerous genes underlying hereditary disease and phenotypes of interest to the biomedical science and breeding communities. However, these maps lacked sufficient marker density to order thousands of shorter scaffolds in earlier assemblies, which instead relied heavily on comparative mapping with related species. A high-resolution map would aid in validating and ordering chromosome scaffolds from existing and new genome assemblies. Here, we describe a high-resolution genetic linkage map of the domestic cat genome based on genotyping 453 domestic cats from several multi-generational pedigrees on the Illumina 63K SNP array. The final maps include 58,055 SNP markers placed relative to 6637 markers with unique positions, distributed across all autosomes and the X chromosome. Our final sex-averaged maps span a total autosomal length of 4464 cM, the longest described linkage map for any mammal, confirming length estimates from a previous microsatellite-based map. The linkage map was used to order and orient the scaffolds from a substantially more contiguous domestic cat genome assembly (Felis catus v8.0), which incorporated ∼20 × coverage of Illumina fragment reads. The new genome assembly shows substantial improvements in contiguity, with a nearly fourfold increase in N50 scaffold size to 18 Mb. We use this map to report probable structural errors in previous maps and assemblies, and to describe features of the recombination landscape, including a massive (∼50 Mb) recombination desert (of virtually zero recombination) on the X chromosome that parallels a similar desert on the porcine X chromosome in both size and physical location. Copyright © 2016 Li et al.
The draft genome of a diploid cotton Gossypium raimondii
USDA-ARS?s Scientific Manuscript database
We have sequenced and assembled the draft genome of Gossypium raimondii, whose progenitor is considered the contributor of the D-subgenome to the economically important natural textile fiber producer, G. hirsutum. Next-generation Illumina pair-end (PE) sequencing strategies were employed to obtain ...
Reducing assembly complexity of microbial genomes with single-molecule sequencing.
Koren, Sergey; Harhay, Gregory P; Smith, Timothy P L; Bono, James L; Harhay, Dayna M; Mcvey, Scott D; Radune, Diana; Bergman, Nicholas H; Phillippy, Adam M
2013-01-01
The short reads output by first- and second-generation DNA sequencing instruments cannot completely reconstruct microbial chromosomes. Therefore, most genomes have been left unfinished due to the significant resources required to manually close gaps in draft assemblies. Third-generation, single-molecule sequencing addresses this problem by greatly increasing sequencing read length, which simplifies the assembly problem. To measure the benefit of single-molecule sequencing on microbial genome assembly, we sequenced and assembled the genomes of six bacteria and analyzed the repeat complexity of 2,267 complete bacteria and archaea. Our results indicate that the majority of known bacterial and archaeal genomes can be assembled without gaps, at finished-grade quality, using a single PacBio RS sequencing library. These single-library assemblies are also more accurate than typical short-read assemblies and hybrid assemblies of short and long reads. Automated assembly of long, single-molecule sequencing data reduces the cost of microbial finishing to $1,000 for most genomes, and future advances in this technology are expected to drive the cost lower. This is expected to increase the number of completed genomes, improve the quality of microbial genome databases, and enable high-fidelity, population-scale studies of pan-genomes and chromosomal organization.
Collaboration between Writers and Graphic Designers in Documentation Projects.
ERIC Educational Resources Information Center
Mirel, Barbara; And Others
1995-01-01
Analyzes collaborations between software manual writers and graphic designers to discover how their processes of collaboration directly affect the form of a finished manual. Identifies three models of collaboration: assembly line (linear drafting), swap meet (iterative drafting and joint problem solving), and symphony (codevelopment in every…
Single-molecule sequencing of the desiccation-tolerant grass Oropetium thomaeum
DOE Office of Scientific and Technical Information (OSTI.GOV)
VanBuren, Robert; Bryant, Doug; Edger, Patrick P.
Plant genomes, and eukaryotic genomes in general, are typically repetitive, polyploid and heterozygous, which complicates genome assembly1. The short read lengths of early Sanger and current next-generation sequencing platforms hinder assembly through complex repeat regions, and many draft and reference genomes are fragmented, lacking skewed GC and repetitive intergenic sequences, which are gaining importance due to projects like the Encyclopedia of DNA Elements (ENCODE). Here we report the whole-genome sequencing and assembly of the desiccation-tolerant grass Oropetium thomaeum. Using only single-molecule real-time sequencing, which generates long (>16 kilobases) reads with random errors, we assembled 99% (244 megabases) of the Oropetiummore » genome into 625 contigs with an N50 length of 2.4 megabases. Oropetium is an example of a ‘near-complete’ draft genome which includes gapless coverage over gene space as well as intergenic sequences such as centromeres, telomeres, transposable elements and rRNA clusters that are typically unassembled in draft genomes. Oropetium has 28,466 protein-coding genes and 43% repeat sequences, yet with 30% more compact euchromatic regions it is the smallest known grass genome. As a result, the Oropetium genome demonstrates the utility of single-molecule real-time sequencing for assembling high-quality plant and other eukaryotic genomes, and serves as a valuable resource for the plant comparative genomics community.« less
Single-molecule sequencing of the desiccation-tolerant grass Oropetium thomaeum
VanBuren, Robert; Bryant, Doug; Edger, Patrick P.; ...
2015-11-11
Plant genomes, and eukaryotic genomes in general, are typically repetitive, polyploid and heterozygous, which complicates genome assembly1. The short read lengths of early Sanger and current next-generation sequencing platforms hinder assembly through complex repeat regions, and many draft and reference genomes are fragmented, lacking skewed GC and repetitive intergenic sequences, which are gaining importance due to projects like the Encyclopedia of DNA Elements (ENCODE). Here we report the whole-genome sequencing and assembly of the desiccation-tolerant grass Oropetium thomaeum. Using only single-molecule real-time sequencing, which generates long (>16 kilobases) reads with random errors, we assembled 99% (244 megabases) of the Oropetiummore » genome into 625 contigs with an N50 length of 2.4 megabases. Oropetium is an example of a ‘near-complete’ draft genome which includes gapless coverage over gene space as well as intergenic sequences such as centromeres, telomeres, transposable elements and rRNA clusters that are typically unassembled in draft genomes. Oropetium has 28,466 protein-coding genes and 43% repeat sequences, yet with 30% more compact euchromatic regions it is the smallest known grass genome. As a result, the Oropetium genome demonstrates the utility of single-molecule real-time sequencing for assembling high-quality plant and other eukaryotic genomes, and serves as a valuable resource for the plant comparative genomics community.« less
Sen, Diya; Chandrababunaidu, Mathu Malar; Singh, Deeksha; Sanghi, Neha; Ghorai, Arpita; Mishra, Gyan Prakash; Madduluri, Madhavi; Adhikary, Siba Prasad; Tripathy, Sucheta
2015-03-05
We report here the draft genome sequence of Scytonema millei VB511283, a cyanobacterium isolated from biofilms on the exterior of stone monuments in Santiniketan, eastern India. The draft genome is 11,627,246 bp long (11.63 Mb), with 118 scaffolds. About 9,011 protein-coding genes, 117 tRNAs, and 12 rRNAs are predicted from this assembly. Copyright © 2015 Sen et al.
Tamariz, Jesus; Llanos, Carlos; Seas, Carlos; Montenegro, Paola; Lagos, Jose; Fernandes, Miriam R; Cerdeira, Louise; Lincopan, Nilton
2018-03-29
We present here the draft genome sequence of the first New Delhi metallo-β-lactamase (NDM-1)-producing Escherichia coli strain, belonging to sequence type 155 (ST155), isolated in Peru. Assembly of this draft genome resulted in 5,061,184 bp, revealing a clinically significant resistome for β-lactams, aminoglycosides, tetracyclines, phenicols, sulfonamides, trimethoprim, and fluoroquinolones. Copyright © 2018 Tamariz et al.
Draft Genome Sequence of the Putrescine-Producing Strain Lactococcus lactis subsp. lactis 1AA59
del Rio, Beatriz; Linares, Daniel M.; Fernandez, María; Mayo, Baltasar; Martín, M. Cruz
2015-01-01
We report here the 2,576,542-bp genome annotated draft assembly sequence of Lactococcus lactis subsp. lactis 1AA59. This strain—isolated from a traditional cheese—produces putrescine, one of the most frequently biogenic amines found in dairy products. PMID:26089428
Single haplotype assembly of the human genome from a hydatidiform mole.
Steinberg, Karyn Meltz; Schneider, Valerie A; Graves-Lindsay, Tina A; Fulton, Robert S; Agarwala, Richa; Huddleston, John; Shiryev, Sergey A; Morgulis, Aleksandr; Surti, Urvashi; Warren, Wesley C; Church, Deanna M; Eichler, Evan E; Wilson, Richard K
2014-12-01
A complete reference assembly is essential for accurately interpreting individual genomes and associating variation with phenotypes. While the current human reference genome sequence is of very high quality, gaps and misassemblies remain due to biological and technical complexities. Large repetitive sequences and complex allelic diversity are the two main drivers of assembly error. Although increasing the length of sequence reads and library fragments can improve assembly, even the longest available reads do not resolve all regions. In order to overcome the issue of allelic diversity, we used genomic DNA from an essentially haploid hydatidiform mole, CHM1. We utilized several resources from this DNA including a set of end-sequenced and indexed BAC clones and 100× Illumina whole-genome shotgun (WGS) sequence coverage. We used the WGS sequence and the GRCh37 reference assembly to create an assembly of the CHM1 genome. We subsequently incorporated 382 finished BAC clone sequences to generate a draft assembly, CHM1_1.1 (NCBI AssemblyDB GCA_000306695.2). Analysis of gene, repetitive element, and segmental duplication content show this assembly to be of excellent quality and contiguity. However, comparison to assembly-independent resources, such as BAC clone end sequences and PacBio long reads, indicate misassembled regions. Most of these regions are enriched for structural variation and segmental duplication, and can be resolved in the future. This publicly available assembly will be integrated into the Genome Reference Consortium curation framework for further improvement, with the ultimate goal being a completely finished gap-free assembly. © 2014 Steinberg et al.; Published by Cold Spring Harbor Laboratory Press.
Single haplotype assembly of the human genome from a hydatidiform mole
Steinberg, Karyn Meltz; Schneider, Valerie A.; Graves-Lindsay, Tina A.; Fulton, Robert S.; Agarwala, Richa; Huddleston, John; Shiryev, Sergey A.; Morgulis, Aleksandr; Surti, Urvashi; Warren, Wesley C.; Church, Deanna M.; Eichler, Evan E.; Wilson, Richard K.
2014-01-01
A complete reference assembly is essential for accurately interpreting individual genomes and associating variation with phenotypes. While the current human reference genome sequence is of very high quality, gaps and misassemblies remain due to biological and technical complexities. Large repetitive sequences and complex allelic diversity are the two main drivers of assembly error. Although increasing the length of sequence reads and library fragments can improve assembly, even the longest available reads do not resolve all regions. In order to overcome the issue of allelic diversity, we used genomic DNA from an essentially haploid hydatidiform mole, CHM1. We utilized several resources from this DNA including a set of end-sequenced and indexed BAC clones and 100× Illumina whole-genome shotgun (WGS) sequence coverage. We used the WGS sequence and the GRCh37 reference assembly to create an assembly of the CHM1 genome. We subsequently incorporated 382 finished BAC clone sequences to generate a draft assembly, CHM1_1.1 (NCBI AssemblyDB GCA_000306695.2). Analysis of gene, repetitive element, and segmental duplication content show this assembly to be of excellent quality and contiguity. However, comparison to assembly-independent resources, such as BAC clone end sequences and PacBio long reads, indicate misassembled regions. Most of these regions are enriched for structural variation and segmental duplication, and can be resolved in the future. This publicly available assembly will be integrated into the Genome Reference Consortium curation framework for further improvement, with the ultimate goal being a completely finished gap-free assembly. PMID:25373144
Oldeschulte, David L; Halley, Yvette A; Wilson, Miranda L; Bhattarai, Eric K; Brashear, Wesley; Hill, Joshua; Metz, Richard P; Johnson, Charles D; Rollins, Dale; Peterson, Markus J; Bickhart, Derek M; Decker, Jared E; Sewell, John F; Seabury, Christopher M
2017-09-07
Northern bobwhite ( Colinus virginianus ; hereafter bobwhite) and scaled quail ( Callipepla squamata ) populations have suffered precipitous declines across most of their US ranges. Illumina-based first- (v1.0) and second- (v2.0) generation draft genome assemblies for the scaled quail and the bobwhite produced N50 scaffold sizes of 1.035 and 2.042 Mb, thereby producing a 45-fold improvement in contiguity over the existing bobwhite assembly, and ≥90% of the assembled genomes were captured within 1313 and 8990 scaffolds, respectively. The scaled quail assembly (v1.0 = 1.045 Gb) was ∼20% smaller than the bobwhite (v2.0 = 1.254 Gb), which was supported by kmer-based estimates of genome size. Nevertheless, estimates of GC content (41.72%; 42.66%), genome-wide repetitive content (10.40%; 10.43%), and MAKER-predicted protein coding genes (17,131; 17,165) were similar for the scaled quail (v1.0) and bobwhite (v2.0) assemblies, respectively. BUSCO analyses utilizing 3023 single-copy orthologs revealed a high level of assembly completeness for the scaled quail (v1.0; 84.8%) and the bobwhite (v2.0; 82.5%), as verified by comparison with well-established avian genomes. We also detected 273 putative segmental duplications in the scaled quail genome (v1.0), and 711 in the bobwhite genome (v2.0), including some that were shared among both species. Autosomal variant prediction revealed ∼2.48 and 4.17 heterozygous variants per kilobase within the scaled quail (v1.0) and bobwhite (v2.0) genomes, respectively, and estimates of historic effective population size were uniformly higher for the bobwhite across all time points in a coalescent model. However, large-scale declines were predicted for both species beginning ∼15-20 KYA. Copyright © 2017 Oldeschulte et al.
Nowrousian, Minou; Stajich, Jason E.; Chu, Meiling; Engh, Ines; Espagne, Eric; Halliday, Karen; Kamerewerd, Jens; Kempken, Frank; Knab, Birgit; Kuo, Hsiao-Che; Osiewacz, Heinz D.; Pöggeler, Stefanie; Read, Nick D.; Seiler, Stephan; Smith, Kristina M.; Zickler, Denise; Kück, Ulrich; Freitag, Michael
2010-01-01
Filamentous fungi are of great importance in ecology, agriculture, medicine, and biotechnology. Thus, it is not surprising that genomes for more than 100 filamentous fungi have been sequenced, most of them by Sanger sequencing. While next-generation sequencing techniques have revolutionized genome resequencing, e.g. for strain comparisons, genetic mapping, or transcriptome and ChIP analyses, de novo assembly of eukaryotic genomes still presents significant hurdles, because of their large size and stretches of repetitive sequences. Filamentous fungi contain few repetitive regions in their 30–90 Mb genomes and thus are suitable candidates to test de novo genome assembly from short sequence reads. Here, we present a high-quality draft sequence of the Sordaria macrospora genome that was obtained by a combination of Illumina/Solexa and Roche/454 sequencing. Paired-end Solexa sequencing of genomic DNA to 85-fold coverage and an additional 10-fold coverage by single-end 454 sequencing resulted in ∼4 Gb of DNA sequence. Reads were assembled to a 40 Mb draft version (N50 of 117 kb) with the Velvet assembler. Comparative analysis with Neurospora genomes increased the N50 to 498 kb. The S. macrospora genome contains even fewer repeat regions than its closest sequenced relative, Neurospora crassa. Comparison with genomes of other fungi showed that S. macrospora, a model organism for morphogenesis and meiosis, harbors duplications of several genes involved in self/nonself-recognition. Furthermore, S. macrospora contains more polyketide biosynthesis genes than N. crassa. Phylogenetic analyses suggest that some of these genes may have been acquired by horizontal gene transfer from a distantly related ascomycete group. Our study shows that, for typical filamentous fungi, de novo assembly of genomes from short sequence reads alone is feasible, that a mixture of Solexa and 454 sequencing substantially improves the assembly, and that the resulting data can be used for comparative studies to address basic questions of fungal biology. PMID:20386741
Nowrousian, Minou; Stajich, Jason E; Chu, Meiling; Engh, Ines; Espagne, Eric; Halliday, Karen; Kamerewerd, Jens; Kempken, Frank; Knab, Birgit; Kuo, Hsiao-Che; Osiewacz, Heinz D; Pöggeler, Stefanie; Read, Nick D; Seiler, Stephan; Smith, Kristina M; Zickler, Denise; Kück, Ulrich; Freitag, Michael
2010-04-08
Filamentous fungi are of great importance in ecology, agriculture, medicine, and biotechnology. Thus, it is not surprising that genomes for more than 100 filamentous fungi have been sequenced, most of them by Sanger sequencing. While next-generation sequencing techniques have revolutionized genome resequencing, e.g. for strain comparisons, genetic mapping, or transcriptome and ChIP analyses, de novo assembly of eukaryotic genomes still presents significant hurdles, because of their large size and stretches of repetitive sequences. Filamentous fungi contain few repetitive regions in their 30-90 Mb genomes and thus are suitable candidates to test de novo genome assembly from short sequence reads. Here, we present a high-quality draft sequence of the Sordaria macrospora genome that was obtained by a combination of Illumina/Solexa and Roche/454 sequencing. Paired-end Solexa sequencing of genomic DNA to 85-fold coverage and an additional 10-fold coverage by single-end 454 sequencing resulted in approximately 4 Gb of DNA sequence. Reads were assembled to a 40 Mb draft version (N50 of 117 kb) with the Velvet assembler. Comparative analysis with Neurospora genomes increased the N50 to 498 kb. The S. macrospora genome contains even fewer repeat regions than its closest sequenced relative, Neurospora crassa. Comparison with genomes of other fungi showed that S. macrospora, a model organism for morphogenesis and meiosis, harbors duplications of several genes involved in self/nonself-recognition. Furthermore, S. macrospora contains more polyketide biosynthesis genes than N. crassa. Phylogenetic analyses suggest that some of these genes may have been acquired by horizontal gene transfer from a distantly related ascomycete group. Our study shows that, for typical filamentous fungi, de novo assembly of genomes from short sequence reads alone is feasible, that a mixture of Solexa and 454 sequencing substantially improves the assembly, and that the resulting data can be used for comparative studies to address basic questions of fungal biology.
ABACAS: algorithm-based automatic contiguation of assembled sequences
Assefa, Samuel; Keane, Thomas M.; Otto, Thomas D.; Newbold, Chris; Berriman, Matthew
2009-01-01
Summary: Due to the availability of new sequencing technologies, we are now increasingly interested in sequencing closely related strains of existing finished genomes. Recently a number of de novo and mapping-based assemblers have been developed to produce high quality draft genomes from new sequencing technology reads. New tools are necessary to take contigs from a draft assembly through to a fully contiguated genome sequence. ABACAS is intended as a tool to rapidly contiguate (align, order, orientate), visualize and design primers to close gaps on shotgun assembled contigs based on a reference sequence. The input to ABACAS is a set of contigs which will be aligned to the reference genome, ordered and orientated, visualized in the ACT comparative browser, and optimal primer sequences are automatically generated. Availability and Implementation: ABACAS is implemented in Perl and is freely available for download from http://abacas.sourceforge.net Contact: sa4@sanger.ac.uk PMID:19497936
Whole genome sequencing of Chinese clearhead icefish, Protosalanx hyalocranius.
Liu, Kai; Xu, Dongpo; Li, Jia; Bian, Chao; Duan, Jinrong; Zhou, Yanfeng; Zhang, Minying; You, Xinxin; You, Yang; Chen, Jieming; Yu, Hui; Xu, Gangchun; Fang, Di-An; Qiang, Jun; Jiang, Shulun; He, Jie; Xu, Junmin; Shi, Qiong; Zhang, Zhiyong; Xu, Pao
2017-04-01
Chinese clearhead icefish, Protosalanx hyalocranius , is a representative icefish species with economic importance and special appearance. Due to its great economic value in China, the fish was introduced into Lake Dianchi and several other lakes from the Lake Taihu half a century ago. Similar to the Sinocyclocheilus cavefish, the clearhead icefish has certain cavefish-like traits, such as transparent body and nearly scaleless skin. Here, we provide the whole genome sequence of this surface-dwelling fish and generated a draft genome assembly, aiming at exploring molecular mechanisms for the biological interests. A total of 252.1 Gb of raw reads were sequenced. Subsequently, a novel draft genome assembly was generated, with the scaffold N50 reaching 1.163 Mb. The genome completeness was estimated to be 98.39 % by using the CEGMA evaluation. Finally, we annotated 19 884 protein-coding genes and observed that repeat sequences account for 24.43 % of the genome assembly. We report the first draft genome of the Chinese clearhead icefish. The genome assembly will provide a solid foundation for further molecular breeding and germplasm resource protection in Chinese clearhead icefish, as well as other icefishes. It is also a valuable genetic resource for revealing the molecular mechanisms for the cavefish-like characters. © The Authors 2017. Published by Oxford University Press.
Draft genome of the Northern snakehead, Channa argus.
Xu, Jian; Bian, Chao; Chen, Kunci; Liu, Guiming; Jiang, Yanliang; Luo, Qing; You, Xinxin; Peng, Wenzhu; Li, Jia; Huang, Yu; Yi, Yunhai; Dong, Chuanju; Deng, Hua; Zhang, Songhao; Zhang, Hanyuan; Shi, Qiong; Xu, Peng
2017-04-01
The Northern snakehead (Channa argus), a member of the Channidae family of the Perciformes, is an economically important freshwater fish native to East Asia. In North America, it has become notorious as an intentionally released invasive species. Its ability to breathe air with gills and migrate short distances over land makes it a good model for bimodal breath research. Therefore, recent research has focused on the identification of relevant candidate genes. Here, we performed whole genome sequencing of C. argus to construct its draft genome, aiming to offer useful information for further functional studies and identification of target genes related to its unusual facultative air breathing. Findings: We assembled the C. argus genome with a total of 140.3 Gb of raw reads, which were sequenced using the Illumina HiSeq2000 platform. The final draft genome assembly was approximately 615.3 Mb, with a contig N50 of 81.4 kb and scaffold N50 of 4.5 Mb. The identified repeat sequences account for 18.9% of the whole genome. The 19 877 protein-coding genes were predicted from the genome assembly, with an average of 10.5 exons per gene. Conclusion: We generated a high-quality draft genome of C. argus, which will provide a valuable genetic resource for further biomedical investigations of this economically important teleost fish. © The Author 2017. Published by Oxford University Press.
A physical map of the bovine genome
Snelling, Warren M; Chiu, Readman; Schein, Jacqueline E; Hobbs, Matthew; Abbey, Colette A; Adelson, David L; Aerts, Jan; Bennett, Gary L; Bosdet, Ian E; Boussaha, Mekki; Brauning, Rudiger; Caetano, Alexandre R; Costa, Marcos M; Crawford, Allan M; Dalrymple, Brian P; Eggen, André; Everts-van der Wind, Annelie; Floriot, Sandrine; Gautier, Mathieu; Gill, Clare A; Green, Ronnie D; Holt, Robert; Jann, Oliver; Jones, Steven JM; Kappes, Steven M; Keele, John W; de Jong, Pieter J; Larkin, Denis M; Lewin, Harris A; McEwan, John C; McKay, Stephanie; Marra, Marco A; Mathewson, Carrie A; Matukumalli, Lakshmi K; Moore, Stephen S; Murdoch, Brenda; Nicholas, Frank W; Osoegawa, Kazutoyo; Roy, Alice; Salih, Hanni; Schibler, Laurent; Schnabel, Robert D; Silveri, Licia; Skow, Loren C; Smith, Timothy PL; Sonstegard, Tad S; Taylor, Jeremy F; Tellam, Ross; Van Tassell, Curtis P; Williams, John L; Womack, James E; Wye, Natasja H; Yang, George; Zhao, Shaying
2007-01-01
Background Cattle are important agriculturally and relevant as a model organism. Previously described genetic and radiation hybrid (RH) maps of the bovine genome have been used to identify genomic regions and genes affecting specific traits. Application of these maps to identify influential genetic polymorphisms will be enhanced by integration with each other and with bacterial artificial chromosome (BAC) libraries. The BAC libraries and clone maps are essential for the hybrid clone-by-clone/whole-genome shotgun sequencing approach taken by the bovine genome sequencing project. Results A bovine BAC map was constructed with HindIII restriction digest fragments of 290,797 BAC clones from animals of three different breeds. Comparative mapping of 422,522 BAC end sequences assisted with BAC map ordering and assembly. Genotypes and pedigree from two genetic maps and marker scores from three whole-genome RH panels were consolidated on a 17,254-marker composite map. Sequence similarity allowed integrating the BAC and composite maps with the bovine draft assembly (Btau3.1), establishing a comprehensive resource describing the bovine genome. Agreement between the marker and BAC maps and the draft assembly is high, although discrepancies exist. The composite and BAC maps are more similar than either is to the draft assembly. Conclusion Further refinement of the maps and greater integration into the genome assembly process may contribute to a high quality assembly. The maps provide resources to associate phenotypic variation with underlying genomic variation, and are crucial resources for understanding the biology underpinning this important ruminant species so closely associated with humans. PMID:17697342
O'Brien, Heath E; Gong, Yunchen; Fung, Pauline; Wang, Pauline W; Guttman, David S
2011-01-01
Next-generation genomic technology has both greatly accelerated the pace of genome research as well as increased our reliance on draft genome sequences. While groups such as the Genomics Standards Consortium have made strong efforts to promote genome standards there is a still a general lack of uniformity among published draft genomes, leading to challenges for downstream comparative analyses. This lack of uniformity is a particular problem when using standard draft genomes that frequently have large numbers of low-quality sequencing tracts. Here we present a proposal for an "enhanced-quality draft" genome that identifies at least 95% of the coding sequences, thereby effectively providing a full accounting of the genic component of the genome. Enhanced-quality draft genomes are easily attainable through a combination of small- and large-insert next-generation, paired-end sequencing. We illustrate the generation of an enhanced-quality draft genome by re-sequencing the plant pathogenic bacterium Pseudomonas syringae pv. phaseolicola 1448A (Pph 1448A), which has a published, closed genome sequence of 5.93 Mbp. We use a combination of Illumina paired-end and mate-pair sequencing, and surprisingly find that de novo assemblies with 100x paired-end coverage and mate-pair sequencing with as low as low as 2-5x coverage are substantially better than assemblies based on higher coverage. The rapid and low-cost generation of large numbers of enhanced-quality draft genome sequences will be of particular value for microbial diagnostics and biosecurity, which rely on precise discrimination of potentially dangerous clones from closely related benign strains.
Draft genome sequence of Dactylonectria macrodydima, a plant pathogenic fungus in the Nectriaceae
USDA-ARS?s Scientific Manuscript database
Dactylonectria macrodidyma is part of the Nectriaceae, a family containing important plant pathogens. This species possesses the ability to induce disease on grapevine, avocado and olive. Here, we report the first draft genome of D. macrodidyma isolate JAC15-08. The assembled genome was 58 Mbp and c...
USDA-ARS?s Scientific Manuscript database
Intense artificial selection over the last 100 years has produced elite maize (Zea mays) inbred lines that combine to produce high-yielding hybrids. To further our understanding of how genome and transcriptome variation contribute to the production of high-yielding hybrids, we generated a draft geno...
DOE Office of Scientific and Technical Information (OSTI.GOV)
Starkenburg, S. R.; Polle, J. E. W.; Hovde, B.
ABSTRACT The green alga Scenedesmus obliquus is an emerging platform species for the industrial production of biofuels. Here, we report the draft assembly and annotation for the nuclear, plastid, and mitochondrial genomes of S. obliquus strain DOE0152z.
Project Hill-Climb: Drafting and Design in Motion
ERIC Educational Resources Information Center
Crowl, William F.
2008-01-01
This article describes the Hill-Climb project of a second level Computer-Aided Drafting and Design (CADD) class. The author primarily designed the activity to increase student understanding of the assembly drawing process and its components. The emphasis on problem solving adds a dimension that can aid students in their other classes as well. By…
Draft Genomes for Eight Burkholderia mallei Isolates from Turkey
DOE Office of Scientific and Technical Information (OSTI.GOV)
Daligault, H. E.; Johnson, Shannon L.; Davenport, K. W.
Burkholderia mallei, the etiologic agent of glanders, is a Gram-negative, nonmotile, facultative intracellular pathogen. Though glanders have been eradicated from many parts of the world, the threat ofB. malleibeing used as a weapon is very real. We, then, present draft genome assemblies of 8Burkholderia malleistrains that were isolated in Turkey.
Starkenburg, S. R.; Polle, J. E. W.; Hovde, B.; ...
2017-08-10
ABSTRACT The green alga Scenedesmus obliquus is an emerging platform species for the industrial production of biofuels. Here, we report the draft assembly and annotation for the nuclear, plastid, and mitochondrial genomes of S. obliquus strain DOE0152z.
Draft Genomes for Eight Burkholderia mallei Isolates from Turkey
Daligault, H. E.; Johnson, Shannon L.; Davenport, K. W.; ...
2016-01-07
Burkholderia mallei, the etiologic agent of glanders, is a Gram-negative, nonmotile, facultative intracellular pathogen. Though glanders have been eradicated from many parts of the world, the threat ofB. malleibeing used as a weapon is very real. We, then, present draft genome assemblies of 8Burkholderia malleistrains that were isolated in Turkey.
The draft genome sequence of cork oak
Ramos, António Marcos; Usié, Ana; Barbosa, Pedro; Barros, Pedro M.; Capote, Tiago; Chaves, Inês; Simões, Fernanda; Abreu, Isabl; Carrasquinho, Isabel; Faro, Carlos; Guimarães, Joana B.; Mendonça, Diogo; Nóbrega, Filomena; Rodrigues, Leandra; Saibo, Nelson J. M.; Varela, Maria Carolina; Egas, Conceição; Matos, José; Miguel, Célia M.; Oliveira, M. Margarida; Ricardo, Cândido P.; Gonçalves, Sónia
2018-01-01
Cork oak (Quercus suber) is native to southwest Europe and northwest Africa where it plays a crucial environmental and economical role. To tackle the cork oak production and industrial challenges, advanced research is imperative but dependent on the availability of a sequenced genome. To address this, we produced the first draft version of the cork oak genome. We followed a de novo assembly strategy based on high-throughput sequence data, which generated a draft genome comprising 23,347 scaffolds and 953.3 Mb in size. A total of 79,752 genes and 83,814 transcripts were predicted, including 33,658 high-confidence genes. An InterPro signature assignment was detected for 69,218 transcripts, which represented 82.6% of the total. Validation studies demonstrated the genome assembly and annotation completeness and highlighted the usefulness of the draft genome for read mapping of high-throughput sequence data generated using different protocols. All data generated is available through the public databases where it was deposited, being therefore ready to use by the academic and industry communities working on cork oak and/or related species. PMID:29786699
The draft genome sequence of cork oak.
Ramos, António Marcos; Usié, Ana; Barbosa, Pedro; Barros, Pedro M; Capote, Tiago; Chaves, Inês; Simões, Fernanda; Abreu, Isabl; Carrasquinho, Isabel; Faro, Carlos; Guimarães, Joana B; Mendonça, Diogo; Nóbrega, Filomena; Rodrigues, Leandra; Saibo, Nelson J M; Varela, Maria Carolina; Egas, Conceição; Matos, José; Miguel, Célia M; Oliveira, M Margarida; Ricardo, Cândido P; Gonçalves, Sónia
2018-05-22
Cork oak (Quercus suber) is native to southwest Europe and northwest Africa where it plays a crucial environmental and economical role. To tackle the cork oak production and industrial challenges, advanced research is imperative but dependent on the availability of a sequenced genome. To address this, we produced the first draft version of the cork oak genome. We followed a de novo assembly strategy based on high-throughput sequence data, which generated a draft genome comprising 23,347 scaffolds and 953.3 Mb in size. A total of 79,752 genes and 83,814 transcripts were predicted, including 33,658 high-confidence genes. An InterPro signature assignment was detected for 69,218 transcripts, which represented 82.6% of the total. Validation studies demonstrated the genome assembly and annotation completeness and highlighted the usefulness of the draft genome for read mapping of high-throughput sequence data generated using different protocols. All data generated is available through the public databases where it was deposited, being therefore ready to use by the academic and industry communities working on cork oak and/or related species.
Draft genome sequence of the silver pomfret fish, Pampus argenteus.
AlMomin, Sabah; Kumar, Vinod; Al-Amad, Sami; Al-Hussaini, Mohsen; Dashti, Talal; Al-Enezi, Khaznah; Akbar, Abrar
2016-01-01
Silver pomfret, Pampus argenteus, is a fish species from coastal waters. Despite its high commercial value, this edible fish has not been sequenced. Hence, its genetic and genomic studies have been limited. We report the first draft genome sequence of the silver pomfret obtained using a Next Generation Sequencing (NGS) technology. We assembled 38.7 Gb of nucleotides into scaffolds of 350 Mb with N50 of about 1.5 kb, using high quality paired end reads. These scaffolds represent 63.7% of the estimated silver pomfret genome length. The newly sequenced and assembled genome has 11.06% repetitive DNA regions, and this percentage is comparable to that of the tilapia genome. The genome analysis predicted 16 322 genes. About 91% of these genes showed homology with known proteins. Many gene clusters were annotated to protein and fatty-acid metabolism pathways that may be important in the context of the meat texture and immune system developmental processes. The reference genome can pave the way for the identification of many other genomic features that could improve breeding and population-management strategies, and it can also help characterize the genetic diversity of P. argenteus.
Edger, Patrick P; VanBuren, Robert; Colle, Marivi; Poorten, Thomas J; Wai, Ching Man; Niederhuth, Chad E; Alger, Elizabeth I; Ou, Shujun; Acharya, Charlotte B; Wang, Jie; Callow, Pete; McKain, Michael R; Shi, Jinghua; Collier, Chad; Xiong, Zhiyong; Mower, Jeffrey P; Slovin, Janet P; Hytönen, Timo; Jiang, Ning; Childs, Kevin L; Knapp, Steven J
2018-02-01
Although draft genomes are available for most agronomically important plant species, the majority are incomplete, highly fragmented, and often riddled with assembly and scaffolding errors. These assembly issues hinder advances in tool development for functional genomics and systems biology. Here we utilized a robust, cost-effective approach to produce high-quality reference genomes. We report a near-complete genome of diploid woodland strawberry (Fragaria vesca) using single-molecule real-time sequencing from Pacific Biosciences (PacBio). This assembly has a contig N50 length of ∼7.9 million base pairs (Mb), representing a ∼300-fold improvement of the previous version. The vast majority (>99.8%) of the assembly was anchored to 7 pseudomolecules using 2 sets of optical maps from Bionano Genomics. We obtained ∼24.96 Mb of sequence not present in the previous version of the F. vesca genome and produced an improved annotation that includes 1496 new genes. Comparative syntenic analyses uncovered numerous, large-scale scaffolding errors present in each chromosome in the previously published version of the F. vesca genome. Our results highlight the need to improve existing short-read based reference genomes. Furthermore, we demonstrate how genome quality impacts commonly used analyses for addressing both fundamental and applied biological questions. © The Authors 2017. Published by Oxford University Press.
Batista, Thiago M; Moreira, Rennan G; Hilário, Heron O; Morais, Camila G; Franco, Glória R; Rosa, Luiz H; Rosa, Carlos A
2017-03-01
We present the draft genome sequence of the type strain of the yeast Sugiyamaella xylanicola UFMG-CM-Y1884 T (= UFMG-CA-32.1 T = CBS 12683 T ), a xylan-degrading species capable of fermenting d-xylose to ethanol. The assembled genome has a size of ~ 13.7 Mb and a GC content of 33.8% and contains 5971 protein-coding genes. We identified 15 genes with significant similarity to the d-xylose reductase gene from several other fungal species. The draft genome assembled from whole-genome shotgun sequencing of the yeast Sugiyamaella xylanicola UFMG-CM-Y1884 T (= UFMG-CA-32.1 T = CBS 12683 T ) has been deposited at DDBJ/ENA/GenBank under the accession number MQSX00000000 under version MQSX01000000.
Martin, Guillaume; Baurens, Franc-Christophe; Droc, Gaëtan; Rouard, Mathieu; Cenci, Alberto; Kilian, Andrzej; Hastie, Alex; Doležel, Jaroslav; Aury, Jean-Marc; Alberti, Adriana; Carreel, Françoise; D'Hont, Angélique
2016-03-16
Recent advances in genomics indicate functional significance of a majority of genome sequences and their long range interactions. As a detailed examination of genome organization and function requires very high quality genome sequence, the objective of this study was to improve reference genome assembly of banana (Musa acuminata). We have developed a modular bioinformatics pipeline to improve genome sequence assemblies, which can handle various types of data. The pipeline comprises several semi-automated tools. However, unlike classical automated tools that are based on global parameters, the semi-automated tools proposed an expert mode for a user who can decide on suggested improvements through local compromises. The pipeline was used to improve the draft genome sequence of Musa acuminata. Genotyping by sequencing (GBS) of a segregating population and paired-end sequencing were used to detect and correct scaffold misassemblies. Long insert size paired-end reads identified scaffold junctions and fusions missed by automated assembly methods. GBS markers were used to anchor scaffolds to pseudo-molecules with a new bioinformatics approach that avoids the tedious step of marker ordering during genetic map construction. Furthermore, a genome map was constructed and used to assemble scaffolds into super scaffolds. Finally, a consensus gene annotation was projected on the new assembly from two pre-existing annotations. This approach reduced the total Musa scaffold number from 7513 to 1532 (i.e. by 80%), with an N50 that increased from 1.3 Mb (65 scaffolds) to 3.0 Mb (26 scaffolds). 89.5% of the assembly was anchored to the 11 Musa chromosomes compared to the previous 70%. Unknown sites (N) were reduced from 17.3 to 10.0%. The release of the Musa acuminata reference genome version 2 provides a platform for detailed analysis of banana genome variation, function and evolution. Bioinformatics tools developed in this work can be used to improve genome sequence assemblies in other species.
Recovering complete and draft population genomes from metagenome datasets
Sangwan, Naseer; Xia, Fangfang; Gilbert, Jack A.
2016-03-08
Assembly of metagenomic sequence data into microbial genomes is of fundamental value to improving our understanding of microbial ecology and metabolism by elucidating the functional potential of hard-to-culture microorganisms. Here, we provide a synthesis of available methods to bin metagenomic contigs into species-level groups and highlight how genetic diversity, sequencing depth, and coverage influence binning success. Despite the computational cost on application to deeply sequenced complex metagenomes (e.g., soil), covarying patterns of contig coverage across multiple datasets significantly improves the binning process. We also discuss and compare current genome validation methods and reveal how these methods tackle the problem ofmore » chimeric genome bins i.e., sequences from multiple species. Finally, we explore how population genome assembly can be used to uncover biogeographic trends and to characterize the effect of in situ functional constraints on the genome-wide evolution.« less
Recovering complete and draft population genomes from metagenome datasets
DOE Office of Scientific and Technical Information (OSTI.GOV)
Sangwan, Naseer; Xia, Fangfang; Gilbert, Jack A.
Assembly of metagenomic sequence data into microbial genomes is of fundamental value to improving our understanding of microbial ecology and metabolism by elucidating the functional potential of hard-to-culture microorganisms. Here, we provide a synthesis of available methods to bin metagenomic contigs into species-level groups and highlight how genetic diversity, sequencing depth, and coverage influence binning success. Despite the computational cost on application to deeply sequenced complex metagenomes (e.g., soil), covarying patterns of contig coverage across multiple datasets significantly improves the binning process. We also discuss and compare current genome validation methods and reveal how these methods tackle the problem ofmore » chimeric genome bins i.e., sequences from multiple species. Finally, we explore how population genome assembly can be used to uncover biogeographic trends and to characterize the effect of in situ functional constraints on the genome-wide evolution.« less
Kisand, Veljo; Lettieri, Teresa
2013-04-01
De novo genome sequencing of previously uncharacterized microorganisms has the potential to open up new frontiers in microbial genomics by providing insight into both functional capabilities and biodiversity. Until recently, Roche 454 pyrosequencing was the NGS method of choice for de novo assembly because it generates hundreds of thousands of long reads (<450 bps), which are presumed to aid in the analysis of uncharacterized genomes. The array of tools for processing NGS data are increasingly free and open source and are often adopted for both their high quality and role in promoting academic freedom. The error rate of pyrosequencing the Alcanivorax borkumensis genome was such that thousands of insertions and deletions were artificially introduced into the finished genome. Despite a high coverage (~30 fold), it did not allow the reference genome to be fully mapped. Reads from regions with errors had low quality, low coverage, or were missing. The main defect of the reference mapping was the introduction of artificial indels into contigs through lower than 100% consensus and distracting gene calling due to artificial stop codons. No assembler was able to perform de novo assembly comparable to reference mapping. Automated annotation tools performed similarly on reference mapped and de novo draft genomes, and annotated most CDSs in the de novo assembled draft genomes. Free and open source software (FOSS) tools for assembly and annotation of NGS data are being developed rapidly to provide accurate results with less computational effort. Usability is not high priority and these tools currently do not allow the data to be processed without manual intervention. Despite this, genome assemblers now readily assemble medium short reads into long contigs (>97-98% genome coverage). A notable gap in pyrosequencing technology is the quality of base pair calling and conflicting base pairs between single reads at the same nucleotide position. Regardless, using draft whole genomes that are not finished and remain fragmented into tens of contigs allows one to characterize unknown bacteria with modest effort.
Li, Qili; Bu, Junyan; Yu, Zhihe; Tang, Lihua; Huang, Suiping; Guo, Tangxun; Mo, Jianyou; Hsiang, Tom
2018-02-22
Here, we present a draft genome sequence of isolate 15060 of Colletotrichum fructicola , a causal agent of mango anthracnose. The final assembly consists of 1,048 scaffolds totaling 56,493,063 bp (G+C content, 53.38%) and 15,180 predicted genes. Copyright © 2018 Li et al.
Draft Genome Sequence of “Cohnella kolymensis” B-2846
Kudryashova, Ekaterina B.; Ariskina, Elena V.
2016-01-01
A draft genome sequence of “Cohnella kolymensis” strain B-2846 was derived using IonTorrent sequencing technology. The size of the assembly and G+C content were in agreement with those of other species of this genus. Characterization of the genome of a novel species of Cohnella will assist in bacterial systematics. PMID:26769947
Federal Register 2010, 2011, 2012, 2013, 2014
2013-08-23
... Dichloromethane and N-Methylpyrrolidone.'' Dichloromethane and N-Methylpyrrolidone (DCM and NMP) (CASRN 75-09-2... represent and should not be construed to represent any Agency determination or policy. The draft DCM and NMP..., to assemble a panel of experts to evaluate the draft DCM and NMP TSCA risk assessment report for...
Yakym, Christopher J.; Helmkampf, Martin; Hagiwara, Kehau; Ip, Courtney G.; Antonio, Brandi J.; Armstrong, Ellie; Ulloa, Wesley J.; Awaya, Jonathan D.
2016-01-01
We report here the 6.0-Mb draft genome assembly of Pseudoalteromonas luteoviolacea strain IPB1 that was isolated from the Hawaiian marine sponge Iotrochota protea. Genome mining complemented with bioassay studies will elucidate secondary metabolite biosynthetic pathways and will help explain the ecological interaction between host sponge and microorganism. PMID:27660784
Singh, Nitin Kumar; Blachowicz, Adriana; Checinska, Aleksandra; Wang, Clay; Venkateswaran, Kasthuri
2016-07-14
Draft genome sequences of Aspergillus fumigatus strains (ISSFT-021 and IF1SW-F4), opportunistic pathogens isolated from the International Space Station (ISS), were assembled to facilitate investigations of the nature of the virulence characteristics of the ISS strains to other clinical strains isolated on Earth. Copyright © 2016 Singh et al.
Improved hybrid de novo genome assembly of domesticated apple (Malus x domestica).
Li, Xuewei; Kui, Ling; Zhang, Jing; Xie, Yinpeng; Wang, Liping; Yan, Yan; Wang, Na; Xu, Jidi; Li, Cuiying; Wang, Wen; van Nocker, Steve; Dong, Yang; Ma, Fengwang; Guan, Qingmei
2016-08-08
Domesticated apple (Malus × domestica Borkh) is a popular temperate fruit with high nutrient levels and diverse flavors. In 2012, global apple production accounted for at least one tenth of all harvested fruits. A high-quality apple genome assembly is crucial for the selection and breeding of new cultivars. Currently, a single reference genome is available for apple, assembled from 16.9 × genome coverage short reads via Sanger and 454 sequencing technologies. Although a useful resource, this assembly covers only ~89 % of the non-repetitive portion of the genome, and has a relatively short (16.7 kb) contig N50 length. These downsides make it difficult to apply this reference in transcriptive or whole-genome re-sequencing analyses. Here we present an improved hybrid de novo genomic assembly of apple (Golden Delicious), which was obtained from 76 Gb (~102 × genome coverage) Illumina HiSeq data and 21.7 Gb (~29 × genome coverage) PacBio data. The final draft genome is approximately 632.4 Mb, representing ~ 90 % of the estimated genome. The contig N50 size is 111,619 bp, representing a 7 fold improvement. Further annotation analyses predicted 53,922 protein-coding genes and 2,765 non-coding RNA genes. The new apple genome assembly will serve as a valuable resource for investigating complex apple traits at the genomic level. It is not only suitable for genome editing and gene cloning, but also for RNA-seq and whole-genome re-sequencing studies.
Sequencing and assembly of the 22-gb loblolly pine genome.
Zimin, Aleksey; Stevens, Kristian A; Crepeau, Marc W; Holtz-Morris, Ann; Koriabine, Maxim; Marçais, Guillaume; Puiu, Daniela; Roberts, Michael; Wegrzyn, Jill L; de Jong, Pieter J; Neale, David B; Salzberg, Steven L; Yorke, James A; Langley, Charles H
2014-03-01
Conifers are the predominant gymnosperm. The size and complexity of their genomes has presented formidable technical challenges for whole-genome shotgun sequencing and assembly. We employed novel strategies that allowed us to determine the loblolly pine (Pinus taeda) reference genome sequence, the largest genome assembled to date. Most of the sequence data were derived from whole-genome shotgun sequencing of a single megagametophyte, the haploid tissue of a single pine seed. Although that constrained the quantity of available DNA, the resulting haploid sequence data were well-suited for assembly. The haploid sequence was augmented with multiple linking long-fragment mate pair libraries from the parental diploid DNA. For the longest fragments, we used novel fosmid DiTag libraries. Sequences from the linking libraries that did not match the megagametophyte were identified and removed. Assembly of the sequence data were aided by condensing the enormous number of paired-end reads into a much smaller set of longer "super-reads," rendering subsequent assembly with an overlap-based assembly algorithm computationally feasible. To further improve the contiguity and biological utility of the genome sequence, additional scaffolding methods utilizing independent genome and transcriptome assemblies were implemented. The combination of these strategies resulted in a draft genome sequence of 20.15 billion bases, with an N50 scaffold size of 66.9 kbp.
Sharma, Sandeep; Zaccaron, Alex Z; Ridenour, John B; Allen, Tom W; Conner, Kassie; Doyle, Vinson P; Price, Trey; Sikora, Edward; Singh, Raghuwinder; Spurlock, Terry; Tomaso-Peterson, Maria; Wilkerson, Tessie; Bluhm, Burton H
2018-04-01
The draft genome of Xylaria sp. isolate MSU_SB201401, causal agent of taproot decline of soybean in the southern U.S., is presented here. The genome assembly was 56.7 Mb in size with an L50 of 246. A total of 10,880 putative protein-encoding genes were predicted, including 647 genes encoding carbohydrate-active enzymes and 1053 genes encoding secreted proteins. This is the first draft genome of a plant-pathogenic Xylaria sp. associated with soybean. The draft genome of Xylaria sp. isolate MSU_SB201401 will provide an important resource for future experiments to determine the molecular basis of pathogenesis.
Draft Genome Sequence of Microbacterium sp. Strain UCD-TDU (Phylum Actinobacteria)
Bendiks, Zachary A.; Lang, Jenna M.; Darling, Aaron E.; Coil, David A.
2013-01-01
Here, we present the draft genome sequence of Microbacterium sp. strain UCD-TDU, a member of the phylum Actinobacteria. The assembly contains 3,746,321 bp (in 8 scaffolds). This strain was isolated from a residential toilet as part of an undergraduate student research project to sequence reference genomes of microbes from the built environment. PMID:23516225
Draft Genome Sequence of Lactobacillus paracasei DmW181, a Bacterium Isolated from Wild Drosophila.
Hammer, Austin J; Walters, Amber; Carroll, Courtney; Newell, Peter D; Chaston, John M
2017-07-06
The draft genome sequence of Lactobacillus paracasei DmW181, an anaerobic bacterium isolate from wild Drosophila flies, is reported here. Strain DmW181 possesses genes for sialic acid and mannose metabolism. The assembled genome is 3,201,429 bp, with 3,454 predicted genes. Copyright © 2017 Hammer et al.
Draft Genome Sequence of Hafnia paralvei Strain GTA-HAF03.
Kohlman, Melissa E; Carrillo, Catherine D; Wong, Alex
2015-02-19
Hafnia paralvei is a Gram-negative member of the Enterobacteriaceae family, closely related to the opportunistic pathogen Hafnia alvei. We report here the first draft genome sequence of H. paralvei, from the beef trim isolate GTA-HAF03, consisting of a 5.0-Mbp assembly encoding 4,382 proteins and 90 predicted RNAs. Copyright © 2015 Kohlman et al.
Draft Genome Sequence of the Tyramine Producer Enterococcus durans Strain IPLA 655
Ladero, Victor; Linares, Daniel M.; del Rio, Beatriz; Fernandez, Maria; Martin, M. Cruz
2013-01-01
We here report a 3.059-Mbp draft assembly for the genome of Enterococcus durans strain IPLA 655. This dairy isolate provides a model for studying the regulation of the biosynthesis of tyramine (a toxic compound). These results should aid our understanding of tyramine production and allow tyramine accumulation in food to be reduced. PMID:23682153
Draft genome sequence and genetic transformation of the oleaginous alga Nannochloropis gaditana
Radakovits, Randor; Jinkerson, Robert E.; Fuerstenberg, Susan I.; Tae, Hongseok; Settlage, Robert E.; Boore, Jeffrey L.; Posewitz, Matthew C.
2012-01-01
The potential use of algae in biofuels applications is receiving significant attention. However, none of the current algal model species are competitive production strains. Here we present a draft genome sequence and a genetic transformation method for the marine microalga Nannochloropsis gaditana CCMP526. We show that N. gaditana has highly favourable lipid yields, and is a promising production organism. The genome assembly includes nuclear (~29 Mb) and organellar genomes, and contains 9,052 gene models. We define the genes required for glycerolipid biogenesis and detail the differential regulation of genes during nitrogen-limited lipid biosynthesis. Phylogenomic analysis identifies genetic attributes of this organism, including unique stramenopile photosynthesis genes and gene expansions that may explain the distinguishing photoautotrophic phenotypes observed. The availability of a genome sequence and transformation methods will facilitate investigations into N. gaditana lipid biosynthesis and permit genetic engineering strategies to further improve this naturally productive alga. PMID:22353717
Draft genome sequence and genetic transformation of the oleaginous alga Nannochloropis gaditana.
Radakovits, Randor; Jinkerson, Robert E; Fuerstenberg, Susan I; Tae, Hongseok; Settlage, Robert E; Boore, Jeffrey L; Posewitz, Matthew C
2012-02-21
The potential use of algae in biofuels applications is receiving significant attention. However, none of the current algal model species are competitive production strains. Here we present a draft genome sequence and a genetic transformation method for the marine microalga Nannochloropsis gaditana CCMP526. We show that N. gaditana has highly favourable lipid yields, and is a promising production organism. The genome assembly includes nuclear (~29 Mb) and organellar genomes, and contains 9,052 gene models. We define the genes required for glycerolipid biogenesis and detail the differential regulation of genes during nitrogen-limited lipid biosynthesis. Phylogenomic analysis identifies genetic attributes of this organism, including unique stramenopile photosynthesis genes and gene expansions that may explain the distinguishing photoautotrophic phenotypes observed. The availability of a genome sequence and transformation methods will facilitate investigations into N. gaditana lipid biosynthesis and permit genetic engineering strategies to further improve this naturally productive alga.
Genome sequence and genetic diversity of the common carp, Cyprinus carpio.
Xu, Peng; Zhang, Xiaofeng; Wang, Xumin; Li, Jiongtang; Liu, Guiming; Kuang, Youyi; Xu, Jian; Zheng, Xianhu; Ren, Lufeng; Wang, Guoliang; Zhang, Yan; Huo, Linhe; Zhao, Zixia; Cao, Dingchen; Lu, Cuiyun; Li, Chao; Zhou, Yi; Liu, Zhanjiang; Fan, Zhonghua; Shan, Guangle; Li, Xingang; Wu, Shuangxiu; Song, Lipu; Hou, Guangyuan; Jiang, Yanliang; Jeney, Zsigmond; Yu, Dan; Wang, Li; Shao, Changjun; Song, Lai; Sun, Jing; Ji, Peifeng; Wang, Jian; Li, Qiang; Xu, Liming; Sun, Fanyue; Feng, Jianxin; Wang, Chenghui; Wang, Shaolin; Wang, Baosen; Li, Yan; Zhu, Yaping; Xue, Wei; Zhao, Lan; Wang, Jintu; Gu, Ying; Lv, Weihua; Wu, Kejing; Xiao, Jingfa; Wu, Jiayan; Zhang, Zhang; Yu, Jun; Sun, Xiaowen
2014-11-01
The common carp, Cyprinus carpio, is one of the most important cyprinid species and globally accounts for 10% of freshwater aquaculture production. Here we present a draft genome of domesticated C. carpio (strain Songpu), whose current assembly contains 52,610 protein-coding genes and approximately 92.3% coverage of its paleotetraploidized genome (2n = 100). The latest round of whole-genome duplication has been estimated to have occurred approximately 8.2 million years ago. Genome resequencing of 33 representative individuals from worldwide populations demonstrates a single origin for C. carpio in 2 subspecies (C. carpio Haematopterus and C. carpio carpio). Integrative genomic and transcriptomic analyses were used to identify loci potentially associated with traits including scaling patterns and skin color. In combination with the high-resolution genetic map, the draft genome paves the way for better molecular studies and improved genome-assisted breeding of C. carpio and other closely related species.
An Annotated Draft Genome for Radix auricularia (Gastropoda, Mollusca)
Feldmeyer, Barbara; Schmidt, Hanno; Greshake, Bastian; Tills, Oliver; Truebano, Manuela; Rundle, Simon D.; Paule, Juraj; Ebersberger, Ingo; Pfenninger, Markus
2017-01-01
Molluscs are the second most species-rich phylum in the animal kingdom, yet only 11 genomes of this group have been published so far. Here, we present the draft genome sequence of the pulmonate freshwater snail Radix auricularia. Six whole genome shotgun libraries with different layouts were sequenced. The resulting assembly comprises 4,823 scaffolds with a cumulative length of 910 Mb and an overall read coverage of 72×. The assembly contains 94.6% of a metazoan core gene collection, indicating an almost complete coverage of the coding fraction. The discrepancy of ∼690 Mb compared with the estimated genome size of R. auricularia (1.6 Gb) results from a high repeat content of 70% mainly comprising DNA transposons. The annotation of 17,338 protein coding genes was supported by the use of publicly available transcriptome data. This draft will serve as starting point for further genomic and population genetic research in this scientifically important phylum. PMID:28204581
Draft sequencing and analysis of the genome of pufferfish Takifugu flavidus.
Gao, Yang; Gao, Qiang; Zhang, Huan; Wang, Lingling; Zhang, Fuchong; Yang, Chuanyan; Song, Linsheng
2014-12-01
The pufferfish Takifugu flavidus is an important economic species due to its outstanding flavour and high market value. It has been regarded as an excellent model of genetic study for decades as well. In the present study, three mate-pair libraries of T. flavidus genome were sequenced by the SOLiD 4 next-generation sequencing platform, and the draft genome was constructed with the short reads using an assisted assembly strategy. The draft consists of 50,947 scaffolds with an N50 value of 305.7 kb, and the average GC content was 45.2%. The combined length of repetitive sequences was 26.5 Mb, which accounted for 6.87% of the genome, indicating that the compactness of T. flavidus genome was approximative with that of T. rubripes genome. A total of 1,253 non-coding RNA genes and 30,285 protein-encoding genes were assigned to the genome. There were 132,775 and 394 presumptive genes playing roles in the colour pattern variation, the relatively slow growth and the lipid metabolism, respectively. Among them, genes involved in the microtubule-dependent transport system, angiogenesis, decapentaplegic pathway and lipid mobilization were significantly expanded in the T. flavidus genome. This draft genome provides a valuable resource for understanding and improving both fundamental and applied research with pufferfish in the future. © The Author 2014. Published by Oxford University Press on behalf of Kazusa DNA Research Institute.
Sutcliffe, Brodie; Rosewarne, Carly P.; Greenfield, Paul; Li, Dongmei
2013-01-01
The draft genome sequence of Thermotoga maritima A7A was obtained from a metagenomic assembly obtained from a high-temperature hydrocarbon reservoir in the Gippsland Basin, Australia. The organism is predicted to be a motile anaerobe with an array of catabolic enzymes for the degradation of numerous carbohydrates. PMID:24009120
O'Hair, Joshua A.; Li, Hui; Thapa, Santosh; Scholz, Matthew B.
2017-01-01
ABSTRACT Novel cellulolytic microorganisms can potentially influence second-generation biofuel production. This paper reports the draft genome sequence of Bacillus licheniformis strain YNP1-TSU, isolated from hydrothermal-vegetative microbiomes inside Yellowstone National Park. The assembled sequence contigs predicted 4,230 coding genes, 66 tRNAs, and 10 rRNAs through automated annotation. PMID:28254968
Singh, Deeksha; Chandrababunaidu, Mathu Malar; Panda, Arijit; Sen, Diya; Bhattacharyya, Sourav
2015-01-01
The draft genome assembly of Hassallia byssoidea strain VB512170 with a genome size of ~13 Mb and 10,183 protein-coding genes in 62 scaffolds is reported here for the first time. This is a terrestrial hydrophobic cyanobacterium isolated from monuments in India. We report several copies of luciferase and antibiotic genes in this organism. PMID:25745001
Tanjung, Zulfikar Achmad; Aditama, Redi; Buana, Rika Fithri Nurani; Pratomo, Antonius Dony Madu; Tryono, Reno; Liwang, Tony
2018-01-01
ABSTRACT Ganoderma boninense is the dominant fungal pathogen of basal stem rot (BSR) disease on Elaeis guineensis. We sequenced the nuclear genome of mycelia using both Illumina and Pacific Biosciences platforms for assembly of scaffolds. The draft genome comprised 79.24 Mb, 495 scaffolds, and 26,226 predicted coding sequences. PMID:29700132
Hyson, Peter; Shapiro, Joshua A; Wien, Michelle W
2015-10-08
Exiguobacterium sp. strain BMC-KP was isolated as part of a student environmental sampling project at Bryn Mawr College, PA. Sequencing of bacterial DNA assembled a 3.32-Mb draft genome. Analysis suggests the presence of genes for tolerance to cold and toxic metals, broad carbohydrate metabolism, and genes derived from phage. Copyright © 2015 Hyson et al.
Sakai-Kawada, Francis E; Yakym, Christopher J; Helmkampf, Martin; Hagiwara, Kehau; Ip, Courtney G; Antonio, Brandi J; Armstrong, Ellie; Ulloa, Wesley J; Awaya, Jonathan D
2016-09-22
We report here the 6.0-Mb draft genome assembly of Pseudoalteromonas luteoviolacea strain IPB1 that was isolated from the Hawaiian marine sponge Iotrochota protea Genome mining complemented with bioassay studies will elucidate secondary metabolite biosynthetic pathways and will help explain the ecological interaction between host sponge and microorganism. Copyright © 2016 Sakai-Kawada et al.
Draft Genome Sequence of Escherichia coli K-12 (ATCC 10798).
Dimitrova, Daniela; Engelbrecht, Kathleen C; Putonti, Catherine; Koenig, David W; Wolfe, Alan J
2017-07-06
Here, we present the draft genome sequence of Escherichia coli ATCC 10798. E. coli ATCC 10798 is a K-12 strain, one of the most well-studied model microorganisms. The size of the genome was 4,685,496 bp, with a G+C content of 50.70%. This assembly consists of 62 contigs and the F plasmid. Copyright © 2017 Dimitrova et al.
Draft Genome Sequence of Escherichia coli K-12 (ATCC 10798)
Dimitrova, Daniela; Engelbrecht, Kathleen C.; Koenig, David W.; Wolfe, Alan J.
2017-01-01
ABSTRACT Here, we present the draft genome sequence of Escherichia coli ATCC 10798. E. coli ATCC 10798 is a K-12 strain, one of the most well-studied model microorganisms. The size of the genome was 4,685,496 bp, with a G+C content of 50.70%. This assembly consists of 62 contigs and the F plasmid. PMID:28684574
A high-coverage draft genome of the mycalesine butterfly Bicyclus anynana.
Nowell, Reuben W; Elsworth, Ben; Oostra, Vicencio; Zwaan, Bas J; Wheat, Christopher W; Saastamoinen, Marjo; Saccheri, Ilik J; Van't Hof, Arjen E; Wasik, Bethany R; Connahs, Heidi; Aslam, Muhammad L; Kumar, Sujai; Challis, Richard J; Monteiro, Antónia; Brakefield, Paul M; Blaxter, Mark
2017-07-01
The mycalesine butterfly Bicyclus anynana, the "Squinting bush brown," is a model organism in the study of lepidopteran ecology, development, and evolution. Here, we present a draft genome sequence for B. anynana to serve as a genomics resource for current and future studies of this important model species. Seven libraries with insert sizes ranging from 350 bp to 20 kb were constructed using DNA from an inbred female and sequenced using both Illumina and PacBio technology; 128 Gb of raw Illumina data was filtered to 124 Gb and assembled to a final size of 475 Mb (∼×260 assembly coverage). Contigs were scaffolded using mate-pair, transcriptome, and PacBio data into 10 800 sequences with an N50 of 638 kb (longest scaffold 5 Mb). The genome is comprised of 26% repetitive elements and encodes a total of 22 642 predicted protein-coding genes. Recovery of a BUSCO set of core metazoan genes was almost complete (98%). Overall, these metrics compare well with other recently published lepidopteran genomes. We report a high-quality draft genome sequence for Bicyclus anynana. The genome assembly and annotated gene models are available at LepBase (http://ensembl.lepbase.org/index.html). © The Authors 2017. Published by Oxford University Press.
A high-coverage draft genome of the mycalesine butterfly Bicyclus anynana
Elsworth, Ben; Oostra, Vicencio; Zwaan, Bas J.; Wheat, Christopher W.; Saastamoinen, Marjo; Saccheri, Ilik J.; van’t Hof, Arjen E.; Wasik, Bethany R.; Connahs, Heidi; Aslam, Muhammad L.; Kumar, Sujai; Challis, Richard J.; Monteiro, Antónia; Brakefield, Paul M.
2017-01-01
Abstract The mycalesine butterfly Bicyclus anynana, the “Squinting bush brown,” is a model organism in the study of lepidopteran ecology, development, and evolution. Here, we present a draft genome sequence for B. anynana to serve as a genomics resource for current and future studies of this important model species. Seven libraries with insert sizes ranging from 350 bp to 20 kb were constructed using DNA from an inbred female and sequenced using both Illumina and PacBio technology; 128 Gb of raw Illumina data was filtered to 124 Gb and assembled to a final size of 475 Mb (∼×260 assembly coverage). Contigs were scaffolded using mate-pair, transcriptome, and PacBio data into 10 800 sequences with an N50 of 638 kb (longest scaffold 5 Mb). The genome is comprised of 26% repetitive elements and encodes a total of 22 642 predicted protein-coding genes. Recovery of a BUSCO set of core metazoan genes was almost complete (98%). Overall, these metrics compare well with other recently published lepidopteran genomes. We report a high-quality draft genome sequence for Bicyclus anynana. The genome assembly and annotated gene models are available at LepBase (http://ensembl.lepbase.org/index.html). PMID:28486658
Melo, Ricardo Rodrigues de; Persinoti, Gabriela Felix; Paixão, Douglas Antonio Alvaredo; Squina, Fábio Márcio; Ruller, Roberto; Sato, Helia Harumi
Here, we show the draft genome sequence of Streptomyces sp. F1, a strain isolated from soil with great potential for secretion of hydrolytic enzymes used to deconstruct cellulosic biomass. The draft genome assembly of Streptomyces sp. strain F1 has 69 contigs with a total genome size of 8,142,296bp and G+C 72.65%. Preliminary genome analysis identified 175 proteins as Carbohydrate-Active Enzymes, being 85 glycoside hydrolases organized in 33 distinct families. This draft genome information provides new insights on the key genes encoding hydrolytic enzymes involved in biomass deconstruction employed by soil bacteria. Copyright © 2017 Sociedade Brasileira de Microbiologia. Published by Elsevier Editora Ltda. All rights reserved.
Gschloessl, B; Dorkeld, F; Berges, H; Beydon, G; Bouchez, O; Branco, M; Bretaudeau, A; Burban, C; Dubois, E; Gauthier, P; Lhuillier, E; Nichols, J; Nidelet, S; Rocha, S; Sauné, L; Streiff, R; Gautier, M; Kerdelhué, C
2018-05-01
The pine processionary moth Thaumetopoea pityocampa (Lepidoptera: Notodontidae) is the main pine defoliator in the Mediterranean region. Its urticating larvae cause severe human and animal health concerns in the invaded areas. This species shows a high phenotypic variability for various traits, such as phenology, fecundity and tolerance to extreme temperatures. This study presents the construction and analysis of extensive genomic and transcriptomic resources, which are an obligate prerequisite to understand their underlying genetic architecture. Using a well-studied population from Portugal with peculiar phenological characteristics, the karyotype was first determined and a first draft genome of 537 Mb total length was assembled into 68,292 scaffolds (N50 = 164 kb). From this genome assembly, 29,415 coding genes were predicted. To circumvent some limitations for fine-scale physical mapping of genomic regions of interest, a 3X coverage BAC library was also developed. In particular, 11 BACs from this library were individually sequenced to assess the assembly quality. Additionally, de novo transcriptomic resources were generated from various developmental stages sequenced with HiSeq and MiSeq Illumina technologies. The reads were de novo assembled into 62,376 and 63,175 transcripts, respectively. Then, a robust subset of the genome-predicted coding genes, the de novo transcriptome assemblies and previously published 454/Sanger data were clustered to obtain a high-quality and comprehensive reference transcriptome consisting of 29,701 bona fide unigenes. These sequences covered 99% of the cegma and 88% of the busco highly conserved eukaryotic genes and 84% of the busco arthropod gene set. Moreover, 90% of these transcripts could be localized on the draft genome. The described information is available via a genome annotation portal (http://bipaa.genouest.org/sp/thaumetopoea_pityocampa/). © 2018 John Wiley & Sons Ltd.
USDA-ARS?s Scientific Manuscript database
The genome of the cattle tick R. microplus, an ectoparasite with global distribution, is estimated to be 7.1 Gbp and consists of ~70% repetitive DNA. We report the first assembly of a tick genome that utilized a hybrid sequencing and assembly approach to capture the repetitive fractions of the genom...
USDA-ARS?s Scientific Manuscript database
The current pig reference genome sequence (Sscrofa10.2) was established using Sanger sequencing and following the clone-by-clone hierarchical shotgun sequencing approach used in the public human genome project. However, as sequence coverage was low (4-6x) the resulting assembly was only of draft qua...
The genome of black cottonwood, Populus trichocarpa (Torr. & Gray)
G.A. Tuskan; S. DiFazio; S. Jansson; J. Bohlmann; I. Grigoriev; U. Hellsten; N. Putnam; S. Ralph; S. Rombauts; A. Salamov; J. Schein; L. Sterck; A. Aerts; R.R. Bhalerao; R.P. Bhalerao; D. Blaudez; W. Boerjan; A. Brun; A. Brunner; V. Busov; M. Campbell; J. Carlson; M. Chalot; J. Chapman; G.-L. Chen; D. Cooper; P.M. Coutinho; J. Couturier; S. Covert; Q. Cronk; R. Cunningham; J. Davis; S. Degroeve; A. Dejardin; C. dePamphilis; J. Detter; B. Dirks; U. Dubchak; S. Duplessis; J. Ehlting; B. Ellis; K. Gendler; D. Goodstein; M. Gribskov; J. Grimwood; A. Groover; L. Gunter; B. Hamberger; B. Heinze; Y. Helariutta; B. Henrissat; D. Holligan; R. Holt; W. Huang; N. Islam-Faridi; S. Jones; M. Jones-Rhoades; R. Jorgensen; C. Joshi; J. Kangasjarvi; J. Karlsson; C. Kelleher; R. Kirkpatrick; M. Kirst; A. Kohler; U. Kalluri; F. Larimer; J. Leebens-Mack; J.-C. Leple; P. Locascio; Y. Lou; S. Lucas; F. Martin; B. Montanini; C. Napoli; D.R. Nelson; C. Nelson; K. Nieminen; O. Nilsson; V. Pereda; G. Peter; R. Philippe; G. Pilate; A. Poliakov; J. Razumovskaya; P. Richardson; C. Rinaldi; K. Ritland; P. Rouze; D. Ryaboy; J. Schumtz; J. Schrader; B. Segerman; H. Shin; A. Siddiqui; F. Sterky; A. Terry; C.-J. Tsai; E. Uberbacher; P. Unneberg; J. Vahala; K. Wall; S. Wessler; G. Yang; T. Yin; C. Douglas; M. Marra; G. Sandberg; Y. Van de Peer; D. Rokhsar
2006-01-01
We report the draft genome of the black cottonwood tree, Populus trichocarpa. Integration of shotgun sequence assembly with genetic mapping enabled chromosome-scale reconstruction of the genome. More than 45,000 putative protein-coding genes were identified. Analysis of the assembled genome revealed a whole-genome duplication event; about 8000 pairs...
Riveros-Mckay, Fernando; Campos, Itzia; Giles-Gómez, Martha; Bolívar, Francisco
2014-01-01
Leuconostoc mesenteroides P45 was isolated from the traditional Mexican pulque beverage. We report its draft genome sequence, assembled in 6 contigs consisting of 1,874,188 bp and no plasmids. Genome annotation predicted a total of 1,800 genes, 1,687 coding sequences, 52 pseudogenes, 9 rRNAs, 51 tRNAs, 1 noncoding RNA, and 44 frameshifted genes. PMID:25377708
Singh, Deeksha; Chandrababunaidu, Mathu Malar; Panda, Arijit; Sen, Diya; Bhattacharyya, Sourav; Adhikary, Siba Prasad; Tripathy, Sucheta
2015-03-05
The draft genome assembly of Hassallia byssoidea strain VB512170 with a genome size of ~13 Mb and 10,183 protein-coding genes in 62 scaffolds is reported here for the first time. This is a terrestrial hydrophobic cyanobacterium isolated from monuments in India. We report several copies of luciferase and antibiotic genes in this organism. Copyright © 2015 Singh et al.
Moreno-Avitia, Fabian; Lozano, Luis; Utrilla, Jose; Bolívar, Francisco; Escalante, Adelfo
2017-06-08
Pseudomonas chlororaphis strain ATCC 9446 is a biocontrol-related organism. We report here its draft genome sequence assembled into 35 contigs consisting of 6,783,030 bp. Genome annotation predicted a total of 6,200 genes, 6,128 coding sequences, 81 pseudogenes, 58 tRNAs, 4 noncoding RNAs (ncRNAs), and 41 frameshifted genes. Copyright © 2017 Moreno-Avitia et al.
Ortiz, Elio M.; Berretta, Marcelo F.; Benintende, Graciela B.; Zandomeni, Rubén O.
2015-01-01
Geobacillus sp. isolate T6 was collected from a thermal spring in Salta, Argentina. The draft genome sequence (3,767,773 bp) of this isolate is represented by one major scaffold of 3,46 Mbp, a second one of 207 kbp, and 20 scaffolds of <13 kbp. The assembled sequences revealed 3,919 protein-coding genes. PMID:26184933
Utomo, Condro; Tanjung, Zulfikar Achmad; Aditama, Redi; Buana, Rika Fithri Nurani; Pratomo, Antonius Dony Madu; Tryono, Reno; Liwang, Tony
2018-04-26
Ganoderma boninense is the dominant fungal pathogen of basal stem rot (BSR) disease on Elaeis guineensis We sequenced the nuclear genome of mycelia using both Illumina and Pacific Biosciences platforms for assembly of scaffolds. The draft genome comprised 79.24 Mb, 495 scaffolds, and 26,226 predicted coding sequences. Copyright © 2018 Utomo et al.
Nasser, Kother; Mustafa, Abu Salim; Khan, Mohd Wasif; Purohit, Prashant; Al-Obaid, Inaam; Dhar, Rita; Al-Fouzan, Wadha
2018-04-19
Acinetobacter baumannii is an important opportunistic pathogen in global health care settings. Its dissemination and multidrug resistance pose an issue with treatment and outbreak control. Here, we present draft genome assemblies of six multidrug-resistant clinical strains of A. baumannii isolated from patients admitted to one of two major hospitals in Kuwait. Copyright © 2018 Nasser et al.
Pilkington, Sarah M; Crowhurst, Ross; Hilario, Elena; Nardozza, Simona; Fraser, Lena; Peng, Yongyan; Gunaseelan, Kularajathevan; Simpson, Robert; Tahir, Jibran; Deroles, Simon C; Templeton, Kerry; Luo, Zhiwei; Davy, Marcus; Cheng, Canhong; McNeilage, Mark; Scaglione, Davide; Liu, Yifei; Zhang, Qiong; Datson, Paul; De Silva, Nihal; Gardiner, Susan E; Bassett, Heather; Chagné, David; McCallum, John; Dzierzon, Helge; Deng, Cecilia; Wang, Yen-Yi; Barron, Lorna; Manako, Kelvina; Bowen, Judith; Foster, Toshi M; Erridge, Zoe A; Tiffin, Heather; Waite, Chethi N; Davies, Kevin M; Grierson, Ella P; Laing, William A; Kirk, Rebecca; Chen, Xiuyin; Wood, Marion; Montefiori, Mirco; Brummell, David A; Schwinn, Kathy E; Catanach, Andrew; Fullerton, Christina; Li, Dawei; Meiyalaghan, Sathiyamoorthy; Nieuwenhuizen, Niels; Read, Nicola; Prakash, Roneel; Hunter, Don; Zhang, Huaibi; McKenzie, Marian; Knäbel, Mareike; Harris, Alastair; Allan, Andrew C; Gleave, Andrew; Chen, Angela; Janssen, Bart J; Plunkett, Blue; Ampomah-Dwamena, Charles; Voogd, Charlotte; Leif, Davin; Lafferty, Declan; Souleyre, Edwige J F; Varkonyi-Gasic, Erika; Gambi, Francesco; Hanley, Jenny; Yao, Jia-Long; Cheung, Joey; David, Karine M; Warren, Ben; Marsh, Ken; Snowden, Kimberley C; Lin-Wang, Kui; Brian, Lara; Martinez-Sanchez, Marcela; Wang, Mindy; Ileperuma, Nadeesha; Macnee, Nikolai; Campin, Robert; McAtee, Peter; Drummond, Revel S M; Espley, Richard V; Ireland, Hilary S; Wu, Rongmei; Atkinson, Ross G; Karunairetnam, Sakuntala; Bulley, Sean; Chunkath, Shayhan; Hanley, Zac; Storey, Roy; Thrimawithana, Amali H; Thomson, Susan; David, Charles; Testolin, Raffaele; Huang, Hongwen; Hellens, Roger P; Schaffer, Robert J
2018-04-16
Most published genome sequences are drafts, and most are dominated by computational gene prediction. Draft genomes typically incorporate considerable sequence data that are not assigned to chromosomes, and predicted genes without quality confidence measures. The current Actinidia chinensis (kiwifruit) 'Hongyang' draft genome has 164 Mb of sequences unassigned to pseudo-chromosomes, and omissions have been identified in the gene models. A second genome of an A. chinensis (genotype Red5) was fully sequenced. This new sequence resulted in a 554.0 Mb assembly with all but 6 Mb assigned to pseudo-chromosomes. Pseudo-chromosomal comparisons showed a considerable number of translocation events have occurred following a whole genome duplication (WGD) event some consistent with centromeric Robertsonian-like translocations. RNA sequencing data from 12 tissues and ab initio analysis informed a genome-wide manual annotation, using the WebApollo tool. In total, 33,044 gene loci represented by 33,123 isoforms were identified, named and tagged for quality of evidential support. Of these 3114 (9.4%) were identical to a protein within 'Hongyang' The Kiwifruit Information Resource (KIR v2). Some proportion of the differences will be varietal polymorphisms. However, as most computationally predicted Red5 models required manual re-annotation this proportion is expected to be small. The quality of the new gene models was tested by fully sequencing 550 cloned 'Hort16A' cDNAs and comparing with the predicted protein models for Red5 and both the original 'Hongyang' assembly and the revised annotation from KIR v2. Only 48.9% and 63.5% of the cDNAs had a match with 90% identity or better to the original and revised 'Hongyang' annotation, respectively, compared with 90.9% to the Red5 models. Our study highlights the need to take a cautious approach to draft genomes and computationally predicted genes. Our use of the manual annotation tool WebApollo facilitated manual checking and correction of gene models enabling improvement of computational prediction. This utility was especially relevant for certain types of gene families such as the EXPANSIN like genes. Finally, this high quality gene set will supply the kiwifruit and general plant community with a new tool for genomics and other comparative analysis.
Moura, Quézia; Fernandes, Miriam R; Cerdeira, Louise; Santos, Ana Carolina M; de Souza, Tiago A; Ienne, Susan; Pignatari, Antonio Carlos C; Gales, Ana C; Silva, Rosa M; Lincopan, Nilton
2017-09-01
Here we report the draft genome sequence of a multidrug-resistant (MDR) Aeromonas hydrophila strain belonging to sequence type 508 (ST508) isolated from a human bloodstream infection. Assembly and annotation of this draft genome resulted in 5028498bp and revealed the presence of 16S rRNA methylase rmtD and bla CTX-M-131 genes encoding high-level resistance to aminoglycosides and cephalosporins, respectively, as well as multiple virulence genes. This draft genome can provide significant information for understanding mechanisms on the establishment and treatment of infections caused by this pathogen. Copyright © 2017 International Society for Chemotherapy of Infection and Cancer. Published by Elsevier Ltd. All rights reserved.
Chiriac, Cecilia; Baricz, Andreea
2018-01-01
ABSTRACT The draft genome assembly of Janthinobacterium sp. strain ROICE36 has 207 contigs, with a total genome size of 5,977,006 bp and a G+C content of 62%. Preliminary genome analysis identified 5,363 protein-coding genes and a total of 7 secondary metabolic gene clusters (encoding bacteriocins, nonribosomal peptide-synthetase [NRPS], terpene, hserlactone, and other ketide synthases). PMID:29650588
Kim, Jung A; Jeon, Jongbum; Kim, Ki-Tae; Choi, Gobong; Park, Sook-Young; Lee, Hyun-Jung; Shim, Sang-Hee; Lee, Yong-Hwan; Kim, Soonok
2017-08-03
An endophytic fungus, Gaeumannomyces sp. strain JS-464, is capable of producing a number of secondary metabolites which showed significant nitric oxide reduction activity. The draft genome assembly has a size of 53,151,282 bp, with a G+C content of 53.11% consisting of 80 scaffolds with an N 50 of 7.46 Mbp. Copyright © 2017 Kim et al.
Auffret, Pauline; Segura, Audrey; Klopp, Christophe; Bouchez, Olivier; Kérourédan, Monique; Bibbal, Delphine; Brugère, Hubert; Forano, Evelyne
2017-01-01
ABSTRACT Enterohemorrhagic Escherichia coli (EHEC) with serotype O157:H7 is a major foodborne pathogen. Here, we report the draft genome sequence of EHEC O157:H7 strain MC2 isolated from cattle in France. The assembly contains 5,400,376 bp that encoded 5,914 predicted genes (5,805 protein-encoding genes and 109 RNA genes). PMID:28983004
Draft genome sequence of field isolate Brucella melitensis strain 2007BM/1 from India.
Singh, D K; Kumar, Bablu; Shrinet, Garima; Singh, R P; Das, Aparajita; Mantur, B G; Abhishek; Pandey, Aruna; Mondal, Piyali; Sajjanar, B K; Doimari, Soni; Singh, Vijayata; Kumari, Reena; Tiwari, A K; Gandham, Ravi Kumar
2018-04-21
Brucellosis is among one of the most widespread important global zoonotic diseases that is endemic in many parts of India. Brucella melitensis is supposed to be the most pathogenic species for humans. Here we report the draft genome sequence of B. melitensis strain 2007BM/1 isolated from a human in India. Genomic DNA was extracted from Brucella culture and was sequenced using an Illumina MiSeq platform. The generated reads were assembled using three de novo assemblers and the draft genome was annotated. This monoisolate, with a genome length of 3268756bp, was found to be resistant to azithromycin and trimethoprim/sulfamethoxazole but susceptible to tetracycline, ofloxacin, rifampicin, ciprofloxacin and doxycycline. The presence of virulence genes in the strain was identified. The results obtained will help in understanding drug resistance mechanisms and virulence factors in highly zoonotic B. melitensis and suggest the need for judicious use of antibiotics in livestock health and management practices. Copyright © 2018 International Society for Chemotherapy of Infection and Cancer. Published by Elsevier Ltd. All rights reserved.
A draft annotation and overview of the human genome
Wright, Fred A; Lemon, William J; Zhao, Wei D; Sears, Russell; Zhuo, Degen; Wang, Jian-Ping; Yang, Hee-Yung; Baer, Troy; Stredney, Don; Spitzner, Joe; Stutz, Al; Krahe, Ralf; Yuan, Bo
2001-01-01
Background The recent draft assembly of the human genome provides a unified basis for describing genomic structure and function. The draft is sufficiently accurate to provide useful annotation, enabling direct observations of previously inferred biological phenomena. Results We report here a functionally annotated human gene index placed directly on the genome. The index is based on the integration of public transcript, protein, and mapping information, supplemented with computational prediction. We describe numerous global features of the genome and examine the relationship of various genetic maps with the assembly. In addition, initial sequence analysis reveals highly ordered chromosomal landscapes associated with paralogous gene clusters and distinct functional compartments. Finally, these annotation data were synthesized to produce observations of gene density and number that accord well with historical estimates. Such a global approach had previously been described only for chromosomes 21 and 22, which together account for 2.2% of the genome. Conclusions We estimate that the genome contains 65,000-75,000 transcriptional units, with exon sequences comprising 4%. The creation of a comprehensive gene index requires the synthesis of all available computational and experimental evidence. PMID:11516338
Zhang, Yunzeng; Barthe, Gary; Grosser, Jude W; Wang, Nian
2016-07-08
Citrus blight is a citrus tree overall decline disease and causes serious losses in the citrus industry worldwide. Although it was described more than one hundred years ago, its causal agent remains unknown and its pathophysiology is not well determined, which hampers our understanding of the disease and design of suitable disease management. In this study, we sequenced and assembled the draft genome for Swingle citrumelo, one important citrus rootstock. The draft genome is approximately 280 Mb, which covers 74 % of the estimated Swingle citrumelo genome and the average coverage is around 15X. The draft genome of Swingle citrumelo enabled us to conduct transcriptome analysis of roots of blight and healthy Swingle citrumelo using RNA-seq. The RNA-seq was reliable as evidenced by the high consistence of RNA-seq analysis and quantitative reverse transcription PCR results (R(2) = 0.966). Comparison of the gene expression profiles between blight and healthy root samples revealed the molecular mechanism underneath the characteristic blight phenotypes including decline, starch accumulation, and drought stress. The JA and ET biosynthesis and signaling pathways showed decreased transcript abundance, whereas SA-mediated defense-related genes showed increased transcript abundance in blight trees, suggesting unclassified biotrophic pathogen was involved in this disease. Overall, the Swingle citrumelo draft genome generated in this study will advance our understanding of plant biology and contribute to the citrus breeding. Transcriptome analysis of blight and healthy trees deepened our understanding of the pathophysiology of citrus blight.
Bhattacharyya, Anamitra; Stilwagen, Stephanie; Reznik, Gary; Feil, Helene; Feil, William S; Anderson, Iain; Bernal, Axel; D'Souza, Mark; Ivanova, Natalia; Kapatral, Vinayak; Larsen, Niels; Los, Tamara; Lykidis, Athanasios; Selkov, Eugene; Walunas, Theresa L; Purcell, Alexander; Edwards, Rob A; Hawkins, Trevor; Haselkorn, Robert; Overbeek, Ross; Kyrpides, Nikos C; Predki, Paul F
2002-10-01
Draft sequencing is a rapid and efficient method for determining the near-complete sequence of microbial genomes. Here we report a comparative analysis of one complete and two draft genome sequences of the phytopathogenic bacterium, Xylella fastidiosa, which causes serious disease in plants, including citrus, almond, and oleander. We present highlights of an in silico analysis based on a comparison of reconstructions of core biological subsystems. Cellular pathway reconstructions have been used to identify a small number of genes, which are likely to reside within the draft genomes but are not captured in the draft assembly. These represented only a small fraction of all genes and were predominantly large and small ribosomal subunit protein components. By using this approach, some of the inherent limitations of draft sequence can be significantly reduced. Despite the incomplete nature of the draft genomes, it is possible to identify several phage-related genes, which appear to be absent from the draft genomes and not the result of insufficient sequence sampling. This region may therefore identify potential host-specific functions. Based on this first functional reconstruction of a phytopathogenic microbe, we spotlight an unusual respiration machinery as a potential target for biological control. We also predicted and developed a new defined growth medium for Xylella.
Riveros-Mckay, Fernando; Campos, Itzia; Giles-Gómez, Martha; Bolívar, Francisco; Escalante, Adelfo
2014-11-06
Leuconostoc mesenteroides P45 was isolated from the traditional Mexican pulque beverage. We report its draft genome sequence, assembled in 6 contigs consisting of 1,874,188 bp and no plasmids. Genome annotation predicted a total of 1,800 genes, 1,687 coding sequences, 52 pseudogenes, 9 rRNAs, 51 tRNAs, 1 noncoding RNA, and 44 frameshifted genes. Copyright © 2014 Riveros-Mckay et al.
Peacekeeper Ballistic Missile System Fiscal Impact Analysis of Deployment in Wyoming and Nebraska
1984-05-01
12 Cheyenne Land Use 1-13 Laramie County Economic and Demographic Data 1-15 Rural Wyoming Counties 1-15 Nebraska Counties 1-25 Colorado Counties 1-25...baseline data for the study, and prepared draft materials, is acknowledged gratefully. William Eldred assisted willingly in data preparation and drafting...Assembly and Check-out (A&CO) of missile components and support equipment, the operational startup of the Peacekeeper system and transition to its
Ross, Daniel E; Marshall, Christopher W; May, Harold D; Norman, R Sean
2017-09-07
Draft genome sequences of Acetobacterium sp. strain MES1 and Desulfovibrio sp. strain MES5 were obtained from the metagenome of a cathode-associated community enriched within a microbial electrosynthesis system (MES). The draft genome sequences provide insight into the functional potential of these microorganisms within an MES and a foundation for future comparative analyses. Copyright © 2017 Ross et al.
Urasaki, Naoya; Takagi, Hiroki; Natsume, Satoshi; Uemura, Aiko; Taniai, Naoki; Miyagi, Norimichi; Fukushima, Mai; Suzuki, Shouta; Tarora, Kazuhiko; Tamaki, Moritoshi; Sakamoto, Moriaki; Terauchi, Ryohei; Matsumura, Hideo
2017-02-01
Bitter gourd (Momordica charantia) is an important vegetable and medicinal plant in tropical and subtropical regions globally. In this study, the draft genome sequence of a monoecious bitter gourd inbred line, OHB3-1, was analyzed. Through Illumina sequencing and de novo assembly, scaffolds of 285.5 Mb in length were generated, corresponding to ∼84% of the estimated genome size of bitter gourd (339 Mb). In this draft genome sequence, 45,859 protein-coding gene loci were identified, and transposable elements accounted for 15.3% of the whole genome. According to synteny mapping and phylogenetic analysis of conserved genes, bitter gourd was more related to watermelon (Citrullus lanatus) than to cucumber (Cucumis sativus) or melon (C. melo). Using RAD-seq analysis, 1507 marker loci were genotyped in an F2 progeny of two bitter gourd lines, resulting in an improved linkage map, comprising 11 linkage groups. By anchoring RAD tag markers, 255 scaffolds were assigned to the linkage map. Comparative analysis of genome sequences and predicted genes determined that putative trypsin-inhibitor and ribosome-inactivating genes were distinctive in the bitter gourd genome. These genes could characterize the bitter gourd as a medicinal plant. © The Author 2016. Published by Oxford University Press on behalf of Kazusa DNA Research Institute.
Ferreira, Dalila Souza Santos; Kato, Rodrigo Bentes; Miranda, Fábio Malcher; da Costa Pinheiro, Kenny; Fonseca, Paula Luize Camargos; Tomé, Luiz Marcelo Ribeiro; Vaz, Aline Bruna Martins; Badotti, Fernanda; Ramos, Rommel Thiago Jucá; Brenig, Bertram; Azevedo, Vasco Ariston de Carvalho; Benevides, Raquel Guimarães; Góes-Neto, Aristóteles
2018-06-01
Herein, we present the draft genome of Trametes villosa isolate CCMB561, a wood-decaying Basidiomycota commonly found in tropical semiarid climate. The genome assembly was 57.98 Mb in size with an L50 of 691. A total of 16,711 putative protein-encoding genes was predicted, including 590 genes coding for carbohydrate-active enzymes (CAZy), directly involved in the decomposition of lignocellulosic materials. This is the first genome of this species of high interest in bioenergy research. The draft genome of Trametes villosa isolate CCMB561 will provide an important resource for future investigations in biofuel production, bioremediation and other green technologies.
Jo, Jihoon; Oh, Jooseong; Lee, Hyun-Gwan; Hong, Hyun-Hee; Lee, Sung-Gwon; Cheon, Seongmin; Kern, Elizabeth M A; Jin, Soyeong; Cho, Sung-Jin; Park, Joong-Ki; Park, Chungoo
2017-01-01
The Japanese sea cucumber (Apostichopus japonicus Selenka 1867) is an economically important species as a source of seafood and ingredient in traditional medicine. It is mainly found off the coasts of northeast Asia. Recently, substantial exploitation and widespread biotic diseases in A. japonicus have generated increasing conservation concern. However, the genomic knowledge base and resources available for researchers to use in managing this natural resource and to establish genetically based breeding systems for sea cucumber aquaculture are still in a nascent stage. A total of 312 Gb of raw sequences were generated using the Illumina HiSeq 2000 platform and assembled to a final size of 0.66 Gb, which is about 80.5% of the estimated genome size (0.82 Gb). We observed nucleotide-level heterozygosity within the assembled genome to be 0.986%. The resulting draft genome assembly comprising 132 607 scaffolds with an N50 value of 10.5 kb contains a total of 21 771 predicted protein-coding genes. We identified 6.6-14.5 million heterozygous single nucleotide polymorphisms in the assembled genome of the three natural color variants (green, red, and black), resulting in an estimated nucleotide diversity of 0.00146. We report the first draft genome of A. japonicus and provide a general overview of the genetic variation in the three major color variants of A. japonicus. These data will help provide a comprehensive view of the genetic, physiological, and evolutionary relationships among color variants in A. japonicus, and will be invaluable resources for sea cucumber genomic research. © The Author 2017. Published by Oxford University Press.
Draft genome of tule elk Cervus canadensis nannodes.
Mizzi, Jessica E; Lounsberry, Zachary T; Brown, C Titus; Sacks, Benjamin N
2017-01-01
This paper presents the first draft genome of the tule elk ( Cervus elaphus nannodes ), a subspecies native to California that underwent an extreme genetic bottleneck in the late 1800s. The genome was generated from Illumina HiSeq 3000 whole genome sequencing of four individuals, resulting in the assembly of 2.395 billion base pairs (Gbp) over 602,862 contigs over 500 bp and N50 = 6,885 bp. This genome provides a resource to facilitate future genomic research on elk and other cervids.
Garza-Ramos, Ulises; Tamayo-Legorreta, Elsa; Arellano-Quintanilla, Doris María; Rodriguez-Medina, Nadia; Silva-Sanchez, Jesús; Catalan-Najera, Juan; Rocha-Martínez, Marisol Karina; Bravo-Díaz, María Asunción
2018-01-01
ABSTRACT A colistin-resistant mcr-1-carrying Escherichia coli strain, RC2-007, was isolated from a swine farm in Mexico. This extraintestinal and uropathogenic strain of E. coli belongs to serotype O89:H9 and sequence type 744. Assembly and annotation resulted in a 4.9-Mb draft genome that revealed the presence of plasmid-mediated mcr-1-ISApI1 genes as part of a prophage. PMID:29519827
Mets, David G; Brainard, Michael S
2018-01-01
Abstract Background Vocal learning in songbirds has emerged as a powerful model for sensorimotor learning. Neurobehavioral studies of Bengalese finch (Lonchura striata domestica) song, naturally more variable and plastic than songs of other finch species, have demonstrated the importance of behavioral variability for initial learning, maintenance, and plasticity of vocalizations. However, the molecular and genetic underpinnings of this variability and the learning it supports are poorly understood. Findings To establish a platform for the molecular analysis of behavioral variability and plasticity, we generated an initial draft assembly of the Bengalese finch genome from a single male animal to 151× coverage and an N50 of 3.0 MB. Furthermore, we developed an initial set of gene models using RNA-seq data from 8 samples that comprise liver, muscle, cerebellum, brainstem/midbrain, and forebrain tissue from juvenile and adult Bengalese finches of both sexes. Conclusions We provide a draft Bengalese finch genome and gene annotation to facilitate the study of the molecular-genetic influences on behavioral variability and the process of vocal learning. These data will directly support many avenues for the identification of genes involved in learning, including differential expression analysis, comparative genomic analysis (through comparison to existing avian genome assemblies), and derivation of genetic maps for linkage analysis. Bengalese finch gene models and sequences will be essential for subsequent manipulation (molecular or genetic) of genes and gene products, enabling novel mechanistic investigations into the role of variability in learned behavior. PMID:29618046
Colquitt, Bradley M; Mets, David G; Brainard, Michael S
2018-03-01
Vocal learning in songbirds has emerged as a powerful model for sensorimotor learning. Neurobehavioral studies of Bengalese finch (Lonchura striata domestica) song, naturally more variable and plastic than songs of other finch species, have demonstrated the importance of behavioral variability for initial learning, maintenance, and plasticity of vocalizations. However, the molecular and genetic underpinnings of this variability and the learning it supports are poorly understood. To establish a platform for the molecular analysis of behavioral variability and plasticity, we generated an initial draft assembly of the Bengalese finch genome from a single male animal to 151× coverage and an N50 of 3.0 MB. Furthermore, we developed an initial set of gene models using RNA-seq data from 8 samples that comprise liver, muscle, cerebellum, brainstem/midbrain, and forebrain tissue from juvenile and adult Bengalese finches of both sexes. We provide a draft Bengalese finch genome and gene annotation to facilitate the study of the molecular-genetic influences on behavioral variability and the process of vocal learning. These data will directly support many avenues for the identification of genes involved in learning, including differential expression analysis, comparative genomic analysis (through comparison to existing avian genome assemblies), and derivation of genetic maps for linkage analysis. Bengalese finch gene models and sequences will be essential for subsequent manipulation (molecular or genetic) of genes and gene products, enabling novel mechanistic investigations into the role of variability in learned behavior.
Sharma, Monikankana; Dasappa, S
2017-12-15
Biomass as a fuel for cooking is a common practice in rural India, and about 700 million people use traditional stoves to meet their energy demand. However, the thermal and the combustion efficiencies of these stoves are very low, leading to an inefficient use of biomass, and also, resulting in significant indoor air pollution. Research development has however led to the development of some improved stoves viz., natural draft and forced draft for both domestic as well as large scale cooking applications and government is trying to promote them. Forced draft stoves using processed biomass fuels (pellets) have received more prominence due to their superior performance, however, higher initial cost and limited fuel distribution networks have remained the key challenges. Improved natural draft stoves too have gained attention for being relatively inexpensive, and they are more likely to hit the rural households. In this paper, we have examined the environmental benefits obtained by the use of improved stoves for two important scenarios: traditional stoves are replaced by (i)improved natural draft stoves and, (ii) by improved natural draft as well as forced draft stoves. In the best case scenario (case ii), i.e., by shifting 111 million households who currently use wood to the forced draft stoves, and another 45 million households who are dependent on dung cake and agro residues to the improved natural draft stoves, the emission reduction that can be achieved are as follows: particulate matter (PM) 875 kT, black carbon (BC) 229 kT, organic carbon (OC) 525 kT, methane (CH 4 )1178 kT and non methane hydrocarbon (NMHC) of 564 kT. With the promotion of only natural draft improved stoves, the total reductions would be ∼12% lower than the combinational promotion. The CO 2 equivalent reduction is estimated to be ∼70-80 MT per year. Copyright © 2017 Elsevier Ltd. All rights reserved.
Bringing the fathead minnow (Pimephales promelas) into the ...
The fathead minnow (Pimephales promelas) is a well-established ecotoxicological model organism that has been widely used for regulatory ecotoxicity testing and research for over a half century. Throughout this time, a lot of knowledge has been gained about the fathead minnow’s biological responses to various xenobiotics. However, despite its importance as a model organism, the fathead minnow still has few publicly available gene sequences. Recently, Burns et al. (2015; Environ. Toxicol. Chem. 35:212) described the sequencing and de-novo assembly of the fathead minnow genome. Two draft genome assemblies are now publicly available on the GenBank database. However, on their own the draft assemblies remain of limited use to researchers who are primarily interested in the functional units of the genome, i.e. the genes. In the present study, an annotation pipeline, consisting of gene prediction, evidence alignment, and data synthesis, was applied to the fathead minnow SOAPdenovo assembly. Ab initio gene prediction was performed using AUGUSTUS, which provided a starting point of 43,345 gene predictions. Fathead minnow Expressed Sequence Tags (ESTs) and zebrafish protein-coding sequences (CDSs) were then aligned to the assembly using the corresponding spliced alignment methods of the program Exonerate. Of the over 240,000 EST alignments, 73% were successfully aligned with 90% or greater sequence identity and query coverage. Similarly, 39% of nearly 45,000 zebrafish co
Verwaaijen, Bart; Wibberg, Daniel; Nelkner, Johanna; Gordin, Miriam; Rupp, Oliver; Winkler, Anika; Bremges, Andreas; Blom, Jochen; Grosch, Rita; Pühler, Alfred; Schlüter, Andreas
2018-02-10
Lettuce (Lactuca sativa, L.) is an important annual plant of the family Asteraceae (Compositae). The commercial lettuce cultivar Tizian has been used in various scientific studies investigating the interaction of the plant with phytopathogens or biological control agents. Here, we present the de novo draft genome sequencing and gene prediction for this specific cultivar derived from transcriptome sequence data. The assembled scaffolds amount to a size of 2.22 Gb. Based on RNAseq data, 31,112 transcript isoforms were identified. Functional predictions for these transcripts were determined within the GenDBE annotation platform. Comparison with the cv. Salinas reference genome revealed a high degree of sequence similarity on genome and transcriptome levels, with an average amino acid identity of 99%. Furthermore, it was observed that two large regions are either missing or are highly divergent within the cv. Tizian genome compared to cv. Salinas. One of these regions covers the major resistance complex 1 region of cv. Salinas. The cv. Tizian draft genome sequence provides a valuable resource for future functional and transcriptome analyses focused on this lettuce cultivar. Copyright © 2017 Elsevier B.V. All rights reserved.
Baptista, Rodrigo P; Reis-Cunha, Joao Luis; DeBarry, Jeremy D; Chiari, Egler; Kissinger, Jessica C; Bartholomeu, Daniella C; Macedo, Andrea M
2018-02-14
Next-generation sequencing (NGS) methods are low-cost high-throughput technologies that produce thousands to millions of sequence reads. Despite the high number of raw sequence reads, their short length, relative to Sanger, PacBio or Nanopore reads, complicates the assembly of genomic repeats. Many genome tools are available, but the assembly of highly repetitive genome sequences using only NGS short reads remains challenging. Genome assembly of organisms responsible for important neglected diseases such as Trypanosoma cruzi, the aetiological agent of Chagas disease, is known to be challenging because of their repetitive nature. Only three of six recognized discrete typing units (DTUs) of the parasite have their draft genomes published and therefore genome evolution analyses in the taxon are limited. In this study, we developed a computational workflow to assemble highly repetitive genomes via a combination of de novo and reference-based assembly strategies to better overcome the intrinsic limitations of each, based on Illumina reads. The highly repetitive genome of the human-infecting parasite T. cruzi 231 strain was used as a test subject. The combined-assembly approach shown in this study benefits from the reference-based assembly ability to resolve highly repetitive sequences and from the de novo capacity to assemble genome-specific regions, improving the quality of the assembly. The acceptable confidence obtained by analyzing our results showed that our combined approach is an attractive option to assemble highly repetitive genomes with NGS short reads. Phylogenomic analysis including the 231 strain, the first representative of DTU III whose genome was sequenced, was also performed and provides new insights into T. cruzi genome evolution.
HopBase: a unified resource for Humulus genomics
Hill, Steven T.; Sudarsanam, Ramcharan
2017-01-01
Abstract Hop (Humulus lupulus L. var lupulus) is a dioecious plant of worldwide significance, used primarily for bittering and flavoring in brewing beer. Studies on the medicinal properties of several unique compounds produced by hop have led to additional interest from pharmacy and healthcare industries as well as livestock production as a natural antibiotic. Genomic research in hop has resulted a published draft genome and transcriptome assemblies. As research into the genomics of hop has gained interest, there is a critical need for centralized online genomic resources. To support the growing research community, we report the development of an online resource "HopBase.org." In addition to providing a gene annotation to the existing Shinsuwase draft genome, HopBase makes available genome assemblies and annotations for both the cultivar “Teamaker” and male hop accession number USDA 21422M. These genome assemblies, gene annotations, along with other common data, coupled with a genome browser and BLAST database enable the hop community to enter the genomic age. The HopBase genomic resource is accessible at http://hopbase.org and http://hopbase.cgrb.oregonstate.edu. PMID:28415075
Simões-Araújo, Jean Luiz; Rumjanek, Norma Gouvêa; Xavier, Gustavo Ribeiro; Zilli, Jerri Édson
The strain BR 3351 T (Bradyrhizobium manausense) was obtained from nodules of cowpea (Vigna unguiculata L. Walp) growing in soil collected from Amazon rainforest. Furthermore, it was observed that the strain has high capacity to fix nitrogen symbiotically in symbioses with cowpea. We report here the draft genome sequence of strain BR 3351 T . The information presented will be important for comparative analysis of nodulation and nitrogen fixation for diazotrophic bacteria. A draft genome with 9,145,311bp and 62.9% of GC content was assembled in 127 scaffolds using 100bp pair-end Illumina MiSeq system. The RAST annotation identified 8603 coding sequences, 51 RNAs genes, classified in 504 subsystems. Published by Elsevier Editora Ltda.
Lin, Hsin-Hung; Liao, Yu-Chieh
2015-01-01
Despite the ever-increasing output of next-generation sequencing data along with developing assemblers, dozens to hundreds of gaps still exist in de novo microbial assemblies due to uneven coverage and large genomic repeats. Third-generation single-molecule, real-time (SMRT) sequencing technology avoids amplification artifacts and generates kilobase-long reads with the potential to complete microbial genome assembly. However, due to the low accuracy (~85%) of third-generation sequences, a considerable amount of long reads (>50X) are required for self-correction and for subsequent de novo assembly. Recently-developed hybrid approaches, using next-generation sequencing data and as few as 5X long reads, have been proposed to improve the completeness of microbial assembly. In this study we have evaluated the contemporary hybrid approaches and demonstrated that assembling corrected long reads (by runCA) produced the best assembly compared to long-read scaffolding (e.g., AHA, Cerulean and SSPACE-LongRead) and gap-filling (SPAdes). For generating corrected long reads, we further examined long-read correction tools, such as ECTools, LSC, LoRDEC, PBcR pipeline and proovread. We have demonstrated that three microbial genomes including Escherichia coli K12 MG1655, Meiothermus ruber DSM1279 and Pdeobacter heparinus DSM2366 were successfully hybrid assembled by runCA into near-perfect assemblies using ECTools-corrected long reads. In addition, we developed a tool, Patch, which implements corrected long reads and pre-assembled contigs as inputs, to enhance microbial genome assemblies. With the additional 20X long reads, short reads of S. cerevisiae W303 were hybrid assembled into 115 contigs using the verified strategy, ECTools + runCA. Patch was subsequently applied to upgrade the assembly to a 35-contig draft genome. Our evaluation of the hybrid approaches shows that assembling the ECTools-corrected long reads via runCA generates near complete microbial genomes, suggesting that genome assembly could benefit from re-analyzing the available hybrid datasets that were not assembled in an optimal fashion.
Endogenous hepadnaviruses, bornaviruses and circoviruses in snakes
Gilbert, C.; Meik, J. M.; Dashevsky, D.; Card, D. C.; Castoe, T. A.; Schaack, S.
2014-01-01
We report the discovery of endogenous viral elements (EVEs) from Hepadnaviridae, Bornaviridae and Circoviridae in the speckled rattlesnake, Crotalus mitchellii, the first viperid snake for which a draft whole genome sequence assembly is available. Analysis of the draft assembly reveals genome fragments from the three virus families were inserted into the genome of this snake over the past 50 Myr. Cross-species PCR screening of orthologous loci and computational scanning of the python and king cobra genomes reveals that circoviruses integrated most recently (within the last approx. 10 Myr), whereas bornaviruses and hepadnaviruses integrated at least approximately 13 and approximately 50 Ma, respectively. This is, to our knowledge, the first report of circo-, borna- and hepadnaviruses in snakes and the first characterization of non-retroviral EVEs in non-avian reptiles. Our study provides a window into the historical dynamics of viruses in these host lineages and shows that their evolution involved multiple host-switches between mammals and reptiles. PMID:25080342
Liu, Xianghui; Arumugam, Krithika; Natarajan, Gayathri; Seviour, Thomas W; Drautz-Moses, Daniela I; Wuertz, Stefan; Law, Yingyu; Williams, Rohan B H
2018-05-10
Here, we present the draft genome sequence of an anaerobic ammonium-oxidizing bacterium (AnAOB), " Candidatus Brocadia," which was enriched in an anammox reactor. A 3.2-Mb genome sequence comprising 168 contigs was assembled, in which 2,765 protein-coding genes, 47 tRNAs, and one each of 5S, 16S, and 23S rRNAs were annotated. No evidence for the presence of a nitric oxide-forming nitrite reductase was found. Copyright © 2018 Liu et al.
Garza-Ramos, Ulises; Tamayo-Legorreta, Elsa; Arellano-Quintanilla, Doris María; Rodriguez-Medina, Nadia; Silva-Sanchez, Jesús; Catalan-Najera, Juan; Rocha-Martínez, Marisol Karina; Bravo-Díaz, María Asunción; Alpuche-Aranda, Celia
2018-03-08
A colistin-resistant mcr-1 -carrying Escherichia coli strain, RC2-007, was isolated from a swine farm in Mexico. This extraintestinal and uropathogenic strain of E. coli belongs to serotype O89:H9 and sequence type 744. Assembly and annotation resulted in a 4.9-Mb draft genome that revealed the presence of plasmid-mediated mcr-1 -IS ApI1 genes as part of a prophage. Copyright © 2018 Garza-Ramos et al.
Argout, X; Martin, G; Droc, G; Fouet, O; Labadie, K; Rivals, E; Aury, J M; Lanaud, C
2017-09-15
Theobroma cacao L., native to the Amazonian basin of South America, is an economically important fruit tree crop for tropical countries as a source of chocolate. The first draft genome of the species, from a Criollo cultivar, was published in 2011. Although a useful resource, some improvements are possible, including identifying misassemblies, reducing the number of scaffolds and gaps, and anchoring un-anchored sequences to the 10 chromosomes. We used a NGS-based approach to significantly improve the assembly of the Belizian Criollo B97-61/B2 genome. We combined four Illumina large insert size mate paired libraries with 52x of Pacific Biosciences long reads to correct misassembled regions and reduced the number of scaffolds. We then used genotyping by sequencing (GBS) methods to increase the proportion of the assembly anchored to chromosomes. The scaffold number decreased from 4,792 in assembly V1 to 554 in V2 while the scaffold N50 size has increased from 0.47 Mb in V1 to 6.5 Mb in V2. A total of 96.7% of the assembly was anchored to the 10 chromosomes compared to 66.8% in the previous version. Unknown sites (Ns) were reduced from 10.8% to 5.7%. In addition, we updated the functional annotations and performed a new RefSeq structural annotation based on RNAseq evidence. Theobroma cacao Criollo genome version 2 will be a valuable resource for the investigation of complex traits at the genomic level and for future comparative genomics and genetics studies in cacao tree. New functional tools and annotations are available on the Cocoa Genome Hub ( http://cocoa-genome-hub.southgreen.fr ).
Genome Sequence of the Necrotrophic Plant Pathogen Alternaria brassicicola Abra43
Belmas, Elodie; Briand, Martial; Kwasiborski, Anthony; Colou, Justine; N’Guyen, Guillaume; Iacomi, Béatrice; Grappin, Philippe; Campion, Claire; Simoneau, Philippe; Barret, Matthieu
2018-01-01
ABSTRACT Alternaria brassicicola causes dark spot (or black spot) disease, which is one of the most common and destructive fungal diseases of Brassicaceae spp. worldwide. Here, we report the draft genome sequence of strain Abra43. The assembly comprises 29 scaffolds, with an N50 value of 2.1 Mb. The assembled genome was 31,036,461 bp in length, with a G+C content of 50.85%. PMID:29439047
Genome Sequence of an Endophytic Fungus, Fusarium solani JS-169, Which Has Antifungal Activity.
Kim, Jung A; Jeon, Jongbum; Park, Sook-Young; Kim, Ki-Tae; Choi, Gobong; Lee, Hyun-Jung; Kim, Yangsun; Yang, Hee-Sun; Yeo, Joo-Hong; Lee, Yong-Hwan; Kim, Soonok
2017-10-19
An endophytic fungus, Fusarium solani strain JS-169, isolated from a mulberry twig, showed considerable antifungal activity. Here, we report the draft genome sequence of this strain. The assembly comprises 17 scaffolds, with an N 50 value of 4.93 Mb. The assembled genome was 45,813,297 bp in length, with a G+C content of 49.91%. Copyright © 2017 Kim et al.
NASA Technical Reports Server (NTRS)
Brewer, W. V.; Rasis, E. P.; Shih, H. R.
1993-01-01
Results from NASA/HBCU Grant No. NAG-1-1125 are summarized. Designs developed for model fabrication, exploratory concepts drafted, interface of computer with robot and end-effector, and capability enhancement are discussed.
Gleaning evolutionary insights from the genome sequence of a probiotic yeast Saccharomyces boulardii
2013-01-01
Background The yeast Saccharomyces boulardii is used worldwide as a probiotic to alleviate the effects of several gastrointestinal diseases and control antibiotics-associated diarrhea. While many studies report the probiotic effects of S. boulardii, no genome information for this yeast is currently available in the public domain. Results We report the 11.4 Mbp draft genome of this probiotic yeast. The draft genome was obtained by assembling Roche 454 FLX + shotgun data into 194 contigs with an N50 of 251 Kbp. We compare our draft genome with all other Saccharomyces cerevisiae genomes. Conclusions Our analysis confirms the close similarity of S. boulardii to S. cerevisiae strains and provides a framework to understand the probiotic effects of this yeast, which exhibits unique physiological and metabolic properties. PMID:24148866
Khatri, Indu; Akhtar, Akil; Kaur, Kamaldeep; Tomar, Rajul; Prasad, Gandham Satyanarayana; Ramya, Thirumalai Nallan Chakravarthy; Subramanian, Srikrishna
2013-10-22
The yeast Saccharomyces boulardii is used worldwide as a probiotic to alleviate the effects of several gastrointestinal diseases and control antibiotics-associated diarrhea. While many studies report the probiotic effects of S. boulardii, no genome information for this yeast is currently available in the public domain. We report the 11.4 Mbp draft genome of this probiotic yeast. The draft genome was obtained by assembling Roche 454 FLX + shotgun data into 194 contigs with an N50 of 251 Kbp. We compare our draft genome with all other Saccharomyces cerevisiae genomes. Our analysis confirms the close similarity of S. boulardii to S. cerevisiae strains and provides a framework to understand the probiotic effects of this yeast, which exhibits unique physiological and metabolic properties.
Draft Genome of the Pearl Oyster Pinctada fucata: A Platform for Understanding Bivalve Biology
Takeuchi, Takeshi; Kawashima, Takeshi; Koyanagi, Ryo; Gyoja, Fuki; Tanaka, Makiko; Ikuta, Tetsuro; Shoguchi, Eiichi; Fujiwara, Mayuki; Shinzato, Chuya; Hisata, Kanako; Fujie, Manabu; Usami, Takeshi; Nagai, Kiyohito; Maeyama, Kaoru; Okamoto, Kikuhiko; Aoki, Hideo; Ishikawa, Takashi; Masaoka, Tetsuji; Fujiwara, Atushi; Endo, Kazuyoshi; Endo, Hirotoshi; Nagasawa, Hiromichi; Kinoshita, Shigeharu; Asakawa, Shuichi; Watabe, Shugo; Satoh, Nori
2012-01-01
The study of the pearl oyster Pinctada fucata is key to increasing our understanding of the molecular mechanisms involved in pearl biosynthesis and biology of bivalve molluscs. We sequenced ∼1150-Mb genome at ∼40-fold coverage using the Roche 454 GS-FLX and Illumina GAIIx sequencers. The sequences were assembled into contigs with N50 = 1.6 kb (total contig assembly reached to 1024 Mb) and scaffolds with N50 = 14.5 kb. The pearl oyster genome is AT-rich, with a GC content of 34%. DNA transposons, retrotransposons, and tandem repeat elements occupied 0.4, 1.5, and 7.9% of the genome, respectively (a total of 9.8%). Version 1.0 of the P. fucata draft genome contains 23 257 complete gene models, 70% of which are supported by the corresponding expressed sequence tags. The genes include those reported to have an association with bio-mineralization. Genes encoding transcription factors and signal transduction molecules are present in numbers comparable with genomes of other metazoans. Genome-wide molecular phylogeny suggests that the lophotrochozoan represents a distinct clade from ecdysozoans. Our draft genome of the pearl oyster thus provides a platform for the identification of selection markers and genes for calcification, knowledge of which will be important in the pearl industry. PMID:22315334
Patil, Yogita; Müller, Nicolai; Schink, Bernhard; ...
2017-02-20
Anaerobium acetethylicum strain GluBS11 T belongs to the family Lachnospiraceae within the order Clostridiales. It is a Gram-positive, non-motile and strictly anaerobic bacterium isolated from biogas slurry that was originally enriched with gluconate as carbon source (Patil, et al., Int J Syst Evol Microbiol 65:3289-3296, 2015). Here we describe the draft genome sequence of strain GluBS11 T and provide a detailed insight into its physiological and metabolic features. The draft genome sequence generated 4,609,043 bp, distributed among 105 scaffolds assembled using the SPAdes genome assembler method. It comprises in total 4,132 genes, of which 4,008 were predicted to be proteinmore » coding genes, 124 RNA genes and 867 pseudogenes. The content was 43.51 mol %. The annotated genome of strain GluBS11 T contains putative genes coding for the pentose phosphate pathway, the Embden-Meyerhoff-Parnas pathway, the Entner-Doudoroff pathway and the tricarboxylic acid cycle. The genome revealed the presence of most of the necessary genes required for the fermentation of glucose and gluconate to acetate, ethanol, and hydrogen gas. However, a candidate gene for production of formate was not identified.« less
DOE Office of Scientific and Technical Information (OSTI.GOV)
Patil, Yogita; Müller, Nicolai; Schink, Bernhard
Anaerobium acetethylicum strain GluBS11 T belongs to the family Lachnospiraceae within the order Clostridiales. It is a Gram-positive, non-motile and strictly anaerobic bacterium isolated from biogas slurry that was originally enriched with gluconate as carbon source (Patil, et al., Int J Syst Evol Microbiol 65:3289-3296, 2015). Here we describe the draft genome sequence of strain GluBS11 T and provide a detailed insight into its physiological and metabolic features. The draft genome sequence generated 4,609,043 bp, distributed among 105 scaffolds assembled using the SPAdes genome assembler method. It comprises in total 4,132 genes, of which 4,008 were predicted to be proteinmore » coding genes, 124 RNA genes and 867 pseudogenes. The content was 43.51 mol %. The annotated genome of strain GluBS11 T contains putative genes coding for the pentose phosphate pathway, the Embden-Meyerhoff-Parnas pathway, the Entner-Doudoroff pathway and the tricarboxylic acid cycle. The genome revealed the presence of most of the necessary genes required for the fermentation of glucose and gluconate to acetate, ethanol, and hydrogen gas. However, a candidate gene for production of formate was not identified.« less
Draft transcriptome of Globodera ellingtonae
USDA-ARS?s Scientific Manuscript database
The recently described cyst nematode species, Globodera ellingtonae, is a phylogenetic intermediary between the potato cyst nematodes (PCN), G. rotochiensis and G. pallida, and as such provides a new avenue for understanding the evolution and biology of PCN. Given that assembled genomes and transcri...
Courtland Target Assembly Facility Environmental Assessment
2006-10-01
Draft Environmental Assessment 2-17 tributyl phosphate (TBP)6, diatomaceous earth, talcum powder, cornmeal , water, steel, and plastic. 2.2.2... cornmeal , water, steel, and plastic that would not qualify as hazardous materials. TBP is non-explosive, non-flammable, and stable under normal
Incentivizing Multiple Revisions Improves Student Writing without Increasing Instructor Workload
ERIC Educational Resources Information Center
Stellmack, Mark A.; Sandidge, Rita R.; Sippl, Amy L.; Miller, Danneka J.
2015-01-01
Previous research has shown that when students are required to submit a draft and a revision of their writing, large proportions of students do not improve across drafts. We implemented a writing assignment in which students were permitted to submit up to four optional drafts. To encourage substantive revisions, students were awarded additional…
DOE Office of Scientific and Technical Information (OSTI.GOV)
Hirsch, Candice N.; Hirsch, Cory D.; Brohammer, Alex B.
Intense artificial selection over the last 100 years has produced elite maize (Zea mays) inbred lines that combine to produce high-yielding hybrids. To further our understanding of how genome and transcriptome variation contribute to the production of high-yielding hybrids, we generated a draft genome assembly of the inbred line PH207 to complement and compare with the existing B73 reference sequence. B73 is a founder of the Stiff Stalk germplasm pool, while PH207 is a founder of Iodent germplasm, both of which have contributed substantially to the production of temperate commercial maize and are combined to make heterotic hybrids. Comparison ofmore » these two assemblies revealed over 2500 genes present in only one of the two genotypes and 136 gene families that have undergone extensive expansion or contraction. Transcriptome profiling revealed extensive expression variation, with as many as 10,564 differentially expressed transcripts and 7128 transcripts expressed in only one of the two genotypes in a single tissue. Genotype-specific genes were more likely to have tissue/condition-specific expression and lower transcript abundance. The availability of a high-quality genome assembly for the elite maize inbred PH207 expands our knowledge of the breadth of natural genome and transcriptome variation in elite maize inbred lines across heterotic pools.« less
Hirsch, Candice N.; Hirsch, Cory D.; Brohammer, Alex B.; ...
2016-11-01
Intense artificial selection over the last 100 years has produced elite maize (Zea mays) inbred lines that combine to produce high-yielding hybrids. To further our understanding of how genome and transcriptome variation contribute to the production of high-yielding hybrids, we generated a draft genome assembly of the inbred line PH207 to complement and compare with the existing B73 reference sequence. B73 is a founder of the Stiff Stalk germplasm pool, while PH207 is a founder of Iodent germplasm, both of which have contributed substantially to the production of temperate commercial maize and are combined to make heterotic hybrids. Comparison ofmore » these two assemblies revealed over 2500 genes present in only one of the two genotypes and 136 gene families that have undergone extensive expansion or contraction. Transcriptome profiling revealed extensive expression variation, with as many as 10,564 differentially expressed transcripts and 7128 transcripts expressed in only one of the two genotypes in a single tissue. Genotype-specific genes were more likely to have tissue/condition-specific expression and lower transcript abundance. The availability of a high-quality genome assembly for the elite maize inbred PH207 expands our knowledge of the breadth of natural genome and transcriptome variation in elite maize inbred lines across heterotic pools.« less
Soifer, Ilya; Barad, Omer; Shem-Tov, Doron; Baruch, Kobi; Lu, Fei; Hernandez, Alvaro G.; Wright, Chris L.; Koehler, Klaus; Buell, C. Robin; de Leon, Natalia
2016-01-01
Intense artificial selection over the last 100 years has produced elite maize (Zea mays) inbred lines that combine to produce high-yielding hybrids. To further our understanding of how genome and transcriptome variation contribute to the production of high-yielding hybrids, we generated a draft genome assembly of the inbred line PH207 to complement and compare with the existing B73 reference sequence. B73 is a founder of the Stiff Stalk germplasm pool, while PH207 is a founder of Iodent germplasm, both of which have contributed substantially to the production of temperate commercial maize and are combined to make heterotic hybrids. Comparison of these two assemblies revealed over 2500 genes present in only one of the two genotypes and 136 gene families that have undergone extensive expansion or contraction. Transcriptome profiling revealed extensive expression variation, with as many as 10,564 differentially expressed transcripts and 7128 transcripts expressed in only one of the two genotypes in a single tissue. Genotype-specific genes were more likely to have tissue/condition-specific expression and lower transcript abundance. The availability of a high-quality genome assembly for the elite maize inbred PH207 expands our knowledge of the breadth of natural genome and transcriptome variation in elite maize inbred lines across heterotic pools. PMID:27803309
SmedGD 2.0: The Schmidtea mediterranea genome database
Robb, Sofia M.C.; Gotting, Kirsten; Ross, Eric; Sánchez Alvarado, Alejandro
2016-01-01
Planarians have emerged as excellent models for the study of key biological processes such as stem cell function and regulation, axial polarity specification, regeneration, and tissue homeostasis among others. The most widely used organism for these studies is the free-living flatworm Schmidtea mediterranea. In 2007, the Schmidtea mediterranea Genome Database (SmedGD) was first released to provide a much needed resource for the small, but growing planarian community. SmedGD 1.0 has been a depository for genome sequence, a draft assembly, and related experimental data (e.g., RNAi phenotypes, in situ hybridization images, and differential gene expression results). We report here a comprehensive update to SmedGD (SmedGD 2.0) that aims to expand its role as an interactive community resource. The new database includes more recent, and up-to-date transcription data, provides tools that enhance interconnectivity between different genome assemblies and transcriptomes, including next generation assemblies for both the sexual and asexual biotypes of S. mediterranea. SmedGD 2.0 (http://smedgd.stowers.org) not only provides significantly improved gene annotations, but also tools for data sharing, attributes that will help both the planarian and biomedical communities to more efficiently mine the genomics and transcriptomics of S. mediterranea. PMID:26138588
The draft genome of Globodera ellingtonae
USDA-ARS?s Scientific Manuscript database
Globodera ellingtonae is a newly described potato cyst nematode found in Idaho, Oregon, and Argentina. Here we present a genome assembly for G. ellingtonae, a relative of the quarantine nematodes G. pallida and G. rostochiensis, produced using data from Illumina and Pacific Biosciences sequencing te...
Engineering Documentation and Data Control
NASA Technical Reports Server (NTRS)
Matteson, Michael J.; Bramley, Craig; Ciaruffoli, Veronica
2001-01-01
Mississippi Space Services (MSS) the facility services contractor for NASA's John C. Stennis Space Center (SSC), is utilizing technology to improve engineering documentation and data control. Two identified improvement areas, labor intensive documentation research and outdated drafting standards, were targeted as top priority. MSS selected AutoManager(R) WorkFlow from Cyco software to manage engineering documentation. The software is currently installed on over 150 desctops. The outdated SSC drafting standard was written for pre-CADD drafting methods, in other words, board drafting. Implementation of COTS software solutions to manage engineering documentation and update the drafting standard resulted in significant increases in productivity by reducing the time spent searching for documents.
Jiang, Yujia; Lu, Jiasheng; Chen, Tianpeng; Yan, Wei; Dong, Weiliang; Zhou, Jie; Zhang, Wenming; Ma, Jiangfeng; Jiang, Min; Xin, Fengxue
2018-05-23
A novel butanogenic Clostridium sp. NJ4 was successfully isolated and characterized, which could directly produce relatively high titer of butanol from inulin through consolidated bioprocessing (CBP). The assembled draft genome of strain NJ4 is 4.09 Mp, containing 3891 encoded protein sequences with G+C content of 30.73%. Among these annotated genes, a levanase, a hypothetical inulinase, and two bifunctional alcohol/aldehyde dehydrogenases (AdhE) were found to play key roles in the achievement of ABE production from inulin through CBP.
Feldmesser, Ester; Rosenwasser, Shilo; Vardi, Assaf; Ben-Dor, Shifra
2014-02-22
The advent of Next Generation Sequencing technologies and corresponding bioinformatics tools allows the definition of transcriptomes in non-model organisms. Non-model organisms are of great ecological and biotechnological significance, and consequently the understanding of their unique metabolic pathways is essential. Several methods that integrate de novo assembly with genome-based assembly have been proposed. Yet, there are many open challenges in defining genes, particularly where genomes are not available or incomplete. Despite the large numbers of transcriptome assemblies that have been performed, quality control of the transcript building process, particularly on the protein level, is rarely performed if ever. To test and improve the quality of the automated transcriptome reconstruction, we used manually defined and curated genes, several of them experimentally validated. Several approaches to transcript construction were utilized, based on the available data: a draft genome, high quality RNAseq reads, and ESTs. In order to maximize the contribution of the various data, we integrated methods including de novo and genome based assembly, as well as EST clustering. After each step a set of manually curated genes was used for quality assessment of the transcripts. The interplay between the automated pipeline and the quality control indicated which additional processes were required to improve the transcriptome reconstruction. We discovered that E. huxleyi has a very high percentage of non-canonical splice junctions, and relatively high rates of intron retention, which caused unique issues with the currently available tools. While individual tools missed genes and artificially joined overlapping transcripts, combining the results of several tools improved the completeness and quality considerably. The final collection, created from the integration of several quality control and improvement rounds, was compared to the manually defined set both on the DNA and protein levels, and resulted in an improvement of 20% versus any of the read-based approaches alone. To the best of our knowledge, this is the first time that an automated transcript definition is subjected to quality control using manually defined and curated genes and thereafter the process is improved. We recommend using a set of manually curated genes to troubleshoot transcriptome reconstruction.
Draft genome sequence of non-shiga toxin-producing Escherichia coli O157 NCCP15738.
Kwon, Taesoo; Kim, Jung-Beom; Bak, Young-Seok; Yu, Young-Bin; Kwon, Ki Sung; Kim, Won; Cho, Seung-Hak
2016-01-01
The non-shiga toxin-producing Escherichia coli (non-STEC) O157 is a pathogenic strain that cause diarrhea but does not cause hemolytic-uremic syndrome, or hemorrhagic colitis. Here, we present the 5-Mb draft genome sequence of non-STEC O157 NCCP15738, which was isolated from the feces of a Korean patient with diarrhea, and describe its features and the structural basis for its genome evolution. A total of 565-Mbp paired-end reads were generated using the Illumina-HiSeq 2000 platform. The reads were assembled into 135 scaffolds throughout the de novo assembly. The assembled genome size of NCCP15738 was 5,005,278 bp with an N50 value of 142,450 bp and 50.65 % G+C content. Using Rapid Annotation using Subsystem Technology analysis, we predicted 4780 ORFs and 31 RNA genes. The evolutionary tree was inferred from multiple sequence alignment of 45 E. coli species. The most closely related neighbor of NCCP15738 indicated by whole-genome phylogeny was E. coli UMNK88, but that indicated by multilocus sequence analysis was E. coli DH1(ME8569). A comparison between the NCCP15738 genome and those of reference strains, E. coli K-12 substr. MG1655 and EHEC O157:H7 EDL933 by bioinformatics analyses revealed unique genes in NCCP15738 associated with lysis protein S, two-component signal transduction system, conjugation, the flagellum, nucleotide-binding proteins, and metal-ion binding proteins. Notably, NCCP15738 has a dual flagella system like that in Vibrio parahaemolyticus, Aeromonas spp., and Rhodospirillum centenum. The draft genome sequence and the results of bioinformatics analysis of NCCP15738 provide the basis for understanding the genomic evolution of this strain.
Yong, Hoi-Sen; Eamsobhana, Praphathip; Lim, Phaik-Eem; Razali, Rozaimi; Aziz, Farhanah Abdul; Rosli, Nurul Shielawati Mohamed; Poole-Johnson, Johan; Anwar, Arif
2015-08-01
Angiostrongylus cantonensis is a bursate nematode parasite that causes eosinophilic meningitis (or meningoencephalitis) in humans in many parts of the world. The genomic data from A. cantonensis will form a useful resource for comparative genomic and chemogenomic studies to aid the development of diagnostics and therapeutics. We have sequenced, assembled and annotated the genome of A. cantonensis. The genome size is estimated to be ∼260 Mb, with 17,280 genomic scaffolds, 91X coverage, 81.45% for complete and 93.95% for partial score based on CEGMA analysis of genome completeness. The number of predicted genes of ≥300 bp was 17,482. A total of 7737 predicted protein-coding genes of ≥50 amino acids were identified in the assembled genome. Among the proteins of known function, kinases are the most abundant followed by transferases. The draft genome contains 34 excretory-secretory proteins (ES), a minimum of 44 Nematode Astacin (NAS) metalloproteases, 12 Homeobox (HOX) genes, and 30 neurotransmitters. The assembled genome size (260 Mb) is larger than those of Pristionchus pacificus, Caenorhabditis elegans, Necator americanus, Caenorhabditis briggsae, Trichinella spiralis, Brugia malayi and Loa loa, but smaller than Haemonchus contortus and Ascaris suum. The repeat content (25%) is similar to H. contortus. The GC content (41.17%) is lower compared to P. pacificus (42.7%) and H. contortus (43.1%) but higher compared to C. briggsae (37.69%), A. suum (37.9%) and N. americanus (40.2%) while the scaffold N50 is 42,191. This draft genome will facilitate the understanding of many unresolved issues on the parasite and the disorder it causes. Copyright © 2015 Elsevier B.V. All rights reserved.
Zheng, Zequn; Zhang, Qisen; Zhou, Gaofeng; Sweetingham, Mark W.; Howieson, John G.; Li, Chengdao
2013-01-01
Lupin (Lupinus angustifolius L.) is the most recently domesticated crop in major agricultural cultivation. Its seeds are high in protein and dietary fibre, but low in oil and starch. Medical and dietetic studies have shown that consuming lupin-enriched food has significant health benefits. We report the draft assembly from a whole genome shotgun sequencing dataset for this legume species with 26.9x coverage of the genome, which is predicted to contain 57,807 genes. Analysis of the annotated genes with metabolic pathways provided a partial understanding of some key features of lupin, such as the amino acid profile of storage proteins in seeds. Furthermore, we applied the NGS-based RAD-sequencing technology to obtain 8,244 sequence-defined markers for anchoring the genomic sequences. A total of 4,214 scaffolds from the genome sequence assembly were aligned into the genetic map. The combination of the draft assembly and a sequence-defined genetic map made it possible to locate and study functional genes of agronomic interest. The identification of co-segregating SNP markers, scaffold sequences and gene annotation facilitated the identification of a candidate R gene associated with resistance to the major lupin disease anthracnose. We demonstrated that the combination of medium-depth genome sequencing and a high-density genetic linkage map by application of NGS technology is a cost-effective approach to generating genome sequence data and a large number of molecular markers to study the genomics, genetics and functional genes of lupin, and to apply them to molecular plant breeding. This strategy does not require prior genome knowledge, which potentiates its application to a wide range of non-model species. PMID:23734219
Yang, Huaan; Tao, Ye; Zheng, Zequn; Zhang, Qisen; Zhou, Gaofeng; Sweetingham, Mark W; Howieson, John G; Li, Chengdao
2013-01-01
Lupin (Lupinus angustifolius L.) is the most recently domesticated crop in major agricultural cultivation. Its seeds are high in protein and dietary fibre, but low in oil and starch. Medical and dietetic studies have shown that consuming lupin-enriched food has significant health benefits. We report the draft assembly from a whole genome shotgun sequencing dataset for this legume species with 26.9x coverage of the genome, which is predicted to contain 57,807 genes. Analysis of the annotated genes with metabolic pathways provided a partial understanding of some key features of lupin, such as the amino acid profile of storage proteins in seeds. Furthermore, we applied the NGS-based RAD-sequencing technology to obtain 8,244 sequence-defined markers for anchoring the genomic sequences. A total of 4,214 scaffolds from the genome sequence assembly were aligned into the genetic map. The combination of the draft assembly and a sequence-defined genetic map made it possible to locate and study functional genes of agronomic interest. The identification of co-segregating SNP markers, scaffold sequences and gene annotation facilitated the identification of a candidate R gene associated with resistance to the major lupin disease anthracnose. We demonstrated that the combination of medium-depth genome sequencing and a high-density genetic linkage map by application of NGS technology is a cost-effective approach to generating genome sequence data and a large number of molecular markers to study the genomics, genetics and functional genes of lupin, and to apply them to molecular plant breeding. This strategy does not require prior genome knowledge, which potentiates its application to a wide range of non-model species.
Barrero, Roberto A; Guerrero, Felix D; Black, Michael; McCooke, John; Chapman, Brett; Schilkey, Faye; Pérez de León, Adalberto A; Miller, Robert J; Bruns, Sara; Dobry, Jason; Mikhaylenko, Galina; Stormo, Keith; Bell, Callum; Tao, Quanzhou; Bogden, Robert; Moolhuijzen, Paula M; Hunter, Adam; Bellgard, Matthew I
2017-08-01
The genome of the cattle tick Rhipicephalus microplus, an ectoparasite with global distribution, is estimated to be 7.1Gbp in length and consists of approximately 70% repetitive DNA. We report the draft assembly of a tick genome that utilized a hybrid sequencing and assembly approach to capture the repetitive fractions of the genome. Our hybrid approach produced an assembly consisting of 2.0Gbp represented in 195,170 scaffolds with a N50 of 60,284bp. The Rmi v2.0 assembly is 51.46% repetitive with a large fraction of unclassified repeats, short interspersed elements, long interspersed elements and long terminal repeats. We identified 38,827 putative R. microplus gene loci, of which 24,758 were protein coding genes (≥100 amino acids). OrthoMCL comparative analysis against 11 selected species including insects and vertebrates identified 10,835 and 3,423 protein coding gene loci that are unique to R. microplus or common to both R. microplus and Ixodes scapularis ticks, respectively. We identified 191 microRNA loci, of which 168 have similarity to known miRNAs and 23 represent novel miRNA families. We identified the genomic loci of several highly divergent R. microplus esterases with sequence similarity to acetylcholinesterase. Additionally we report the finding of a novel cytochrome P450 CYP41 homolog that shows similar protein folding structures to known CYP41 proteins known to be involved in acaricide resistance. Copyright © 2017 Australian Society for Parasitology. Published by Elsevier Ltd. All rights reserved.
Want Superstar Teachers? Scout for Talent, and Recruit Like Crazy.
ERIC Educational Resources Information Center
Bateman, C. Fred
1986-01-01
A school can assemble a winning teaching team by taking lessons from sports talent recruitment programs. Schools should search for early talent and ask education professors to identify promising student teachers. Contracts should be offered immediately to final round draft choices. (CJH)
77 FR 22247 - Veterinary Feed Directive; Draft Text for Proposed Regulation
Federal Register 2010, 2011, 2012, 2013, 2014
2012-04-13
.... FDA-2010-N-0155] Veterinary Feed Directive; Draft Text for Proposed Regulation AGENCY: Food and Drug Administration, HHS. ACTION: Notification; draft text for proposed regulation. SUMMARY: The Food and Drug Administration (FDA) is announcing the availability of draft text for a proposed regulation intended to improve...
Sánchez-Nieves, Rubén; Facciotti, Marc; Saavedra-Collado, Sofía; Dávila-Santiago, Lizbeth; Rodríguez-Carrero, Roy; Montalvo-Rodríguez, Rafael
2016-03-01
The genus Haloarcula belongs to the family Halobacteriaceae which currently has 10 valid species. Here we report the draft genome sequence of strain SL3, a new species within this genus, isolated from the Solar Salterns of Cabo Rojo, Puerto Rico. Genome assembly performed using NGEN Assembler resulted in 18 contigs (N50 = 601,911 bp), the largest of which contains 1,023,775 bp. The genome consists of 3.97 MB and has a GC content of 61.97%. Like all species of Haloarcula, the genome encodes heterogeneous copies of the small subunit ribosomal RNA. In addition, the genome includes 6 rRNAs, 48 tRNAs, and 3797 protein coding sequences. Several carbohydrate-active enzymes genes were found, as well as enzymes involved in the dihydroxyacetone processing pathway which are not found in other Haloarcula species. The NCBI accession number for this genome is LIUF00000000 and the strain deposit number is CECT9001.
Peng, Xinxia; Alföldi, Jessica; Gori, Kevin; Eisfeld, Amie J; Tyler, Scott R; Tisoncik-Go, Jennifer; Brawand, David; Law, G Lynn; Skunca, Nives; Hatta, Masato; Gasper, David J; Kelly, Sara M; Chang, Jean; Thomas, Matthew J; Johnson, Jeremy; Berlin, Aaron M; Lara, Marcia; Russell, Pamela; Swofford, Ross; Turner-Maier, Jason; Young, Sarah; Hourlier, Thibaut; Aken, Bronwen; Searle, Steve; Sun, Xingshen; Yi, Yaling; Suresh, M; Tumpey, Terrence M; Siepel, Adam; Wisely, Samantha M; Dessimoz, Christophe; Kawaoka, Yoshihiro; Birren, Bruce W; Lindblad-Toh, Kerstin; Di Palma, Federica; Engelhardt, John F; Palermo, Robert E; Katze, Michael G
2014-12-01
The domestic ferret (Mustela putorius furo) is an important animal model for multiple human respiratory diseases. It is considered the 'gold standard' for modeling human influenza virus infection and transmission. Here we describe the 2.41 Gb draft genome assembly of the domestic ferret, constituting 2.28 Gb of sequence plus gaps. We annotated 19,910 protein-coding genes on this assembly using RNA-seq data from 21 ferret tissues. We characterized the ferret host response to two influenza virus infections by RNA-seq analysis of 42 ferret samples from influenza time-course data and showed distinct signatures in ferret trachea and lung tissues specific to 1918 or 2009 human pandemic influenza virus infections. Using microarray data from 16 ferret samples reflecting cystic fibrosis disease progression, we showed that transcriptional changes in the CFTR-knockout ferret lung reflect pathways of early disease that cannot be readily studied in human infants with cystic fibrosis disease.
Endogenous hepadnaviruses, bornaviruses and circoviruses in snakes.
Gilbert, C; Meik, J M; Dashevsky, D; Card, D C; Castoe, T A; Schaack, S
2014-09-22
We report the discovery of endogenous viral elements (EVEs) from Hepadnaviridae, Bornaviridae and Circoviridae in the speckled rattlesnake, Crotalus mitchellii, the first viperid snake for which a draft whole genome sequence assembly is available. Analysis of the draft assembly reveals genome fragments from the three virus families were inserted into the genome of this snake over the past 50 Myr. Cross-species PCR screening of orthologous loci and computational scanning of the python and king cobra genomes reveals that circoviruses integrated most recently (within the last approx. 10 Myr), whereas bornaviruses and hepadnaviruses integrated at least approximately 13 and approximately 50 Ma, respectively. This is, to our knowledge, the first report of circo-, borna- and hepadnaviruses in snakes and the first characterization of non-retroviral EVEs in non-avian reptiles. Our study provides a window into the historical dynamics of viruses in these host lineages and shows that their evolution involved multiple host-switches between mammals and reptiles. © 2014 The Author(s) Published by the Royal Society. All rights reserved.
Peng, Xinxia; Alföldi, Jessica; Gori, Kevin; Eisfeld, Amie J.; Tyler, Scott R.; Tisoncik-Go, Jennifer; Brawand, David; Law, G. Lynn; Skunca, Nives; Hatta, Masato; Gasper, David J.; Kelly, Sara M.; Chang, Jean; Thomas, Matthew J.; Johnson, Jeremy; Berlin, Aaron M.; Lara, Marcia; Russell, Pamela; Swofford, Ross; Turner-Maier, Jason; Young, Sarah; Hourlier, Thibaut; Aken, Bronwen; Searle, Steve; Sun, Xingshen; Yi, Yaling; Suresh, M.; Tumpey, Terrence M.; Siepel, Adam; Wisely, Samantha M.; Dessimoz, Christophe; Kawaoka, Yoshihiro; Birren, Bruce W.; Lindblad-Toh, Kerstin; Di Palma, Federica; Engelhardt, John F.; Palermo, Robert E.; Katze, Michael G.
2014-01-01
The domestic ferret (Mustela putorius furo) is an important animal model for multiple human respiratory diseases. It is considered the ‘gold standard’ for modeling human influenza virus infection and transmission1–4. Here we describe the 2.41 Gb draft genome assembly of the domestic ferret, constituting 2.28 Gb of sequence plus gaps. We annotate 19,910 protein-coding genes on this assembly using RNA-seq data from 21 ferret tissues. We characterize the ferret host response to two influenza virus infections by RNA-seq analysis of 42 ferret samples from influenza time courses, and show distinct signatures in ferret trachea and lung tissues specific to 1918 or 2009 human pandemic influenza virus infections. Using microarray data from 16 ferret samples reflecting cystic fibrosis (CF) disease progression, we show that transcriptional changes in the CFTR-knockout ferret lung reflect pathways of early disease that cannot be readily studied in human infants with CF disease. PMID:25402615
The draft genome of tropical fruit durian (Durio zibethinus).
Teh, Bin Tean; Lim, Kevin; Yong, Chern Han; Ng, Cedric Chuan Young; Rao, Sushma Ramesh; Rajasegaran, Vikneswari; Lim, Weng Khong; Ong, Choon Kiat; Chan, Ki; Cheng, Vincent Kin Yuen; Soh, Poh Sheng; Swarup, Sanjay; Rozen, Steven G; Nagarajan, Niranjan; Tan, Patrick
2017-11-01
Durian (Durio zibethinus) is a Southeast Asian tropical plant known for its hefty, spine-covered fruit and sulfury and onion-like odor. Here we present a draft genome assembly of D. zibethinus, representing the third plant genus in the Malvales order and first in the Helicteroideae subfamily to be sequenced. Single-molecule sequencing and chromosome contact maps enabled assembly of the highly heterozygous durian genome at chromosome-scale resolution. Transcriptomic analysis showed upregulation of sulfur-, ethylene-, and lipid-related pathways in durian fruits. We observed paleopolyploidization events shared by durian and cotton and durian-specific gene expansions in MGL (methionine γ-lyase), associated with production of volatile sulfur compounds (VSCs). MGL and the ethylene-related gene ACS (aminocyclopropane-1-carboxylic acid synthase) were upregulated in fruits concomitantly with their downstream metabolites (VSCs and ethylene), suggesting a potential association between ethylene biosynthesis and methionine regeneration via the Yang cycle. The durian genome provides a resource for tropical fruit biology and agronomy.
Draft genome of the gayal, Bos frontalis
Wang, Ming-Shan; Zeng, Yan; Wang, Xiao; Nie, Wen-Hui; Wang, Jin-Huan; Su, Wei-Ting; Xiong, Zi-Jun; Wang, Sheng; Qu, Kai-Xing; Yan, Shou-Qing; Yang, Min-Min; Wang, Wen; Dong, Yang; Zhang, Ya-Ping
2017-01-01
Abstract Gayal (Bos frontalis), also known as mithan or mithun, is a large endangered semi-domesticated bovine that has a limited geographical distribution in the hill-forests of China, Northeast India, Bangladesh, Myanmar, and Bhutan. Many questions about the gayal such as its origin, population history, and genetic basis of local adaptation remain largely unresolved. De novo sequencing and assembly of the whole gayal genome provides an opportunity to address these issues. We report a high-depth sequencing, de novo assembly, and annotation of a female Chinese gayal genome. Based on the Illumina genomic sequencing platform, we have generated 350.38 Gb of raw data from 16 different insert-size libraries. A total of 276.86 Gb of clean data is retained after quality control. The assembled genome is about 2.85 Gb with scaffold and contig N50 sizes of 2.74 Mb and 14.41 kb, respectively. Repetitive elements account for 48.13% of the genome. Gene annotation has yielded 26 667 protein-coding genes, of which 97.18% have been functionally annotated. BUSCO assessment shows that our assembly captures 93% (3183 of 4104) of the core eukaryotic genes and 83.1% of vertebrate universal single-copy orthologs. We provide the first comprehensive de novo genome of the gayal. This genetic resource is integral for investigating the origin of the gayal and performing comparative genomic studies to improve understanding of the speciation and divergence of bovine species. The assembled genome could be used as reference in future population genetic studies of gayal. PMID:29048483
In-field experiment of electro-hydraulic tillage depth draft-position mixed control on tractor
NASA Astrophysics Data System (ADS)
Han, Jiangyi; Xia, Changgao; Shang, Gaogao; Gao, Xiang
2017-12-01
The soil condition and condition of the plow affect the tillage resistance and the maximum traction of tractor. In order to improve the adaptability of tractor tillage depth control, a multi-parameter control strategy is proposed that included tillage depth target, draft force aim and draft-position mixed ratio. In the strategy, the resistance coefficient was used to adjust the draft force target. Then, based on a JINMA1204 tractor, the electro-hydraulic hitch prototype is constructed that could set control parameters.. The fuzzy controller of draft-position mixed control is designed. After that, in-field experiments of position control was carried on, and the result of experiment shows the error of tillage depth was less than ±20mm. The experiment of draft-position control shown that the draft force and the tillage depth could be adjust by multi-parameter such as tillage depth, resistance coefficient and draft-position mixed coefficient. So that, the multi-parameter control strategy could improve the adaptability of tillage depth control in various soils and plow condition.
The Genome of the Cucumber, Cucumis Sativus L
USDA-ARS?s Scientific Manuscript database
Cucumber is an economically important crop as well as a model system for sex determination studies and plant vascular biology. Here we report the draft genome sequence of Cucumis sativus var. sativus L., assembled using a novel combination of traditional Sanger and next-generation Illumina GA sequen...
76 FR 10917 - Draft Regulatory Guide: Issuance, Availability
Federal Register 2010, 2011, 2012, 2013, 2014
2011-02-28
... in the agency's ``Regulatory Guide'' series. This series was developed to describe and make available... connection assemblies can perform their safety functions during and after a design-basis event. Title 10 of... Reprocessing Plants,'' Criterion III, ``Design Control,'' requires, in part, that test programs used to verify...
The aquatic animals' transcriptome resource for comparative functional analysis.
Chou, Chih-Hung; Huang, Hsi-Yuan; Huang, Wei-Chih; Hsu, Sheng-Da; Hsiao, Chung-Der; Liu, Chia-Yu; Chen, Yu-Hung; Liu, Yu-Chen; Huang, Wei-Yun; Lee, Meng-Lin; Chen, Yi-Chang; Huang, Hsien-Da
2018-05-09
Aquatic animals have great economic and ecological importance. Among them, non-model organisms have been studied regarding eco-toxicity, stress biology, and environmental adaptation. Due to recent advances in next-generation sequencing techniques, large amounts of RNA-seq data for aquatic animals are publicly available. However, currently there is no comprehensive resource exist for the analysis, unification, and integration of these datasets. This study utilizes computational approaches to build a new resource of transcriptomic maps for aquatic animals. This aquatic animal transcriptome map database dbATM provides de novo assembly of transcriptome, gene annotation and comparative analysis of more than twenty aquatic organisms without draft genome. To improve the assembly quality, three computational tools (Trinity, Oases and SOAPdenovo-Trans) were employed to enhance individual transcriptome assembly, and CAP3 and CD-HIT-EST software were then used to merge these three assembled transcriptomes. In addition, functional annotation analysis provides valuable clues to gene characteristics, including full-length transcript coding regions, conserved domains, gene ontology and KEGG pathways. Furthermore, all aquatic animal genes are essential for comparative genomics tasks such as constructing homologous gene groups and blast databases and phylogenetic analysis. In conclusion, we establish a resource for non model organism aquatic animals, which is great economic and ecological importance and provide transcriptomic information including functional annotation and comparative transcriptome analysis. The database is now publically accessible through the URL http://dbATM.mbc.nctu.edu.tw/ .
Design optimization of hydraulic turbine draft tube based on CFD and DOE method
NASA Astrophysics Data System (ADS)
Nam, Mun chol; Dechun, Ba; Xiangji, Yue; Mingri, Jin
2018-03-01
In order to improve performance of the hydraulic turbine draft tube in its design process, the optimization for draft tube is performed based on multi-disciplinary collaborative design optimization platform by combining the computation fluid dynamic (CFD) and the design of experiment (DOE) in this paper. The geometrical design variables are considered as the median section in the draft tube and the cross section in its exit diffuser and objective function is to maximize the pressure recovery factor (Cp). Sample matrixes required for the shape optimization of the draft tube are generated by optimal Latin hypercube (OLH) method of the DOE technique and their performances are evaluated through computational fluid dynamic (CFD) numerical simulation. Subsequently the main effect analysis and the sensitivity analysis of the geometrical parameters of the draft tube are accomplished. Then, the design optimization of the geometrical design variables is determined using the response surface method. The optimization result of the draft tube shows a marked performance improvement over the original.
Chen, Chaoyang; Sun, Chongran; Wu, Yi-Rui
2018-03-21
A wild-type solventogenic strain Clostridium diolis WST, isolated from mangrove sediments, was characterized to produce high amount of butanol and acetone with negligible level of ethanol and acids from glucose via a unique acetone-butanol (AB) fermentation pathway. Through the genomic sequencing, the assembled draft genome of strain WST is calculated to be 5.85 Mb with a GC content of 29.69% and contains 5263 genes that contribute to the annotation of 5049 protein-coding sequences. Within these annotated genes, the butanol dehydrogenase gene (bdh) was determined to be in a higher amount from strain WST compared to other Clostridial strains, which is positively related to its high-efficient production of butanol. Therefore, we present a draft genome sequence analysis of strain WST in this article that should facilitate to further understand the solventogenic mechanism of this special microorganism.
BLAST Ring Image Generator (BRIG): simple prokaryote genome comparisons
2011-01-01
Background Visualisation of genome comparisons is invaluable for helping to determine genotypic differences between closely related prokaryotes. New visualisation and abstraction methods are required in order to improve the validation, interpretation and communication of genome sequence information; especially with the increasing amount of data arising from next-generation sequencing projects. Visualising a prokaryote genome as a circular image has become a powerful means of displaying informative comparisons of one genome to a number of others. Several programs, imaging libraries and internet resources already exist for this purpose, however, most are either limited in the number of comparisons they can show, are unable to adequately utilise draft genome sequence data, or require a knowledge of command-line scripting for implementation. Currently, there is no freely available desktop application that enables users to rapidly visualise comparisons between hundreds of draft or complete genomes in a single image. Results BLAST Ring Image Generator (BRIG) can generate images that show multiple prokaryote genome comparisons, without an arbitrary limit on the number of genomes compared. The output image shows similarity between a central reference sequence and other sequences as a set of concentric rings, where BLAST matches are coloured on a sliding scale indicating a defined percentage identity. Images can also include draft genome assembly information to show read coverage, assembly breakpoints and collapsed repeats. In addition, BRIG supports the mapping of unassembled sequencing reads against one or more central reference sequences. Many types of custom data and annotations can be shown using BRIG, making it a versatile approach for visualising a range of genomic comparison data. BRIG is readily accessible to any user, as it assumes no specialist computational knowledge and will perform all required file parsing and BLAST comparisons automatically. Conclusions There is a clear need for a user-friendly program that can produce genome comparisons for a large number of prokaryote genomes with an emphasis on rapidly utilising unfinished or unassembled genome data. Here we present BRIG, a cross-platform application that enables the interactive generation of comparative genomic images via a simple graphical-user interface. BRIG is freely available for all operating systems at http://sourceforge.net/projects/brig/. PMID:21824423
BLAST Ring Image Generator (BRIG): simple prokaryote genome comparisons.
Alikhan, Nabil-Fareed; Petty, Nicola K; Ben Zakour, Nouri L; Beatson, Scott A
2011-08-08
Visualisation of genome comparisons is invaluable for helping to determine genotypic differences between closely related prokaryotes. New visualisation and abstraction methods are required in order to improve the validation, interpretation and communication of genome sequence information; especially with the increasing amount of data arising from next-generation sequencing projects. Visualising a prokaryote genome as a circular image has become a powerful means of displaying informative comparisons of one genome to a number of others. Several programs, imaging libraries and internet resources already exist for this purpose, however, most are either limited in the number of comparisons they can show, are unable to adequately utilise draft genome sequence data, or require a knowledge of command-line scripting for implementation. Currently, there is no freely available desktop application that enables users to rapidly visualise comparisons between hundreds of draft or complete genomes in a single image. BLAST Ring Image Generator (BRIG) can generate images that show multiple prokaryote genome comparisons, without an arbitrary limit on the number of genomes compared. The output image shows similarity between a central reference sequence and other sequences as a set of concentric rings, where BLAST matches are coloured on a sliding scale indicating a defined percentage identity. Images can also include draft genome assembly information to show read coverage, assembly breakpoints and collapsed repeats. In addition, BRIG supports the mapping of unassembled sequencing reads against one or more central reference sequences. Many types of custom data and annotations can be shown using BRIG, making it a versatile approach for visualising a range of genomic comparison data. BRIG is readily accessible to any user, as it assumes no specialist computational knowledge and will perform all required file parsing and BLAST comparisons automatically. There is a clear need for a user-friendly program that can produce genome comparisons for a large number of prokaryote genomes with an emphasis on rapidly utilising unfinished or unassembled genome data. Here we present BRIG, a cross-platform application that enables the interactive generation of comparative genomic images via a simple graphical-user interface. BRIG is freely available for all operating systems at http://sourceforge.net/projects/brig/.
Phase 111A Crew Interface Specifications Development for Inflight Maintenance and Stowage Functions
NASA Technical Reports Server (NTRS)
Carl, John G.
1973-01-01
This report presents the findings and data products developed during the Phase IIIA Crew Interface Specification Study for Inflight Maintenance and Stowage Functions, performed by General Electric for the NASA, Johnson Space Center with a set of documentation that can be used as definitive guidelines to improve the present process of defining, controlling and managing flight crew interface requirements that are related to inflight maintenance (including assembly and servicing) and stowage functions. During the Phase IIIA contract period, the following data products were developed: 1) Projected NASA Crew Procedures/Flight Data File Development Process. 2) Inflight Maintenance Management Process Description. 3) Preliminary Draft, General Specification, Inflight Maintenance Management Requirements. 4) Inflight Maintenance Operational Process Description. 5) Preliminary Draft, General Specification, Inflight Maintenance Task and Support Requirements Analysis. 6) Suggested IFM Data Processing Reports for Logistics Management The above Inflight Maintenance data products have been developed during the Phase IIIA study after review of Space Shuttle Program Documentation, including the Level II Integrated Logistics Requirements and other DOD and NASA data relative to Payloads Accommodations and Satellite On-Orbit Servicing. These Inflight Maintenance data products were developed to be in consonance with Space Shuttle Program technical and management requirements.
Single-Case Designs Technical Documentation
ERIC Educational Resources Information Center
Kratochwill, T. R.; Hitchcock, J.; Horner, R. H.; Levin, J. R.; Odom, S. L.; Rindskopf, D. M; Shadish, W. R.
2010-01-01
In an effort to expand the pool of scientific evidence available for review, the What Works Clearinghouse (WWC) assembled a panel of national experts in single-case design (SCD) and analysis to draft SCD Standards. SCDs are adaptations of interrupted time-series designs and can provide a rigorous experimental evaluation of intervention effects.…
The genome of the fire ant Solenopsis invicta
USDA-ARS?s Scientific Manuscript database
Ants have evolved very complex societies and are key ecosystem members. Some of them are also major pests, as exemplified by the fire ant Solenopsis invicta. We present here the draft genome of S. invicta, assembled from 454 and Illumina reads obtained from a focal haploid male and his brothers. In ...
Das, Abhishek; Panda, Arijit; Singh, Deeksha; Chandrababunaidu, Mathu Malar; Mishra, Gyan Prakash; Bhan, Sushma
2015-01-01
Scytonema tolypothrichoides VB-61278, a terrestrial cyanobacterium, can be exploited to produce commercially important products. Here, we report for the first time a 10-Mb draft genome assembly of S. tolypothrichoides VB-61278, with 214 scaffolds and 7,148 putative protein-coding genes. PMID:25838486
Eastman, Alexander W.; Yuan, Ze-Chun
2015-01-01
Advances in sequencing technology have drastically increased the depth and feasibility of bacterial genome sequencing. However, little information is available that details the specific techniques and procedures employed during genome sequencing despite the large numbers of published genomes. Shotgun approaches employed by second-generation sequencing platforms has necessitated the development of robust bioinformatics tools for in silico assembly, and complete assembly is limited by the presence of repetitive DNA sequences and multi-copy operons. Typically, re-sequencing with multiple platforms and laborious, targeted Sanger sequencing are employed to finish a draft bacterial genome. Here we describe a novel strategy based on the identification and targeted sequencing of repetitive rDNA operons to expedite bacterial genome assembly and finishing. Our strategy was validated by finishing the genome of Paenibacillus polymyxa strain CR1, a bacterium with potential in sustainable agriculture and bio-based processes. An analysis of the 38 contigs contained in the P. polymyxa strain CR1 draft genome revealed 12 repetitive rDNA operons with varied intragenic and flanking regions of variable length, unanimously located at contig boundaries and within contig gaps. These highly similar but not identical rDNA operons were experimentally verified and sequenced simultaneously with multiple, specially designed primer sets. This approach also identified and corrected significant sequence rearrangement generated during the initial in silico assembly of sequencing reads. Our approach reduces the required effort associated with blind primer walking for contig assembly, increasing both the speed and feasibility of genome finishing. Our study further reinforces the notion that repetitive DNA elements are major limiting factors for genome finishing. Moreover, we provided a step-by-step workflow for genome finishing, which may guide future bacterial genome finishing projects. PMID:25653642
The Draft Assembly of the Radically Organized Stylonychia lemnae Macronuclear Genome
Aeschlimann, Samuel H.; Jönsson, Franziska; Postberg, Jan; Stover, Nicholas A.; Petera, Robert L.; Lipps, Hans-Joachim; Nowacki, Mariusz; Swart, Estienne C.
2014-01-01
Stylonychia lemnae is a classical model single-celled eukaryote, and a quintessential ciliate typified by dimorphic nuclei: A small, germline micronucleus and a massive, vegetative macronucleus. The genome within Stylonychia’s macronucleus has a very unusual architecture, comprised variably and highly amplified “nanochromosomes,” each usually encoding a single gene with a minimal amount of surrounding noncoding DNA. As only a tiny fraction of the Stylonychia genes has been sequenced, and to promote research using this organism, we sequenced its macronuclear genome. We report the analysis of the 50.2-Mb draft S. lemnae macronuclear genome assembly, containing in excess of 16,000 complete nanochromosomes, assembled as less than 20,000 contigs. We found considerable conservation of fundamental genomic properties between S. lemnae and its close relative, Oxytricha trifallax, including nanochromosomal gene synteny, alternative fragmentation, and copy number. Protein domain searches in Stylonychia revealed two new telomere-binding protein homologs and the presence of linker histones. Among the diverse histone variants of S. lemnae and O. trifallax, we found divergent, coexpressed variants corresponding to four of the five core nucleosomal proteins (H1.2, H2A.6, H2B.4, and H3.7) suggesting that these ciliates may possess specialized nucleosomes involved in genome processing during nuclear differentiation. The assembly of the S. lemnae macronuclear genome demonstrates that largely complete, well-assembled highly fragmented genomes of similar size and complexity may be produced from one library and lane of Illumina HiSeq 2000 shotgun sequencing. The provision of the S. lemnae macronuclear genome sets the stage for future detailed experimental studies of chromatin-mediated, RNA-guided developmental genome rearrangements. PMID:24951568
Brown, Nathan M; Mueller, Ryan S; Shepardson, Jonathan W; Landry, Zachary C; Morré, Jeffrey T; Maier, Claudia S; Hardy, F Joan; Dreher, Theo W
2016-06-13
Very few closed genomes of the cyanobacteria that commonly produce toxic blooms in lakes and reservoirs are available, limiting our understanding of the properties of these organisms. A new anatoxin-a-producing member of the Nostocaceae, Anabaena sp. WA102, was isolated from a freshwater lake in Washington State, USA, in 2013 and maintained in non-axenic culture. The Anabaena sp. WA102 5.7 Mbp genome assembly has been closed with long-read, single-molecule sequencing and separately a draft genome assembly has been produced with short-read sequencing technology. The closed and draft genome assemblies are compared, showing a correlation between long repeats in the genome and the many gaps in the short-read assembly. Anabaena sp. WA102 encodes anatoxin-a biosynthetic genes, as does its close relative Anabaena sp. AL93 (also introduced in this study). These strains are distinguished by differences in the genes for light-harvesting phycobilins, with Anabaena sp. AL93 possessing a phycoerythrocyanin operon. Biologically relevant structural variants in the Anabaena sp. WA102 genome were detected only by long-read sequencing: a tandem triplication of the anaBCD promoter region in the anatoxin-a synthase gene cluster (not triplicated in Anabaena sp. AL93) and a 5-kbp deletion variant present in two-thirds of the population. The genome has a large number of mobile elements (160). Strikingly, there was no synteny with the genome of its nearest fully assembled relative, Anabaena sp. 90. Structural and functional genome analyses indicate that Anabaena sp. WA102 has a flexible genome. Genome closure, which can be readily achieved with long-read sequencing, reveals large scale (e.g., gene order) and local structural features that should be considered in understanding genome evolution and function.
Draft genomes of two blister beetles Hycleus cichorii and Hycleus phaleratus
Wu, Yuan-Ming; Li, Jiang
2018-01-01
Abstract Background Commonly known as blister beetles or Spanish fly, there are more than 1500 species in the Meloidae family (Hexapoda: Coleoptera: Tenebrionoidea) that produce the potent defensive blistering agent cantharidin. Cantharidin and its derivatives have been used to treat cancers such as liver, stomach, lung, and esophageal cancers. Hycleus cichorii and Hycleus phaleratus are the most commercially important blister beetles in China due to their ability to biosynthesize this potent vesicant. However, there is a lack of genome reference, which has hindered development of studies on the biosynthesis of cantharidin and a better understanding of its biology and pharmacology. Results We report 2 draft genomes and quantified gene sets for the blister beetles H. cichorii and H. phaleratus, 2 complex genomes with >72% repeats and approximately 1% heterozygosity, using Illumina sequencing data. An integrated assembly pipeline was performed for assembly, and most of the coding regions were obtained. Benchmarking universal single-copy orthologs (BUSCO) assessment showed that our assembly obtained more than 98% of the Endopterygota universal single-copy orthologs. Comparison analysis showed that the completeness of coding genes in our assembly was comparable to other beetle genomes such as Dendroctonus ponderosae and Agrilus planipennis. Gene annotation yielded 13 813 and 13 725 protein-coding genes in H. cichorii and H. phaleratus, of which approximately 89% were functionally annotated. BUSCO assessment showed that approximately 86% and 84% of the Endopterygota universal single-copy orthologs were annotated completely in these 2 gene sets, whose completeness is comparable to that of D. ponderosae and A. planipennis. Conclusions Assembly of both blister beetle genomes provides a valuable resource for future biosynthesis of cantharidin and comparative genomic studies of blister beetles and other beetles. PMID:29444297
Draft genomes of two blister beetles Hycleus cichorii and Hycleus phaleratus.
Wu, Yuan-Ming; Li, Jiang; Chen, Xiang-Sheng
2018-03-01
Commonly known as blister beetles or Spanish fly, there are more than 1500 species in the Meloidae family (Hexapoda: Coleoptera: Tenebrionoidea) that produce the potent defensive blistering agent cantharidin. Cantharidin and its derivatives have been used to treat cancers such as liver, stomach, lung, and esophageal cancers. Hycleus cichorii and Hycleus phaleratus are the most commercially important blister beetles in China due to their ability to biosynthesize this potent vesicant. However, there is a lack of genome reference, which has hindered development of studies on the biosynthesis of cantharidin and a better understanding of its biology and pharmacology. We report 2 draft genomes and quantified gene sets for the blister beetles H. cichorii and H. phaleratus, 2 complex genomes with >72% repeats and approximately 1% heterozygosity, using Illumina sequencing data. An integrated assembly pipeline was performed for assembly, and most of the coding regions were obtained. Benchmarking universal single-copy orthologs (BUSCO) assessment showed that our assembly obtained more than 98% of the Endopterygota universal single-copy orthologs. Comparison analysis showed that the completeness of coding genes in our assembly was comparable to other beetle genomes such as Dendroctonus ponderosae and Agrilus planipennis. Gene annotation yielded 13 813 and 13 725 protein-coding genes in H. cichorii and H. phaleratus, of which approximately 89% were functionally annotated. BUSCO assessment showed that approximately 86% and 84% of the Endopterygota universal single-copy orthologs were annotated completely in these 2 gene sets, whose completeness is comparable to that of D. ponderosae and A. planipennis. Assembly of both blister beetle genomes provides a valuable resource for future biosynthesis of cantharidin and comparative genomic studies of blister beetles and other beetles.
Humble, E; Martinez-Barrio, A; Forcada, J; Trathan, P N; Thorne, M A S; Hoffmann, M; Wolf, J B W; Hoffman, J I
2016-07-01
Custom genotyping arrays provide a flexible and accurate means of genotyping single nucleotide polymorphisms (SNPs) in a large number of individuals of essentially any organism. However, validation rates, defined as the proportion of putative SNPs that are verified to be polymorphic in a population, are often very low. A number of potential causes of assay failure have been identified, but none have been explored systematically. In particular, as SNPs are often developed from transcriptomes, parameters relating to the genomic context are rarely taken into account. Here, we assembled a draft Antarctic fur seal (Arctocephalus gazella) genome (assembly size: 2.41 Gb; scaffold/contig N50 : 3.1 Mb/27.5 kb). We then used this resource to map the probe sequences of 144 putative SNPs genotyped in 480 individuals. The number of probe-to-genome mappings and alignment length together explained almost a third of the variation in validation success, indicating that sequence uniqueness and proximity to intron-exon boundaries play an important role. The same pattern was found after mapping the probe sequences to the Walrus and Weddell seal genomes, suggesting that the genomes of species divergent by as much as 23 million years can hold information relevant to SNP validation outcomes. Additionally, reanalysis of genotyping data from seven previous studies found the same two variables to be significantly associated with SNP validation success across a variety of taxa. Finally, our study reveals considerable scope for validation rates to be improved, either by simply filtering for SNPs whose flanking sequences align uniquely and completely to a reference genome, or through predictive modelling. © 2015 John Wiley & Sons Ltd.
Sellera, Fábio P; Fernandes, Miriam R; Moura, Quézia; Souza, Tiago A; Nascimento, Cristiane L; Cerdeira, Louise; Lincopan, Nilton
2018-03-01
The incidence of multidrug-resistant bacteria in wildlife animals has been investigated to improve our knowledge of the spread of clinically relevant antimicrobial resistance genes. The aim of this study was to report the first draft genome sequence of an extensively drug-resistant (XDR) Pseudomonas aeruginosa ST644 isolate recovered from a Magellanic penguin with a footpad infection (bumblefoot) undergoing rehabilitation process. The genome was sequenced on an Illumina NextSeq ® platform using 150-bp paired-end reads. De novo genome assembly was performed using Velvet v.1.2.10, and the whole genome sequence was evaluated using bioinformatics approaches from the Center of Genomic Epidemiology, whereas an in-house method (mapping of raw whole genome sequence reads) was used to identify chromosomal point mutations. The genome size was calculated at 6436450bp, with 6357 protein-coding sequences and the presence of genes conferring resistance to aminoglycosides, β-lactams, phenicols, sulphonamides, tetracyclines, quinolones and fosfomycin; in addition, mutations in the genes gyrA (Thr83Ile), parC (Ser87Leu), phoQ (Arg61His) and pmrB (Tyr345His), conferring resistance to quinolones and polymyxins, respectively, were confirmed. This draft genome sequence can provide useful information for comparative genomic analysis regarding the dissemination of clinically significant antibiotic resistance genes and XDR bacterial species at the human-animal interface. Copyright © 2017 International Society for Chemotherapy of Infection and Cancer. Published by Elsevier Ltd. All rights reserved.
Wang, Zhiping; Liu, Lili; Guo, Feng; Zhang, Tong
2015-10-01
Biotreatment processes fed with coking wastewater often encounter insufficient removal of pollutants, such as ammonia, phenols, and polycyclic aromatic hydrocarbons (PAHs), especially for cyanides. However, only a limited number of bacterial species in pure cultures have been confirmed to metabolize cyanides, which hinders the improvement of these processes. In this study, a microbial community of activated sludge enriched in a coking wastewater treatment plant was analyzed using 454 pyrosequencing and Illumina sequencing to characterize the potential cyanide-degrading bacteria. According to the classification of these pyro-tags, targeting V3/V4 regions of 16S rRNA gene, half of them were assigned to the family Xanthomonadaceae, implying that Xanthomonadaceae bacteria are well-adapted to coking wastewater. A nearly complete draft genome of the dominant bacterium was reconstructed from metagenome of this community to explore cyanide metabolism based on analysis of the genome. The assembled 16S rRNA gene from this draft genome showed that this bacterium was a novel species of Thermomonas within Xanthomonadaceae, which was further verified by comparative genomics. The annotation using KEGG and Pfam identified genes related to cyanide metabolism, including genes responsible for the iron-harvesting system, cyanide-insensitive terminal oxidase, cyanide hydrolase/nitrilase, and thiosulfate:cyanide transferase. Phylogenetic analysis showed that these genes had homologs in previously identified genomes of bacteria within Xanthomonadaceae and even presented similar gene cassettes, thus implying an inherent cyanide-decomposing potential. The findings of this study expand our knowledge about the bacterial degradation of cyanide compounds and will be helpful in the remediation of cyanides contamination.
Decoding the massive genome of loblolly pine using haploid DNA and novel assembly strategies
2014-01-01
Background The size and complexity of conifer genomes has, until now, prevented full genome sequencing and assembly. The large research community and economic importance of loblolly pine, Pinus taeda L., made it an early candidate for reference sequence determination. Results We develop a novel strategy to sequence the genome of loblolly pine that combines unique aspects of pine reproductive biology and genome assembly methodology. We use a whole genome shotgun approach relying primarily on next generation sequence generated from a single haploid seed megagametophyte from a loblolly pine tree, 20-1010, that has been used in industrial forest tree breeding. The resulting sequence and assembly was used to generate a draft genome spanning 23.2 Gbp and containing 20.1 Gbp with an N50 scaffold size of 66.9 kbp, making it a significant improvement over available conifer genomes. The long scaffold lengths allow the annotation of 50,172 gene models with intron lengths averaging over 2.7 kbp and sometimes exceeding 100 kbp in length. Analysis of orthologous gene sets identifies gene families that may be unique to conifers. We further characterize and expand the existing repeat library based on the de novo analysis of the repetitive content, estimated to encompass 82% of the genome. Conclusions In addition to its value as a resource for researchers and breeders, the loblolly pine genome sequence and assembly reported here demonstrates a novel approach to sequencing the large and complex genomes of this important group of plants that can now be widely applied. PMID:24647006
Integrated genome sequence and linkage map of physic nut (Jatropha curcas L.), a biodiesel plant.
Wu, Pingzhi; Zhou, Changpin; Cheng, Shifeng; Wu, Zhenying; Lu, Wenjia; Han, Jinli; Chen, Yanbo; Chen, Yan; Ni, Peixiang; Wang, Ying; Xu, Xun; Huang, Ying; Song, Chi; Wang, Zhiwen; Shi, Nan; Zhang, Xudong; Fang, Xiaohua; Yang, Qing; Jiang, Huawu; Chen, Yaping; Li, Meiru; Wang, Ying; Chen, Fan; Wang, Jun; Wu, Guojiang
2015-03-01
The family Euphorbiaceae includes some of the most efficient biomass accumulators. Whole genome sequencing and the development of genetic maps of these species are important components in molecular breeding and genetic improvement. Here we report the draft genome of physic nut (Jatropha curcas L.), a biodiesel plant. The assembled genome has a total length of 320.5 Mbp and contains 27,172 putative protein-coding genes. We established a linkage map containing 1208 markers and anchored the genome assembly (81.7%) to this map to produce 11 pseudochromosomes. After gene family clustering, 15,268 families were identified, of which 13,887 existed in the castor bean genome. Analysis of the genome highlighted specific expansion and contraction of a number of gene families during the evolution of this species, including the ribosome-inactivating proteins and oil biosynthesis pathway enzymes. The genomic sequence and linkage map provide a valuable resource not only for fundamental and applied research on physic nut but also for evolutionary and comparative genomics analysis, particularly in the Euphorbiaceae. © 2015 The Authors The Plant Journal © 2015 John Wiley & Sons Ltd.
Li, Chien-Feng; Tang, Hui-Ling; Chiou, Chien-Shun; Tung, Kwong-Chung; Lu, Min-Chi; Lai, Yi-Chyi
2018-03-01
Klebsiella spp. are regarded as major pathogens causing infections in humans and various animals. Here we report the draft genome sequence of a CTX-M-type β-lactamase-producing Klebsiella quasipneumoniae subsp. similipneumoniae strain CHKP0062 isolated from a Yellow-margined Box turtle. An Illumina-Solexa platform was used to sequence the genome of CHKP0062. Qualified reads were assembled de novo using Velvet. The draft genome was annotated by the NCBI Prokaryotic Genome Annotation Pipeline (PGAP). The resistome and virulome of the strain were investigated. A total of 5423 protein-coding sequences, 87 tRNAs, 24 rRNAs and 12 ncRNAs were identified in the 5 699 275-bp genome. CHKP0062 was assigned to sequence type ST2131 with the K-loci type as KL67. No virulence-associated genes were identified. However, numerous antimicrobial resistance genes were present in this strain. Plasmid contigs were assembled and revealed homology to the multidrug resistance plasmids pC15-K, pCTX-M3 and pKF3-94, with the carriage of the class A β-lactamase genes bla TEM-1b and bla CTX-M-3 . The genome sequence reported in this study will be useful for comparative genomic analysis regarding the dissemination of clinically important antibiotic resistance genes among Klebsiella spp. isolated from humans and animals. Copyright © 2017 International Society for Chemotherapy of Infection and Cancer. Published by Elsevier Ltd. All rights reserved.
Das, Abhishek; Panda, Arijit; Singh, Deeksha; Chandrababunaidu, Mathu Malar; Mishra, Gyan Prakash; Bhan, Sushma; Adhikary, Siba Prasad; Tripathy, Sucheta
2015-04-02
Scytonema tolypothrichoides VB-61278, a terrestrial cyanobacterium, can be exploited to produce commercially important products. Here, we report for the first time a 10-Mb draft genome assembly of S. tolypothrichoides VB-61278, with 214 scaffolds and 7,148 putative protein-coding genes. Copyright © 2015 Das et al.
Federal Register 2010, 2011, 2012, 2013, 2014
2013-11-05
... the proposed Project may include access roads, wind turbine assembly lay down areas, overhead and... to an incidental take permit (ITP) application that Champlin Hawaii Wind Holdings, LLC (Champlin...) near Kahuku, Hawaii, for production of wind-generated electrical energy on the island of Oahu. In...
Payment to Creators for Library Loans (Public Lending Right).
ERIC Educational Resources Information Center
Faulds, M.
These recommendations and report on Public Lending Right (PLR) drafted by the Parliamentary Assembly of the Council of Europe were designed to encourage the recognition of the principle of PLR and the setting up of compatible PLR schemes throughout Europe. It discusses why an agreement for PLR is necessary, and describes several methods of…
USDA-ARS?s Scientific Manuscript database
Premise of the study: Microsatellite markers were developed for Plasmopara obducens, the causal agent of the newly emergent downy mildew disease of Impatiens walleriana. Methods and Results: A 151.2 Mb draft genome assembly was generated from P. obducens using Illumina technology and mined to identi...
76 FR 41527 - Draft Regulatory Guide: Re-Issuance and Availability
Federal Register 2010, 2011, 2012, 2013, 2014
2011-07-14
... and methods that are acceptable to the NRC staff for implementing specific parts of the NRC's..., and fabrication of mixed-oxide fuel or fuel assemblies. DG-3037 provides guidance on how to meet the... publicly disclosed. You may submit comments by any one of the following methods: Federal Rulemaking Web...
Whole-Genome Sequence of the Soil Bacterium Micrococcus sp. KBS0714.
Kuo, V; Shoemaker, W R; Muscarella, M E; Lennon, J T
2017-08-10
We present here a draft genome assembly of Micrococcus sp. KBS0714, which was isolated from agricultural soil. The genome provides insight into the strategies that Micrococcus spp. use to contend with environmental stressors such as desiccation and starvation in environmental and host-associated ecosystems. Copyright © 2017 Kuo et al.
Seabury, Christopher M.; Dowd, Scot E.; Seabury, Paul M.; Raudsepp, Terje; Brightsmith, Donald J.; Liboriussen, Poul; Halley, Yvette; Fisher, Colleen A.; Owens, Elaine; Viswanathan, Ganesh; Tizard, Ian R.
2013-01-01
Data deposition to NCBI Genomes This Whole Genome Shotgun project has been deposited at DDBJ/EMBL/GenBank under the accession AMXX00000000 (SMACv1.0, unscaffolded genome assembly). The version described in this paper is the first version (AMXX01000000). The scaffolded assembly (SMACv1.1) has been deposited at DDBJ/EMBL/GenBank under the accession AOUJ00000000, and is also the first version (AOUJ01000000). Strong biological interest in traits such as the acquisition and utilization of speech, cognitive abilities, and longevity catalyzed the utilization of two next-generation sequencing platforms to provide the first-draft de novo genome assembly for the large, new world parrot Ara macao (Scarlet Macaw). Despite the challenges associated with genome assembly for an outbred avian species, including 951,507 high-quality putative single nucleotide polymorphisms, the final genome assembly (>1.035 Gb) includes more than 997 Mb of unambiguous sequence data (excluding N’s). Cytogenetic analyses including ZooFISH revealed complex rearrangements associated with two scarlet macaw macrochromosomes (AMA6, AMA7), which supports the hypothesis that translocations, fusions, and intragenomic rearrangements are key factors associated with karyotype evolution among parrots. In silico annotation of the scarlet macaw genome provided robust evidence for 14,405 nuclear gene annotation models, their predicted transcripts and proteins, and a complete mitochondrial genome. Comparative analyses involving the scarlet macaw, chicken, and zebra finch genomes revealed high levels of nucleotide-based conservation as well as evidence for overall genome stability among the three highly divergent species. Application of a new whole-genome analysis of divergence involving all three species yielded prioritized candidate genes and noncoding regions for parrot traits of interest (i.e., speech, intelligence, longevity) which were independently supported by the results of previous human GWAS studies. We also observed evidence for genes and noncoding loci that displayed extreme conservation across the three avian lineages, thereby reflecting their likely biological and developmental importance among birds. PMID:23667475
Draft EU regulation on paediatric medicines: some improvements but still far from perfect.
2006-02-01
(1) In 2004, the European Commission proposed a draft European Regulation on paediatric medicines. This draft was more closely oriented towards defending drug companies' interests than with meeting children's medical needs. (2) Despite pressure from drug companies and their allies, several major improvements were made to the draft at its first reading in the European Parliament, thanks especially to the efforts of the Medicines in Europe Forum. (3) In particular, European deputies pushed for a better definition of children's needs and paediatric research priorities, greater transparency at various important stages of the market authorization procedure, and strengthened pharmacovigilance. (4) Yet the incentives and rewards offered to companies fail to take into account the notion of true therapeutic advantages and R&D expenditure. (5) Unfortunately the Commission refused some important amendments and published a new draft proposal, which was accepted by the Council of Health Ministers at the end of 2005. The new draft will come before the Parliament for a second reading in 2006.
Capture envelopes of rectangular hoods in cross drafts.
Huang, R F; Sir, S Y; Chen, Y K; Yeh, W Y; Chen, C W; Chen, C C
2001-01-01
The suction fields of the rectangular hoods of various aspect ratios varying from 0.1 to 10 that are subject to the influence of cross drafts were experimentally studied in an apparatus consisting of a hood model/wind tunnel assembly. The velocity field on the symmetry plane was measured with a two-component laser Doppler anemometer. Being under the influence of cross draft, the suction field presents a characteristic capture envelope, which is described by a dividing streamline. The characteristics of the capture envelope were found to be determined by the cross-draft to hood-suction velocity ratio R and the hood-opening aspect ratio AR. The flow characteristics of the hoods with aspect ratios less than unity were dramatically different from those with aspect ratios greater than one. If areas of the hood openings had the same values, the hydraulic-diameter normalized characteristic length scales of the capture zone of the square hood were as same as those of the circular hood. When the diameter of a circular hood was equal to the width of a square hood, the physical dimensions of the capture zones created by these two hoods coincided with each other.
Huang, Jing; Qiao, Zi Xu; Tang, Jing Wei; Wang, Gejiao
2015-01-01
Pontibacillus yanchengensis Y32(T) is an aerobic, motile, Gram-positive, endospore-forming, and moderately halophilic bacterium isolated from a salt field. In this study, we describe the features of P. yanchengensis strain Y32(T) together with a comparison with other four Pontibacillus genomes. The 4,281,464 bp high-quality-draft genome of strain Y32(T) is arranged into 153 contigs containing 3,965 protein-coding genes and 77 RNA encoding genes. The genome of strain Y32(T) possesses many genes related to its halophilic character, flagellar assembly and chemotaxis to support its survival in a salt-rich environment.
Draft genome sequence of ramie, Boehmeria nivea (L.) Gaudich.
Luan, Ming-Bao; Jian, Jian-Bo; Chen, Ping; Chen, Jun-Hui; Chen, Jian-Hua; Gao, Qiang; Gao, Gang; Zhou, Ju-Hong; Chen, Kun-Mei; Guang, Xuan-Min; Chen, Ji-Kang; Zhang, Qian-Qian; Wang, Xiao-Fei; Fang, Long; Sun, Zhi-Min; Bai, Ming-Zhou; Fang, Xiao-Dong; Zhao, Shan-Cen; Xiong, He-Ping; Yu, Chun-Ming; Zhu, Ai-Guo
2018-05-01
Ramie, Boehmeria nivea (L.) Gaudich, family Urticaceae, is a plant native to eastern Asia, and one of the world's oldest fibre crops. It is also used as animal feed and for the phytoremediation of heavy metal-contaminated farmlands. Thus, the genome sequence of ramie was determined to explore the molecular basis of its fibre quality, protein content and phytoremediation. For further understanding ramie genome, different paired-end and mate-pair libraries were combined to generate 134.31 Gb of raw DNA sequences using the Illumina whole-genome shotgun sequencing approach. The highly heterozygous B. nivea genome was assembled using the Platanus Genome Assembler, which is an effective tool for the assembly of highly heterozygous genome sequences. The final length of the draft genome of this species was approximately 341.9 Mb (contig N50 = 22.62 kb, scaffold N50 = 1,126.36 kb). Based on ramie genome annotations, 30,237 protein-coding genes were predicted, and the repetitive element content was 46.3%. The completeness of the final assembly was evaluated by benchmarking universal single-copy orthologous genes (BUSCO); 90.5% of the 1,440 expected embryophytic genes were identified as complete, and 4.9% were identified as fragmented. Phylogenetic analysis based on single-copy gene families and one-to-one orthologous genes placed ramie with mulberry and cannabis, within the clade of urticalean rosids. Genome information of ramie will be a valuable resource for the conservation of endangered Boehmeria species and for future studies on the biogeography and characteristic evolution of members of Urticaceae. © 2018 John Wiley & Sons Ltd.
NASA Astrophysics Data System (ADS)
Wilhelm, S.; Balarac, G.; Métais, O.; Ségoufin, C.
2016-11-01
Flow prediction in a bulb turbine draft tube is conducted for two operating points using Unsteady RANS (URANS) simulations and Large Eddy Simulations (LES). The inlet boundary condition of the draft tube calculation is a rotating two dimensional velocity profile exported from a RANS guide vane- runner calculation. Numerical results are compared with experimental data in order to validate the flow field and head losses prediction. Velocity profiles prediction is improved with LES in the center of the draft tube compared to URANS results. Moreover, more complex flow structures are obtained with LES. A local analysis of the predicted flow field using the energy balance in the draft tube is then introduced in order to detect the hydrodynamic instabilities responsible for head losses in the draft tube. In particular, the production of turbulent kinetic energy next to the draft tube wall and in the central vortex structure is found to be responsible for a large part of the mean kinetic energy dissipation in the draft tube and thus for head losses. This analysis is used in order to understand the differences in head losses for different operating points. The numerical methodology could then be improved thanks to an in-depth understanding of the local flow topology.
76 FR 36542 - Draft Guidance for Industry and Food and Drug Administration Staff: The Content of...
Federal Register 2010, 2011, 2012, 2013, 2014
2011-06-22
...The Food and Drug Administration (FDA) is announcing the availability of the draft guidance document entitled ``Draft Guidance for Industry and Food and Drug Administration Staff: The Content of Investigational Device Exemption (IDE) and Premarket Approval (PMA) Applications for Low Glucose Suspend (LGS) Device Systems.'' This draft guidance document provides industry and Agency staff with recommendations that are intended to improve the safety and effectiveness of LGS Device Systems. This draft guidance is not final nor is it in effect at this time.
1993-11-01
navigation improvements for Neah Bay, Clallam Bay, and Port Angeles was begun under the Puget Sound and Adjacent Waters, General Investigations authority. The...Regonnaissance Report and Plan of Study. Puget Sound and Adjacent Waters. Washington. Northern Olympic Peninsula Shallow-Draft Naviaation Study, August 1983...operators from having to make long trips from the fishing grounds near Neah Bay to ports farther east in the Strait of Juan de Fuca or in Puget Sound . 9 9
Mars Sample Handling Protocol Workshop Series: Workshop 4
NASA Technical Reports Server (NTRS)
Race Margaret S. (Editor); DeVincenzi, Donald L. (Editor); Rummel, John D. (Editor); Acevedo, Sara E. (Editor)
2001-01-01
In preparation for missions to Mars that will involve the return of samples to Earth, it will be necessary to prepare for the receiving, handling, testing, distributing, and archiving of martian materials here on Earth. Previous groups and committees have studied selected aspects of sample return activities, but specific detailed protocols for the handling and testing of returned samples must still be developed. To further refine the requirements for sample hazard testing and to develop the criteria for subsequent release of sample materials from quarantine, the NASA Planetary Protection Officer convened a series of workshops in 2000-2001. The overall objective of the Workshop Series was to produce a Draft Protocol by which returned martian sample materials can be assessed for biological hazards and examined for evidence of life (extant or extinct) while safeguarding the purity of the samples from possible terrestrial contamination. This report also provides a record of the proceedings of Workshop 4, the final Workshop of the Series, which was held in Arlington, Virginia, June 5-7, 2001. During Workshop 4, the sub-groups were provided with a draft of the protocol compiled in May 2001 from the work done at prior Workshops in the Series. Then eight sub-groups were formed to discuss the following assigned topics: Review and Assess the Draft Protocol for Physical/Chemical Testing Review and Assess the Draft Protocol for Life Detection Testing Review and Assess the Draft Protocol for Biohazard Testing Environmental and Health/Monitoring and Safety Issues Requirements of the Draft Protocol for Facilities and Equipment Contingency Planning for Different Outcomes of the Draft Protocol Personnel Management Considerations in Implementation of the Draft Protocol Draft Protocol Implementation Process and Update Concepts This report provides the first complete presentation of the Draft Protocol for Mars Sample Handling to meet planetary protection needs. This Draft Protocol, which was compiled from deliberations and recommendations from earlier Workshops in the Series, represents a consensus that emerged from the discussions of all the sub-groups assembled over the course of the five Workshops of the Series. These discussions converged on a conceptual approach to sample handling, as well as on specific analytical requirements. Discussions also identified important issues requiring attention, as well as research and development needed for protocol implementation.
Draft genome sequence of the fish pathogen Flavobacterium columnare strain CSF-298-10
USDA-ARS?s Scientific Manuscript database
We announce the genome assembly of Flavobacterium columnare strain CSF-298-10, a strain isolated from an outbreak of Columnaris disease at a commercial trout farm in Snake River Valley Idaho, USA. The complete genome consists of 13 contigs totaling 3,284,579 bp, average G+C content of 31.5% and 2933...
Genomics - the new rock and roll?
Dunham, I
2000-10-01
The end of the beginning of the Human Genome Project was announced on 26 June when the working draft or first assembly was announced. Here, Ian Dunham who led the group at the Sanger Centre that produced the first complete sequence of a human chromosome reflects on how it felt to be with the genome project from the beginning.
How Learners Use Automated Computer-Based Feedback to Produce Revised Drafts of Essays
ERIC Educational Resources Information Center
Laing, Jonny; El Ebyary, Khaled; Windeatt, Scott
2012-01-01
Our previous results suggest that the use of "Criterion", an automatic writing evaluation (AWE) system, is particularly successful in encouraging learners to produce amended drafts of their essays, and that those amended drafts generally represent an improvement on the original submission. Our analysis of the submitted essays and the…
Federal Register 2010, 2011, 2012, 2013, 2014
2010-11-12
...] Draft Guidance for Industry and Food and Drug Administration Staff on Dear Health Care Provider Letters... a draft guidance for industry and FDA staff entitled ``Dear Health Care Provider Letters: Improving Communication of Important Safety Information.'' Dear Health Care Provider (DHCP) Letters are correspondence...
Federal Register 2010, 2011, 2012, 2013, 2014
2012-10-23
... DEIS for Proposed Runway Safety Area Improvements at the Kodiak Airport, Kodiak, AK AGENCY: Federal... advise the public that a Draft Environmental Impact Statement (DEIS) for proposed Runway Safety Area... the DEIS can be submitted to the individual listed in the section, FOR FURTHER INFORMATION CONTACT. A...
The sequence and de novo assembly of the giant panda genome
Li, Ruiqiang; Fan, Wei; Tian, Geng; Zhu, Hongmei; He, Lin; Cai, Jing; Huang, Quanfei; Cai, Qingle; Li, Bo; Bai, Yinqi; Zhang, Zhihe; Zhang, Yaping; Wang, Wen; Li, Jun; Wei, Fuwen; Li, Heng; Jian, Min; Li, Jianwen; Zhang, Zhaolei; Nielsen, Rasmus; Li, Dawei; Gu, Wanjun; Yang, Zhentao; Xuan, Zhaoling; Ryder, Oliver A.; Leung, Frederick Chi-Ching; Zhou, Yan; Cao, Jianjun; Sun, Xiao; Fu, Yonggui; Fang, Xiaodong; Guo, Xiaosen; Wang, Bo; Hou, Rong; Shen, Fujun; Mu, Bo; Ni, Peixiang; Lin, Runmao; Qian, Wubin; Wang, Guodong; Yu, Chang; Nie, Wenhui; Wang, Jinhuan; Wu, Zhigang; Liang, Huiqing; Min, Jiumeng; Wu, Qi; Cheng, Shifeng; Ruan, Jue; Wang, Mingwei; Shi, Zhongbin; Wen, Ming; Liu, Binghang; Ren, Xiaoli; Zheng, Huisong; Dong, Dong; Cook, Kathleen; Shan, Gao; Zhang, Hao; Kosiol, Carolin; Xie, Xueying; Lu, Zuhong; Zheng, Hancheng; Li, Yingrui; Steiner, Cynthia C.; Lam, Tommy Tsan-Yuk; Lin, Siyuan; Zhang, Qinghui; Li, Guoqing; Tian, Jing; Gong, Timing; Liu, Hongde; Zhang, Dejin; Fang, Lin; Ye, Chen; Zhang, Juanbin; Hu, Wenbo; Xu, Anlong; Ren, Yuanyuan; Zhang, Guojie; Bruford, Michael W.; Li, Qibin; Ma, Lijia; Guo, Yiran; An, Na; Hu, Yujie; Zheng, Yang; Shi, Yongyong; Li, Zhiqiang; Liu, Qing; Chen, Yanling; Zhao, Jing; Qu, Ning; Zhao, Shancen; Tian, Feng; Wang, Xiaoling; Wang, Haiyin; Xu, Lizhi; Liu, Xiao; Vinar, Tomas; Wang, Yajun; Lam, Tak-Wah; Yiu, Siu-Ming; Liu, Shiping; Zhang, Hemin; Li, Desheng; Huang, Yan; Wang, Xia; Yang, Guohua; Jiang, Zhi; Wang, Junyi; Qin, Nan; Li, Li; Li, Jingxiang; Bolund, Lars; Kristiansen, Karsten; Wong, Gane Ka-Shu; Olson, Maynard; Zhang, Xiuqing; Li, Songgang; Yang, Huanming; Wang, Jian; Wang, Jun
2013-01-01
Using next-generation sequencing technology alone, we have successfully generated and assembled a draft sequence of the giant panda genome. The assembled contigs (2.25 gigabases (Gb)) cover approximately 94% of the whole genome, and the remaining gaps (0.05 Gb) seem to contain carnivore-specific repeats and tandem repeats. Comparisons with the dog and human showed that the panda genome has a lower divergence rate. The assessment of panda genes potentially underlying some of its unique traits indicated that its bamboo diet might be more dependent on its gut microbiome than its own genetic composition. We also identified more than 2.7 million heterozygous single nucleotide polymorphisms in the diploid genome. Our data and analyses provide a foundation for promoting mammalian genetic research, and demonstrate the feasibility for using next-generation sequencing technologies for accurate, cost-effective and rapid de novo assembly of large eukaryotic genomes. PMID:20010809
Strategies and tools for whole genome alignments
DOE Office of Scientific and Technical Information (OSTI.GOV)
Couronne, Olivier; Poliakov, Alexander; Bray, Nicolas
2002-11-25
The availability of the assembled mouse genome makespossible, for the first time, an alignment and comparison of two largevertebrate genomes. We have investigated different strategies ofalignment for the subsequent analysis of conservation of genomes that areeffective for different quality assemblies. These strategies were appliedto the comparison of the working draft of the human genome with the MouseGenome Sequencing Consortium assembly, as well as other intermediatemouse assemblies. Our methods are fast and the resulting alignmentsexhibit a high degree of sensitivity, covering more than 90 percent ofknown coding exons in the human genome. We have obtained such coveragewhile preserving specificity. With amore » view towards the end user, we havedeveloped a suite of tools and websites for automatically aligning, andsubsequently browsing and working with whole genome comparisons. Wedescribe the use of these tools to identify conserved non-coding regionsbetween the human and mouse genomes, some of which have not beenidentified by other methods.« less
Chenoll, E; Codoñer, F M; Silva, A; Martinez-Blanch, J F; Martorell, P; Ramón, D; Genovés, S
2014-03-27
Bifidobacterium animalis subsp. lactis strain CECT 8145 is able to reduce body fat content and improve metabolic syndrome biomarkers. Here, we report the draft genome sequence of this strain, which may provide insights into its safety status and functional role.
Federal Register 2010, 2011, 2012, 2013, 2014
2011-12-13
..., is a fundamental tool used to harmonize our environmental, economic, and social aspirations and is a... COUNCIL ON ENVIRONMENTAL QUALITY Draft Guidance on Improving the Process for Preparing Efficient and Timely Environmental Reviews under the National Environmental Policy Act AGENCY: Council on...
Development of an Official Guideline for the Economic Evaluation of Drugs/Medical Devices in Japan.
Shiroiwa, Takeru; Fukuda, Takashi; Ikeda, Shunya; Takura, Tomoyuki; Moriwaki, Kensuke
2017-03-01
In Japan, cost-effectiveness evaluation was implemented on a trial basis from fiscal year 2016. The results will be applied to the future repricing of drugs and medical devices. On the basis of a request from the Central Social Insurance Medical Council (Chuikyo), our research team drafted the official methodological guideline for trial implementation. Here, we report the process of developing and the contents of the official guideline for cost-effectiveness evaluation. The guideline reflects discussions at the Chuikyo subcommittee (e.g., the role of quality-adjusted life-year) and incorporates our academic perspective. Team members generated research questions for each section of the guideline and discussions on these questions were carried out. A draft guideline was prepared and submitted to the Ministry of Health, Labour and Welfare (MHLW), and then to the subcommittee. The draft guideline was revised on the basis of the discussions at the subcommitte, if appropriate. Although the "public health care payer's perspective" is standard in this guideline, other perspectives can be applied as necessary depending on the objective of analysis. On the basis of the discussions at the subcommittee, quality-adjusted life-year will be used as the basic outcome. A discount rate of 2% per annum for costs and outcomes is recommended. The final guideline was officially approved by the Chuikyo general assembly in February 2016. This is the first officially approved guideline for the economic evaluation of drugs and medical devices in Japan. The guideline is expected to improve the quality and comparability of submitted cost-effectiveness data for decision making. Copyright © 2017 International Society for Pharmacoeconomics and Outcomes Research (ISPOR). Published by Elsevier Inc. All rights reserved.
Cannarozzi, Gina; Plaza-Wüthrich, Sonia; Esfeld, Korinna; Larti, Stéphanie; Wilson, Yi Song; Girma, Dejene; de Castro, Edouard; Chanyalew, Solomon; Blösch, Regula; Farinelli, Laurent; Lyons, Eric; Schneider, Michel; Falquet, Laurent; Kuhlemeier, Cris; Assefa, Kebebew; Tadele, Zerihun
2014-07-09
Tef (Eragrostis tef), an indigenous cereal critical to food security in the Horn of Africa, is rich in minerals and protein, resistant to many biotic and abiotic stresses and safe for diabetics as well as sufferers of immune reactions to wheat gluten. We present the genome of tef, the first species in the grass subfamily Chloridoideae and the first allotetraploid assembled de novo. We sequenced the tef genome for marker-assisted breeding, to shed light on the molecular mechanisms conferring tef's desirable nutritional and agronomic properties, and to make its genome publicly available as a community resource. The draft genome contains 672 Mbp representing 87% of the genome size estimated from flow cytometry. We also sequenced two transcriptomes, one from a normalized RNA library and another from unnormalized RNASeq data. The normalized RNA library revealed around 38000 transcripts that were then annotated by the SwissProt group. The CoGe comparative genomics platform was used to compare the tef genome to other genomes, notably sorghum. Scaffolds comprising approximately half of the genome size were ordered by syntenic alignment to sorghum producing tef pseudo-chromosomes, which were sorted into A and B genomes as well as compared to the genetic map of tef. The draft genome was used to identify novel SSR markers, investigate target genes for abiotic stress resistance studies, and understand the evolution of the prolamin family of proteins that are responsible for the immune response to gluten. It is highly plausible that breeding targets previously identified in other cereal crops will also be valuable breeding targets in tef. The draft genome and transcriptome will be of great use for identifying these targets for genetic improvement of this orphan crop that is vital for feeding 50 million people in the Horn of Africa.
A reference genome of the European beech (Fagus sylvatica L.).
Mishra, Bagdevi; Gupta, Deepak K; Pfenninger, Markus; Hickler, Thomas; Langer, Ewald; Nam, Bora; Paule, Juraj; Sharma, Rahul; Ulaszewski, Bartosz; Warmbier, Joanna; Burczyk, Jaroslaw; Thines, Marco
2018-06-01
The European beech is arguably the most important climax broad-leaved tree species in Central Europe, widely planted for its valuable wood. Here, we report the 542 Mb draft genome sequence of an up to 300-year-old individual (Bhaga) from an undisturbed stand in the Kellerwald-Edersee National Park in central Germany. Using a hybrid assembly approach, Illumina reads with short- and long-insert libraries, coupled with long Pacific Biosciences reads, we obtained an assembled genome size of 542 Mb, in line with flow cytometric genome size estimation. The largest scaffold was of 1.15 Mb, the N50 length was 145 kb, and the L50 count was 983. The assembly contained 0.12% of Ns. A Benchmarking with Universal Single-Copy Orthologs (BUSCO) analysis retrieved 94% complete BUSCO genes, well in the range of other high-quality draft genomes of trees. A total of 62,012 protein-coding genes were predicted, assisted by transcriptome sequencing. In addition, we are reporting an efficient method for extracting high-molecular-weight DNA from dormant buds, by which contamination by environmental bacteria and fungi was kept at a minimum. The assembled genome will be a valuable resource and reference for future population genomics studies on the evolution and past climate change adaptation of beech and will be helpful for identifying genes, e.g., involved in drought tolerance, in order to select and breed individuals to adapt forestry to climate change in Europe. A continuously updated genome browser and download page can be accessed from beechgenome.net, which will include future genome versions of the reference individual Bhaga, as new sequencing approaches develop.
Assembly, Annotation, and Analysis of Multiple Mycorrhizal Fungal Genomes
DOE Office of Scientific and Technical Information (OSTI.GOV)
Initiative Consortium, Mycorrhizal Genomics; Kuo, Alan; Grigoriev, Igor
Mycorrhizal fungi play critical roles in host plant health, soil community structure and chemistry, and carbon and nutrient cycling, all areas of intense interest to the US Dept. of Energy (DOE) Joint Genome Institute (JGI). To this end we are building on our earlier sequencing of the Laccaria bicolor genome by partnering with INRA-Nancy and the mycorrhizal research community in the MGI to sequence and analyze dozens of mycorrhizal genomes of all Basidiomycota and Ascomycota orders and multiple ecological types (ericoid, orchid, and ectomycorrhizal). JGI has developed and deployed high-throughput sequencing techniques, and Assembly, RNASeq, and Annotation Pipelines. In 2012more » alone we sequenced, assembled, and annotated 12 draft or improved genomes of mycorrhizae, and predicted ~;;232831 genes and ~;;15011 multigene families, All of this data is publicly available on JGI MycoCosm (http://jgi.doe.gov/fungi/), which provides access to both the genome data and tools with which to analyze the data. Preliminary comparisons of the current total of 14 public mycorrhizal genomes suggest that 1) short secreted proteins potentially involved in symbiosis are more enriched in some orders than in others amongst the mycorrhizal Agaricomycetes, 2) there are wide ranges of numbers of genes involved in certain functional categories, such as signal transduction and post-translational modification, and 3) novel gene families are specific to some ecological types.« less
Federal Register 2010, 2011, 2012, 2013, 2014
2011-07-14
... and Draft Environmental Impact Statement: Highway 35, Between Norfolk and South Sioux City, NE AGENCY... Environmental Impact Statement. SUMMARY: The FHWA is issuing this notice to advise the public that we are rescinding the Notice of Intent (NOI) and Draft Environmental Impact Statement (DEIS) for improvements that...
Yoshikawa, Toru; Kawakami, Norito; Kogi, Kazutaka; Tsutsumi, Akizumi; Shimazu, Miyuki; Nagami, Makiko; Shimazu, Akihito
2007-07-01
An action checklist for improving the workplace environment by means of enhancing mental health of workers (Mental Health Action Check List: MHACL) was developed. The use of the checklist for primary prevention was examined. MHACL was developed through three steps: (1) Review of related references and collection of improvement examples for designing a draft MHACL; (2) pilot application of the draft at industrial workplaces and trials at workshops of occupational health staff; and (3) proposing a new MHACL for general use in industry. Workplace improvement actions related to mental health were listed in eight technical areas. From 84 workplaces in Japan, 201 such actions were collected. Typical improvement action phrases were extracted based on these examples, and a draft MHACL containing 40 generally applicable actions were prepared. This draft was applied to selected workplaces for its use as a tool for group discussion. Then, the utility of the checklist was discussed by 105 occupational health staff working in public service offices. The workshop suggested modifications of the draft MHACL including improved check items and usage procedures and the need to use easy-to-understand actions. The final version of the MHACL comprised 30 items in six technical areas: A) sharing work planning, B) work time and organization, C) ergonomic work methods, D) workplace environment, E) mutual support in the workplace, and F) preparedness and care. A new action checklist was proposed for use as a means of changing existing workplace environments and proposing practical actions for improving it. The checklist was confirmed to be useful for organizing workplace-level discussion for identifying immediate improvements at the workplace. The checklist is expected to be widely applied for promoting primary prevention measures in terms of better mental health.
Lessons for livestock genomics from genome and transcriptome sequencing in cattle and other mammals.
Taylor, Jeremy F; Whitacre, Lynsey K; Hoff, Jesse L; Tizioto, Polyana C; Kim, JaeWoo; Decker, Jared E; Schnabel, Robert D
2016-08-17
Decreasing sequencing costs and development of new protocols for characterizing global methylation, gene expression patterns and regulatory regions have stimulated the generation of large livestock datasets. Here, we discuss experiences in the analysis of whole-genome and transcriptome sequence data. We analyzed whole-genome sequence (WGS) data from 132 individuals from five canid species (Canis familiaris, C. latrans, C. dingo, C. aureus and C. lupus) and 61 breeds, three bison (Bison bison), 64 water buffalo (Bubalus bubalis) and 297 bovines from 17 breeds. By individual, data vary in extent of reference genome depth of coverage from 4.9X to 64.0X. We have also analyzed RNA-seq data for 580 samples representing 159 Bos taurus and Rattus norvegicus animals and 98 tissues. By aligning reads to a reference assembly and calling variants, we assessed effects of average depth of coverage on the actual coverage and on the number of called variants. We examined the identity of unmapped reads by assembling them and querying produced contigs against the non-redundant nucleic acids database. By imputing high-density single nucleotide polymorphism data on 4010 US registered Angus animals to WGS using Run4 of the 1000 Bull Genomes Project and assessing the accuracy of imputation, we identified misassembled reference sequence regions. We estimate that a 24X depth of coverage is required to achieve 99.5 % coverage of the reference assembly and identify 95 % of the variants within an individual's genome. Genomes sequenced to low average coverage (e.g., <10X) may fail to cover 10 % of the reference genome and identify <75 % of variants. About 10 % of genomic DNA or transcriptome sequence reads fail to align to the reference assembly. These reads include loci missing from the reference assembly and misassembled genes and interesting symbionts, commensal and pathogenic organisms. Assembly errors and a lack of annotation of functional elements significantly limit the utility of the current draft livestock reference assemblies. The Functional Annotation of Animal Genomes initiative seeks to annotate functional elements, while a 70X Pac-Bio assembly for cow is underway and may result in a significantly improved reference assembly.
Draft genome sequence of pigeonpea (Cajanus cajan), an orphan legume crop of resource-poor farmers.
Varshney, Rajeev K; Chen, Wenbin; Li, Yupeng; Bharti, Arvind K; Saxena, Rachit K; Schlueter, Jessica A; Donoghue, Mark T A; Azam, Sarwar; Fan, Guangyi; Whaley, Adam M; Farmer, Andrew D; Sheridan, Jaime; Iwata, Aiko; Tuteja, Reetu; Penmetsa, R Varma; Wu, Wei; Upadhyaya, Hari D; Yang, Shiaw-Pyng; Shah, Trushar; Saxena, K B; Michael, Todd; McCombie, W Richard; Yang, Bicheng; Zhang, Gengyun; Yang, Huanming; Wang, Jun; Spillane, Charles; Cook, Douglas R; May, Gregory D; Xu, Xun; Jackson, Scott A
2011-11-06
Pigeonpea is an important legume food crop grown primarily by smallholder farmers in many semi-arid tropical regions of the world. We used the Illumina next-generation sequencing platform to generate 237.2 Gb of sequence, which along with Sanger-based bacterial artificial chromosome end sequences and a genetic map, we assembled into scaffolds representing 72.7% (605.78 Mb) of the 833.07 Mb pigeonpea genome. Genome analysis predicted 48,680 genes for pigeonpea and also showed the potential role that certain gene families, for example, drought tolerance-related genes, have played throughout the domestication of pigeonpea and the evolution of its ancestors. Although we found a few segmental duplication events, we did not observe the recent genome-wide duplication events observed in soybean. This reference genome sequence will facilitate the identification of the genetic basis of agronomically important traits, and accelerate the development of improved pigeonpea varieties that could improve food security in many developing countries.
ERIC Educational Resources Information Center
Nelson, F. Howard; Hess, G. Alfred, Jr.
This draft document presents an analysis of all major educational reform proposals before the Illinois General Assembly, assessing the costs of implementation and what benefits might be expected for the funding levels contained in the proposals themselves. The 18 areas examined are (1) student testing; (2) student retention/remediation; (3)…
USDA-ARS?s Scientific Manuscript database
‘Bacillus vanillea’ XY18T (=CGMCC 8629 T =NCCB 100507 T) was isolated from cured vanilla beans and involved in the formation of vanilla aroma compounds. A draft genome of this type strain was assembled and yielded a length of 3.72 Mbp and a GC content of 46.3%. Comparative genomic analysis with its ...
Genome Sequence Analysis of the Biogenic Amine-Degrading Strain Lactobacillus casei 5b
Ladero, Victor; Herrero-Fresno, Ana; Martinez, Noelia; del Río, Beatriz; Linares, Daniel M.; Fernández, María; Martín, María Cruz
2014-01-01
We here report a 3.02-Mbp annotated draft assembly of the Lactobacillus casei 5b genome. The sequence of this biogenic amine-degrading dairy isolate may help identify the mechanisms involved in the catabolism of biogenic amines and perhaps shed light on ways to reduce the presence of these toxic compounds in food. PMID:24435875
Pasquali, Frédérique; Palma, Federica; Guillier, Laurent; Lucchi, Alex; De Cesare, Alessandra; Manfreda, Gerardo
2018-01-01
Listeria monocytogenes is a foodborne pathogen adapted to survive and persist in multiple environments. Following two previous studies on prevalence and virulence of L. monocytogenes ST121 and ST14 repeatedly collected in a the same rabbit-meat processing plant, the research questions of the present study were to: (1) assess persistence of L. monocytogenes isolates from the rabbit-plant; (2) select genes associated to physiological adaptation to the food-processing environment; (3) compare presence/absence/truncation of these genes in newly sequenced and publicly available ST121 and ST14 genomes. A total of 273 draft genomes including ST121 and ST14 newly sequenced and publicly available draft genomes were analyzed. Whole-genome Single Nucleotide Polymorfism (wgSNP) analysis was performed separately on the assemblies of ST121 and ST14 draft genomes. SNPs alignments were used to infer phylogeny. A dataset of L. monocytogenes ecophysiology genes was built based on a comprehensive literature review. The 94 selected genes were screened on the assemblies of all ST121 and ST14 draft genomes. Significant gene enrichments were evaluated by statistical analyses. A persistent ST14 clone, including 23 out of 27 newly sequenced genomes, was circulating in the rabbit-meat plant along with two not persistent clones. A significant enrichment was observed in ST121 genomes concerning stress survival islet 2 (SSI-2) (alkaline and oxidative stress), qacH gene (resistance to benzalkonium chloride), cadA1C gene cassette (resistance to 70 mg/l of cadmium chloride) and a truncated version of actA gene (biofilm formation). Conversely, ST14 draft genomes were enriched with a full-length version of actA gene along with the Listeria Genomic Island 2 (LGI 2) including the ars operon (arsenic resistance) and the cadA4C gene cassette (resistance to 35 mg/l of cadmium chloride). Phenotypic tests confirmed ST121 as a weak biofilm producer in comparison to ST14. In conclusion, ST121 carried the qacH gene and was phenotypically resistant to quaternary ammonium compounds. This property might contribute to the high prevalence of ST121 in food processing plants. ST14 showed greater ability to form biofilms, which might contribute to the occasional colonization and persistence on harborage sites where sanitizing procedures are difficult to display. PMID:29662481
Pasquali, Frédérique; Palma, Federica; Guillier, Laurent; Lucchi, Alex; De Cesare, Alessandra; Manfreda, Gerardo
2018-01-01
Listeria monocytogenes is a foodborne pathogen adapted to survive and persist in multiple environments. Following two previous studies on prevalence and virulence of L. monocytogenes ST121 and ST14 repeatedly collected in a the same rabbit-meat processing plant, the research questions of the present study were to: (1) assess persistence of L. monocytogenes isolates from the rabbit-plant; (2) select genes associated to physiological adaptation to the food-processing environment; (3) compare presence/absence/truncation of these genes in newly sequenced and publicly available ST121 and ST14 genomes. A total of 273 draft genomes including ST121 and ST14 newly sequenced and publicly available draft genomes were analyzed. Whole-genome Single Nucleotide Polymorfism (wgSNP) analysis was performed separately on the assemblies of ST121 and ST14 draft genomes. SNPs alignments were used to infer phylogeny. A dataset of L. monocytogenes ecophysiology genes was built based on a comprehensive literature review. The 94 selected genes were screened on the assemblies of all ST121 and ST14 draft genomes. Significant gene enrichments were evaluated by statistical analyses. A persistent ST14 clone, including 23 out of 27 newly sequenced genomes, was circulating in the rabbit-meat plant along with two not persistent clones. A significant enrichment was observed in ST121 genomes concerning stress survival islet 2 (SSI-2) (alkaline and oxidative stress), qacH gene (resistance to benzalkonium chloride), cadA1C gene cassette (resistance to 70 mg/l of cadmium chloride) and a truncated version of actA gene (biofilm formation). Conversely, ST14 draft genomes were enriched with a full-length version of actA gene along with the Listeria Genomic Island 2 (LGI 2) including the ars operon (arsenic resistance) and the cadA4C gene cassette (resistance to 35 mg/l of cadmium chloride). Phenotypic tests confirmed ST121 as a weak biofilm producer in comparison to ST14. In conclusion, ST121 carried the qacH gene and was phenotypically resistant to quaternary ammonium compounds. This property might contribute to the high prevalence of ST121 in food processing plants. ST14 showed greater ability to form biofilms, which might contribute to the occasional colonization and persistence on harborage sites where sanitizing procedures are difficult to display.
Lee, Chi-Ching; Chen, Yi-Ping Phoebe; Yao, Tzu-Jung; Ma, Cheng-Yu; Lo, Wei-Cheng; Lyu, Ping-Chiang; Tang, Chuan Yi
2013-04-10
Sequencing of microbial genomes is important because of microbial-carrying antibiotic and pathogenetic activities. However, even with the help of new assembling software, finishing a whole genome is a time-consuming task. In most bacteria, pathogenetic or antibiotic genes are carried in genomic islands. Therefore, a quick genomic island (GI) prediction method is useful for ongoing sequencing genomes. In this work, we built a Web server called GI-POP (http://gipop.life.nthu.edu.tw) which integrates a sequence assembling tool, a functional annotation pipeline, and a high-performance GI predicting module, in a support vector machine (SVM)-based method called genomic island genomic profile scanning (GI-GPS). The draft genomes of the ongoing genome projects in contigs or scaffolds can be submitted to our Web server, and it provides the functional annotation and highly probable GI-predicting results. GI-POP is a comprehensive annotation Web server designed for ongoing genome project analysis. Researchers can perform annotation and obtain pre-analytic information include possible GIs, coding/non-coding sequences and functional analysis from their draft genomes. This pre-analytic system can provide useful information for finishing a genome sequencing project. Copyright © 2012 Elsevier B.V. All rights reserved.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Montgomery, Rose; Scaglione, John M; Bevard, Bruce Balkcom
The High Burnup Spent Fuel Data project pulled 25 sister rods (9 from the project assemblies and 16 from similar HBU assemblies) for characterization. The 25 sister rods are all high burnup and cover the range of modern domestic cladding alloys. The 25 sister rods were shipped to Oak Ridge National Laboratory (ORNL) in early 2016 for detailed non-destructive and destructive examination. Examinations are intended to provide baseline data on the initial physical state of the cladding and fuel prior to the loading, drying, and long-term dry storage process. Further examinations are focused on determining the effects of temperatures encounteredmore » during and following drying. Similar tests will be performed on rods taken from the project assemblies at the end of their long-term storage in a TN-32 dry storage cask (the cask rods ) to identify any significant changes in the fuel rods that may have occurred during the dry storage period. Additionally, some of the sister rods will be used for separate effects testing to expand the applicability of the project data to the fleet, and to address some of the data-related gaps associated with extended storage and subsequent transportation of high burnup fuel. A draft test plan is being developed that describes the experimental work to be conducted on the sister rods. This paper summarizes the draft test plan and necessary coordination activities for the multi-year experimental program to supply data relevant to the assessment of the safety of long-term storage followed by transportation of high burnup spent fuel.« less
Draft genome of the protandrous Chinese black porgy, Acanthopagrus schlegelii.
Zhang, Zhiyong; Zhang, Kai; Chen, Shuyin; Zhang, Zhiwei; Zhang, Jinyong; You, Xinxin; Bian, Chao; Xu, Jin; Jia, Chaofeng; Qiang, Jun; Zhu, Fei; Li, Hongxia; Liu, Hailin; Shen, Dehua; Ren, Zhonghong; Chen, Jieming; Li, Jia; Gao, Tianheng; Gu, Ruobo; Xu, Junmin; Shi, Qiong; Xu, Pao
2018-04-01
As one of the most popular and valuable commercial marine fishes in China and East Asian countries, the Chinese black porgy (Acanthopagrus schlegelii), also known as the blackhead seabream, has some attractive characteristics such as fast growth rate, good meat quality, resistance to diseases, and excellent adaptability to various environments. Furthermore, the black porgy is a good model for investigating sex changes in fish due to its protandrous hermaphroditism. Here, we obtained a high-quality genome assembly of this interesting teleost species and performed a genomic survey on potential genes associated with the sex-change phenomenon. We generated 175.4 gigabases (Gb) of clean sequence reads using a whole-genome shotgun sequencing strategy. The final genome assembly is approximately 688.1 megabases (Mb), accounting for 93% of the estimated genome size (739.6 Mb). The achieved scaffold N50 is 7.6 Mb, reaching a relatively high level among sequenced fish species. We identified 19 465 protein-coding genes, which had an average transcript length of 17.3 kb. By performing a comparative genomic analysis, we found 3 types of genes potentially associated with sex change, which are useful for studying the genetic basis of the protandrous hermaphroditism. We provide a draft genome assembly of the Chinese black porgy and discuss the potential genetic mechanisms of sex change. These data are also an important resource for studying the biology and for facilitating breeding of this economically important fish.
2014-12-11
Cassava (Manihot esculenta Crantz) is a major staple crop in Africa, Asia, and South America, and its starchy roots provide nourishment for 800 million people worldwide. Although native to South America, cassava was brought to Africa 400-500 years ago and is now widely cultivated across sub-Saharan Africa, but it is subject to biotic and abiotic stresses. To assist in the rapid identification of markers for pathogen resistance and crop traits, and to accelerate breeding programs, we generated a framework map for M. esculenta Crantz from reduced representation sequencing [genotyping-by-sequencing (GBS)]. The composite 2412-cM map integrates 10 biparental maps (comprising 3480 meioses) and organizes 22,403 genetic markers on 18 chromosomes, in agreement with the observed karyotype. We used the map to anchor 71.9% of the draft genome assembly and 90.7% of the predicted protein-coding genes. The chromosome-anchored genome sequence will be useful for breeding improvement by assisting in the rapid identification of markers linked to important traits, and in providing a framework for genomic selection-enhanced breeding of this important crop. Copyright © 2015 International Cassava Genetic Map Consortium (ICGMC).
Lyons, Jessica
2014-12-11
Cassava Manihot esculenta Crantz) is a major staple crop in Africa, Asia, and South America, and its starchy roots provide nourishment for 800 million people worldwide. Although native to South America, cassava was brought to Africa 400–500 years ago and is now widely cultivated across sub-Saharan Africa, but it is subject to biotic and abiotic stresses. To assist in the rapid identification of markers for pathogen resistance and crop traits, and to accelerate breeding programs, we generated a framework map for M. esculent Crantz from reduced representation sequencing [genotyping-by-sequencing (GBS)]. The composite 2412-cM map integrates 10 biparental maps (comprising 3480more » meioses) and organizes 22,403 genetic markers on 18 chromosomes, in agreement with the observed karyotype. Here, we used the map to anchor 71.9% of the draft genome assembly and 90.7% of the predicted protein-coding genes. The chromosome-anchored genome sequence will be useful for breeding improvement by assisting in the rapid identification of markers linked to important traits, and in providing a framework for genomic selectionenhanced breeding of this important crop.« less
DOE Office of Scientific and Technical Information (OSTI.GOV)
Lyons, Jessica
Cassava Manihot esculenta Crantz) is a major staple crop in Africa, Asia, and South America, and its starchy roots provide nourishment for 800 million people worldwide. Although native to South America, cassava was brought to Africa 400–500 years ago and is now widely cultivated across sub-Saharan Africa, but it is subject to biotic and abiotic stresses. To assist in the rapid identification of markers for pathogen resistance and crop traits, and to accelerate breeding programs, we generated a framework map for M. esculent Crantz from reduced representation sequencing [genotyping-by-sequencing (GBS)]. The composite 2412-cM map integrates 10 biparental maps (comprising 3480more » meioses) and organizes 22,403 genetic markers on 18 chromosomes, in agreement with the observed karyotype. Here, we used the map to anchor 71.9% of the draft genome assembly and 90.7% of the predicted protein-coding genes. The chromosome-anchored genome sequence will be useful for breeding improvement by assisting in the rapid identification of markers linked to important traits, and in providing a framework for genomic selectionenhanced breeding of this important crop.« less
Gan, Han M; Lee, Yin P; Austin, Christopher M
2017-01-01
We improved upon the previously reported draft genome of Hydrogenophaga intermedia strain PBC, a 4-aminobenzenesulfonate-degrading bacterium, by supplementing the assembly with Nanopore long reads which enabled the reconstruction of the genome as a single contig. From the complete genome, major genes responsible for the catabolism of 4-aminobenzenesulfonate in strain PBC are clustered in two distinct genomic regions. Although the catabolic genes for 4-sulfocatechol, the deaminated product of 4-aminobenzenesulfonate, are only found in H. intermedia , the sad operon responsible for the first deamination step of 4-aminobenzenesulfonate is conserved in various Hydrogenophaga strains. The absence of pabB gene in the complete genome of H. intermedia PBC is consistent with its p -aminobenzoic acid (pABA) auxotrophy but surprisingly comparative genomics analysis of 14 Hydrogenophaga genomes indicate that pABA auxotrophy is not an uncommon feature among members of this genus. Of even more interest, several Hydrogenophaga strains do not possess the genomic potential for hydrogen oxidation, calling for a revision to the taxonomic description of Hydrogenophaga as "hydrogen eating bacteria."
Federal Register 2010, 2011, 2012, 2013, 2014
2013-08-23
... and Draft Environmental Impact Statement: I-17 Corridor Improvement Study; Maricopa County, Arizona... Corridor Improvement Study was published in the Federal Register on January 6, 2010. FOR FURTHER... Corridor is located in the city of Phoenix, and the study area limits for the EIS consisted of...
Li, Xi; Zhu, Yongze; Shen, Mengyuan; Du, Jing; Zhang, Lei; Wang, Dairong
2018-03-01
Enterobacter cloacae is one of the major pathogens responsible for a variety of human infections. Here we report the draft genome sequence of multidrug-resistant E. cloacae strain HBY isolated from a female patient in China. Whole genomic DNA of E. cloacae strain HBY was extracted and was sequenced using an Illumina HiSeq™ 2000 platform. The generated sequence reads were assembled using CLC Genomics Workbench. The draft genome was annotated using Rapid Annotations using Subsystems Technology (RAST), and the presence of antimicrobial resistance genes was identified. The 5799439-bp genome contains various antimicrobial resistance genes conferring resistance to aminoglycosides, β-lactams, fosfomycin, macrolides, sulphonamides and fluoroquinolones. Notably, the strain was identified to carry two main carbapenemase genes (bla KPC-2 and bla NDM-1 ). The genome sequence reported in this study will provide valuable information to understand antibiotic resistance mechanisms in this strain. It is important to monitor the spread strains of Enterobacter sp. encoding both of these carbapenemase genes. Copyright © 2017 International Society for Chemotherapy of Infection and Cancer. Published by Elsevier Ltd. All rights reserved.
A draft sequence of the rice genome (Oryza sativa L. ssp. indica).
Yu, Jun; Hu, Songnian; Wang, Jun; Wong, Gane Ka-Shu; Li, Songgang; Liu, Bin; Deng, Yajun; Dai, Li; Zhou, Yan; Zhang, Xiuqing; Cao, Mengliang; Liu, Jing; Sun, Jiandong; Tang, Jiabin; Chen, Yanjiong; Huang, Xiaobing; Lin, Wei; Ye, Chen; Tong, Wei; Cong, Lijuan; Geng, Jianing; Han, Yujun; Li, Lin; Li, Wei; Hu, Guangqiang; Huang, Xiangang; Li, Wenjie; Li, Jian; Liu, Zhanwei; Li, Long; Liu, Jianping; Qi, Qiuhui; Liu, Jinsong; Li, Li; Li, Tao; Wang, Xuegang; Lu, Hong; Wu, Tingting; Zhu, Miao; Ni, Peixiang; Han, Hua; Dong, Wei; Ren, Xiaoyu; Feng, Xiaoli; Cui, Peng; Li, Xianran; Wang, Hao; Xu, Xin; Zhai, Wenxue; Xu, Zhao; Zhang, Jinsong; He, Sijie; Zhang, Jianguo; Xu, Jichen; Zhang, Kunlin; Zheng, Xianwu; Dong, Jianhai; Zeng, Wanyong; Tao, Lin; Ye, Jia; Tan, Jun; Ren, Xide; Chen, Xuewei; He, Jun; Liu, Daofeng; Tian, Wei; Tian, Chaoguang; Xia, Hongai; Bao, Qiyu; Li, Gang; Gao, Hui; Cao, Ting; Wang, Juan; Zhao, Wenming; Li, Ping; Chen, Wei; Wang, Xudong; Zhang, Yong; Hu, Jianfei; Wang, Jing; Liu, Song; Yang, Jian; Zhang, Guangyu; Xiong, Yuqing; Li, Zhijie; Mao, Long; Zhou, Chengshu; Zhu, Zhen; Chen, Runsheng; Hao, Bailin; Zheng, Weimou; Chen, Shouyi; Guo, Wei; Li, Guojie; Liu, Siqi; Tao, Ming; Wang, Jian; Zhu, Lihuang; Yuan, Longping; Yang, Huanming
2002-04-05
We have produced a draft sequence of the rice genome for the most widely cultivated subspecies in China, Oryza sativa L. ssp. indica, by whole-genome shotgun sequencing. The genome was 466 megabases in size, with an estimated 46,022 to 55,615 genes. Functional coverage in the assembled sequences was 92.0%. About 42.2% of the genome was in exact 20-nucleotide oligomer repeats, and most of the transposons were in the intergenic regions between genes. Although 80.6% of predicted Arabidopsis thaliana genes had a homolog in rice, only 49.4% of predicted rice genes had a homolog in A. thaliana. The large proportion of rice genes with no recognizable homologs is due to a gradient in the GC content of rice coding sequences.
Seribelli, Amanda Aparecida; Frazão, Miliane Rodrigues; Gonzales, Júlia Cunha; Cao, Guojie; Leon, Maria Sanchez; Kich, Jalusa Deon; Allard, Marc William; Falcão, Juliana Pfrimer
2018-04-19
Salmonellosis is a disease with a high incidence worldwide, and Salmonella enterica subsp. enterica serovar Typhimurium is one of the most clinically important serovars. We report here the draft genome sequences of 20 S. Typhimurium strains isolated from swine in Santa Catarina, Brazil. These draft genomes will improve our understanding of S. Typhimurium in Brazil.
USDA-ARS?s Scientific Manuscript database
We produced and assembled high quality draft genomes (~100X coverage) for 305 Salmonella from a diverse a group of over 100 serovars and diverse sources. Of these isolates, 119 were selected to capture a wide variety of different AR patterns. In our subsequent analyses we included 285 additional pub...
Jang, Hyein; Addy, Nicole; Ewing, Laura; Jean-Gilles Beaubrun, Junia; Lee, YouYoung; Woo, JungHa; Negrete, Flavia; Finkelstein, Samantha; Tall, Ben D; Lehner, Angelika; Eshwar, Athmanya; Gopinath, Gopal R
2018-04-12
Here, we present draft genome sequences of 29 Cronobacter sakazakii isolates obtained from foods of plant origin and dried-food manufacturing facilities. Assemblies and annotations resulted in genome sizes ranging from 4.3 to 4.5 Mb and 3,977 to 4,256 gene-coding sequences with G+C contents of ∼57.0%.
USDA-ARS?s Scientific Manuscript database
This study reports a de novo assembled draft genome sequence of Xylella fastidiosa subsp. multiplex strain BB01 causing blueberry bacterial leaf scorch in Georgia, USA. The BB01 genome is 2,517,579 bp with a G+C content of 51.8% and 2,943 open reading frames (ORFs) and 48 RNA genes....
Dichosa, Armand E. K.; Davenport, Karen W.; Li, Po-E; ...
2015-03-19
In this study, we report here the genome sequence of Thauera sp. strain SWB20, isolated from a Singaporean wastewater treatment facility using gel microdroplets (GMDs) and single-cell genomics (SCG). This approach provided a single clonal microcolony that was sufficient to obtain a 4.9-Mbp genome assembly of an ecologically relevant Thauera species.
Examples of finite element mesh generation using SDRC IDEAS
NASA Technical Reports Server (NTRS)
Zapp, John; Volakis, John L.
1990-01-01
IDEAS (Integrated Design Engineering Analysis Software) offers a comprehensive package for mechanical design engineers. Due to its multifaceted capabilities, however, it can be manipulated to serve the needs of electrical engineers, also. IDEAS can be used to perform the following tasks: system modeling, system assembly, kinematics, finite element pre/post processing, finite element solution, system dynamics, drafting, test data analysis, and project relational database.
REMOTE RECORDING ANNULAR VANE ASSEMBLY
Wehmann, G.
1963-06-25
A weather vane apparatus is described which is capable of movement in horizontal and vertical planes. Associated with the vane are tangent potentiometers, commutators, and other electrical apparatus for deriving electrical output voltages as a function of the wind direction. The apparatus is particularly adapted for use with an anemometer to provide an electrical output indicating the amount and direction of an up or down draft. (AEC)
Sánchez-Rangel, Diana; Hernández-Domínguez, Eric; Pérez-Torres, Claudia-Anahí; Ortiz-Castro, Randy; Villafán, Emanuel; Alonso-Sánchez, Alexandro; Rodríguez-Haas, Benjamín; López-Buenfil, Abel; García-Avila, Clemente; Ramírez-Pool, José-Abrahán
2017-01-01
ABSTRACT Here, we report the genome of Fusarium euwallaceae strain HFEW-16-IV-019, an isolate obtained from Kuroshio shot hole borer (a Euwallacea sp.). These beetles were collected in Tijuana, Mexico, from elm trees showing typical symptoms of Fusarium dieback. The final assembly consists of 287 scaffolds spanning 48,274,071 bp and 13,777 genes. PMID:28860245
Draft Genome Sequence of Desulfovibrio BerOc1, a Mercury-Methylating Strain
Gassie, Claire; Bouchez, Oliver; Klopp, Christophe; Guyoneaud, Rémy
2017-01-01
ABSTRACT Desulfovibrio BerOc1 is a sulfate-reducing bacterium isolated from the Berre lagoon (French Mediterranean coast). BerOc1 is able to methylate and demethylate mercury. The genome size is 4,081,579 bp assembled into five contigs. We identified the hgcA and hgcB genes involved in mercury methylation, but not those responsible for mercury demethylation. PMID:28104657
Dukić, Marinela; Berner, Daniel; Roesti, Marius; Haag, Christoph R; Ebert, Dieter
2016-10-13
Recombination rate is an essential parameter for many genetic analyses. Recombination rates are highly variable across species, populations, individuals and different genomic regions. Due to the profound influence that recombination can have on intraspecific diversity and interspecific divergence, characterization of recombination rate variation emerges as a key resource for population genomic studies and emphasises the importance of high-density genetic maps as tools for studying genome biology. Here we present such a high-density genetic map for Daphnia magna, and analyse patterns of recombination rate across the genome. A F2 intercross panel was genotyped by Restriction-site Associated DNA sequencing to construct the third-generation linkage map of D. magna. The resulting high-density map included 4037 markers covering 813 scaffolds and contigs that sum up to 77 % of the currently available genome draft sequence (v2.4) and 55 % of the estimated genome size (238 Mb). Total genetic length of the map presented here is 1614.5 cM and the genome-wide recombination rate is estimated to 6.78 cM/Mb. Merging genetic and physical information we consistently found that recombination rate estimates are high towards the peripheral parts of the chromosomes, while chromosome centres, harbouring centromeres in D. magna, show very low recombination rate estimates. Due to its high-density, the third-generation linkage map for D. magna can be coupled with the draft genome assembly, providing an essential tool for genome investigation in this model organism. Thus, our linkage map can be used for the on-going improvements of the genome assembly, but more importantly, it has enabled us to characterize variation in recombination rate across the genome of D. magna for the first time. These new insights can provide a valuable assistance in future studies of the genome evolution, mapping of quantitative traits and population genetic studies.
Pajuelo, Mónica J; Eguiluz, María; Dahlstrom, Eric; Requena, David; Guzmán, Frank; Ramirez, Manuel; Sheen, Patricia; Frace, Michael; Sammons, Scott; Cama, Vitaliano; Anzick, Sarah; Bruno, Dan; Mahanty, Siddhartha; Wilkins, Patricia; Nash, Theodore; Gonzalez, Armando; García, Héctor H; Gilman, Robert H; Porcella, Steve; Zimic, Mirko
2015-12-01
Infections with Taenia solium are the most common cause of adult acquired seizures worldwide, and are the leading cause of epilepsy in developing countries. A better understanding of the genetic diversity of T. solium will improve parasite diagnostics and transmission pathways in endemic areas thereby facilitating the design of future control measures and interventions. Microsatellite markers are useful genome features, which enable strain typing and identification in complex pathogen genomes. Here we describe microsatellite identification and characterization in T. solium, providing information that will assist in global efforts to control this important pathogen. For genome sequencing, T. solium cysts and proglottids were collected from Huancayo and Puno in Peru, respectively. Using next generation sequencing (NGS) and de novo assembly, we assembled two draft genomes and one hybrid genome. Microsatellite sequences were identified and 36 of them were selected for further analysis. Twenty T. solium isolates were collected from Tumbes in the northern region, and twenty from Puno in the southern region of Peru. The size-polymorphism of the selected microsatellites was determined with multi-capillary electrophoresis. We analyzed the association between microsatellite polymorphism and the geographic origin of the samples. The predicted size of the hybrid (proglottid genome combined with cyst genome) T. solium genome was 111 MB with a GC content of 42.54%. A total of 7,979 contigs (>1,000 nt) were obtained. We identified 9,129 microsatellites in the Puno-proglottid genome and 9,936 in the Huancayo-cyst genome, with 5 or more repeats, ranging from mono- to hexa-nucleotide. Seven microsatellites were polymorphic and 29 were monomorphic within the analyzed isolates. T. solium tapeworms were classified into two genetic groups that correlated with the North/South geographic origin of the parasites. The availability of draft genomes for T. solium represents a significant step towards the understanding the biology of the parasite. We report here a set of T. solium polymorphic microsatellite markers that appear promising for genetic epidemiology studies.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Ghodhbane-Gtari, Faten; Beauchemin, Nicholas; Louati, Moussa
Here, we report the first genome sequence of a Nocardia plant endophyte, N. casuarinae strain BMG51109, isolated from Casuarina glauca root nodules. The improved high-quality draft genome sequence contains 8,787,999 bp with a 68.90% GC content and 7,307 predicted protein-coding genes.
Ghodhbane-Gtari, Faten; Beauchemin, Nicholas; Louati, Moussa; ...
2016-08-04
Here, we report the first genome sequence of a Nocardia plant endophyte, N. casuarinae strain BMG51109, isolated from Casuarina glauca root nodules. The improved high-quality draft genome sequence contains 8,787,999 bp with a 68.90% GC content and 7,307 predicted protein-coding genes.
Ramirez-Gonzalez, Ricardo; Caccamo, Mario; MacLean, Daniel
2011-10-01
Scientists now use high-throughput sequencing technologies and short-read assembly methods to create draft genome assemblies in just days. Tools and pipelines like the assembler, and the workflow management environments make it easy for a non-specialist to implement complicated pipelines to produce genome assemblies and annotations very quickly. Such accessibility results in a proliferation of assemblies and associated files, often for many organisms. These assemblies get used as a working reference by lots of different workers, from a bioinformatician doing gene prediction or a bench scientist designing primers for PCR. Here we describe Gee Fu, a database tool for genomic assembly and feature data, including next-generation sequence alignments. Gee Fu is an instance of a Ruby-On-Rails web application on a feature database that provides web and console interfaces for input, visualization of feature data via AnnoJ, access to data through a web-service interface, an API for direct data access by Ruby scripts and access to feature data stored in BAM files. Gee Fu provides a platform for storing and sharing different versions of an assembly and associated features that can be accessed and updated by bench biologists and bioinformaticians in ways that are easy and useful for each. http://tinyurl.com/geefu dan.maclean@tsl.ac.uk.
Mindell, J; Sheridan, L; Joffe, M; Samson-Barry, H; Atkinson, S
2004-03-01
To increase the positive and mitigate the negative health impacts of the mayor's draft transport strategy for London. A rapid prospective health impact assessment (HIA) of the penultimate draft of the strategy, using a review commissioned by the regional director of public health; an appraisal of congestion charging; and a participatory workshop. Two audits of changes were performed to assess the impact on policy of the HIA process. Regional government policy development. Recommendations from the rapid HIA were fed back into the drafting process. Changes (a) between the penultimate draft and the draft for public consultation and (b) between that and the final mayoral strategy. The draft transport strategy published for consultation differed in a number of respects from the previous version. Almost all the recommendations from the HIA were incorporated into the final strategy. Significant changes included promoting sustainable travel plans for workplaces and schools; giving priority to infrastructure and services that benefit London's deprived communities; increased emphasis on promoting walking and cycling and reducing reliance on private cars; and a commitment to track the health impacts of the final strategy and its implementation. Specific additions included re-allocating road space. HIA was successful in influencing the transport strategy for London, resulting in several improvements from a health viewpoint. HIA is an effective method both for bringing about significant change in policy proposals and in increasing policy makers' understanding of determinants of health and hence in changing attitudes of policy makers.
Experiment definition phase shuttle laboratory. LDRL-10.6 experiment
NASA Technical Reports Server (NTRS)
1976-01-01
The work completed on the experiment definition phase of the shuttle laboratory LDRL 10.6 micrometers experiment from 27 September 1975 to 26 January 1976 was reported. This work included progress in the following areas: (1) optomechanical system: completion of detail drawings, completion of the beryllium subassembly, fabrication, checking, and weighing of approximately 95% of the detailed parts, dry film lubrication of the bearings and gears, and initiation of assembly of the gimbals; (2) optics: update of the detailed optical layout, receipt of nine mirrors and the pre-expander; (3) miscellaneous: delivery of draft material for the final report, completion of optical testing of the 10.6 micrometers receiver, and receipt, assembly, and checkout of NASA test console.
The pig X and Y Chromosomes: structure, sequence, and evolution
Skinner, Benjamin M.; Sargent, Carole A.; Churcher, Carol; Hunt, Toby; Herrero, Javier; Loveland, Jane E.; Dunn, Matt; Louzada, Sandra; Fu, Beiyuan; Chow, William; Gilbert, James; Austin-Guest, Siobhan; Beal, Kathryn; Carvalho-Silva, Denise; Cheng, William; Gordon, Daria; Grafham, Darren; Hardy, Matt; Harley, Jo; Hauser, Heidi; Howden, Philip; Howe, Kerstin; Lachani, Kim; Ellis, Peter J.I.; Kelly, Daniel; Kerry, Giselle; Kerwin, James; Ng, Bee Ling; Threadgold, Glen; Wileman, Thomas; Wood, Jonathan M.D.; Yang, Fengtang; Harrow, Jen; Affara, Nabeel A.; Tyler-Smith, Chris
2016-01-01
We have generated an improved assembly and gene annotation of the pig X Chromosome, and a first draft assembly of the pig Y Chromosome, by sequencing BAC and fosmid clones from Duroc animals and incorporating information from optical mapping and fiber-FISH. The X Chromosome carries 1033 annotated genes, 690 of which are protein coding. Gene order closely matches that found in primates (including humans) and carnivores (including cats and dogs), which is inferred to be ancestral. Nevertheless, several protein-coding genes present on the human X Chromosome were absent from the pig, and 38 pig-specific X-chromosomal genes were annotated, 22 of which were olfactory receptors. The pig Y-specific Chromosome sequence generated here comprises 30 megabases (Mb). A 15-Mb subset of this sequence was assembled, revealing two clusters of male-specific low copy number genes, separated by an ampliconic region including the HSFY gene family, which together make up most of the short arm. Both clusters contain palindromes with high sequence identity, presumably maintained by gene conversion. Many of the ancestral X-related genes previously reported in at least one mammalian Y Chromosome are represented either as active genes or partial sequences. This sequencing project has allowed us to identify genes—both single copy and amplified—on the pig Y Chromosome, to compare the pig X and Y Chromosomes for homologous sequences, and thereby to reveal mechanisms underlying pig X and Y Chromosome evolution. PMID:26560630
Federal Register 2010, 2011, 2012, 2013, 2014
2012-12-17
... and Draft Environmental Impact Statement: I-10 Corridor Improvement Study; Maricopa County, AZ AGENCY..., Arizona. A NOI to prepare an EIS for the I-10 Corridor Improvement Study was published in the Federal..., Arizona. The I-10 Corridor is in or adjacent to the cities of Phoenix, Tempe, and Chandler, as well as the...
Draft Genome Sequence of Zobellia sp. Strain OII3, Isolated from the Coastal Zone of the Baltic Sea.
Harms, Henrik; Poehlein, Anja; Thürmer, Andrea; König, Gabriele M; Schäberle, Till F
2017-09-07
Zobellia sp. strain OII3 was isolated from a marine environmental sample due to its heterotrophic lifestyle, i.e., using Escherichia coli cells as prey. It shows strong agar-lytic activity. The genome was assembled into 41 contigs with a total size of 5.4 Mb, revealing the genetic basis for natural product biosynthesis. Copyright © 2017 Harms et al.
Addy, Nicole; Ewing, Laura; Jean-Gilles Beaubrun, Junia; Lee, YouYoung; Woo, JungHa; Negrete, Flavia; Finkelstein, Samantha; Tall, Ben D.; Lehner, Angelika; Eshwar, Athmanya; Gopinath, Gopal R.
2018-01-01
ABSTRACT Here, we present draft genome sequences of 29 Cronobacter sakazakii isolates obtained from foods of plant origin and dried-food manufacturing facilities. Assemblies and annotations resulted in genome sizes ranging from 4.3 to 4.5 Mb and 3,977 to 4,256 gene-coding sequences with G+C contents of ∼57.0%. PMID:29650569
Ibarra-Laclette, Enrique; Sánchez-Rangel, Diana; Hernández-Domínguez, Eric; Pérez-Torres, Claudia-Anahí; Ortiz-Castro, Randy; Villafán, Emanuel; Alonso-Sánchez, Alexandro; Rodríguez-Haas, Benjamín; López-Buenfil, Abel; García-Avila, Clemente; Ramírez-Pool, José-Abrahán
2017-08-31
Here, we report the genome of Fusarium euwallaceae strain HFEW-16-IV-019, an isolate obtained from Kuroshio shot hole borer (a Euwallacea sp.). These beetles were collected in Tijuana, Mexico, from elm trees showing typical symptoms of Fusarium dieback. The final assembly consists of 287 scaffolds spanning 48,274,071 bp and 13,777 genes. Copyright © 2017 Ibarra-Laclette et al.
Saha, Surya; Hunter, Wayne B; Reese, Justin; Morgan, J Kent; Marutani-Hert, Mizuri; Huang, Hong; Lindeberg, Magdalen
2012-01-01
Diaphorina citri (Hemiptera: Psyllidae), the Asian citrus psyllid, is the insect vector of Ca. Liberibacter asiaticus, the causal agent of citrus greening disease. Sequencing of the D. citri metagenome has been initiated to gain better understanding of the biology of this organism and the potential roles of its bacterial endosymbionts. To corroborate candidate endosymbionts previously identified by rDNA amplification, raw reads from the D. citri metagenome sequence were mapped to reference genome sequences. Results of the read mapping provided the most support for Wolbachia and an enteric bacterium most similar to Salmonella. Wolbachia-derived reads were extracted using the complete genome sequences for four Wolbachia strains. Reads were assembled into a draft genome sequence, and the annotation assessed for the presence of features potentially involved in host interaction. Genome alignment with the complete sequences reveals membership of Wolbachia wDi in supergroup B, further supported by phylogenetic analysis of FtsZ. FtsZ and Wsp phylogenies additionally indicate that the Wolbachia strain in the Florida D. citri isolate falls into a sub-clade of supergroup B, distinct from Wolbachia present in Chinese D. citri isolates, supporting the hypothesis that the D. citri introduced into Florida did not originate from China.
Saha, Surya; Hunter, Wayne B.; Reese, Justin; Morgan, J. Kent; Marutani-Hert, Mizuri; Huang, Hong; Lindeberg, Magdalen
2012-01-01
Diaphorina citri (Hemiptera: Psyllidae), the Asian citrus psyllid, is the insect vector of Ca. Liberibacter asiaticus, the causal agent of citrus greening disease. Sequencing of the D. citri metagenome has been initiated to gain better understanding of the biology of this organism and the potential roles of its bacterial endosymbionts. To corroborate candidate endosymbionts previously identified by rDNA amplification, raw reads from the D. citri metagenome sequence were mapped to reference genome sequences. Results of the read mapping provided the most support for Wolbachia and an enteric bacterium most similar to Salmonella. Wolbachia-derived reads were extracted using the complete genome sequences for four Wolbachia strains. Reads were assembled into a draft genome sequence, and the annotation assessed for the presence of features potentially involved in host interaction. Genome alignment with the complete sequences reveals membership of Wolbachia wDi in supergroup B, further supported by phylogenetic analysis of FtsZ. FtsZ and Wsp phylogenies additionally indicate that the Wolbachia strain in the Florida D. citri isolate falls into a sub-clade of supergroup B, distinct from Wolbachia present in Chinese D. citri isolates, supporting the hypothesis that the D. citri introduced into Florida did not originate from China. PMID:23166822
Draft genome of the red harvester ant Pogonomyrmex barbatus.
Smith, Chris R; Smith, Christopher D; Robertson, Hugh M; Helmkampf, Martin; Zimin, Aleksey; Yandell, Mark; Holt, Carson; Hu, Hao; Abouheif, Ehab; Benton, Richard; Cash, Elizabeth; Croset, Vincent; Currie, Cameron R; Elhaik, Eran; Elsik, Christine G; Favé, Marie-Julie; Fernandes, Vilaiwan; Gibson, Joshua D; Graur, Dan; Gronenberg, Wulfila; Grubbs, Kirk J; Hagen, Darren E; Viniegra, Ana Sofia Ibarraran; Johnson, Brian R; Johnson, Reed M; Khila, Abderrahman; Kim, Jay W; Mathis, Kaitlyn A; Munoz-Torres, Monica C; Murphy, Marguerite C; Mustard, Julie A; Nakamura, Rin; Niehuis, Oliver; Nigam, Surabhi; Overson, Rick P; Placek, Jennifer E; Rajakumar, Rajendhran; Reese, Justin T; Suen, Garret; Tao, Shu; Torres, Candice W; Tsutsui, Neil D; Viljakainen, Lumi; Wolschin, Florian; Gadau, Jürgen
2011-04-05
We report the draft genome sequence of the red harvester ant, Pogonomyrmex barbatus. The genome was sequenced using 454 pyrosequencing, and the current assembly and annotation were completed in less than 1 y. Analyses of conserved gene groups (more than 1,200 manually annotated genes to date) suggest a high-quality assembly and annotation comparable to recently sequenced insect genomes using Sanger sequencing. The red harvester ant is a model for studying reproductive division of labor, phenotypic plasticity, and sociogenomics. Although the genome of P. barbatus is similar to other sequenced hymenopterans (Apis mellifera and Nasonia vitripennis) in GC content and compositional organization, and possesses a complete CpG methylation toolkit, its predicted genomic CpG content differs markedly from the other hymenopterans. Gene networks involved in generating key differences between the queen and worker castes (e.g., wings and ovaries) show signatures of increased methylation and suggest that ants and bees may have independently co-opted the same gene regulatory mechanisms for reproductive division of labor. Gene family expansions (e.g., 344 functional odorant receptors) and pseudogene accumulation in chemoreception and P450 genes compared with A. mellifera and N. vitripennis are consistent with major life-history changes during the adaptive radiation of Pogonomyrmex spp., perhaps in parallel with the development of the North American deserts.
Mofiz, Ehtesham; Holt, Deborah C; Seemann, Torsten; Currie, Bart J; Fischer, Katja; Papenfuss, Anthony T
2016-06-02
The scabies mite, Sarcoptes scabiei, is a parasitic arachnid and cause of the infectious skin disease scabies in humans and mange in other animal species. Scabies infections are a major health problem, particularly in remote Indigenous communities in Australia, where secondary group A streptococcal and Staphylococcus aureus infections of scabies sores are thought to drive the high rate of rheumatic heart disease and chronic kidney disease. We sequenced the genome of two samples of Sarcoptes scabiei var. hominis obtained from unrelated patients with crusted scabies located in different parts of northern Australia using the Illumina HiSeq. We also sequenced samples of Sarcoptes scabiei var. suis from a pig model. Because of the small size of the scabies mite, these data are derived from pools of thousands of mites and are metagenomic, including host and microbiome DNA. We performed cleaning and de novo assembly and present Sarcoptes scabiei var. hominis and var. suis draft reference genomes. We have constructed a preliminary annotation of this reference comprising 13,226 putative coding sequences based on sequence similarity to known proteins. We have developed extensive genomic resources for the scabies mite, including reference genomes and a preliminary annotation.
Bian, Chao; Hu, Yinchang; Ravi, Vydianathan; Kuznetsova, Inna S.; Shen, Xueyan; Mu, Xidong; Sun, Ying; You, Xinxin; Li, Jia; Li, Xiaofeng; Qiu, Ying; Tay, Boon-Hui; Thevasagayam, Natascha May; Komissarov, Aleksey S.; Trifonov, Vladimir; Kabilov, Marsel; Tupikin, Alexey; Luo, Jianren; Liu, Yi; Song, Hongmei; Liu, Chao; Wang, Xuejie; Gu, Dangen; Yang, Yexin; Li, Wujiao; Polgar, Gianluca; Fan, Guangyi; Zeng, Peng; Zhang, He; Xiong, Zijun; Tang, Zhujing; Peng, Chao; Ruan, Zhiqiang; Yu, Hui; Chen, Jieming; Fan, Mingjun; Huang, Yu; Wang, Min; Zhao, Xiaomeng; Hu, Guojun; Yang, Huanming; Wang, Jian; Wang, Jun; Xu, Xun; Song, Linsheng; Xu, Gangchun; Xu, Pao; Xu, Junmin; O’Brien, Stephen J.; Orbán, László; Venkatesh, Byrappa; Shi, Qiong
2016-01-01
The Asian arowana (Scleropages formosus), one of the world’s most expensive cultivated ornamental fishes, is an endangered species. It represents an ancient lineage of teleosts: the Osteoglossomorpha. Here, we provide a high-quality chromosome-level reference genome of a female golden-variety arowana using a combination of deep shotgun sequencing and high-resolution linkage mapping. In addition, we have also generated two draft genome assemblies for the red and green varieties. Phylogenomic analysis supports a sister group relationship between Osteoglossomorpha (bonytongues) and Elopomorpha (eels and relatives), with the two clades together forming a sister group of Clupeocephala which includes all the remaining teleosts. The arowana genome retains the full complement of eight Hox clusters unlike the African butterfly fish (Pantodon buchholzi), another bonytongue fish, which possess only five Hox clusters. Differential gene expression among three varieties provides insights into the genetic basis of colour variation. A potential heterogametic sex chromosome is identified in the female arowana karyotype, suggesting that the sex is determined by a ZW/ZZ sex chromosomal system. The high-quality reference genome of the golden arowana and the draft assemblies of the red and green varieties are valuable resources for understanding the biology, adaptation and behaviour of Asian arowanas. PMID:27089831
Opera: reconstructing optimal genomic scaffolds with high-throughput paired-end sequences.
Gao, Song; Sung, Wing-Kin; Nagarajan, Niranjan
2011-11-01
Scaffolding, the problem of ordering and orienting contigs, typically using paired-end reads, is a crucial step in the assembly of high-quality draft genomes. Even as sequencing technologies and mate-pair protocols have improved significantly, scaffolding programs still rely on heuristics, with no guarantees on the quality of the solution. In this work, we explored the feasibility of an exact solution for scaffolding and present a first tractable solution for this problem (Opera). We also describe a graph contraction procedure that allows the solution to scale to large scaffolding problems and demonstrate this by scaffolding several large real and synthetic datasets. In comparisons with existing scaffolders, Opera simultaneously produced longer and more accurate scaffolds demonstrating the utility of an exact approach. Opera also incorporates an exact quadratic programming formulation to precisely compute gap sizes (Availability: http://sourceforge.net/projects/operasf/ ).
Opera: Reconstructing Optimal Genomic Scaffolds with High-Throughput Paired-End Sequences
Gao, Song; Sung, Wing-Kin
2011-01-01
Abstract Scaffolding, the problem of ordering and orienting contigs, typically using paired-end reads, is a crucial step in the assembly of high-quality draft genomes. Even as sequencing technologies and mate-pair protocols have improved significantly, scaffolding programs still rely on heuristics, with no guarantees on the quality of the solution. In this work, we explored the feasibility of an exact solution for scaffolding and present a first tractable solution for this problem (Opera). We also describe a graph contraction procedure that allows the solution to scale to large scaffolding problems and demonstrate this by scaffolding several large real and synthetic datasets. In comparisons with existing scaffolders, Opera simultaneously produced longer and more accurate scaffolds demonstrating the utility of an exact approach. Opera also incorporates an exact quadratic programming formulation to precisely compute gap sizes (Availability: http://sourceforge.net/projects/operasf/). PMID:21929371
De Novo Genome and Transcriptome Assembly of the Canadian Beaver (Castor canadensis).
Lok, Si; Paton, Tara A; Wang, Zhuozhi; Kaur, Gaganjot; Walker, Susan; Yuen, Ryan K C; Sung, Wilson W L; Whitney, Joseph; Buchanan, Janet A; Trost, Brett; Singh, Naina; Apresto, Beverly; Chen, Nan; Coole, Matthew; Dawson, Travis J; Ho, Karen; Hu, Zhizhou; Pullenayegum, Sanjeev; Samler, Kozue; Shipstone, Arun; Tsoi, Fiona; Wang, Ting; Pereira, Sergio L; Rostami, Pirooz; Ryan, Carol Ann; Tong, Amy Hin Yan; Ng, Karen; Sundaravadanam, Yogi; Simpson, Jared T; Lim, Burton K; Engstrom, Mark D; Dutton, Christopher J; Kerr, Kevin C R; Franke, Maria; Rapley, William; Wintle, Richard F; Scherer, Stephen W
2017-02-09
The Canadian beaver ( Castor canadensis ) is the largest indigenous rodent in North America. We report a draft annotated assembly of the beaver genome, the first for a large rodent and the first mammalian genome assembled directly from uncorrected and moderate coverage (< 30 ×) long reads generated by single-molecule sequencing. The genome size is 2.7 Gb estimated by k-mer analysis. We assembled the beaver genome using the new Canu assembler optimized for noisy reads. The resulting assembly was refined using Pilon supported by short reads (80 ×) and checked for accuracy by congruency against an independent short read assembly. We scaffolded the assembly using the exon-gene models derived from 9805 full-length open reading frames (FL-ORFs) constructed from the beaver leukocyte and muscle transcriptomes. The final assembly comprised 22,515 contigs with an N50 of 278,680 bp and an N50-scaffold of 317,558 bp. Maximum contig and scaffold lengths were 3.3 and 4.2 Mb, respectively, with a combined scaffold length representing 92% of the estimated genome size. The completeness and accuracy of the scaffold assembly was demonstrated by the precise exon placement for 91.1% of the 9805 assembled FL-ORFs and 83.1% of the BUSCO (Benchmarking Universal Single-Copy Orthologs) gene set used to assess the quality of genome assemblies. Well-represented were genes involved in dentition and enamel deposition, defining characteristics of rodents with which the beaver is well-endowed. The study provides insights for genome assembly and an important genomics resource for Castoridae and rodent evolutionary biology. Copyright © 2017 Lok et al.
De Novo Genome and Transcriptome Assembly of the Canadian Beaver (Castor canadensis)
Lok, Si; Paton, Tara A.; Wang, Zhuozhi; Kaur, Gaganjot; Walker, Susan; Yuen, Ryan K. C.; Sung, Wilson W. L.; Whitney, Joseph; Buchanan, Janet A.; Trost, Brett; Singh, Naina; Apresto, Beverly; Chen, Nan; Coole, Matthew; Dawson, Travis J.; Ho, Karen; Hu, Zhizhou; Pullenayegum, Sanjeev; Samler, Kozue; Shipstone, Arun; Tsoi, Fiona; Wang, Ting; Pereira, Sergio L.; Rostami, Pirooz; Ryan, Carol Ann; Tong, Amy Hin Yan; Ng, Karen; Sundaravadanam, Yogi; Simpson, Jared T.; Lim, Burton K.; Engstrom, Mark D.; Dutton, Christopher J.; Kerr, Kevin C. R.; Franke, Maria; Rapley, William; Wintle, Richard F.; Scherer, Stephen W.
2017-01-01
The Canadian beaver (Castor canadensis) is the largest indigenous rodent in North America. We report a draft annotated assembly of the beaver genome, the first for a large rodent and the first mammalian genome assembled directly from uncorrected and moderate coverage (< 30 ×) long reads generated by single-molecule sequencing. The genome size is 2.7 Gb estimated by k-mer analysis. We assembled the beaver genome using the new Canu assembler optimized for noisy reads. The resulting assembly was refined using Pilon supported by short reads (80 ×) and checked for accuracy by congruency against an independent short read assembly. We scaffolded the assembly using the exon–gene models derived from 9805 full-length open reading frames (FL-ORFs) constructed from the beaver leukocyte and muscle transcriptomes. The final assembly comprised 22,515 contigs with an N50 of 278,680 bp and an N50-scaffold of 317,558 bp. Maximum contig and scaffold lengths were 3.3 and 4.2 Mb, respectively, with a combined scaffold length representing 92% of the estimated genome size. The completeness and accuracy of the scaffold assembly was demonstrated by the precise exon placement for 91.1% of the 9805 assembled FL-ORFs and 83.1% of the BUSCO (Benchmarking Universal Single-Copy Orthologs) gene set used to assess the quality of genome assemblies. Well-represented were genes involved in dentition and enamel deposition, defining characteristics of rodents with which the beaver is well-endowed. The study provides insights for genome assembly and an important genomics resource for Castoridae and rodent evolutionary biology. PMID:28087693
Draft genome sequence of the rubber tree Hevea brasiliensis.
Rahman, Ahmad Yamin Abdul; Usharraj, Abhilash O; Misra, Biswapriya B; Thottathil, Gincy P; Jayasekaran, Kandakumar; Feng, Yun; Hou, Shaobin; Ong, Su Yean; Ng, Fui Ling; Lee, Ling Sze; Tan, Hock Siew; Sakaff, Muhd Khairul Luqman Muhd; Teh, Beng Soon; Khoo, Bee Feong; Badai, Siti Suriawati; Aziz, Nurohaida Ab; Yuryev, Anton; Knudsen, Bjarne; Dionne-Laporte, Alexandre; Mchunu, Nokuthula P; Yu, Qingyi; Langston, Brennick J; Freitas, Tracey Allen K; Young, Aaron G; Chen, Rui; Wang, Lei; Najimudin, Nazalan; Saito, Jennifer A; Alam, Maqsudul
2013-02-02
Hevea brasiliensis, a member of the Euphorbiaceae family, is the major commercial source of natural rubber (NR). NR is a latex polymer with high elasticity, flexibility, and resilience that has played a critical role in the world economy since 1876. Here, we report the draft genome sequence of H. brasiliensis. The assembly spans ~1.1 Gb of the estimated 2.15 Gb haploid genome. Overall, ~78% of the genome was identified as repetitive DNA. Gene prediction shows 68,955 gene models, of which 12.7% are unique to Hevea. Most of the key genes associated with rubber biosynthesis, rubberwood formation, disease resistance, and allergenicity have been identified. The knowledge gained from this genome sequence will aid in the future development of high-yielding clones to keep up with the ever increasing need for natural rubber.
Draft genome sequence of the rubber tree Hevea brasiliensis
2013-01-01
Background Hevea brasiliensis, a member of the Euphorbiaceae family, is the major commercial source of natural rubber (NR). NR is a latex polymer with high elasticity, flexibility, and resilience that has played a critical role in the world economy since 1876. Results Here, we report the draft genome sequence of H. brasiliensis. The assembly spans ~1.1 Gb of the estimated 2.15 Gb haploid genome. Overall, ~78% of the genome was identified as repetitive DNA. Gene prediction shows 68,955 gene models, of which 12.7% are unique to Hevea. Most of the key genes associated with rubber biosynthesis, rubberwood formation, disease resistance, and allergenicity have been identified. Conclusions The knowledge gained from this genome sequence will aid in the future development of high-yielding clones to keep up with the ever increasing need for natural rubber. PMID:23375136
Coughlan, Simone; Taylor, Ali Shirley; Feane, Eoghan; Sanders, Mandy; Schonian, Gabriele; Cotton, James A.
2018-01-01
The unicellular protozoan parasite Leishmania causes the neglected tropical disease leishmaniasis, affecting 12 million people in 98 countries. In South America, where the Viannia subgenus predominates, so far only L. (Viannia) braziliensis and L. (V.) panamensis have been sequenced, assembled and annotated as reference genomes. Addressing this deficit in molecular information can inform species typing, epidemiological monitoring and clinical treatment. Here, L. (V.) naiffi and L. (V.) guyanensis genomic DNA was sequenced to assemble these two genomes as draft references from short sequence reads. The methods used were tested using short sequence reads for L. braziliensis M2904 against its published reference as a comparison. This assembly and annotation pipeline identified 70 additional genes not annotated on the original M2904 reference. Phylogenetic and evolutionary comparisons of L. guyanensis and L. naiffi with 10 other Viannia genomes revealed four traits common to all Viannia: aneuploidy, 22 orthologous groups of genes absent in other Leishmania subgenera, elevated TATE transposon copies and a high NADH-dependent fumarate reductase gene copy number. Within the Viannia, there were limited structural changes in genome architecture specific to individual species: a 45 Kb amplification on chromosome 34 was present in all bar L. lainsoni, L. naiffi had a higher copy number of the virulence factor leishmanolysin, and laboratory isolate L. shawi M8408 had a possible minichromosome derived from the 3’ end of chromosome 34. This combination of genome assembly, phylogenetics and comparative analysis across an extended panel of diverse Viannia has uncovered new insights into the origin and evolution of this subgenus and can help improve diagnostics for leishmaniasis surveillance. PMID:29765675
Icarus: visualizer for de novo assembly evaluation.
Mikheenko, Alla; Valin, Gleb; Prjibelski, Andrey; Saveliev, Vladislav; Gurevich, Alexey
2016-11-01
: Data visualization plays an increasingly important role in NGS data analysis. With advances in both sequencing and computational technologies, it has become a new bottleneck in genomics studies. Indeed, evaluation of de novo genome assemblies is one of the areas that can benefit from the visualization. However, even though multiple quality assessment methods are now available, existing visualization tools are hardly suitable for this purpose. Here, we present Icarus-a novel genome visualizer for accurate assessment and analysis of genomic draft assemblies, which is based on the tool QUAST. Icarus can be used in studies where a related reference genome is available, as well as for non-model organisms. The tool is available online and as a standalone application. http://cab.spbu.ru/software/icarus CONTACT: aleksey.gurevich@spbu.ruSupplementary information: Supplementary data are available at Bioinformatics online. © The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.
Hosmani, Prashant S.; Villalobos-Ayala, Krystal; Miller, Sherry; Shippy, Teresa; Flores, Mirella; Rosendale, Andrew; Cordola, Chris; Bell, Tracey; Mann, Hannah; DeAvila, Gabe; DeAvila, Daniel; Moore, Zachary; Buller, Kyle; Ciolkevich, Kathryn; Nandyal, Samantha; Mahoney, Robert; Van Voorhis, Joshua; Dunlevy, Megan; Farrow, David; Hunter, David; Morgan, Taylar; Shore, Kayla; Guzman, Victoria; Izsak, Allison; Dixon, Danielle E.; Cridge, Andrew; Cano, Liliana; Cao, Xiaolong; Jiang, Haobo; Leng, Nan; Johnson, Shannon; Cantarel, Brandi L.; Richards, Stephen; English, Adam; Shatters, Robert G.; Childers, Chris; Chen, Mei-Ju; Hunter, Wayne; Cilia, Michelle; Mueller, Lukas A.; Munoz-Torres, Monica; Nelson, David; Poelchau, Monica F.; Benoit, Joshua B.; Wiersma-Koch, Helen; D’Elia, Tom; Brown, Susan J.
2017-01-01
Abstract The Asian citrus psyllid (Diaphorina citri Kuwayama) is the insect vector of the bacterium Candidatus Liberibacter asiaticus (CLas), the pathogen associated with citrus Huanglongbing (HLB, citrus greening). HLB threatens citrus production worldwide. Suppression or reduction of the insect vector using chemical insecticides has been the primary method to inhibit the spread of citrus greening disease. Accurate structural and functional annotation of the Asian citrus psyllid genome, as well as a clear understanding of the interactions between the insect and CLas, are required for development of new molecular-based HLB control methods. A draft assembly of the D. citri genome has been generated and annotated with automated pipelines. However, knowledge transfer from well-curated reference genomes such as that of Drosophila melanogaster to newly sequenced ones is challenging due to the complexity and diversity of insect genomes. To identify and improve gene models as potential targets for pest control, we manually curated several gene families with a focus on genes that have key functional roles in D. citri biology and CLas interactions. This community effort produced 530 manually curated gene models across developmental, physiological, RNAi regulatory and immunity-related pathways. As previously shown in the pea aphid, RNAi machinery genes putatively involved in the microRNA pathway have been specifically duplicated. A comprehensive transcriptome enabled us to identify a number of gene families that are either missing or misassembled in the draft genome. In order to develop biocuration as a training experience, we included undergraduate and graduate students from multiple institutions, as well as experienced annotators from the insect genomics research community. The resulting gene set (OGS v1.0) combines both automatically predicted and manually curated gene models. Database URL: https://citrusgreening.org/ PMID:29220441
NASA Astrophysics Data System (ADS)
KO, Pohan; MATSUMOTO, Kiyoshi; OHTAKE, Norio; DING, Hua
2016-11-01
As for turbomachine off-design performance improvement is challenging but critical for maximising the performing area. In this paper, a curved draft tube for a medium head Kaplan type hydro turbine is introduced and discussed for its significant effect on expanding operating head range. Without adding any extra structure and working fluid for swirl destruction and damping, a carefully designed outline shape of draft tube with the selected placement of center-piers successfully supresses the growth of turbulence eddy and the transport of the swirl to the outlet. Also, more kinetic energy is recovered and the head lost is improved. Finally, the model test results are also presented. The obvious performance improvement was found in the lower net head area, where the maximum efficiency improvement was measured up to 20% without compromising the best efficiency point. Additionally, this design results in a new draft tube more compact in size and so leads to better construction and manufacturing cost performance for prototype. The draft tube geometry parameter designing process was concerning the best efficiency point together with the off-design points covering various water net heads and discharges. The hydraulic performance and flow behavior was numerically previewed and visualized by solving Reynolds-Averaged Navier-Stokes equations with Shear Stress Transport turbulence model. The simulation was under the assumption of steady-state incompressible turbulence flow inside the flow passage, and the inlet boundary condition was the carefully simulated flow pattern from the runner outlet. For confirmation, the corresponding turbine efficiency performance of the entire operating area was verified by model test.
The purpose of this draft report is to provide a summary of climate change impacts to selected watersheds and recommendations for how to improve the process of conducting watershed assessments in the future.
Kim, Hyung Jun; Jang, Soojin
2017-12-01
Staphylococcus haemolyticus is the second most frequently isolated coagulase-negative staphylococci from blood cultures. Moreover, multidrug resistance associated with the genome flexibility of S. haemolyticus has been increasingly reported worldwide. Here we report the draft genome sequence of multidrug-resistant S. haemolyticus IPK_TSA25 isolated from a building surface in South Korea. Genomic DNA of S. haemolyticus IPK_TSA25 was sequenced using the PacBio RS II sequencing platform. Generated reads were assembled using PacBio SMRT Analysis 2.3.0. The draft genome was annotated and antibiotic resistance genes were identified. The genome of 2517398bp contains various antibiotic resistance genes associated with resistance to β-lactams, aminoglycosides and macrolides. Genome analysis also revealed chromosomal integration of the full-length Staphylococcus aureus plasmid pS0385-1 containing a tetracycline resistance gene. The genome sequence reported in this study will provide valuable information to understand the flexibility of the S. haemolyticus genome, which facilitates acquisition of antibiotic resistance genes and contributes to the dissemination of antibiotic resistance by this emerging pathogen. Copyright © 2017 International Society for Chemotherapy of Infection and Cancer. Published by Elsevier Ltd. All rights reserved.
Main photoautotrophic components of biofilms in natural draft cooling towers.
Hauer, Tomáš; Čapek, Petr; Böhmová, Petra
2016-05-01
While photoautotrophic organisms are an important component of biofilms that live in certain regions of natural draft cooling towers, little is known about these communities. We therefore examined 18 towers at nine sites to identify the general patterns of community assembly in three distinct tower parts, and we examined how community structures differ depending on geography. We also compared the newly acquired data with previously published data. The bottom sections of draft cooling towers are mainly settled by large filamentous algae, primarily Cladophora glomerata. The central portions of towers host a small amount of planktic algae biomass originating in the cooling water. The upper fourths of towers are colonized by biofilms primarily dominated by cyanobacteria, e.g., members of the genera Gloeocapsa and Scytonema. A total of 41 taxa of phototrophic microorganisms were identified. Species composition of the upper fourth of all towers was significantly affected by cardinal position. There was different species composition at positions facing north compared to positions facing south. West- and east-facing positions were transitory and highly similar to each other in terms of species composition. Biofilms contribute to the degradation of paint coatings inside towers.
Whole-Genome Sequencing and Assembly with High-Throughput, Short-Read Technologies
Sundquist, Andreas; Ronaghi, Mostafa; Tang, Haixu; Pevzner, Pavel; Batzoglou, Serafim
2007-01-01
While recently developed short-read sequencing technologies may dramatically reduce the sequencing cost and eventually achieve the $1000 goal for re-sequencing, their limitations prevent the de novo sequencing of eukaryotic genomes with the standard shotgun sequencing protocol. We present SHRAP (SHort Read Assembly Protocol), a sequencing protocol and assembly methodology that utilizes high-throughput short-read technologies. We describe a variation on hierarchical sequencing with two crucial differences: (1) we select a clone library from the genome randomly rather than as a tiling path and (2) we sample clones from the genome at high coverage and reads from the clones at low coverage. We assume that 200 bp read lengths with a 1% error rate and inexpensive random fragment cloning on whole mammalian genomes is feasible. Our assembly methodology is based on first ordering the clones and subsequently performing read assembly in three stages: (1) local assemblies of regions significantly smaller than a clone size, (2) clone-sized assemblies of the results of stage 1, and (3) chromosome-sized assemblies. By aggressively localizing the assembly problem during the first stage, our method succeeds in assembling short, unpaired reads sampled from repetitive genomes. We tested our assembler using simulated reads from D. melanogaster and human chromosomes 1, 11, and 21, and produced assemblies with large sets of contiguous sequence and a misassembly rate comparable to other draft assemblies. Tested on D. melanogaster and the entire human genome, our clone-ordering method produces accurate maps, thereby localizing fragment assembly and enabling the parallelization of the subsequent steps of our pipeline. Thus, we have demonstrated that truly inexpensive de novo sequencing of mammalian genomes will soon be possible with high-throughput, short-read technologies using our methodology. PMID:17534434
Radford, Devon R; Leon-Velarde, Carlos G; Chen, Shu; Hamidi Oskouei, Amir M; Balamurugan, Sampathkumar
2018-03-29
The genomes of two strains of Salmonella enterica subsp. enterica serovar Cubana and serovar Muenchen, isolated from dry hazelnuts and chia seeds, respectively, were sequenced using the Illumina MiSeq platform, assembled de novo using the overlap-layout-consensus method, and aligned to their respective most identical sequence genome scaffolds using MUMMER and BLAST searches. Copyright © 2018 Radford et al.
1987-03-01
intelligent way, assemble those documents and data in usable formats, examine the communications tapes available for this project, and to develop a sampling...Lifetime Learning Publications, Belmont. CA. 1982. Rowe. Neil C.. Artifcial Intelligence , Draft Copv, Class Notes for Winter Quarter. CS 33 10, \\aval...AT2 122 BATTLEFIELD MANAGEMENT SYSTEM DATA REQUIRENTS TO 1/2 SUPPORT PASSAGE OF COMPANY LEVEL TACTICAL INFORMATION (U) NVALE POSTGRADUATE SCHOOL
Verification of Disarmament or Limitation of Armaments: Instruments, Negotiations, Proposals
1992-05-01
explosions and may complicate the process of detection. An even greater difficulty faced by seismologists is the ambient background of seismic "noise...suspected event would be a complex operation. It would consist of surveys of the area of the presumed nuclear explosion in order to measure ambient ...Draft Resolution to the OAS General Assembly, June 1991 and OAS Resolution "Cooperacion para la seguridad en el hemisferio. Limitacion de la
ERIC Educational Resources Information Center
Traill, David
Planning for "Operation Overlord" had been under way for about a year when General Dwight Eisenhower, commander of all the Allied forces in Europe, was ordered in February 1944 to invade the continent. Thousands of troops from the United States, Great Britain, France, Canada, and other nations were assembled in southern England and…
Draft Genome Sequence of Desulfovibrio BerOc1, a Mercury-Methylating Strain.
Goñi Urriza, Marisol; Gassie, Claire; Bouchez, Oliver; Klopp, Christophe; Guyoneaud, Rémy
2017-01-19
Desulfovibrio BerOc1 is a sulfate-reducing bacterium isolated from the Berre lagoon (French Mediterranean coast). BerOc1 is able to methylate and demethylate mercury. The genome size is 4,081,579 bp assembled into five contigs. We identified the hgcA and hgcB genes involved in mercury methylation, but not those responsible for mercury demethylation. Copyright © 2017 Goñi Urriza et al.
Adding Realism to Technical Drafting Programs
ERIC Educational Resources Information Center
Weaver, Gerald L.
1976-01-01
Suggestions for improved, relevant technical drafting programs are presented: (1) making realistic assignments, (2) viewing real projects, (3) duplicating industrial projects, (4) practicing lettering, (5) conducting research, (6) engaging in teamwork, (7) adapting to change, (8) learning to meet deadlines, and (9) stressing the importance of…
Mancuso, Carol A; Wentzel, Catherine H; Ghomrawi, Hassan M K; Kelly, Bryan T
2017-05-01
To develop a patient-derived expectations survey for hip preservation surgery. Patients were eligible if they were undergoing primary hip surgery and were recruited in person or by telephone. The survey was developed in 3 phases. During phase 1, 64 patients were interviewed preoperatively and asked open-ended questions about their expectations of surgery; a draft survey was assembled by categorizing responses. During phase 2, the survey was administered twice to another group of 50 patients preoperatively to assess test-retest reliability and concordance was measured with weighted kappa values and intraclass correlations. All patients also completed valid standard hip surveys electronically. During phase 3, final items were selected, factor analysis was performed, and a scoring system was developed. In phase 1, 509 expectations were volunteered from which 21 distinct categories were discerned and became the items for the draft survey. In phase 2, the draft survey was completed twice, 4 days apart. In phase 3, all 21 items were retained for the final survey addressing pain, mobility, sports, resumption of active lifestyles, future function, and psychological well-being. An overall score is calculated from the number of items expected and the amount of improvement expected, and ranges from 0 to 100; higher is more expectations. For phase 2 patients, mean scores for both administrations were 82, Cronbach alpha coefficients were 0.88 and 0.91, and the intraclass correlation was 0.92. A higher score (i.e., greater expectations) was associated with worse hip condition measured by standard hip surveys (P ≤ .05). We developed a patient-derived survey that is valid, reliable, and addresses a spectrum of expectations. The survey generates an overall score that is easy to calculate and interpret and offers a practical and comprehensive way to record patients' preoperative expectations. Level II, prognostic study, prospective sample. Copyright © 2016 Arthroscopy Association of North America. Published by Elsevier Inc. All rights reserved.
Genome assembly and transcriptome resource for river buffalo, Bubalus bubalis (2n = 50)
Iamartino, Daniela; Pruitt, Kim D; Sonstegard, Tad; Smith, Timothy P L; Low, Wai Yee; Biagini, Tommaso; Bomba, Lorenzo; Capomaccio, Stefano; Castiglioni, Bianca; Coletta, Angelo; Corrado, Federica; Ferré, Fabrizio; Iannuzzi, Leopoldo; Lawley, Cynthia; Macciotta, Nicolò; McClure, Matthew; Mancini, Giordano; Matassino, Donato; Mazza, Raffaele; Milanesi, Marco; Moioli, Bianca; Morandi, Nicola; Ramunno, Luigi; Peretti, Vincenzo; Pilla, Fabio; Ramelli, Paola; Schroeder, Steven; Strozzi, Francesco; Thibaud-Nissen, Francoise; Zicarelli, Luigi; Ajmone-Marsan, Paolo; Valentini, Alessio; Chillemi, Giovanni; Zimin, Aleksey
2017-01-01
Abstract Water buffalo is a globally important species for agriculture and local economies. A de novo assembled, well-annotated reference sequence for the water buffalo is an important prerequisite for studying the biology of this species, and is necessary to manage genetic diversity and to use modern breeding and genomic selection techniques. However, no such genome assembly has been previously reported. There are 2 species of domestic water buffalo, the river (2n = 50) and the swamp (2n = 48) buffalo. Here we describe a draft quality reference sequence for the river buffalo created from Illumina GA and Roche 454 short read sequences using the MaSuRCA assembler. The assembled sequence is 2.83 Gb, consisting of 366 983 scaffolds with a scaffold N50 of 1.41 Mb and contig N50 of 21 398 bp. Annotation of the genome was supported by transcriptome data from 30 tissues and identified 21 711 predicted protein coding genes. Searches for complete mammalian BUSCO gene groups found 98.6% of curated single copy orthologs present among predicted genes, which suggests a high level of completeness of the genome. The annotated sequence is available from NCBI at accession GCA_000471725.1. PMID:29048578
ERIC Educational Resources Information Center
Hughes, Larry R.
This guide to teaching drafting, one in a series of instructional materials for junior high industrial arts education, is designed to assist teachers as they plan and implement new courses of study and as they make revisions and improvements in existing courses in order to integrate classroom learning with real-life experiences. This drafting…
Draft Genome Sequence of Lactobacillus helveticus ATCC 12046
2018-01-01
ABSTRACT Lactobacillus helveticus is a lactic acid bacterium used traditionally in the dairy industry, especially in the manufacture of cheeses. We present here the 2,141,841-bp draft genome sequence of L. helveticus strain ATCC 12046, a potential starter strain for improving cheese production. PMID:29449405
Sánchez-Nieves, Rubén; Facciotti, Marc T; Saavedra-Collado, Sofía; Dávila-Santiago, Lizbeth; Rodríguez-Carrero, Roy; Montalvo-Rodríguez, Rafael
2016-03-01
The genus Halorubrum is a member of the family Halobacteriaceae which currently has the highest number of described species (31) of all the haloarchaea. Here we report the draft genome sequence of strain V5, a new species within this genus that was isolated from the solar salterns of Cabo Rojo, Puerto Rico. Assembly was performed and rendered the genome into 17 contigs (N50 = 515,834 bp), the largest of which contains 1,031,026 bp. The genome consists of 3.57 MB in length with G + C content of 67.6%. In general, the genome includes 4 rRNAs, 52 tRNAs, and 3246 protein-coding sequences. The NCBI accession number for this genome is LIST00000000 and the strain deposit number is CECT9000.
Draft Tier 2 Environmental Impact Statement for International Space Station
NASA Technical Reports Server (NTRS)
1995-01-01
The Draft Tier 2 Environmental Impact Statement (EIS) for the International Space Station (ISS) has been prepared by the National Aeronautics and Space Administration (NASA) and follows NASA's Record of Decision on the Final Tier 1 EIS for the Space Station Freedom. The Tier 2 EIS provides an updated evaluation of the environmental impacts associated with the alternatives considered: the Proposed Action and the No-Action alternative. The Proposed Action is to continue U.S. participation in the assembly and operation of ISS. The No-Action alternative would cancel NASA's participation in the Space Station Program. ISS is an international cooperative venture between NASA, the Canadian Space Agency, the European Space Agency, the Science and Technology Agency of Japan, the Russian Space Agency, and the Italian Space Agency. The purpose of the NASA action would be to further develop a human presence in space; to meet scientific, technological, and commercial research needs; and to foster international cooperation.
Silva, Paula Renata Alves da; Simões-Araújo, Jean Luiz; Vidal, Márcia Soares; Cruz, Leonardo Magalhães; Souza, Emanuel Maltempi de; Baldani, José Ivo
Paraburkholderia tropica (syn Burkholderia tropica) are nitrogen-fixing bacteria commonly found in sugarcane. The Paraburkholderia tropica strain Ppe8 is part of the sugarcane inoculant consortium that has a beneficial effect on yield. Here, we report a draft genome sequence of this strain elucidating the mechanisms involved in its interaction mainly with Poaceae. A genome size of approximately 8.75Mb containing 7844 protein coding genes distributed in 526 subsystems was de novo assembled with ABySS and annotated by RAST. Genes related to the nitrogen fixation process, the secretion systems (I, II, III, IV, and VI), and related to a variety of metabolic traits, such as metabolism of carbohydrates, amino acids, vitamins, and proteins, were detected, suggesting a broad metabolic capacity and possible adaptation to plant association. Copyright © 2017 Sociedade Brasileira de Microbiologia. Published by Elsevier Editora Ltda. All rights reserved.
Bain, Peter A; Papanicolaou, Alexie; Kumar, Anupama
2015-01-01
Murray-Darling rainbowfish (Melanotaenia fluviatilis [Castelnau, 1878]; Atheriniformes: Melanotaeniidae) is a small-bodied teleost currently under development in Australasia as a test species for aquatic toxicological studies. To date, efforts towards the development of molecular biomarkers of contaminant exposure have been hindered by the lack of available sequence data. To address this, we sequenced messenger RNA from brain, liver and gonads of mature male and female fish and generated a high-quality draft transcriptome using a de novo assembly approach. 149,742 clusters of putative transcripts were obtained, encompassing 43,841 non-redundant protein-coding regions. Deduced amino acid sequences were annotated by functional inference based on similarity with sequences from manually curated protein sequence databases. The draft assembly contained protein-coding regions homologous to 95.7% of the complete cohort of predicted proteins from the taxonomically related species, Oryzias latipes (Japanese medaka). The mean length of rainbowfish protein-coding sequences relative to their medaka homologues was 92.1%, indicating that despite the limited number of tissues sampled a large proportion of the total expected number of protein-coding genes was captured in the study. Because of our interest in the effects of environmental contaminants on endocrine pathways, we manually curated subsets of coding regions for putative nuclear receptors and steroidogenic enzymes in the rainbowfish transcriptome, revealing 61 candidate nuclear receptors encompassing all known subfamilies, and 41 putative steroidogenic enzymes representing all major steroidogenic enzymes occurring in teleosts. The transcriptome presented here will be a valuable resource for researchers interested in biomarker development, protein structure and function, and contaminant-response genomics in Murray-Darling rainbowfish.
Gupta, Vikas; Estrada, April D; Blakley, Ivory; Reid, Rob; Patel, Ketan; Meyer, Mason D; Andersen, Stig Uggerhøj; Brown, Allan F; Lila, Mary Ann; Loraine, Ann E
2015-01-01
Blueberries are a rich source of antioxidants and other beneficial compounds that can protect against disease. Identifying genes involved in synthesis of bioactive compounds could enable the breeding of berry varieties with enhanced health benefits. Toward this end, we annotated a previously sequenced draft blueberry genome assembly using RNA-Seq data from five stages of berry fruit development and ripening. Genome-guided assembly of RNA-Seq read alignments combined with output from ab initio gene finders produced around 60,000 gene models, of which more than half were similar to proteins from other species, typically the grape Vitis vinifera. Comparison of gene models to the PlantCyc database of metabolic pathway enzymes identified candidate genes involved in synthesis of bioactive compounds, including bixin, an apocarotenoid with potential disease-fighting properties, and defense-related cyanogenic glycosides, which are toxic. Cyanogenic glycoside (CG) biosynthetic enzymes were highly expressed in green fruit, and a candidate CG detoxification enzyme was up-regulated during fruit ripening. Candidate genes for ethylene, anthocyanin, and 400 other biosynthetic pathways were also identified. Homology-based annotation using Blast2GO and InterPro assigned Gene Ontology terms to around 15,000 genes. RNA-Seq expression profiling showed that blueberry growth, maturation, and ripening involve dynamic gene expression changes, including coordinated up- and down-regulation of metabolic pathway enzymes and transcriptional regulators. Analysis of RNA-seq alignments identified developmentally regulated alternative splicing, promoter use, and 3' end formation. We report genome sequence, gene models, functional annotations, and RNA-Seq expression data that provide an important new resource enabling high throughput studies in blueberry.
The value of new genome references.
Worley, Kim C; Richards, Stephen; Rogers, Jeffrey
2017-09-15
Genomic information has become a ubiquitous and almost essential aspect of biological research. Over the last 10-15 years, the cost of generating sequence data from DNA or RNA samples has dramatically declined and our ability to interpret those data increased just as remarkably. Although it is still possible for biologists to conduct interesting and valuable research on species for which genomic data are not available, the impact of having access to a high quality whole genome reference assembly for a given species is nothing short of transformational. Research on a species for which we have no DNA or RNA sequence data is restricted in fundamental ways. In contrast, even access to an initial draft quality genome (see below for definitions) opens a wide range of opportunities that are simply not available without that reference genome assembly. Although a complete discussion of the impact of genome sequencing and assembly is beyond the scope of this short paper, the goal of this review is to summarize the most common and highest impact contributions that whole genome sequencing and assembly has had on comparative and evolutionary biology. Copyright © 2016. Published by Elsevier Inc.
Federal Register 2010, 2011, 2012, 2013, 2014
2012-12-12
... humans. ``Fungistats'' are antimicrobial pesticides intended for aesthetic or cosmetic purposes and only... to improve protection of public health through proper use of mold-related pesticides. III. Do PR... ENVIRONMENTAL PROTECTION AGENCY [EPA-HQ-OPP-2010-0539; FRL-9362-3] Pesticides; Draft Guidance for...
Draft Genome Sequence of Lactobacillus helveticus ATCC 12046.
Palomino, María Mercedes; Burguener, Germán F; Campos, Josefina; Allievi, Mariana; Fina-Martin, Joaquina; Prado Acosta, Mariano; Fernández Do Porto, Darío A; Ruzal, Sandra M
2018-02-15
Lactobacillus helveticus is a lactic acid bacterium used traditionally in the dairy industry, especially in the manufacture of cheeses. We present here the 2,141,841-bp draft genome sequence of L. helveticus strain ATCC 12046, a potential starter strain for improving cheese production. Copyright © 2018 Palomino et al.
Federal Register 2010, 2011, 2012, 2013, 2014
2010-05-10
... Property, Plant, and Equipment. The proposed Exposure Draft represents a first step toward improving... Related to Deferred Maintenance and Repairs: Amending SFFAS 6, Accounting for Property, Plant, and Equipment AGENCY: Federal Accounting Standards Advisory Board. ACTION: Notice. Board Action: Pursuant to 31...
Federal Register 2010, 2011, 2012, 2013, 2014
2013-12-20
... DEPARTMENT OF DEFENSE Department of the Army, Corps of Engineers Intent To Prepare a Draft Supplemental Environmental Impact Statement for the Middle Mississippi River Regulating Works Project, Missouri... stabilization and sediment management to ensure adequate navigation depth and width. Project improvements are...
Improved High-Quality Draft Genome Sequence and Annotation of Burkholderia contaminans LMG 23361T.
Jung, Ji Young; Ahn, Youngbeom; Kweon, Ohgew; LiPuma, John J; Hussong, David; Marasa, Bernard S; Cerniglia, Carl E
2017-04-20
Burkholderia contaminans LMG 23361 is the type strain of the species isolated from the milk of a dairy sheep with mastitis. Some pharmaceutical products contain disinfectants such as benzalkonium chloride (BZK) and previously we reported that B. contaminans LMG 23361 T possesses the ability to inactivate BZK with high biodegradation rates. Here, we report an improved high-quality draft genome sequence of this strain. Copyright © 2017 Jung et al.
Jeong, Young-Min; Kim, Namshin; Ahn, Byung Ohg; Oh, Mijin; Chung, Won-Hyong; Chung, Hee; Jeong, Seongmun; Lim, Ki-Byung; Hwang, Yoon-Jung; Kim, Goon-Bo; Baek, Seunghoon; Choi, Sang-Bong; Hyung, Dae-Jin; Lee, Seung-Won; Sohn, Seong-Han; Kwon, Soo-Jin; Jin, Mina; Seol, Young-Joo; Chae, Won Byoung; Choi, Keun Jin; Park, Beom-Seok; Yu, Hee-Ju; Mun, Jeong-Hwan
2016-07-01
This study presents a chromosome-scale draft genome sequence of radish that is assembled into nine chromosomal pseudomolecules. A comprehensive comparative genome analysis with the Brassica genomes provides genomic evidences on the evolution of the mesohexaploid radish genome. Radish (Raphanus sativus L.) is an agronomically important root vegetable crop and its origin and phylogenetic position in the tribe Brassiceae is controversial. Here we present a comprehensive analysis of the radish genome based on the chromosome sequences of R. sativus cv. WK10039. The radish genome was sequenced and assembled into 426.2 Mb spanning >98 % of the gene space, of which 344.0 Mb were integrated into nine chromosome pseudomolecules. Approximately 36 % of the genome was repetitive sequences and 46,514 protein-coding genes were predicted and annotated. Comparative mapping of the tPCK-like ancestral genome revealed that the radish genome has intermediate characteristics between the Brassica A/C and B genomes in the triplicated segments, suggesting an internal origin from the genus Brassica. The evolutionary characteristics shared between radish and other Brassica species provided genomic evidences that the current form of nine chromosomes in radish was rearranged from the chromosomes of hexaploid progenitor. Overall, this study provides a chromosome-scale draft genome sequence of radish as well as novel insight into evolution of the mesohexaploid genomes in the tribe Brassiceae.
De novo genome assembly of the soil-borne fungus and tomato pathogen Pyrenochaeta lycopersici
2014-01-01
Background Pyrenochaeta lycopersici is a soil-dwelling ascomycete pathogen that causes corky root rot disease in tomato (Solanum lycopersicum) and other Solanaceous crops, reducing fruit yields by up to 75%. Fungal pathogens that infect roots receive less attention than those infecting the aerial parts of crops despite their significant impact on plant growth and fruit production. Results We assembled a 54.9Mb P. lycopersici draft genome sequence based on Illumina short reads, and annotated approximately 17,000 genes. The P. lycopersici genome is closely related to hemibiotrophs and necrotrophs, in agreement with the phenotypic characteristics of the fungus and its lifestyle. Several gene families related to host–pathogen interactions are strongly represented, including those responsible for nutrient absorption, the detoxification of fungicides and plant cell wall degradation, the latter confirming that much of the genome is devoted to the pathogenic activity of the fungus. We did not find a MAT gene, which is consistent with the classification of P. lycopersici as an imperfect fungus, but we observed a significant expansion of the gene families associated with heterokaryon incompatibility (HI). Conclusions The P. lycopersici draft genome sequence provided insight into the molecular and genetic basis of the fungal lifestyle, characterizing previously unknown pathogenic behaviors and defining strategies that allow this asexual fungus to increase genetic diversity and to acquire new pathogenic traits. PMID:24767544
Random Amplification and Pyrosequencing for Identification of Novel Viral Genome Sequences
Hang, Jun; Forshey, Brett M.; Kochel, Tadeusz J.; Li, Tao; Solórzano, Víctor Fiestas; Halsey, Eric S.; Kuschner, Robert A.
2012-01-01
ssRNA viruses have high levels of genomic divergence, which can lead to difficulty in genomic characterization of new viruses using traditional PCR amplification and sequencing methods. In this study, random reverse transcription, anchored random PCR amplification, and high-throughput pyrosequencing were used to identify orthobunyavirus sequences from total RNA extracted from viral cultures of acute febrile illness specimens. Draft genome sequence for the orthobunyavirus L segment was assembled and sequentially extended using de novo assembly contigs from pyrosequencing reads and orthobunyavirus sequences in GenBank as guidance. Accuracy and continuous coverage were achieved by mapping all reads to the L segment draft sequence. Subsequently, RT-PCR and Sanger sequencing were used to complete the genome sequence. The complete L segment was found to be 6936 bases in length, encoding a 2248-aa putative RNA polymerase. The identified L segment was distinct from previously published South American orthobunyaviruses, sharing 63% and 54% identity at the nucleotide and amino acid level, respectively, with the complete Oropouche virus L segment and 73% and 81% identity at the nucleotide and amino acid level, respectively, with a partial Caraparu virus L segment. The result demonstrated the effectiveness of a sequence-independent amplification and next-generation sequencing approach for obtaining complete viral genomes from total nucleic acid extracts and its use in pathogen discovery. PMID:22468136
GapBlaster-A Graphical Gap Filler for Prokaryote Genomes.
de Sá, Pablo H C G; Miranda, Fábio; Veras, Adonney; de Melo, Diego Magalhães; Soares, Siomar; Pinheiro, Kenny; Guimarães, Luis; Azevedo, Vasco; Silva, Artur; Ramos, Rommel T J
2016-01-01
The advent of NGS (Next Generation Sequencing) technologies has resulted in an exponential increase in the number of complete genomes available in biological databases. This advance has allowed the development of several computational tools enabling analyses of large amounts of data in each of the various steps, from processing and quality filtering to gap filling and manual curation. The tools developed for gap closure are very useful as they result in more complete genomes, which will influence downstream analyses of genomic plasticity and comparative genomics. However, the gap filling step remains a challenge for genome assembly, often requiring manual intervention. Here, we present GapBlaster, a graphical application to evaluate and close gaps. GapBlaster was developed via Java programming language. The software uses contigs obtained in the assembly of the genome to perform an alignment against a draft of the genome/scaffold, using BLAST or Mummer to close gaps. Then, all identified alignments of contigs that extend through the gaps in the draft sequence are presented to the user for further evaluation via the GapBlaster graphical interface. GapBlaster presents significant results compared to other similar software and has the advantage of offering a graphical interface for manual curation of the gaps. GapBlaster program, the user guide and the test datasets are freely available at https://sourceforge.net/projects/gapblaster2015/. It requires Sun JDK 8 and Blast or Mummer.
Federal Register 2010, 2011, 2012, 2013, 2014
2011-03-04
... DEPARTMENT OF HEALTH AND HUMAN SERVICES Call for Comments on the Draft Report of the Adult Immunization Working Group to the National Vaccine Advisory Committee on Adult Immunization: Complex Challenges..., national adult immunization program that will lead to vaccine-preventable disease reduction by improving...
Draft Genome Sequences of 37 Salmonella enterica Strains Isolated from Poultry Sources in Nigeria
Useh, Nicodemus M.; Ngbede, Emmanuel O.; Akange, Nguavese; Thomas, Milton; Foley, Andrew; Keena, Mitchel Chan; Nelson, Eric; Christopher-Hennings, Jane; Tomita, Masaru
2016-01-01
Here, we report the availability of draft genomes of several Salmonella serotypes, isolated from poultry sources from Nigeria. These genomes will help to further understand the biological diversity of S. enterica and will serve as references in microbial trace-back studies to improve food safety. PMID:27151793
Fernandes, Miriam R; Sellera, Fábio P; Moura, Quézia; Souza, Tiago A; Lincopan, Nilton
2018-03-01
Asymptomatic carriers can act as reservoirs of multidrug-resistant (MDR) bacteria. The aim of this study was to describe the draft genome sequence of a MDR Escherichia coli lineage recovered from a faecal sample of a healthy carrier. Genomic DNA was sequenced on an Illumina NextSeq platform. Sequence reads were de novo assembled using CLC Genomics Workbench and the whole genome sequence was evaluated through bioinformatics tools available from the Center of Genomic Epidemiology as well as additional in silico analysis. The genome size was calculated as 5178340 bp, with 5442 protein-coding sequences and 5492 total genes. Presence of the bla CTX-M-8 , bla CTX-M-55 and fosA3 genes was detected in addition to other antimicrobial resistance genes. Interestingly, the strain was assigned to serotype O8:H4-fimH97 and was classified within the highly virulent phylogroup B2. This draft genome can provide helpful information to elucidate genetic features that contribute to colonisation and adaptation of MDR and virulent pathogens in asymptomatic carriers. Copyright © 2018 International Society for Chemotherapy of Infection and Cancer. Published by Elsevier Ltd. All rights reserved.
Li, Xi; Sun, Long; Zhu, Yongze; Shen, Mengyuan; Tu, Yuexing
2018-04-14
The emergence of carbapenem-resistant Escherichia coli has become a serious challenge to manage in the clinic because of multidrug resistance. Here we report the draft genome sequence of NDM-3-producing E. coli strain NT1 isolated from a bloodstream infection in China. Whole genomic DNA of E. coli strain NT1 was extracted and was sequenced using an Illumina HiSeq™ X Ten platform. The generated sequence reads were assembled using CLC Genomics Workbench. The draft genome was annotated using Rapid Annotation using Subsystem Technology (RAST). Bioinformatics analysis was further performed. The genome size was calculated at 5,353 620bp, with 5297 protein-coding sequences and the presence of genes conferring resistance to aminoglycosides, β-lactams, quinolones, macrolides, phenicols, sulphonamides, tetracycline and trimethoprim. In addition, genes encoding virulence factors were also identified. To our knowledge, this is the first report of an E. coli strain producing NDM-3 isolated from a human bloodstream infection. The genome sequence will provide valuable information to understand antibiotic resistance mechanisms and pathogenic mechanisms in this strain. Close surveillance is urgently needed to monitor the spread of NDM-3-producing isolates. Copyright © 2018 International Society for Chemotherapy of Infection and Cancer. Published by Elsevier Ltd. All rights reserved.
Pajuelo, Mónica J.; Eguiluz, María; Dahlstrom, Eric; Requena, David; Guzmán, Frank; Ramirez, Manuel; Sheen, Patricia; Frace, Michael; Sammons, Scott; Cama, Vitaliano; Anzick, Sarah; Bruno, Dan; Mahanty, Siddhartha; Wilkins, Patricia; Nash, Theodore; Gonzalez, Armando; García, Héctor H.; Gilman, Robert H.; Porcella, Steve; Zimic, Mirko
2015-01-01
Background Infections with Taenia solium are the most common cause of adult acquired seizures worldwide, and are the leading cause of epilepsy in developing countries. A better understanding of the genetic diversity of T. solium will improve parasite diagnostics and transmission pathways in endemic areas thereby facilitating the design of future control measures and interventions. Microsatellite markers are useful genome features, which enable strain typing and identification in complex pathogen genomes. Here we describe microsatellite identification and characterization in T. solium, providing information that will assist in global efforts to control this important pathogen. Methods For genome sequencing, T. solium cysts and proglottids were collected from Huancayo and Puno in Peru, respectively. Using next generation sequencing (NGS) and de novo assembly, we assembled two draft genomes and one hybrid genome. Microsatellite sequences were identified and 36 of them were selected for further analysis. Twenty T. solium isolates were collected from Tumbes in the northern region, and twenty from Puno in the southern region of Peru. The size-polymorphism of the selected microsatellites was determined with multi-capillary electrophoresis. We analyzed the association between microsatellite polymorphism and the geographic origin of the samples. Results The predicted size of the hybrid (proglottid genome combined with cyst genome) T. solium genome was 111 MB with a GC content of 42.54%. A total of 7,979 contigs (>1,000 nt) were obtained. We identified 9,129 microsatellites in the Puno-proglottid genome and 9,936 in the Huancayo-cyst genome, with 5 or more repeats, ranging from mono- to hexa-nucleotide. Seven microsatellites were polymorphic and 29 were monomorphic within the analyzed isolates. T. solium tapeworms were classified into two genetic groups that correlated with the North/South geographic origin of the parasites. Conclusions/Significance The availability of draft genomes for T. solium represents a significant step towards the understanding the biology of the parasite. We report here a set of T. solium polymorphic microsatellite markers that appear promising for genetic epidemiology studies. PMID:26697878
1983-12-01
Initializes the data tables shared by both the Local and Netowrk Operating Systems. 3. Invint: Written in Assembly Language. Initializes the Input/Output...connection with an appropriate type and grade of transport service and appropriate security authentication (Ref 6:38). Data Transfer within a session...V.; Kent, S. Security in oihr Level Protocolst Anorgaches. Alternatives and Recommendations, Draft Report ICST/HLNP-81-19, Wash ingt on,,D.C.: Dept
Dhar, Hena; Swarnkar, Mohit Kumar; Gulati, Arvind; Singh, Anil Kumar; Kasana, Ramesh Chand
2015-02-19
Paenibacillus sp. strain IHB B 3415 is a cellulase-producing psychrotrophic bacterium isolated from a soil sample from the cold deserts of Himachal Pradesh, India. Here, we report an 8.44-Mb assembly of its genome sequence with a G+C content of 50.77%. The data presented here will provide insights into the mechanisms of cellulose degradation at low temperature. Copyright © 2015 Dhar et al.
First draft genome of an iconic clownfish species (Amphiprion frenatus).
Marcionetti, Anna; Rossier, Victor; Bertrand, Joris A M; Litsios, Glenn; Salamin, Nicolas
2018-02-17
Clownfishes (or anemonefishes) form an iconic group of coral reef fishes, principally known for their mutualistic interaction with sea anemones. They are characterized by particular life history traits, such as a complex social structure and mating system involving sequential hermaphroditism, coupled with an exceptionally long lifespan. Additionally, clownfishes are considered to be one of the rare groups to have experienced an adaptive radiation in the marine environment. Here, we assembled and annotated the first genome of a clownfish species, the tomato clownfish (Amphiprion frenatus). We obtained 17,801 assembled scaffolds, containing a total of 26,917 genes. The completeness of the assembly and annotation was satisfying, with 96.5% of the Actinopterygii Benchmarking Universal Single-Copy Orthologs (BUSCOs) being retrieved in A. frenatus assembly. The quality of the resulting assembly is comparable to other bony fish assemblies. This resource is valuable for advancing studies of the particular life history traits of clownfishes, as well as being useful for population genetic studies and the development of new phylogenetic markers. It will also open the way to comparative genomics. Indeed, future genomic comparison among closely related fishes may provide means to identify genes related to the unique adaptations to different sea anemone hosts, as well as better characterize the genomic signatures of an adaptive radiation. © 2018 The Authors. Molecular Ecology Resources Published by John Wiley & Sons Ltd.
Genome assembly and transcriptome resource for river buffalo, Bubalus bubalis (2n = 50).
Williams, John L; Iamartino, Daniela; Pruitt, Kim D; Sonstegard, Tad; Smith, Timothy P L; Low, Wai Yee; Biagini, Tommaso; Bomba, Lorenzo; Capomaccio, Stefano; Castiglioni, Bianca; Coletta, Angelo; Corrado, Federica; Ferré, Fabrizio; Iannuzzi, Leopoldo; Lawley, Cynthia; Macciotta, Nicolò; McClure, Matthew; Mancini, Giordano; Matassino, Donato; Mazza, Raffaele; Milanesi, Marco; Moioli, Bianca; Morandi, Nicola; Ramunno, Luigi; Peretti, Vincenzo; Pilla, Fabio; Ramelli, Paola; Schroeder, Steven; Strozzi, Francesco; Thibaud-Nissen, Francoise; Zicarelli, Luigi; Ajmone-Marsan, Paolo; Valentini, Alessio; Chillemi, Giovanni; Zimin, Aleksey
2017-10-01
Water buffalo is a globally important species for agriculture and local economies. A de novo assembled, well-annotated reference sequence for the water buffalo is an important prerequisite for studying the biology of this species, and is necessary to manage genetic diversity and to use modern breeding and genomic selection techniques. However, no such genome assembly has been previously reported. There are 2 species of domestic water buffalo, the river (2 n = 50) and the swamp (2 n = 48) buffalo. Here we describe a draft quality reference sequence for the river buffalo created from Illumina GA and Roche 454 short read sequences using the MaSuRCA assembler. The assembled sequence is 2.83 Gb, consisting of 366 983 scaffolds with a scaffold N50 of 1.41 Mb and contig N50 of 21 398 bp. Annotation of the genome was supported by transcriptome data from 30 tissues and identified 21 711 predicted protein coding genes. Searches for complete mammalian BUSCO gene groups found 98.6% of curated single copy orthologs present among predicted genes, which suggests a high level of completeness of the genome. The annotated sequence is available from NCBI at accession GCA_000471725.1. © The Author 2017. Published by Oxford University Press.
Shimizu, Tokurou; Tanizawa, Yasuhiro; Mochizuki, Takako; Nagasaki, Hideki; Yoshioka, Terutaka; Toyoda, Atsushi; Fujiyama, Asao; Kaminuma, Eli; Nakamura, Yasukazu
2017-01-01
Satsuma (Citrus unshiu Marc.) is one of the most abundantly produced mandarin varieties of citrus, known for its seedless fruit production and as a breeding parent of citrus. De novo assembly of the heterozygous diploid genome of Satsuma (“Miyagawa Wase”) was conducted by a hybrid assembly approach using short-read sequences, three mate-pair libraries, and a long-read sequence of PacBio by the PLATANUS assembler. The assembled sequence, with a total size of 359.7 Mb at the N50 length of 386,404 bp, consisted of 20,876 scaffolds. Pseudomolecules of Satsuma constructed by aligning the scaffolds to three genetic maps showed genome-wide synteny to the genomes of Clementine, pummelo, and sweet orange. Gene prediction by modeling with MAKER-P proposed 29,024 genes and 37,970 mRNA; additionally, gene prediction analysis found candidates for novel genes in several biosynthesis pathways for gibberellin and violaxanthin catabolism. BUSCO scores for the assembled scaffold and predicted transcripts, and another analysis by BAC end sequence mapping indicated the assembled genome consistency was close to those of the haploid Clementine, pummel, and sweet orange genomes. The number of repeat elements and long terminal repeat retrotransposon were comparable to those of the seven citrus genomes; this suggested no significant failure in the assembly at the repeat region. A resequencing application using the assembled sequence confirmed that both kunenbo-A and Satsuma are offsprings of Kishu, and Satsuma is a back-crossed offspring of Kishu. These results illustrated the performance of the hybrid assembly approach and its ability to construct an accurate heterozygous diploid genome. PMID:29259619
Shimizu, Tokurou; Tanizawa, Yasuhiro; Mochizuki, Takako; Nagasaki, Hideki; Yoshioka, Terutaka; Toyoda, Atsushi; Fujiyama, Asao; Kaminuma, Eli; Nakamura, Yasukazu
2017-01-01
Satsuma ( Citrus unshiu Marc.) is one of the most abundantly produced mandarin varieties of citrus, known for its seedless fruit production and as a breeding parent of citrus. De novo assembly of the heterozygous diploid genome of Satsuma ("Miyagawa Wase") was conducted by a hybrid assembly approach using short-read sequences, three mate-pair libraries, and a long-read sequence of PacBio by the PLATANUS assembler. The assembled sequence, with a total size of 359.7 Mb at the N 50 length of 386,404 bp, consisted of 20,876 scaffolds. Pseudomolecules of Satsuma constructed by aligning the scaffolds to three genetic maps showed genome-wide synteny to the genomes of Clementine, pummelo, and sweet orange. Gene prediction by modeling with MAKER-P proposed 29,024 genes and 37,970 mRNA; additionally, gene prediction analysis found candidates for novel genes in several biosynthesis pathways for gibberellin and violaxanthin catabolism. BUSCO scores for the assembled scaffold and predicted transcripts, and another analysis by BAC end sequence mapping indicated the assembled genome consistency was close to those of the haploid Clementine, pummel, and sweet orange genomes. The number of repeat elements and long terminal repeat retrotransposon were comparable to those of the seven citrus genomes; this suggested no significant failure in the assembly at the repeat region. A resequencing application using the assembled sequence confirmed that both kunenbo-A and Satsuma are offsprings of Kishu, and Satsuma is a back-crossed offspring of Kishu. These results illustrated the performance of the hybrid assembly approach and its ability to construct an accurate heterozygous diploid genome.
da Silva, Fábio Daniel Florêncio; Lima, Alex Ranieri Jerônimo; Moraes, Pablo Henrique Gonçalves; Siqueira, Andrei Santos; Dall’Agnol, Leonardo Teixeira; Baraúna, Anna Rafaella Ferreira; Martins, Luisa Carício; Oliveira, Karol Guimarães; de Lima, Clayton Pereira Silva; Nunes, Márcio Roberto Teixeira; Vianez-Júnior, João Lídio Silva Gonçalves
2016-01-01
Ecological interactions between cyanobacteria and heterotrophic prokaryotes are poorly known. To improve the genomic studies of heterotrophic bacterium-cyanobacterium associations, the draft genome sequence (3.2 Mbp) of Limnobacter sp. strain CACIAM 66H1, found in a nonaxenic culture of Synechococcus sp. (cyanobacteria), is presented here. PMID:27198027
Improving Student Drafting Knowledge and Skills through the Use of Technology.
ERIC Educational Resources Information Center
Ramsey, Steven A.; Vedder, Charles V.
A program was developed to infuse technology into drafting strategies to enhance learning of 11th- and 12th-grade students in a middle-class community in central Illinois who exhibited signs of inadequate achievement related to technology use. To document the extent of students' lack of understanding technology, surveys regarding attitudes toward…
Tutor Feedback on Draft Essays: Developing Students' Academic Writing and Subject Knowledge
ERIC Educational Resources Information Center
Court, Krista
2014-01-01
Providing feedback on draft essays is an accepted means of enacting a social-constructivist approach to assessment, aligning with current views on the value of formative feedback and assessment for learning (AFL). However, the use of this process as a means of improving not only content but also students' academic writing skills has not been…
2011-03-01
utilizing aqueous ammonia used to control nitrogen oxide and dry flue gas desulfurization used to control sulfur dioxide) will be included as part of...blowers; boiler combustion air and forced draft fans; boiler flue gas ; induced draft fans and stacks; as well as extensions of the plant control
2011-03-01
aqueous ammonia used to control nitrogen oxide and dry flue gas desulfurization used to control sulfur dioxide) will be included as part of the...boiler combustion air and forced draft fans; boiler flue gas ; induced draft fans and stacks; as well as extensions of the plant control; electrical
Draft Genome Sequences of 37 Salmonella enterica Strains Isolated from Poultry Sources in Nigeria.
Useh, Nicodemus M; Ngbede, Emmanuel O; Akange, Nguavese; Thomas, Milton; Foley, Andrew; Keena, Mitchel Chan; Nelson, Eric; Christopher-Hennings, Jane; Tomita, Masaru; Suzuki, Haruo; Scaria, Joy
2016-05-05
Here, we report the availability of draft genomes of several Salmonella serotypes, isolated from poultry sources from Nigeria. These genomes will help to further understand the biological diversity of S. enterica and will serve as references in microbial trace-back studies to improve food safety. Copyright © 2016 Useh et al.
Draft Genome Sequence of Fish Pathogen Aeromonas bestiarum GA97-22.
Kumru, Salih; Tekedar, Hasan C; Griffin, Matt J; Waldbieser, Geoffrey C; Liles, Mark R; Sonstegard, Tad; Schroeder, Steven G; Lawrence, Mark L; Karsi, Attila
2018-06-14
Aeromonas bestiarum is a Gram-negative mesophilic motile bacterium causing acute hemorrhagic septicemia or chronic skin ulcers in fish. Here, we report the draft genome sequence of A. bestiarum strain GA97-22, which was isolated from rainbow trout in 1997. This genome sequence will improve our understanding of the complex taxonomy of motile aeromonads.
Federal Register 2010, 2011, 2012, 2013, 2014
2010-02-04
... DEPARTMENT OF DEFENSE Department of the Army; Corps of Engineers Intent To Prepare a Draft... Feasibility Report investigated increased widths and depths in the Pascagoula and Bayou Casotte navigation... Gulf Entrance channel to 44 feet by 550 feet from the 44-foot depth contour in the Gulf of Mexico to...
DOE Office of Scientific and Technical Information (OSTI.GOV)
Tiwari, Ravi; Howieson, John; Yates, Ron
Bradyrhizobium sp. WSM1253 is a novel N 2-fixing bacterium isolated from a root nodule of the herbaceous annual legume Ornithopus compressus that was growing on the Greek Island of Sifnos. WSM1253 emerged as a strain of interest in an Australian program that was selecting inoculant quality bradyrhizobial strains for inoculation of Mediterranean species of lupins ( Lupinus angustifolius, L. princei, L. atlanticus, L. pilosus ). In this report we describe, for the first time, the genome sequence information and annotation of this legume microsymbiont. The 8,719,808 bp genome has a G + C content of 63.09 % with 71 contigsmore » arranged into two scaffolds. The assembled genome contains 8,432 protein-coding genes, 66 RNA genes and a single rRNA operon. In conclusion, this improved-high-quality draft rhizobial genome is one of 20 sequenced through a DOE Joint Genome Institute 2010 Community Sequencing Project.« less
Tiwari, Ravi; Howieson, John; Yates, Ron; ...
2015-11-30
Bradyrhizobium sp. WSM1253 is a novel N 2-fixing bacterium isolated from a root nodule of the herbaceous annual legume Ornithopus compressus that was growing on the Greek Island of Sifnos. WSM1253 emerged as a strain of interest in an Australian program that was selecting inoculant quality bradyrhizobial strains for inoculation of Mediterranean species of lupins ( Lupinus angustifolius, L. princei, L. atlanticus, L. pilosus ). In this report we describe, for the first time, the genome sequence information and annotation of this legume microsymbiont. The 8,719,808 bp genome has a G + C content of 63.09 % with 71 contigsmore » arranged into two scaffolds. The assembled genome contains 8,432 protein-coding genes, 66 RNA genes and a single rRNA operon. In conclusion, this improved-high-quality draft rhizobial genome is one of 20 sequenced through a DOE Joint Genome Institute 2010 Community Sequencing Project.« less
SWARM : a scientific workflow for supporting Bayesian approaches to improve metabolic models.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Shi, X.; Stevens, R.; Mathematics and Computer Science
2008-01-01
With the exponential growth of complete genome sequences, the analysis of these sequences is becoming a powerful approach to build genome-scale metabolic models. These models can be used to study individual molecular components and their relationships, and eventually study cells as systems. However, constructing genome-scale metabolic models manually is time-consuming and labor-intensive. This property of manual model-building process causes the fact that much fewer genome-scale metabolic models are available comparing to hundreds of genome sequences available. To tackle this problem, we design SWARM, a scientific workflow that can be utilized to improve genome-scale metabolic models in high-throughput fashion. SWARM dealsmore » with a range of issues including the integration of data across distributed resources, data format conversions, data update, and data provenance. Putting altogether, SWARM streamlines the whole modeling process that includes extracting data from various resources, deriving training datasets to train a set of predictors and applying Bayesian techniques to assemble the predictors, inferring on the ensemble of predictors to insert missing data, and eventually improving draft metabolic networks automatically. By the enhancement of metabolic model construction, SWARM enables scientists to generate many genome-scale metabolic models within a short period of time and with less effort.« less
The Development of Statistics Textbook Supported with ICT and Portfolio-Based Assessment
NASA Astrophysics Data System (ADS)
Hendikawati, Putriaji; Yuni Arini, Florentina
2016-02-01
This research was development research that aimed to develop and produce a Statistics textbook model that supported with information and communication technology (ICT) and Portfolio-Based Assessment. This book was designed for students of mathematics at the college to improve students’ ability in mathematical connection and communication. There were three stages in this research i.e. define, design, and develop. The textbooks consisted of 10 chapters which each chapter contains introduction, core materials and include examples and exercises. The textbook developed phase begins with the early stages of designed the book (draft 1) which then validated by experts. Revision of draft 1 produced draft 2 which then limited test for readability test book. Furthermore, revision of draft 2 produced textbook draft 3 which simulated on a small sample to produce a valid model textbook. The data were analysed with descriptive statistics. The analysis showed that the Statistics textbook model that supported with ICT and Portfolio-Based Assessment valid and fill up the criteria of practicality.
Shearman, Jeremy R.; Sangsrakru, Duangjai; Jomchai, Nukoon; Ruang-areerate, Panthita; Sonthirod, Chutima; Naktang, Chaiwat; Theerawattanasuk, Kanikar; Tragoonrung, Somvong; Tangphatsornruang, Sithichoke
2015-01-01
Hevea brasiliensis, or rubber tree, is an important crop species that accounts for the majority of natural latex production. The rubber tree nuclear genome consists of 18 chromosomes and is roughly 2.15 Gb. The current rubber tree reference genome assembly consists of 1,150,326 scaffolds ranging from 200 to 531,465 bp and totalling 1.1 Gb. Only 143 scaffolds, totalling 7.6 Mb, have been placed into linkage groups. We have performed RNA-seq on 6 varieties of rubber tree to identify SNPs and InDels and used this information to perform target sequence enrichment and high throughput sequencing to genotype a set of SNPs in 149 rubber tree offspring from a cross between RRIM 600 and RRII 105 rubber tree varieties. We used this information to generate a linkage map allowing for the anchoring of 24,424 contigs from 3,009 scaffolds, totalling 115 Mb or 10.4% of the published sequence, into 18 linkage groups. Each linkage group contains between 319 and 1367 SNPs, or 60 to 194 non-redundant marker positions, and ranges from 156 to 336 cM in length. This linkage map includes 20,143 of the 69,300 predicted genes from rubber tree and will be useful for mapping studies and improving the reference genome assembly. PMID:25831195
Shearman, Jeremy R; Sangsrakru, Duangjai; Jomchai, Nukoon; Ruang-Areerate, Panthita; Sonthirod, Chutima; Naktang, Chaiwat; Theerawattanasuk, Kanikar; Tragoonrung, Somvong; Tangphatsornruang, Sithichoke
2015-01-01
Hevea brasiliensis, or rubber tree, is an important crop species that accounts for the majority of natural latex production. The rubber tree nuclear genome consists of 18 chromosomes and is roughly 2.15 Gb. The current rubber tree reference genome assembly consists of 1,150,326 scaffolds ranging from 200 to 531,465 bp and totalling 1.1 Gb. Only 143 scaffolds, totalling 7.6 Mb, have been placed into linkage groups. We have performed RNA-seq on 6 varieties of rubber tree to identify SNPs and InDels and used this information to perform target sequence enrichment and high throughput sequencing to genotype a set of SNPs in 149 rubber tree offspring from a cross between RRIM 600 and RRII 105 rubber tree varieties. We used this information to generate a linkage map allowing for the anchoring of 24,424 contigs from 3,009 scaffolds, totalling 115 Mb or 10.4% of the published sequence, into 18 linkage groups. Each linkage group contains between 319 and 1367 SNPs, or 60 to 194 non-redundant marker positions, and ranges from 156 to 336 cM in length. This linkage map includes 20,143 of the 69,300 predicted genes from rubber tree and will be useful for mapping studies and improving the reference genome assembly.
Federal Register 2010, 2011, 2012, 2013, 2014
2013-07-05
... operating conditions for vehicular and pedestrian traffic; Improve capacity of the local roadway network; Improve local mobility; reduce congestion; improve emergency response times; and Improve evacuation...
Refining a taxonomy for guideline implementation: results of an exercise in abstract classification
2013-01-01
Background To better understand the efficacy of various implementation strategies, improved methods for describing and classifying the nature of these strategies are urgently required. The aim of this study was to develop and pilot the feasibility of a taxonomy to classify the nature and content of implementation strategies. Methods A draft implementation taxonomy was developed based on the Cochrane Effective Practice and Organisation of Care (EPOC) data collection checklist. The draft taxonomy had four domains (professional, financial, organisational and regulatory) covering 49 distinct strategies. We piloted the draft taxonomy by using it to classify the implementation strategies described in the conference abstracts of the implementation stream of the 2010 Guideline International Network Conference. Five authors classified the strategies in each abstract individually. Final categorisation was then carried out in a face-to-face consensus meeting involving three authors. Results The implementation strategies described in 71 conference abstracts were classified. Approximately 15.5% of abstracts utilised strategies that could not be categorised using the draft taxonomy. Of those strategies that could be categorised, the majority were professionally focused (57%). A total of 41% of projects used only one implementation strategy, with 29% using two and 31% three or more. The three most commonly used strategies were changes in quality assurance, quality improvement and/or performance measurement systems, changes in information and communication technology, and distribution of guideline materials (via hard-copy, audio-visual and/or electronic means). Conclusions Further refinement of the draft taxonomy is required to provide hierarchical dimensions and granularity, particularly in the areas of patient-focused interventions, those concerned with audit and feedback and quality improvement, and electronic forms of implementation, including electronic decision support. PMID:23497520
Refining a taxonomy for guideline implementation: results of an exercise in abstract classification.
Mazza, Danielle; Bairstow, Phillip; Buchan, Heather; Chakraborty, Samantha Paubrey; Van Hecke, Oliver; Grech, Cathy; Kunnamo, Ilkka
2013-03-15
To better understand the efficacy of various implementation strategies, improved methods for describing and classifying the nature of these strategies are urgently required. The aim of this study was to develop and pilot the feasibility of a taxonomy to classify the nature and content of implementation strategies. A draft implementation taxonomy was developed based on the Cochrane Effective Practice and Organisation of Care (EPOC) data collection checklist. The draft taxonomy had four domains (professional, financial, organisational and regulatory) covering 49 distinct strategies. We piloted the draft taxonomy by using it to classify the implementation strategies described in the conference abstracts of the implementation stream of the 2010 Guideline International Network Conference. Five authors classified the strategies in each abstract individually. Final categorisation was then carried out in a face-to-face consensus meeting involving three authors. The implementation strategies described in 71 conference abstracts were classified. Approximately 15.5% of abstracts utilised strategies that could not be categorised using the draft taxonomy. Of those strategies that could be categorised, the majority were professionally focused (57%). A total of 41% of projects used only one implementation strategy, with 29% using two and 31% three or more. The three most commonly used strategies were changes in quality assurance, quality improvement and/or performance measurement systems, changes in information and communication technology, and distribution of guideline materials (via hard-copy, audio-visual and/or electronic means). Further refinement of the draft taxonomy is required to provide hierarchical dimensions and granularity, particularly in the areas of patient-focused interventions, those concerned with audit and feedback and quality improvement, and electronic forms of implementation, including electronic decision support.
Moskalev, Alexey А; Kudryavtseva, Anna V; Graphodatsky, Alexander S; Beklemisheva, Violetta R; Serdyukova, Natalya A; Krutovsky, Konstantin V; Sharov, Vadim V; Kulakovskiy, Ivan V; Lando, Andrey S; Kasianov, Artem S; Kuzmin, Dmitry A; Putintseva, Yuliya A; Feranchuk, Sergey I; Shaposhnikov, Mikhail V; Fraifeld, Vadim E; Toren, Dmitri; Snezhkina, Anastasia V; Sitnik, Vasily V
2017-12-28
Gray whale, Eschrichtius robustus (E. robustus), is a single member of the family Eschrichtiidae, which is considered to be the most primitive in the class Cetacea. Gray whale is often described as a "living fossil". It is adapted to extreme marine conditions and has a high life expectancy (77 years). The assembly of a gray whale genome and transcriptome will allow to carry out further studies of whale evolution, longevity, and resistance to extreme environment. In this work, we report the first de novo assembly and primary analysis of the E. robustus genome and transcriptome based on kidney and liver samples. The presented draft genome assembly is complete by 55% in terms of a total genome length, but only by 24% in terms of the BUSCO complete gene groups, although 10,895 genes were identified. Transcriptome annotation and comparison with other whale species revealed robust expression of DNA repair and hypoxia-response genes, which is expected for whales. This preliminary study of the gray whale genome and transcriptome provides new data to better understand the whale evolution and the mechanisms of their adaptation to the hypoxic conditions.
Exome capture from the spruce and pine giga-genomes.
Suren, H; Hodgins, K A; Yeaman, S; Nurkowski, K A; Smets, P; Rieseberg, L H; Aitken, S N; Holliday, J A
2016-09-01
Sequence capture is a flexible tool for generating reduced representation libraries, particularly in species with massive genomes. We used an exome capture approach to sequence the gene space of two of the dominant species in Canadian boreal and montane forests - interior spruce (Picea glauca x engelmanii) and lodgepole pine (Pinus contorta). Transcriptome data generated with RNA-seq were coupled with draft genome sequences to design baits corresponding to 26 824 genes from pine and 28 649 genes from spruce. A total of 579 samples for spruce and 631 samples for pine were included, as well as two pine congeners and six spruce congeners. More than 50% of targeted regions were sequenced at >10× depth in each species, while ~12% captured near-target regions within 500 bp of a bait position were sequenced to a depth >10×. Much of our read data arose from off-target regions, which was likely due to the fragmented and incomplete nature of the draft genome assemblies. Capture in general was successful for the related species, suggesting that baits designed for a single species are likely to successfully capture sequences from congeners. From these data, we called approximately 10 million SNPs and INDELs in each species from coding regions, introns, untranslated and flanking regions, as well as from the intergenic space. Our study demonstrates the utility of sequence capture for resequencing in complex conifer genomes, suggests guidelines for improving capture efficiency and provides a rich resource of genetic variants for studies of selection and local adaptation in these species. © 2016 John Wiley & Sons Ltd.
da Silva, Fábio Daniel Florêncio; Lima, Alex Ranieri Jerônimo; Moraes, Pablo Henrique Gonçalves; Siqueira, Andrei Santos; Dall'Agnol, Leonardo Teixeira; Baraúna, Anna Rafaella Ferreira; Martins, Luisa Carício; Oliveira, Karol Guimarães; de Lima, Clayton Pereira Silva; Nunes, Márcio Roberto Teixeira; Vianez-Júnior, João Lídio Silva Gonçalves; Gonçalves, Evonnildo Costa
2016-05-19
Ecological interactions between cyanobacteria and heterotrophic prokaryotes are poorly known. To improve the genomic studies of heterotrophic bacterium-cyanobacterium associations, the draft genome sequence (3.2 Mbp) of Limnobacter sp. strain CACIAM 66H1, found in a nonaxenic culture of Synechococcus sp. (cyanobacteria), is presented here. Copyright © 2016 da Silva et al.
Do It Right! Requiring Multiple Submissions of Math and NMR Analysis Assignments in the Laboratory
ERIC Educational Resources Information Center
Slade, David J.
2017-01-01
The first-semester introductory organic chemistry laboratory has been adapted to include mini postlab assignments that students must complete correctly, through as many attempts as prove to be necessary. The use of multiple drafts of writing assignments is a standard approach to improving writing, so the system was designed to require drafts for…
Trubitsyn, Denis; Geurink, Corey; Pikuta, Elena; Lefèvre, Christopher T.; McShan, W. Michael; Gillaspy, Allison F.
2014-01-01
Desulfonatronum thiodismutans strain MLF1, an alkaliphilic bacterium capable of sulfate reduction, was isolated from Mono Lake, California. Here we report the 3.92-Mb draft genome sequence comprising 34 contigs and some results of its automated annotation. These data will improve our knowledge of mechanisms by which bacteria withstand extreme environments. PMID:25081260
Questions to Consider When Reviewing Draft WIOA State Plans. WIOA Game Plan for Low-Income People
ERIC Educational Resources Information Center
Center for Law and Social Policy, Inc. (CLASP), 2016
2016-01-01
As states release draft Workforce Innovation and Opportunity Act (WIOA) state plans for public comment, advocates and other stakeholders have an important opportunity to improve them. The Center for Law and Social Policy (CLASP) is offering a list of questions, which is included in this document, to consider when reviewing state plans. These…
Power, Susan E.; Harris, Hugh M. B.; Bottacini, Francesca; Ross, R. Paul; O’Toole, Paul W.
2013-01-01
Here we report the 1.86-Mb draft genome sequence of Lactobacillus crispatus EM-LC1, a fecal isolate with antimicrobial activity. This genome sequence is expected to provide insights into the antimicrobial activity of L. crispatus and improve our knowledge of its potential probiotic traits. PMID:24356836
The Nuclear and Mitochondrial Genomes of the Facultatively Eusocial Orchid Bee Euglossa dilemma
Brand, Philipp; Saleh, Nicholas; Pan, Hailin; Li, Cai; Kapheim, Karen M.; Ramírez, Santiago R.
2017-01-01
Bees provide indispensable pollination services to both agricultural crops and wild plant populations, and several species of bees have become important models for the study of learning and memory, plant–insect interactions, and social behavior. Orchid bees (Apidae: Euglossini) are especially important to the fields of pollination ecology, evolution, and species conservation. Here we report the nuclear and mitochondrial genome sequences of the orchid bee Euglossa dilemma Bembé & Eltz. E. dilemma was selected because it is widely distributed, highly abundant, and it was recently naturalized in the southeastern United States. We provide a high-quality assembly of the 3.3 Gb genome, and an official gene set of 15,904 gene annotations. We find high conservation of gene synteny with the honey bee throughout 80 MY of divergence time. This genomic resource represents the first draft genome of the orchid bee genus Euglossa, and the first draft orchid bee mitochondrial genome, thus representing a valuable resource to the research community. PMID:28701376
The Nuclear and Mitochondrial Genomes of the Facultatively Eusocial Orchid Bee Euglossa dilemma.
Brand, Philipp; Saleh, Nicholas; Pan, Hailin; Li, Cai; Kapheim, Karen M; Ramírez, Santiago R
2017-09-07
Bees provide indispensable pollination services to both agricultural crops and wild plant populations, and several species of bees have become important models for the study of learning and memory, plant-insect interactions, and social behavior. Orchid bees (Apidae: Euglossini) are especially important to the fields of pollination ecology, evolution, and species conservation. Here we report the nuclear and mitochondrial genome sequences of the orchid bee Euglossa dilemma Bembé & Eltz. E. dilemma was selected because it is widely distributed, highly abundant, and it was recently naturalized in the southeastern United States. We provide a high-quality assembly of the 3.3 Gb genome, and an official gene set of 15,904 gene annotations. We find high conservation of gene synteny with the honey bee throughout 80 MY of divergence time. This genomic resource represents the first draft genome of the orchid bee genus Euglossa , and the first draft orchid bee mitochondrial genome, thus representing a valuable resource to the research community. Copyright © 2017 Brand et al.
NASA Astrophysics Data System (ADS)
The U.S. Integrated Ocean Observing System (IOOS) is encouraging public comment on the draft plan for its Data Management and Communications (DMAC∥ component. The deadline for receipt of comments has been extended to 18 November 2003. The plan can be found at http://www.dmac.ocean.us/dacsc/imp_plan.jsp. The plan was developed by the DMAC Steering Committee, which includes representatives from federal and state agencies, private industry, and academia. This committee was tasked by Ocean.US (the national office for IOOS) with the preparation of a detailed, phased DMAC implementation plan, and initial oversight of its implementation. The scope of the plan includes the IOOS DMAC infrastructure, data archive and access, and basic information products needed for assessing the availability and quality of data within IOOS. Four expert teams (Data Transport, Metadata and Data Discovery, Data Archive and Access, Applications and Products), and two outreach teams (Data Facilities Management, and User Outreach), were assembled to assist in developing material for the plan.
DCT based interpolation filter for motion compensation in HEVC
NASA Astrophysics Data System (ADS)
Alshin, Alexander; Alshina, Elena; Park, Jeong Hoon; Han, Woo-Jin
2012-10-01
High Efficiency Video Coding (HEVC) draft standard has a challenging goal to improve coding efficiency twice compare to H.264/AVC. Many aspects of the traditional hybrid coding framework were improved during new standard development. Motion compensated prediction, in particular the interpolation filter, is one area that was improved significantly over H.264/AVC. This paper presents the details of the interpolation filter design of the draft HEVC standard. The coding efficiency improvements over H.264/AVC interpolation filter is studied and experimental results are presented, which show a 4.0% average bitrate reduction for Luma component and 11.3% average bitrate reduction for Chroma component. The coding efficiency gains are significant for some video sequences and can reach up 21.7%.
Bürgin, M T; Bürkli, P
2002-11-01
At the end of May 2002, the draft of the Swiss "Federal Act on Research on Surplus Embryos and Embryonic Stem Cells" (EFG, Embryonic Research Act) reached the pre-legislative consultation stage. Under certain conditions, it would allow research on "surplus" embryos from in-vitro fertilization, and the derivation of embryonic stem cells from surplus embryos for research purposes. The EFG draft defines an embryo as "the developing organism from the point of nuclear fusion until the completion of organ development". New technological developments show that embryo-like entities can also be created without nuclear fusion having taken place. It remains unclear how to treat embryonic entities that don't fall under the draft's narrow definition of an embryo. Expanding this definition would be a welcome improvement.
NASA Technical Reports Server (NTRS)
Radtke, Robert; Woolley, Charles; Arnold, Lana
1993-01-01
The purpose of the NASA Space Assembly and Servicing Working Group (SASWG) is to study enabling technologies for on-orbit spacecraft maintenance and servicing. One key technology required for effective space logistics activity is the development of standard spacecraft interfaces, including the 'Basic Set' defined by NASA, the U.S. Space Command, and industry panelists to be the following: (1) navigation aids; (2) grasping, berthing, and docking; and (3) utility connections for power, data, and fluids. Draft standards have been prepared and referred to professional standards organizations, including the AIAA, EIA, and SAE space standards committee. The objective of the SASWG is to support these committees with the technical expertise required to prepare standards, guidelines, and recommended practices which will be accepted by the ANSI and international standards organizations, including the ISO, IEC, and PASC.
Dy, Sydney M; Al Hamayel, Nebras Abu; Hannum, Susan M; Sharma, Ritu; Isenberg, Sarina R; Kuchinad, Kamini; Zhu, Junya; Smith, Katherine; Lorenz, Karl A; Kamal, Arif H; Walling, Anne M; Weaver, Sallie J
2017-12-01
Although critical for improving patient outcomes, palliative care quality indicators are not yet widely used. Better understanding of facilitators and barriers to palliative care quality measurement and improvement might improve their use and program quality. Development of a survey tool to assess palliative care team perspectives on facilitators and barriers to quality measurement and improvement in palliative care programs. We used the adapted Consolidated Framework for Implementation Research to define domains and constructs to select instruments. We assembled a draft survey and assessed content validity through pilot testing and cognitive interviews with experts and frontline practitioners for key items. We analyzed responses using a constant comparative process to assess survey item issues and potential solutions. We developed a final survey using these results. The survey includes five published instruments and two additional item sets. Domains include organizational characteristics, individual and team characteristics, intervention characteristics, and process of implementation. Survey modules include Quality Improvement in Palliative Care, Implementing Quality Improvement in the Palliative Care Program, Teamwork and Communication, Measuring the Quality of Palliative Care, and Palliative Care Quality in Your Program. Key refinements from cognitive interviews included item wording on palliative care team members, programs, and quality issues. This novel, adaptable instrument assesses palliative care team perspectives on barriers and facilitators for quality measurement and improvement in palliative care programs. Next steps include evaluation of the survey's construct validity and how survey results correlate with findings from program quality initiatives. Copyright © 2017 American Academy of Hospice and Palliative Medicine. All rights reserved.
Read, Timothy D; Petit, Robert A; Joseph, Sandeep J; Alam, Md Tauqeer; Weil, M Ryan; Ahmad, Maida; Bhimani, Ravila; Vuong, Jocelyn S; Haase, Chad P; Webb, D Harry; Tan, Milton; Dove, Alistair D M
2017-07-14
The whale shark (Rhincodon typus) has by far the largest body size of any elasmobranch (shark or ray) species. Therefore, it is also the largest extant species of the paraphyletic assemblage commonly referred to as fishes. As both a phenotypic extreme and a member of the group Chondrichthyes - the sister group to the remaining gnathostomes, which includes all tetrapods and therefore also humans - its genome is of substantial comparative interest. Whale sharks are also listed as an endangered species on the International Union for Conservation of Nature's Red List of threatened species and are of growing popularity as both a target of ecotourism and as a charismatic conservation ambassador for the pelagic ecosystem. A genome map for this species would aid in defining effective conservation units and understanding global population structure. We characterised the nuclear genome of the whale shark using next generation sequencing (454, Illumina) and de novo assembly and annotation methods, based on material collected from the Georgia Aquarium. The data set consisted of 878,654,233 reads, which yielded a draft assembly of 1,213,200 contigs and 997,976 scaffolds. The estimated genome size was 3.44Gb. As expected, the proteome of the whale shark was most closely related to the only other complete genome of a cartilaginous fish, the holocephalan elephant shark. The whale shark contained a novel Toll-like-receptor (TLR) protein with sequence similarity to both the TLR4 and TLR13 proteins of mammals and TLR21 of teleosts. The data are publicly available on GenBank, FigShare, and from the NCBI Short Read Archive under accession number SRP044374. This represents the first shotgun elasmobranch genome and will aid studies of molecular systematics, biogeography, genetic differentiation, and conservation genetics in this and other shark species, as well as providing comparative data for studies of evolutionary biology and immunology across the jawed vertebrate lineages.
A High Quality Draft Consensus Sequence of the Genome of a Heterozygous Grapevine Variety
Cartwright, Dustin A.; Cestaro, Alessandro; Pruss, Dmitry; Pindo, Massimo; FitzGerald, Lisa M.; Vezzulli, Silvia; Reid, Julia; Malacarne, Giulia; Iliev, Diana; Coppola, Giuseppina; Wardell, Bryan; Micheletti, Diego; Macalma, Teresita; Facci, Marco; Mitchell, Jeff T.; Perazzolli, Michele; Eldredge, Glenn; Gatto, Pamela; Oyzerski, Rozan; Moretto, Marco; Gutin, Natalia; Stefanini, Marco; Chen, Yang; Segala, Cinzia; Davenport, Christine; Demattè, Lorenzo; Mraz, Amy; Battilana, Juri; Stormo, Keith; Costa, Fabrizio; Tao, Quanzhou; Si-Ammour, Azeddine; Harkins, Tim; Lackey, Angie; Perbost, Clotilde; Taillon, Bruce; Stella, Alessandra; Solovyev, Victor; Fawcett, Jeffrey A.; Sterck, Lieven; Vandepoele, Klaas; Grando, Stella M.; Toppo, Stefano; Moser, Claudio; Lanchbury, Jerry; Bogden, Robert; Skolnick, Mark; Sgaramella, Vittorio; Bhatnagar, Satish K.; Fontana, Paolo; Gutin, Alexander; Van de Peer, Yves; Salamini, Francesco; Viola, Roberto
2007-01-01
Background Worldwide, grapes and their derived products have a large market. The cultivated grape species Vitis vinifera has potential to become a model for fruit trees genetics. Like many plant species, it is highly heterozygous, which is an additional challenge to modern whole genome shotgun sequencing. In this paper a high quality draft genome sequence of a cultivated clone of V. vinifera Pinot Noir is presented. Principal Findings We estimate the genome size of V. vinifera to be 504.6 Mb. Genomic sequences corresponding to 477.1 Mb were assembled in 2,093 metacontigs and 435.1 Mb were anchored to the 19 linkage groups (LGs). The number of predicted genes is 29,585, of which 96.1% were assigned to LGs. This assembly of the grape genome provides candidate genes implicated in traits relevant to grapevine cultivation, such as those influencing wine quality, via secondary metabolites, and those connected with the extreme susceptibility of grape to pathogens. Single nucleotide polymorphism (SNP) distribution was consistent with a diffuse haplotype structure across the genome. Of around 2,000,000 SNPs, 1,751,176 were mapped to chromosomes and one or more of them were identified in 86.7% of anchored genes. The relative age of grape duplicated genes was estimated and this made possible to reveal a relatively recent Vitis-specific large scale duplication event concerning at least 10 chromosomes (duplication not reported before). Conclusions Sanger shotgun sequencing and highly efficient sequencing by synthesis (SBS), together with dedicated assembly programs, resolved a complex heterozygous genome. A consensus sequence of the genome and a set of mapped marker loci were generated. Homologous chromosomes of Pinot Noir differ by 11.2% of their DNA (hemizygous DNA plus chromosomal gaps). SNP markers are offered as a tool with the potential of introducing a new era in the molecular breeding of grape. PMID:18094749
Trubitsyn, Denis; Geurink, Corey; Pikuta, Elena; Lefèvre, Christopher T; McShan, W Michael; Gillaspy, Allison F; Bazylinski, Dennis A
2014-07-31
Desulfonatronum thiodismutans strain MLF1, an alkaliphilic bacterium capable of sulfate reduction, was isolated from Mono Lake, California. Here we report the 3.92-Mb draft genome sequence comprising 34 contigs and some results of its automated annotation. These data will improve our knowledge of mechanisms by which bacteria withstand extreme environments. Copyright © 2014 Trubitsyn et al.
75 FR 78798 - Airport Improvement Program: Proposed Changes to Benefit Cost Analysis (BCA) Threshold
Federal Register 2010, 2011, 2012, 2013, 2014
2010-12-16
...The Federal Aviation Administration (FAA) is issuing this Notice to advise that FAA has developed draft guidance modifying its policy requiring benefit cost analyses (BCA) for capacity projects when applying for Airport Improvement Program (AIP) grants for capacity projects at the discretion of the Secretary of Transportation. This modification proposes to raise the threshold at which BCAs are required, from $5 million to $10 million in AIP Discretionary funds. FAA invites airport sponsors and other interested parties to comment on the draft guidance. FAA will consider these comments in promulgating final BCA guidance for airport sponsors.
NASA Technical Reports Server (NTRS)
1981-01-01
The engineering design, fabrication, assembly, operation, economic analysis, and process support research and development for an Experimental Process System Development Unit for producing semiconductor-grade silicon using the slane-to-silicon process are reported. The design activity was completed. About 95% of purchased equipment was received. The draft of the operations manual was about 50% complete and the design of the free-space system continued. The system using silicon power transfer, melting, and shotting on a psuedocontinuous basis was demonstrated.
Sedlar, Karel; Kolek, Jan; Skutkova, Helena; Branska, Barbora; Provaznik, Ivo; Patakova, Petra
2015-11-20
The strain Clostridium pasteurianum NRRL B-598 is non-type, oxygen tolerant, spore-forming, mesophilic and heterofermentative strain with high hydrogen production and ability of acetone-butanol fermentation (ethanol production being negligible). Here, we present the annotated complete genome sequence of this bacterium, replacing the previous draft genome assembly. The genome consisting of a single circular 6,186,879 bp chromosome with no plasmid was determined using PacBio RSII and Roche 454 sequencing. Copyright © 2015 Elsevier B.V. All rights reserved.
Investigation into Deep-Draft Vessel Berthing Problems at Selected U. S. Naval Facilities.
1980-10-01
AOR I WICHITA Alameda, CA 1-24-75 AOR 2 MILWAUKEE Norfolk, VA 1-01-74 AOR 3 KANSAS CITY Alameda, CA 2-16-74 AOR 4 SAVANNAH Norfolk, VA 12-05-70 AOR...auger-cutter assembly dislodges and delivers the material to the pump suction intake. The slurry is pumped to a pipeline for transmission to a remote...complete loss of control of the course steered. Large current eddies having the same effect are found in the vicinity of the foundation piers of the San
Minogue, T D; Daligault, H E; Davenport, K W; Bishop-Lilly, K A; Bruce, D C; Chain, P S; Coyne, S R; Chertkov, O; Freitas, T; Frey, K G; Jaissle, J; Koroleva, G I; Ladner, J T; Palacios, G F; Redden, C L; Xu, Y; Johnson, S L
2014-10-23
The Enterobacteriaceae are environmental and enteric microbes. We sequenced the genomes of two Enterobacter reference strains, E. aerogenes CDC 6003-71 and E. cloacae CDC 442-68, as well as one near neighbor used as an exclusionary reference for diagnostics, Pantoea agglomerans CDC UA0804-01. The genome sizes range from 4.72 to 5.55 Mbp and have G+C contents from 54.6 to 55.1%. Copyright © 2014 Minogue et al.
Bringing the fathead minnow into the genomic era | Science ...
The fathead minnow is a well-established ecotoxicological model organism that has been widely used for regulatory ecotoxicity testing and research for over a half century. While a large amount of molecular information has been gathered on the fathead minnow over the years, the lack of genomic sequence data has limited the utility of the fathead minnow for certain applications. To address this limitation, high-throughput Illumina sequencing technology was employed to sequence the fathead minnow genome. Approximately 100X coverage was achieved by sequencing several libraries of paired-end reads with differing genome insert sizes. Two draft genome assemblies were generated using the SOAPdenovo and String Graph Assembler (SGA) methods, respectively. When these were compared, the SOAPdenovo assembly had a higher scaffold N50 value of 60.4 kbp versus 15.4 kbp, and it also performed better in a Core Eukaryotic Genes Mapping Analysis (CEGMA), mapping 91% versus 67% of genes. As such, this assembly was selected for further development and annotation. The foundation for genome annotation was generated using AUGUSTUS, an ab initio method for gene prediction. A total of 43,345 potential coding sequences were predicted on the genome assembly. These predicted sequences were translated to peptides and queried in a BLAST search against all vertebrates, with 28,290 of these sequences corresponding to zebrafish peptides and 5,242 producing no significant alignments. Additional ty
Analysis of the Kaplan turbine draft tube effect
NASA Astrophysics Data System (ADS)
Motycak, L.; Skotak, A.; Obrovsky, J.
2010-08-01
The aim of this paper is to present information about possible problems and errors which can appear during numerical analyses of low head Kaplan turbines with a view to the runner - draft tube interaction. The setting of numerical model, grid size, used boundary conditions are the interface definition between runner and draft tube are discussed. There are available data from physical model tests which gives a great opportunity to compare CFD and experiment results and on the basis of this comparison to determine the approach to the CFD flow modeling. The main purpose for the Kaplan turbine model measurement was to gather the information about real flow field. The model tests were carried out in new hydraulic laboratory of CKD Blansko Engineering. The model tests were focused on the detailed velocity measurements downstream of the runner by differential pressure probe and on the velocity measurement downstream of the draft tube elbow by Particle Image Velocimetry method (PIV). The data from CFD simulation were compared to the velocity measurement results. In the paper also the design of the original draft tube modification due to flow improvement is discussed in the case of the Kaplan turbine uprating project. The results of the draft tube modification were confirmed by model tests in the hydraulic laboratory as well.
Genome Sequence of the Freshwater Yangtze Finless Porpoise.
Yuan, Yuan; Zhang, Peijun; Wang, Kun; Liu, Mingzhong; Li, Jing; Zheng, Jingsong; Wang, Ding; Xu, Wenjie; Lin, Mingli; Dong, Lijun; Zhu, Chenglong; Qiu, Qiang; Li, Songhai
2018-04-16
The Yangtze finless porpoise ( Neophocaena asiaeorientalis ssp. asiaeorientalis ) is a subspecies of the narrow-ridged finless porpoise ( N. asiaeorientalis ). In total, 714.28 gigabases (Gb) of raw reads were generated by whole-genome sequencing of the Yangtze finless porpoise, using an Illumina HiSeq 2000 platform. After filtering the low-quality and duplicated reads, we assembled a draft genome of 2.22 Gb, with contig N50 and scaffold N50 values of 46.69 kilobases (kb) and 1.71 megabases (Mb), respectively. We identified 887.63 Mb of repetitive sequences and predicted 18,479 protein-coding genes in the assembled genome. The phylogenetic tree showed a relationship between the Yangtze finless porpoise and the Yangtze River dolphin, which diverged approximately 20.84 million years ago. In comparisons with the genomes of 10 other mammals, we detected 44 species-specific gene families, 164 expanded gene families, and 313 positively selected genes in the Yangtze finless porpoise genome. The assembled genome sequence and underlying sequence data are available at the National Center for Biotechnology Information under BioProject accession number PRJNA433603.
Genome Sequence of the Freshwater Yangtze Finless Porpoise
Yuan, Yuan; Zhang, Peijun; Wang, Kun; Liu, Mingzhong; Li, Jing; Zheng, Jinsong; Wang, Ding; Xu, Wenjie; Lin, Mingli; Dong, Lijun; Zhu, Chenglong; Qiu, Qiang
2018-01-01
The Yangtze finless porpoise (Neophocaena asiaeorientalis ssp. asiaeorientalis) is a subspecies of the narrow-ridged finless porpoise (N. asiaeorientalis). In total, 714.28 gigabases (Gb) of raw reads were generated by whole-genome sequencing of the Yangtze finless porpoise, using an Illumina HiSeq 2000 platform. After filtering the low-quality and duplicated reads, we assembled a draft genome of 2.22 Gb, with contig N50 and scaffold N50 values of 46.69 kilobases (kb) and 1.71 megabases (Mb), respectively. We identified 887.63 Mb of repetitive sequences and predicted 18,479 protein-coding genes in the assembled genome. The phylogenetic tree showed a relationship between the Yangtze finless porpoise and the Yangtze River dolphin, which diverged approximately 20.84 million years ago. In comparisons with the genomes of 10 other mammals, we detected 44 species-specific gene families, 164 expanded gene families, and 313 positively selected genes in the Yangtze finless porpoise genome. The assembled genome sequence and underlying sequence data are available at the National Center for Biotechnology Information under BioProject accession number PRJNA433603. PMID:29659530
Gilchrist, Anthony Stuart; Shearman, Deborah C A; Frommer, Marianne; Raphael, Kathryn A; Deshpande, Nandan P; Wilkins, Marc R; Sherwin, William B; Sved, John A
2014-12-20
The tephritid fruit flies include a number of economically important pests of horticulture, with a large accumulated body of research on their biology and control. Amongst the Tephritidae, the genus Bactrocera, containing over 400 species, presents various species groups of potential utility for genetic studies of speciation, behaviour or pest control. In Australia, there exists a triad of closely-related, sympatric Bactrocera species which do not mate in the wild but which, despite distinct morphologies and behaviours, can be force-mated in the laboratory to produce fertile hybrid offspring. To exploit the opportunities offered by genomics, such as the efficient identification of genetic loci central to pest behaviour and to the earliest stages of speciation, investigators require genomic resources for future investigations. We produced a draft de novo genome assembly of Australia's major tephritid pest species, Bactrocera tryoni. The male genome (650-700 Mbp) includes approximately 150 Mb of interspersed repetitive DNA sequences and 60 Mb of satellite DNA. Assessment using conserved core eukaryotic sequences indicated 98% completeness. Over 16,000 MAKER-derived gene models showed a large degree of overlap with other Dipteran reference genomes. The sequence of the ribosomal RNA transcribed unit was also determined. Unscaffolded assemblies of B. neohumeralis and B. jarvisi were then produced; comparison with B. tryoni showed that the species are more closely related than any Drosophila species pair. The similarity of the genomes was exploited to identify 4924 potentially diagnostic indels between the species, all of which occur in non-coding regions. This first draft B. tryoni genome resembles other dipteran genomes in terms of size and putative coding sequences. For all three species included in this study, we have identified a comprehensive set of non-redundant repetitive sequences, including the ribosomal RNA unit, and have quantified the major satellite DNA families. These genetic resources will facilitate the further investigations of genetic mechanisms responsible for the behavioural and morphological differences between these three species and other tephritids. We have also shown how whole genome sequence data can be used to generate simple diagnostic tests between very closely-related species where only one of the species is scaffolded.
Bitzer, Adam S.; Garbeva, Paolina
2014-01-01
Pedobacter sp. strain V48 participates in an interaction with Pseudomonas fluorescens which elicits interaction-induced phenotypes. We report the draft genome sequence of Pedobacter sp. V48, consisting of 6.46 Mbp. The sequence will contribute to improved understanding of the genus and facilitate genomic analysis of the model interspecies interaction with P. fluorescens. PMID:24578271
Silk Purse from a Sow's Ear? Why Knowledge Matters and Why the Draft History NC Will Not Improve It
ERIC Educational Resources Information Center
Hall, Katie; Counsell, Christine
2013-01-01
Katie Hall and Christine Counsell attempt to construct a Key Stage 3 scheme of work out of the draft National Curriculum for history that was released for consultation in England in February 2013. They explain the process by which they attempted to convert the programme of study into a coherent, workable plan that would fulfil the stated aims.…
ERIC Educational Resources Information Center
McGaughy, Charis; de Gonzalez, Alicia
2012-01-01
The California Department of Education is in the process of revising the Career and Technical Education (CTE) Model Curriculum Standards. The Educational Policy Improvement Center (EPIC) conducted an investigation of the draft version of the Health Sciences and Medical Technology Standards (Health Science). The purpose of the study is to…
Palaeosymbiosis Revealed by Genomic Fossils of Wolbachia in a Strongyloidean Nematode
Koutsovoulos, Georgios; Makepeace, Benjamin; Tanya, Vincent N.; Blaxter, Mark
2014-01-01
Wolbachia are common endosymbionts of terrestrial arthropods, and are also found in nematodes: the animal-parasitic filaria, and the plant-parasite Radopholus similis. Lateral transfer of Wolbachia DNA to the host genome is common. We generated a draft genome sequence for the strongyloidean nematode parasite Dictyocaulus viviparus, the cattle lungworm. In the assembly, we identified nearly 1 Mb of sequence with similarity to Wolbachia. The fragments were unlikely to derive from a live Wolbachia infection: most were short, and the genes were disabled through inactivating mutations. Many fragments were co-assembled with definitively nematode-derived sequence. We found limited evidence of expression of the Wolbachia-derived genes. The D. viviparus Wolbachia genes were most similar to filarial strains and strains from the host-promiscuous clade F. We conclude that D. viviparus was infected by Wolbachia in the past, and that clade F-like symbionts may have been the source of filarial Wolbachia infections. PMID:24901418
Spider genomes provide insight into composition and evolution of venom and silk
Sanggaard, Kristian W.; Bechsgaard, Jesper S.; Fang, Xiaodong; Duan, Jinjie; Dyrlund, Thomas F.; Gupta, Vikas; Jiang, Xuanting; Cheng, Ling; Fan, Dingding; Feng, Yue; Han, Lijuan; Huang, Zhiyong; Wu, Zongze; Liao, Li; Settepani, Virginia; Thøgersen, Ida B.; Vanthournout, Bram; Wang, Tobias; Zhu, Yabing; Funch, Peter; Enghild, Jan J.; Schauser, Leif; Andersen, Stig U.; Villesen, Palle; Schierup, Mikkel H; Bilde, Trine; Wang, Jun
2014-01-01
Spiders are ecologically important predators with complex venom and extraordinarily tough silk that enables capture of large prey. Here we present the assembled genome of the social velvet spider and a draft assembly of the tarantula genome that represent two major taxonomic groups of spiders. The spider genomes are large with short exons and long introns, reminiscent of mammalian genomes. Phylogenetic analyses place spiders and ticks as sister groups supporting polyphyly of the Acari. Complex sets of venom and silk genes/proteins are identified. We find that venom genes evolved by sequential duplication, and that the toxic effect of venom is most likely activated by proteases present in the venom. The set of silk genes reveals a highly dynamic gene evolution, new types of silk genes and proteins, and a novel use of aciniform silk. These insights create new opportunities for pharmacological applications of venom and biomaterial applications of silk. PMID:24801114
Why Assembling Plant Genome Sequences Is So Challenging
Claros, Manuel Gonzalo; Bautista, Rocío; Guerrero-Fernández, Darío; Benzerki, Hicham; Seoane, Pedro; Fernández-Pozo, Noé
2012-01-01
In spite of the biological and economic importance of plants, relatively few plant species have been sequenced. Only the genome sequence of plants with relatively small genomes, most of them angiosperms, in particular eudicots, has been determined. The arrival of next-generation sequencing technologies has allowed the rapid and efficient development of new genomic resources for non-model or orphan plant species. But the sequencing pace of plants is far from that of animals and microorganisms. This review focuses on the typical challenges of plant genomes that can explain why plant genomics is less developed than animal genomics. Explanations about the impact of some confounding factors emerging from the nature of plant genomes are given. As a result of these challenges and confounding factors, the correct assembly and annotation of plant genomes is hindered, genome drafts are produced, and advances in plant genomics are delayed. PMID:24832233
Numerical flow simulation and efficiency prediction for axial turbines by advanced turbulence models
NASA Astrophysics Data System (ADS)
Jošt, D.; Škerlavaj, A.; Lipej, A.
2012-11-01
Numerical prediction of an efficiency of a 6-blade Kaplan turbine is presented. At first, the results of steady state analysis performed by different turbulence models for different operating regimes are compared to the measurements. For small and optimal angles of runner blades the efficiency was quite accurately predicted, but for maximal blade angle the discrepancy between calculated and measured values was quite large. By transient analysis, especially when the Scale Adaptive Simulation Shear Stress Transport (SAS SST) model with zonal Large Eddy Simulation (ZLES) in the draft tube was used, the efficiency was significantly improved. The improvement was at all operating points, but it was the largest for maximal discharge. The reason was better flow simulation in the draft tube. Details about turbulent structure in the draft tube obtained by SST, SAS SST and SAS SST with ZLES are illustrated in order to explain the reasons for differences in flow energy losses obtained by different turbulence models.
Coyne, Robert S; Thiagarajan, Mathangi; Jones, Kristie M; Wortman, Jennifer R; Tallon, Luke J; Haas, Brian J; Cassidy-Hanley, Donna M; Wiley, Emily A; Smith, Joshua J; Collins, Kathleen; Lee, Suzanne R; Couvillion, Mary T; Liu, Yifan; Garg, Jyoti; Pearlman, Ronald E; Hamilton, Eileen P; Orias, Eduardo; Eisen, Jonathan A; Methé, Barbara A
2008-01-01
Background Tetrahymena thermophila, a widely studied model for cellular and molecular biology, is a binucleated single-celled organism with a germline micronucleus (MIC) and somatic macronucleus (MAC). The recent draft MAC genome assembly revealed low sequence repetitiveness, a result of the epigenetic removal of invasive DNA elements found only in the MIC genome. Such low repetitiveness makes complete closure of the MAC genome a feasible goal, which to achieve would require standard closure methods as well as removal of minor MIC contamination of the MAC genome assembly. Highly accurate preliminary annotation of Tetrahymena's coding potential was hindered by the lack of both comparative genomic sequence information from close relatives and significant amounts of cDNA evidence, thus limiting the value of the genomic information and also leaving unanswered certain questions, such as the frequency of alternative splicing. Results We addressed the problem of MIC contamination using comparative genomic hybridization with purified MIC and MAC DNA probes against a whole genome oligonucleotide microarray, allowing the identification of 763 genome scaffolds likely to contain MIC-limited DNA sequences. We also employed standard genome closure methods to essentially finish over 60% of the MAC genome. For the improvement of annotation, we have sequenced and analyzed over 60,000 verified EST reads from a variety of cellular growth and development conditions. Using this EST evidence, a combination of automated and manual reannotation efforts led to updates that affect 16% of the current protein-coding gene models. By comparing EST abundance, many genes showing apparent differential expression between these conditions were identified. Rare instances of alternative splicing and uses of the non-standard amino acid selenocysteine were also identified. Conclusion We report here significant progress in genome closure and reannotation of Tetrahymena thermophila. Our experience to date suggests that complete closure of the MAC genome is attainable. Using the new EST evidence, automated and manual curation has resulted in substantial improvements to the over 24,000 gene models, which will be valuable to researchers studying this model organism as well as for comparative genomics purposes. PMID:19036158
Lang, Eddy S; Artz, Jennifer D; Wilkie, Ryan D; Stiell, Ian G; Topping, Claude; Belanger, François P; Afilalo, Marc; Renouf, Tia; Crocco, Anthony; Wyatt, Kelly; Christenson, Jim
2016-05-01
To describe the current state of academic emergency medicine (EM) funding in Canada and develop recommendations to grow and establish sustainable funding. A panel of eight leaders from different EM academic units was assembled. Using mixed methods (including a literature review, sharing of professional experiences, a survey of current EM academic heads, and data previously collected from an environmental scan), 10 recommendations were drafted and presented at an academic symposium. Attendee feedback was incorporated, and the second set of draft recommendations was further distributed to the Canadian Association Emergency Physicians (CAEP) Academic Section for additional comments before being finalized. Recommendations were developed around the funding challenges identified and solutions developed by academic EM university-based units across Canada. A strategic plan was seen as integral to achieving strong funding of an EM unit, especially when it aligned with departmental and institutional priorities. A business plan, although occasionally overlooked, was deemed an important component for planning and sustaining the academic mission. A number of recommendations surrounding philanthropy consisted of creating partnerships with existing foundations and engaging multiple stakeholders and communities. Synergy between academic and clinical EM departments was also viewed as an opportunity to ensure integration of common missions. Education and networking for current and future leaders were also viewed as invaluable to ensure that opportunities are optimized through strong leadership development and shared experiences to further the EM academic missions across the country. These recommendations were designed to improve the financial circumstances for many Canadian EM units. There is a considerable wealth of resources that can contribute to financial stability for an academic unit, and an annual networking meeting and continuing education on these issues will facilitate more rapid implementation of these recommendations.
NASA's Soil Moisture Active and Passive (SMAP) Mission
NASA Technical Reports Server (NTRS)
Kellogg, Kent; Njoku, Eni; Thurman, Sam; Edelstein, Wendy; Jai, Ben; Spencer, Mike; Chen, Gun-Shing; Entekhabi, Dara; O'Neill, Peggy; Piepmeier, Jeffrey;
2010-01-01
The Soil Moisture Active-Passive (SMAP) Mission is one of the first Earth observation satellites being formulated by NASA in response to the 2007 National Research Council s Decadal Survey. SMAP will make global measurements of soil moisture at the Earth's land surface and its freeze-thaw state. These measurements will allow significantly improved estimates of water, energy and carbon transfers between the land and atmosphere. Soil moisture measurements are also of great importance in assessing flooding and monitoring drought. Knowledge gained from SMAP observations can help mitigate these natural hazards, resulting in potentially great economic and social benefits. SMAP observations of soil moisture and freeze/thaw timing over the boreal latitudes will also reduce a major uncertainty in quantifying the global carbon balance and help to resolve an apparent missing carbon sink over land. The SMAP mission concept will utilize an L-band radar and radiometer sharing a rotating 6-meter mesh reflector antenna flying in a 680 km polar orbit with an 8-day exact ground track repeat aboard a 3-axis stabilized spacecraft to provide high-resolution and high-accuracy global maps of soil moisture and freeze/thaw state every two to three days. In addition, the SMAP project will use these surface observations with advanced modeling and data assimilation to provide estimates of deeper root-zone soil moisture and net ecosystem exchange of carbon. SMAP recently completed its Phase A Mission Concept Study Phase for NASA and transitioned into Phase B (Formulation and Detailed Design). A number of significant accomplishments occurred during this initial phase of mission development. The SMAP project held several open meetings to solicit community feedback on possible science algorithms, prepared preliminary draft Algorithm Theoretical Basis Documents (ATBDs) for each mission science product, and established a prototype algorithm testbed to enable testing and evaluation of the performance of candidate algorithms. SMAP conducted an Applications Workshop in September 2009 to coordinate with potential application users interested in the mission data. A draft Applications Plan describing the Project s planned outreach to potential applications users has been prepared and will be updated during Phase B. SMAP made a significant evaluation of the potential terrestrial radio frequency interference (RFI) source environment and established radiometer and radar flight hardware and ground processing mitigation approaches. SMAP finalized its science orbit and orbit injection approach to optimize launch mass and prepared launch and commissioning scenarios and timeline. A science data communications approach was developed to maximize available science data volume to improve science margins while maintaining moderately short data product latencies to support many potential applications using existing ground assets and with minimum impact to the flight system. SMAP developed rigid multi-body and flexible body dynamics and control models and system designs for the 6-meter rotating instrument reflector-boom assembly (RBA) and flight system to confirm pointing and control performance, and devised strategies to efficiently implement on-orbit balancing if needed. Industry partners were selected for the spin mechanism assembly (SMA) and RBA. Preliminary designs for the radar and radiometer were initiated, including constructing breadboards of key assemblies.
Mundt, Kenneth A; Gentry, P Robinan; Dell, Linda D; Rodricks, Joseph V; Boffetta, Paolo
2018-02-01
Shortly after the International Agency for Research on Cancer (IARC) determined that formaldehyde causes leukemia, the United States Environmental Protection Agency (EPA) released its Draft IRIS Toxicological Review of Formaldehyde ("Draft IRIS Assessment"), also concluding that formaldehyde causes leukemia. Peer review of the Draft IRIS Assessment by a National Academy of Science committee noted that "causal determinations are not supported by the narrative provided in the draft" (NRC 2011). They offered recommendations for improving the Draft IRIS assessment and identified several important research gaps. Over the six years since the NRC peer review, significant new science has been published. We identify and summarize key recommendations made by NRC and map them to this new science, including extended analysis of epidemiological studies, updates of earlier occupational cohort studies, toxicological experiments using a sensitive mouse strain, mechanistic studies examining the role of exogenous versus endogenous formaldehyde in bone marrow, and several critical reviews. With few exceptions, new findings are consistently negative, and integration of all available evidence challenges the earlier conclusions that formaldehyde causes leukemia. Given formaldehyde's commercial importance, environmental ubiquity and endogenous production, accurate hazard classification and risk evaluation of whether exposure to formaldehyde from occupational, residential and consumer products causes leukemia are critical. Copyright © 2017 The Authors. Published by Elsevier Inc. All rights reserved.
2010-01-01
Background De novo assembly of transcript sequences produced by short-read DNA sequencing technologies offers a rapid approach to obtain expressed gene catalogs for non-model organisms. A draft genome sequence will be produced in 2010 for a Eucalyptus tree species (E. grandis) representing the most important hardwood fibre crop in the world. Genome annotation of this valuable woody plant and genetic dissection of its superior growth and productivity will be greatly facilitated by the availability of a comprehensive collection of expressed gene sequences from multiple tissues and organs. Results We present an extensive expressed gene catalog for a commercially grown E. grandis × E. urophylla hybrid clone constructed using only Illumina mRNA-Seq technology and de novo assembly. A total of 18,894 transcript-derived contigs, a large proportion of which represent full-length protein coding genes were assembled and annotated. Analysis of assembly quality, length and diversity show that this dataset represent the most comprehensive expressed gene catalog for any Eucalyptus tree. mRNA-Seq analysis furthermore allowed digital expression profiling of all of the assembled transcripts across diverse xylogenic and non-xylogenic tissues, which is invaluable for ascribing putative gene functions. Conclusions De novo assembly of Illumina mRNA-Seq reads is an efficient approach for transcriptome sequencing and profiling in Eucalyptus and other non-model organisms. The transcriptome resource (Eucspresso, http://eucspresso.bi.up.ac.za/) generated by this study will be of value for genomic analysis of woody biomass production in Eucalyptus and for comparative genomic analysis of growth and development in woody and herbaceous plants. PMID:21122097
Complete genome sequence of the phenanthrene-degrading soil bacterium Delftia acidovorans Cs1-4
DOE Office of Scientific and Technical Information (OSTI.GOV)
Shetty, Ameesha R.; de Gannes, Vidya; Obi, Chioma C.
Polycyclic aromatic hydrocarbons (PAH) are ubiquitous environmental pollutants and microbial biodegradation is an important means of remediation of PAH-contaminated soil. Delftia acidovorans Cs1-4 (formerly Delftia sp. Cs1-4) was isolated by using phenanthrene as the sole carbon source from PAH contaminated soil in Wisconsin. Its full genome sequence was determined to gain insights into a mechanisms underlying biodegradation of PAH. Three genomic libraries were constructed and sequenced: an Illumina GAii shotgun library (916,416,493 reads), a 454 Titanium standard library (770,171 reads) and one paired-end 454 library (average insert size of 8 kb, 508,092 reads). The initial assembly contained 40 contigs inmore » two scaffolds. The 454 Titanium standard data and the 454 paired end data were assembled together and the consensus sequences were computationally shredded into 2 kb overlapping shreds. Illumina sequencing data was assembled, and the consensus sequence was computationally shredded into 1.5 kb overlapping shreds. Gaps between contigs were closed by editing in Consed, by PCR and by Bubble PCR primer walks. A total of 182 additional reactions were needed to close gaps and to raise the quality of the finished sequence. The final assembly is based on 253.3 Mb of 454 draft data (averaging 38.4 X coverage) and 590.2 Mb of Illumina draft data (averaging 89.4 X coverage). The genome of strain Cs1-4 consists of a single circular chromosome of 6,685,842 bp (66.7 %G+C) containing 6,028 predicted genes; 5,931 of these genes were protein-encoding and 4,425 gene products were assigned to a putative function. Genes encoding phenanthrene degradation were localized to a 232 kb genomic island (termed the phn island), which contained near its 3’ end a bacteriophage P4-like integrase, an enzyme often associated with chromosomal integration of mobile genetic elements. Other biodegradation pathways reconstructed from the genome sequence included: benzoate (by the acetyl-CoA pathway), styrene, nicotinic acid (by the maleamate pathway) and the pesticides Dicamba and Fenitrothion. Lastly, determination of the complete genome sequence of D. acidovorans Cs1-4 has provided new insights the microbial mechanisms of PAH biodegradation that may shape the process in the environment.« less
Complete genome sequence of the phenanthrene-degrading soil bacterium Delftia acidovorans Cs1-4
Shetty, Ameesha R.; de Gannes, Vidya; Obi, Chioma C.; ...
2015-08-15
Polycyclic aromatic hydrocarbons (PAH) are ubiquitous environmental pollutants and microbial biodegradation is an important means of remediation of PAH-contaminated soil. Delftia acidovorans Cs1-4 (formerly Delftia sp. Cs1-4) was isolated by using phenanthrene as the sole carbon source from PAH contaminated soil in Wisconsin. Its full genome sequence was determined to gain insights into a mechanisms underlying biodegradation of PAH. Three genomic libraries were constructed and sequenced: an Illumina GAii shotgun library (916,416,493 reads), a 454 Titanium standard library (770,171 reads) and one paired-end 454 library (average insert size of 8 kb, 508,092 reads). The initial assembly contained 40 contigs inmore » two scaffolds. The 454 Titanium standard data and the 454 paired end data were assembled together and the consensus sequences were computationally shredded into 2 kb overlapping shreds. Illumina sequencing data was assembled, and the consensus sequence was computationally shredded into 1.5 kb overlapping shreds. Gaps between contigs were closed by editing in Consed, by PCR and by Bubble PCR primer walks. A total of 182 additional reactions were needed to close gaps and to raise the quality of the finished sequence. The final assembly is based on 253.3 Mb of 454 draft data (averaging 38.4 X coverage) and 590.2 Mb of Illumina draft data (averaging 89.4 X coverage). The genome of strain Cs1-4 consists of a single circular chromosome of 6,685,842 bp (66.7 %G+C) containing 6,028 predicted genes; 5,931 of these genes were protein-encoding and 4,425 gene products were assigned to a putative function. Genes encoding phenanthrene degradation were localized to a 232 kb genomic island (termed the phn island), which contained near its 3’ end a bacteriophage P4-like integrase, an enzyme often associated with chromosomal integration of mobile genetic elements. Other biodegradation pathways reconstructed from the genome sequence included: benzoate (by the acetyl-CoA pathway), styrene, nicotinic acid (by the maleamate pathway) and the pesticides Dicamba and Fenitrothion. Lastly, determination of the complete genome sequence of D. acidovorans Cs1-4 has provided new insights the microbial mechanisms of PAH biodegradation that may shape the process in the environment.« less
DOT National Transportation Integrated Search
1997-10-01
The FY 1998-2000 Statewide Transportation Improvement Program (STIP) is a three-year program of highway and transit projects developed to fulfill the requirements set forth in the Intermodal Surface Transportation Efficiency Act of 1991 (ISTEA). The ...
Comparative genomics of two jute species and insight into fibre biogenesis.
Islam, Md Shahidul; Saito, Jennifer A; Emdad, Emdadul Mannan; Ahmed, Borhan; Islam, Mohammad Moinul; Halim, Abdul; Hossen, Quazi Md Mosaddeque; Hossain, Md Zakir; Ahmed, Rasel; Hossain, Md Sabbir; Kabir, Shah Md Tamim; Khan, Md Sarwar Alam; Khan, Md Mursalin; Hasan, Rajnee; Aktar, Nasima; Honi, Ummay; Islam, Rahin; Rashid, Md Mamunur; Wan, Xuehua; Hou, Shaobin; Haque, Taslima; Azam, Muhammad Shafiul; Moosa, Mahdi Muhammad; Elias, Sabrina M; Hasan, A M Mahedi; Mahmood, Niaz; Shafiuddin, Md; Shahid, Saima; Shommu, Nusrat Sharmeen; Jahan, Sharmin; Roy, Saroj; Chowdhury, Amlan; Akhand, Ashikul Islam; Nisho, Golam Morshad; Uddin, Khaled Salah; Rabeya, Taposhi; Hoque, S M Ekramul; Snigdha, Afsana Rahman; Mortoza, Sarowar; Matin, Syed Abdul; Islam, Md Kamrul; Lashkar, M Z H; Zaman, Mahboob; Yuryev, Anton; Uddin, Md Kamal; Rahman, Md Sharifur; Haque, Md Samiul; Alam, Md Monjurul; Khan, Haseena; Alam, Maqsudul
2017-01-30
Jute (Corchorus sp.) is one of the most important sources of natural fibre, covering ∼80% of global bast fibre production 1 . Only Corchorus olitorius and Corchorus capsularis are commercially cultivated, though there are more than 100 Corchorus species 2 in the Malvaceae family. Here we describe high-quality draft genomes of these two species and their comparisons at the functional genomics level to support tailor-designed breeding. The assemblies cover 91.6% and 82.2% of the estimated genome sizes for C. olitorius and C. capsularis, respectively. In total, 37,031 C. olitorius and 30,096 C. capsularis genes are identified, and most of the genes are validated by cDNA and RNA-seq data. Analyses of clustered gene families and gene collinearity show that jute underwent shared whole-genome duplication ∼18.66 million years (Myr) ago prior to speciation. RNA expression analysis from isolated fibre cells reveals the key regulatory and structural genes involved in fibre formation. This work expands our understanding of the molecular basis of fibre formation laying the foundation for the genetic improvement of jute.
Comparative Genomics as a Foundation for Evo-Devo Studies in Birds.
Grayson, Phil; Sin, Simon Y W; Sackton, Timothy B; Edwards, Scott V
2017-01-01
Developmental genomics is a rapidly growing field, and high-quality genomes are a useful foundation for comparative developmental studies. A high-quality genome forms an essential reference onto which the data from numerous assays and experiments, including ChIP-seq, ATAC-seq, and RNA-seq, can be mapped. A genome also streamlines and simplifies the development of primers used to amplify putative regulatory regions for enhancer screens, cDNA probes for in situ hybridization, microRNAs (miRNAs) or short hairpin RNAs (shRNA) for RNA interference (RNAi) knockdowns, mRNAs for misexpression studies, and even guide RNAs (gRNAs) for CRISPR knockouts. Finally, much can be gleaned from comparative genomics alone, including the identification of highly conserved putative regulatory regions. This chapter provides an overview of laboratory and bioinformatics protocols for DNA extraction, library preparation, library quantification, and genome assembly, from fresh or frozen tissue to a draft avian genome. Generating a high-quality draft genome can provide a developmental research group with excellent resources for their study organism, opening the doors to many additional assays and experiments.
Draft De Novo Transcriptome of the Rat Kangaroo Potorous tridactylus as a Tool for Cell Biology
Udy, Dylan B.; Voorhies, Mark; Chan, Patricia P.; Lowe, Todd M.; Dumont, Sophie
2015-01-01
The rat kangaroo (long-nosed potoroo, Potorous tridactylus) is a marsupial native to Australia. Cultured rat kangaroo kidney epithelial cells (PtK) are commonly used to study cell biological processes. These mammalian cells are large, adherent, and flat, and contain large and few chromosomes—and are thus ideal for imaging intra-cellular dynamics such as those of mitosis. Despite this, neither the rat kangaroo genome nor transcriptome have been sequenced, creating a challenge for probing the molecular basis of these cellular dynamics. Here, we present the sequencing, assembly and annotation of the draft rat kangaroo de novo transcriptome. We sequenced 679 million reads that mapped to 347,323 Trinity transcripts and 20,079 Unigenes. We present statistics emerging from transcriptome-wide analyses, and analyses suggesting that the transcriptome covers full-length sequences of most genes, many with multiple isoforms. We also validate our findings with a proof-of-concept gene knockdown experiment. We expect that this high quality transcriptome will make rat kangaroo cells a more tractable system for linking molecular-scale function and cellular-scale dynamics. PMID:26252667
Moura, Quézia; Fernandes, Miriam R; Cerdeira, Louise; Nhambe, Lúcia F; Ienne, Susan; Souza, Tiago A; Lincopan, Nilton
2017-09-01
Multidrug-resistant (MDR) Enterobacter aerogenes strains are frequently associated with nosocomial infections and high mortality rates, representing a serious public health problem. The aim of this study was to present the draft genome sequence of a MDR KPC-2-producing E. aerogenes isolated from a perineal swab of a hospitalised patient in Brazil. Genomic DNA was sequenced using an Illumina MiSeq platform. De novo genome assembly was carried out using the A5-Miseq pipeline, and whole-genome sequence analysis was performed using tools from the Center for Genomic Epidemiology. The strain harboured resistance genes to β-lactams, aminoglycosides, sulphonamides and trimethoprim in addition to genes encoding multidrug efflux system proteins, a quaternary ammonium transporter and heavy metal efflux system proteins. In addition, the strain harboured genes encoding diverse virulence factors. These data might allow a better understanding of the genetic basis of antimicrobial resistance and virulence in E. aerogenes strains. Copyright © 2017 International Society for Chemotherapy of Infection and Cancer. Published by Elsevier Ltd. All rights reserved.
Draft De Novo Transcriptome of the Rat Kangaroo Potorous tridactylus as a Tool for Cell Biology.
Udy, Dylan B; Voorhies, Mark; Chan, Patricia P; Lowe, Todd M; Dumont, Sophie
2015-01-01
The rat kangaroo (long-nosed potoroo, Potorous tridactylus) is a marsupial native to Australia. Cultured rat kangaroo kidney epithelial cells (PtK) are commonly used to study cell biological processes. These mammalian cells are large, adherent, and flat, and contain large and few chromosomes-and are thus ideal for imaging intra-cellular dynamics such as those of mitosis. Despite this, neither the rat kangaroo genome nor transcriptome have been sequenced, creating a challenge for probing the molecular basis of these cellular dynamics. Here, we present the sequencing, assembly and annotation of the draft rat kangaroo de novo transcriptome. We sequenced 679 million reads that mapped to 347,323 Trinity transcripts and 20,079 Unigenes. We present statistics emerging from transcriptome-wide analyses, and analyses suggesting that the transcriptome covers full-length sequences of most genes, many with multiple isoforms. We also validate our findings with a proof-of-concept gene knockdown experiment. We expect that this high quality transcriptome will make rat kangaroo cells a more tractable system for linking molecular-scale function and cellular-scale dynamics.
Shim, Donghwan; Park, Sin-Gi; Kim, Kangmin; Bae, Wonsil; Lee, Gir Won; Ha, Byeong-Suk; Ro, Hyeon-Su; Kim, Myungkil; Ryoo, Rhim; Rhee, Sung-Keun; Nou, Ill-Sup; Koo, Chang-Duck; Hong, Chang Pyo; Ryu, Hojin
2016-04-10
Lentinula edodes, the popular shiitake mushroom, is one of the most important cultivated edible mushrooms. It is used as a food and for medicinal purposes. Here, we present the 46.1 Mb draft genome of L. edodes, comprising 13,028 predicted gene models. The genome assembly consists of 31 scaffolds. Gene annotation provides key information about various signaling pathways and secondary metabolites. This genomic information should help establish the molecular genetic markers for MAS/MAB and increase our understanding of the genome structure and function. Copyright © 2016 Elsevier B.V. All rights reserved.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Kalamorz, Falk; Keis, Stefanie; Stanton, Jo-Ann
The genes and molecular machines that allow for a thermoalkaliphilic lifestyle have not been defined. To address this goal, we report on the improved high-quality draft genome sequence of Caldalkalibacillus thermarum strain TA2.A1, an obligately aerobic bacterium that grows optimally at pH 9.5 and 65 to 70 C on a wide variety of carbon and energy sources.
Batalden, Paul; Stevens, David; Ogrinc, Greg; Mooney, Susan
2008-01-01
In 2005 we published draft guidelines for reporting studies of quality improvement interventions as the initial step in a consensus process for development of a more definitive version. The current article contains the revised version, which we refer to as SQUIRE (Standards for QUality Improvement Reporting Excellence). We describe the consensus process, which included informal feedback, formal written commentaries, input from publication guideline developers, review of the literature on the epistemology of improvement and on methods for evaluating complex social programs, and a meeting of stakeholders for critical review of the guidelines’ content and wording, followed by commentary on sequential versions from an expert consultant group. Finally, we examine major differences between SQUIRE and the initial draft, and consider limitations of and unresolved questions about SQUIRE; we also describe ancillary supporting documents and alternative versions under development, and plans for dissemination, testing, and further development of SQUIRE. PMID:18830766
Draft Genome Sequence of Mycobacterium chimaera Type ...
We report the draft genome sequence of the type strain Mycobacterium chimaera Fl-0169T, a member of the Mycobacterium avium complex (MAC). M. chimaera Fl-0169T was isolated from a patient in Italy and is highly similar to strains of M. chimaera isolated in Ireland, though Fl-0169T possesses unique virulence genes. Evidence suggests that M. avium, M. intracellulare, and M. chimaera are differently virulent and a comparative genomic analysis is critically needed to identify diagnostic targets that reliably differentiate species of MAC. With treatment costs for Mycobacterium infections estimated to be >$1.8 B annually in the U.S., correct species identification will result in improved treatment selection, lower costs, and improved patient outcomes.
Analytical Verifications in Cryogenic Testing of NGST Advanced Mirror System Demonstrators
NASA Technical Reports Server (NTRS)
Cummings, Ramona; Levine, Marie; VanBuren, Dave; Kegley, Jeff; Green, Joseph; Hadaway, James; Presson, Joan; Cline, Todd; Stahl, H. Philip (Technical Monitor)
2002-01-01
Ground based testing is a critical and costly part of component, assembly, and system verifications of large space telescopes. At such tests, however, with integral teamwork by planners, analysts, and test personnel, segments can be included to validate specific analytical parameters and algorithms at relatively low additional cost. This paper opens with strategy of analytical verification segments added to vacuum cryogenic testing of Advanced Mirror System Demonstrator (AMSD) assemblies. These AMSD assemblies incorporate material and architecture concepts being considered in the Next Generation Space Telescope (NGST) design. The test segments for workmanship testing, cold survivability, and cold operation optical throughput are supplemented by segments for analytical verifications of specific structural, thermal, and optical parameters. Utilizing integrated modeling and separate materials testing, the paper continues with support plan for analyses, data, and observation requirements during the AMSD testing, currently slated for late calendar year 2002 to mid calendar year 2003. The paper includes anomaly resolution as gleaned by authors from similar analytical verification support of a previous large space telescope, then closes with draft of plans for parameter extrapolations, to form a well-verified portion of the integrated modeling being done for NGST performance predictions.
2013-01-01
Background Though India has sequenced water buffalo genome but its draft assembly is based on cattle genome BTau 4.0, thus de novo chromosome wise assembly is a major pending issue for global community. The existing radiation hybrid of buffalo and these reported STR can be used further in final gap plugging and “finishing” expected in de novo genome assembly. QTL and gene mapping needs mining of putative STR from buffalo genome at equal interval on each and every chromosome. Such markers have potential role in improvement of desirable characteristics, such as high milk yields, resistance to diseases, high growth rate. The STR mining from whole genome and development of user friendly database is yet to be done to reap the benefit of whole genome sequence. Description By in silico microsatellite mining of whole genome, we have developed first STR database of water buffalo, BuffSatDb (Buffalo MicroSatellite Database (http://cabindb.iasri.res.in/buffsatdb/) which is a web based relational database of 910529 microsatellite markers, developed using PHP and MySQL database. Microsatellite markers have been generated using MIcroSAtellite tool. It is simple and systematic web based search for customised retrieval of chromosome wise and genome-wide microsatellites. Search has been enabled based on chromosomes, motif type (mono-hexa), repeat motif and repeat kind (simple and composite). The search may be customised by limiting location of STR on chromosome as well as number of markers in that range. This is a novel approach and not been implemented in any of the existing marker database. This database has been further appended with Primer3 for primer designing of the selected markers enabling researcher to select markers of choice at desired interval over the chromosome. The unique add-on of degenerate bases further helps in resolving presence of degenerate bases in current buffalo assembly. Conclusion Being first buffalo STR database in the world , this would not only pave the way in resolving current assembly problem but shall be of immense use for global community in QTL/gene mapping critically required to increase knowledge in the endeavour to increase buffalo productivity, especially for third world country where rural economy is significantly dependent on buffalo productivity. PMID:23336431
Sarika; Arora, Vasu; Iquebal, Mir Asif; Rai, Anil; Kumar, Dinesh
2013-01-19
Though India has sequenced water buffalo genome but its draft assembly is based on cattle genome BTau 4.0, thus de novo chromosome wise assembly is a major pending issue for global community. The existing radiation hybrid of buffalo and these reported STR can be used further in final gap plugging and "finishing" expected in de novo genome assembly. QTL and gene mapping needs mining of putative STR from buffalo genome at equal interval on each and every chromosome. Such markers have potential role in improvement of desirable characteristics, such as high milk yields, resistance to diseases, high growth rate. The STR mining from whole genome and development of user friendly database is yet to be done to reap the benefit of whole genome sequence. By in silico microsatellite mining of whole genome, we have developed first STR database of water buffalo, BuffSatDb (Buffalo MicroSatellite Database (http://cabindb.iasri.res.in/buffsatdb/) which is a web based relational database of 910529 microsatellite markers, developed using PHP and MySQL database. Microsatellite markers have been generated using MIcroSAtellite tool. It is simple and systematic web based search for customised retrieval of chromosome wise and genome-wide microsatellites. Search has been enabled based on chromosomes, motif type (mono-hexa), repeat motif and repeat kind (simple and composite). The search may be customised by limiting location of STR on chromosome as well as number of markers in that range. This is a novel approach and not been implemented in any of the existing marker database. This database has been further appended with Primer3 for primer designing of the selected markers enabling researcher to select markers of choice at desired interval over the chromosome. The unique add-on of degenerate bases further helps in resolving presence of degenerate bases in current buffalo assembly. Being first buffalo STR database in the world , this would not only pave the way in resolving current assembly problem but shall be of immense use for global community in QTL/gene mapping critically required to increase knowledge in the endeavour to increase buffalo productivity, especially for third world country where rural economy is significantly dependent on buffalo productivity.
Pritchard, Leighton; Holden, Nicola J; Bielaszewska, Martina; Karch, Helge; Toth, Ian K
2012-01-01
An Escherichia coli O104:H4 outbreak in Germany in summer 2011 caused 53 deaths, over 4000 individual infections across Europe, and considerable economic, social and political impact. This outbreak was the first in a position to exploit rapid, benchtop high-throughput sequencing (HTS) technologies and crowdsourced data analysis early in its investigation, establishing a new paradigm for rapid response to disease threats. We describe a novel strategy for design of diagnostic PCR primers that exploited this rapid draft bacterial genome sequencing to distinguish between E. coli O104:H4 outbreak isolates and other pathogenic E. coli isolates, including the historical hæmolytic uræmic syndrome (HUSEC) E. coli HUSEC041 O104:H4 strain, which possesses the same serotype as the outbreak isolates. Primers were designed using a novel alignment-free strategy against eleven draft whole genome assemblies of E. coli O104:H4 German outbreak isolates from the E. coli O104:H4 Genome Analysis Crowd-Sourcing Consortium website, and a negative sequence set containing 69 E. coli chromosome and plasmid sequences from public databases. Validation in vitro against 21 'positive' E. coli O104:H4 outbreak and 32 'negative' non-outbreak EHEC isolates indicated that individual primer sets exhibited 100% sensitivity for outbreak isolates, with false positive rates of between 9% and 22%. A minimal combination of two primers discriminated between outbreak and non-outbreak E. coli isolates with 100% sensitivity and 100% specificity. Draft genomes of isolates of disease outbreak bacteria enable high throughput primer design and enhanced diagnostic performance in comparison to traditional molecular assays. Future outbreak investigations will be able to harness HTS rapidly to generate draft genome sequences and diagnostic primer sets, greatly facilitating epidemiology and clinical diagnostics. We expect that high throughput primer design strategies will enable faster, more precise responses to future disease outbreaks of bacterial origin, and help to mitigate their societal impact.
a New Animation of Subduction Processes for Undergraduates
NASA Astrophysics Data System (ADS)
Stern, R. J.; Lieu, W. K.; Mantey, A.; Ward, A.; Todd, F.; Farrar, E.; Sean, M.; Windler, J.
2015-12-01
The subduction of oceanic lithosphere beneath convergent plate margins is a fundamental plate tectonic concept and an important Earth process. It is responsible for some of Earth's most dangerous natural hazards including earthquakes and volcanic eruptions but also produced the continental crust and important mineral deposits. A range of geoscientific efforts including NSF MARGINS and GeoPRISMS initiatives have advanced our understanding of subduction zone processes. In spite the importance of subduction zones and our advancing understanding of how these function, there are few animations that clearly explain the subduction process to non-expert audiences. This deficiency reflects the disparate expertises between geoscientists who know the science but have weak animation skills and digital artists and animators who have strong skills in showing objects in motion but are not experts in natural processes like plate tectonics. This transdisciplinary gap can and should be bridged. With a small grant from NSF (DUE-1444954) we set about to generate a realistic subduction zone animation aimed at the university undergraduate audience by first working within our university to rough out a draft animation and then contract a professional to use this to construct the final version. UTD Geosciences faculty (Stern) and graduate student (Lieu) teamed up with faculty from UTD School of Arts, Technology, and Emerging Communication (ATEC)(Farrar, Fechter, and McComber) to identify and recruit talented ATEC undergraduate students (Mantey, Ward) to work on the project. Geoscientists assembled a storyboard and met weekly with ATEC undergraduates to generate a first draft of the animation, which guided development of an accompanying narrative. The draft animation with voice-over was then handed off to professional animator Windler (Archistration CG) to generate the final animation. We plan to show both the student-generated draft version and the final animation during our presentation. The final animation will be freely available via the internet and will also be used as a supplement for McGraw-Hill textbooks in oceanography, physical geology, Earth science, geography, historical geology, natural hazards, and natural resources.
Federal Register 2010, 2011, 2012, 2013, 2014
2013-03-18
... may be sent to: Ian Zelo, NOAA Oil Spill Coordinator, Assessment and Restoration Division, 7600 Sand... are: (1) Improve Helmet Creek, restore juvenile and adult fish passage, (2) Improve water quality, and...
DOE Office of Scientific and Technical Information (OSTI.GOV)
Goordial, Jacqueline; Raymond-Bouchard, Isabelle; Riley, Robert
Here, we report the draft genome sequence of Rhodotorula sp. strain JG1b, a yeast that was isolated from ice-cemented permafrost in the upper-elevation McMurdo Dry Valleys, Antarctica. The sequenced genome size is 19.39 Mb, consisting of 156 scaffolds and containing a total of 5,625 predicted genes. This is the first known cold-adapted Rhodotorula sp. sequenced to date.
Goordial, Jacqueline; Raymond-Bouchard, Isabelle; Riley, Robert; ...
2016-03-17
Here, we report the draft genome sequence of Rhodotorula sp. strain JG1b, a yeast that was isolated from ice-cemented permafrost in the upper-elevation McMurdo Dry Valleys, Antarctica. The sequenced genome size is 19.39 Mb, consisting of 156 scaffolds and containing a total of 5,625 predicted genes. This is the first known cold-adapted Rhodotorula sp. sequenced to date.
"To Improve upon Hints of Things": Illustrating Isaac Newton.
Schilt, Cornelis J
2016-01-01
When Isaac Newton died in 1727 he left a rich legacy in terms of draft manuscripts, encompassing a variety of topics: natural philosophy, mathematics, alchemy, theology, and chronology, as well as papers relating to his career at the Mint. One thing that immediately strikes us is the textuality of Newton's legacy: images are sparse. Regarding his scholarly endeavours we witness the same practice. Newton's extensive drafts on theology and chronology do not contain a single illustration or map. Today we have all of Newton's draft manuscripts as witnesses of his working methods, as well as access to a significant number of books from his own library. Drawing parallels between Newton's reading practices and his natural philosophical and scholarly work, this paper seeks to understand Newton's recondite writing and publishing politics.
Zhang, Jianwei; Kudrna, Dave; Mu, Ting; Li, Weiming; Copetti, Dario; Yu, Yeisoo; Goicoechea, Jose Luis; Lei, Yang; Wing, Rod A
2016-10-15
Next generation sequencing technologies have revolutionized our ability to rapidly and affordably generate vast quantities of sequence data. Once generated, raw sequences are assembled into contigs or scaffolds. However, these assemblies are mostly fragmented and inaccurate at the whole genome scale, largely due to the inability to integrate additional informative datasets (e.g. physical, optical and genetic maps). To address this problem, we developed a semi-automated software tool-Genome Puzzle Master (GPM)-that enables the integration of additional genomic signposts to edit and build 'new-gen-assemblies' that result in high-quality 'annotation-ready' pseudomolecules. With GPM, loaded datasets can be connected to each other via their logical relationships which accomplishes tasks to 'group,' 'merge,' 'order and orient' sequences in a draft assembly. Manual editing can also be performed with a user-friendly graphical interface. Final pseudomolecules reflect a user's total data package and are available for long-term project management. GPM is a web-based pipeline and an important part of a Laboratory Information Management System (LIMS) which can be easily deployed on local servers for any genome research laboratory. The GPM (with LIMS) package is available at https://github.com/Jianwei-Zhang/LIMS CONTACTS: jzhang@mail.hzau.edu.cn or rwing@mail.arizona.eduSupplementary information: Supplementary data are available at Bioinformatics online. © The Author 2016. Published by Oxford University Press.
Koutsovoulos, Georgios; Laetsch, Dominik R.; Stevens, Lewis; Daub, Jennifer; Conlon, Claire; Maroon, Habib; Thomas, Fran; Aboobaker, Aziz A.
2016-01-01
Tardigrades are meiofaunal ecdysozoans that are key to understanding the origins of Arthropoda. Many species of Tardigrada can survive extreme conditions through cryptobiosis. In a recent paper [Boothby TC, et al. (2015) Proc Natl Acad Sci USA 112(52):15976–15981], the authors concluded that the tardigrade Hypsibius dujardini had an unprecedented proportion (17%) of genes originating through functional horizontal gene transfer (fHGT) and speculated that fHGT was likely formative in the evolution of cryptobiosis. We independently sequenced the genome of H. dujardini. As expected from whole-organism DNA sampling, our raw data contained reads from nontarget genomes. Filtering using metagenomics approaches generated a draft H. dujardini genome assembly of 135 Mb with superior assembly metrics to the previously published assembly. Additional microbial contamination likely remains. We found no support for extensive fHGT. Among 23,021 gene predictions we identified 0.2% strong candidates for fHGT from bacteria and 0.2% strong candidates for fHGT from nonmetazoan eukaryotes. Cross-comparison of assemblies showed that the overwhelming majority of HGT candidates in the Boothby et al. genome derived from contaminants. We conclude that fHGT into H. dujardini accounts for at most 1–2% of genes and that the proposal that one-sixth of tardigrade genes originate from functional HGT events is an artifact of undetected contamination. PMID:27035985
Koutsovoulos, Georgios; Kumar, Sujai; Laetsch, Dominik R; Stevens, Lewis; Daub, Jennifer; Conlon, Claire; Maroon, Habib; Thomas, Fran; Aboobaker, Aziz A; Blaxter, Mark
2016-05-03
Tardigrades are meiofaunal ecdysozoans that are key to understanding the origins of Arthropoda. Many species of Tardigrada can survive extreme conditions through cryptobiosis. In a recent paper [Boothby TC, et al. (2015) Proc Natl Acad Sci USA 112(52):15976-15981], the authors concluded that the tardigrade Hypsibius dujardini had an unprecedented proportion (17%) of genes originating through functional horizontal gene transfer (fHGT) and speculated that fHGT was likely formative in the evolution of cryptobiosis. We independently sequenced the genome of H. dujardini As expected from whole-organism DNA sampling, our raw data contained reads from nontarget genomes. Filtering using metagenomics approaches generated a draft H. dujardini genome assembly of 135 Mb with superior assembly metrics to the previously published assembly. Additional microbial contamination likely remains. We found no support for extensive fHGT. Among 23,021 gene predictions we identified 0.2% strong candidates for fHGT from bacteria and 0.2% strong candidates for fHGT from nonmetazoan eukaryotes. Cross-comparison of assemblies showed that the overwhelming majority of HGT candidates in the Boothby et al. genome derived from contaminants. We conclude that fHGT into H. dujardini accounts for at most 1-2% of genes and that the proposal that one-sixth of tardigrade genes originate from functional HGT events is an artifact of undetected contamination.
Mori, Brenda; Brooks, Dina; Norman, Kathleen E; Herold, Jodi; Beaton, Dorcas E
2015-08-01
To develop the first draft of a Canadian tool to assess physiotherapy (PT) students' performance in clinical education (CE). Phase 1: to gain consensus on the items within the new tool, the number and placement of the comment boxes, and the rating scale; Phase 2: to explore the face and content validity of the draft tool. Phase 1 used the Delphi method; Phase 2 used cognitive interviewing methods with recent graduates and clinical instructors (CIs) and detailed interviews with clinical education and measurement experts. Consensus was reached on the first draft of the new tool by round 3 of the Delphi process, which was completed by 21 participants. Interviews were completed with 13 CIs, 6 recent graduates, and 7 experts. Recent graduates and CIs were able to interpret the tool accurately, felt they could apply it to a recent CE experience, and provided suggestions to improve the draft. Experts provided salient advice. The first draft of a new tool to assess PT students in CE, the Canadian Physiotherapy Assessment of Clinical Performance (ACP), was developed and will undergo further development and testing, including national consultation with stakeholders. Data from Phase 2 will contribute to developing an online education module for CIs and students.
630 A MARITIME NUCLEAR STEAM GENERATOR. Progress Report No. 2
DOE Office of Scientific and Technical Information (OSTI.GOV)
None
1962-09-28
A layout of a reduced-height 630A assembly (34 to 23 ft) was prepared and is presently being evaluated for use in a merchant vessel. While shielding studies indicate the need for some rearrangement of the shield materials, the desired radiation constraint can be obtained without an increase in shielding weight. A preliminary stress analysis of the pressure vessel, flow path analysis, and insulation evaluation was completed and showed no major problems. Evaluation of the total containment design indicates a design pressure of 45 psig. The Critical Experiment (CE) mockup is about 80% complete. The CE tank and dolly is aboutmore » 50% complete. The CE hazards report was reviewed and approved. The draft of the test program and procedures document is 75% complete. The LPT control room modifications were made, and the draft of the standard operating procedures completed. The CE fuel was inspected and a significant portion was found to be of no use, about 60% requires recoating. Creep and oxidation test time on some of the fuel sheet has exceeded 3000 hr with no significant oxidation or elongation on any of the samples. The nickel -chromium alloy sheet high- temperature (1750 F) stress and oxidating testing have exceeded 5000 hr with elongations below 0.8% except for one sample of 2.3%. Experimental fuel sheet samples were prepared and comparative property studies were ini-tiated. Fabrication of the ring test assembly, 3-F-1, for test in the MTR is essentially complete. The design of the test ring for seal evaluations was initiated. A detailed schedule for the work in FY 63 was prepared and issued for comments and concurrence. (auth)« less
Perry, George H; Reeves, Darryl; Melsted, Páll; Ratan, Aakrosh; Miller, Webb; Michelini, Katelyn; Louis, Edward E; Pritchard, Jonathan K; Mason, Christopher E; Gilad, Yoav
2012-01-01
We present a high-coverage draft genome assembly of the aye-aye (Daubentonia madagascariensis), a highly unusual nocturnal primate from Madagascar. Our assembly totals ~3.0 billion bp (3.0 Gb), roughly the size of the human genome, comprised of ~2.6 million scaffolds (N50 scaffold size = 13,597 bp) based on short paired-end sequencing reads. We compared the aye-aye genome sequence data with four other published primate genomes (human, chimpanzee, orangutan, and rhesus macaque) as well as with the mouse and dog genomes as nonprimate outgroups. Unexpectedly, we observed strong evidence for a relatively slow substitution rate in the aye-aye lineage compared with these and other primates. In fact, the aye-aye branch length is estimated to be ~10% shorter than that of the human lineage, which is known for its low substitution rate. This finding may be explained, in part, by the protracted aye-aye life-history pattern, including late weaning and age of first reproduction relative to other lemurs. Additionally, the availability of this draft lemur genome sequence allowed us to polarize nucleotide and protein sequence changes to the ancestral primate lineage-a critical period in primate evolution, for which the relevant fossil record is sparse. Finally, we identified 293,800 high-confidence single nucleotide polymorphisms in the donor individual for our aye-aye genome sequence, a captive-born individual from two wild-born parents. The resulting heterozygosity estimate of 0.051% is the lowest of any primate studied to date, which is understandable considering the aye-aye's extensive home-range size and relatively low population densities. Yet this level of genetic diversity also suggests that conservation efforts benefiting this unusual species should be prioritized, especially in the face of the accelerating degradation and fragmentation of Madagascar's forests.
NASA Astrophysics Data System (ADS)
Hernsdorf, A. W.; Amano, Y.; Suzuki, Y.; Ise, K.; Thomas, B. C.; Banfield, J. F.
2015-12-01
Terrestrial sediments are an important global reservoir for methane. Microorganisms in the deep subsurface play a critical role in the methane cycle, yet much remains to be learned about their diversity and metabolisms. To provide more comprehensive insight into the microbiology of the methane cycle in the deep subsurface, we conducted a genome-resolved study of samples collected from the Horonobe Underground Research Laboratory (HURL), Japan. Groundwater samples were obtained from three boreholes from a depth range of between 140 m and 250 m in two consecutive years. Groundwater was filtered and metagenomic DNA extracted and sequenced, and the sequence data assembled. Based on the sequences of phylogenetically informative genes on the assembled fragments, we detected a high degree of overlap in community composition across a vertical transect within one borehole at the two sampling times. However, there was comparatively little similarity observed among communities across boreholes. Spatial and temporal abundance patterns were used in combination with tetranucleotide signatures of assembled genome fragments to bin the data and reconstruct over 200 unique draft genomes, of which 137 are considered to be of high quality (>90% complete). The deepest samples from one borehole were highly dominated by an archaeon identified as ANME-2D; this organism was also present at lower abundance in all other samples from that borehole. Also abundant in these microbial communities were novel members of the Gammaproteobacteria, Saccharibacteria (TM7) and Tenericute phyla. Notably, a ~2 Mbp draft genome for the ANME-2D archaeon was reconstructed. As expected, the genome encodes all of the genes predicted to be involved in the reverse methanogenesis pathway. In contrast with the previously reported ANME2-D genome, the HURL ANME-2D genome lacks the capacity to reduce nitrate. However, we identified many multiheme cytochromes with closest similarity to those of the known Fe-reducing/oxidizing archaeon Ferroglobus placidus. Thus, we suggest that ANME2-D may couple methane oxidation to reduction of ferric iron minerals in the sediment and may be generally important as a link between the iron and methane cycles in deep subsurface environments. Such information has important implications for modeling the global carbon cycle.
Geib, Scott M; Hall, Brian; Derego, Theodore; Bremer, Forest T; Cannoles, Kyle; Sim, Sheina B
2018-04-01
One of the most overlooked, yet critical, components of a whole genome sequencing (WGS) project is the submission and curation of the data to a genomic repository, most commonly the National Center for Biotechnology Information (NCBI). While large genome centers or genome groups have developed software tools for post-annotation assembly filtering, annotation, and conversion into the NCBI's annotation table format, these tools typically require back-end setup and connection to an Structured Query Language (SQL) database and/or some knowledge of programming (Perl, Python) to implement. With WGS becoming commonplace, genome sequencing projects are moving away from the genome centers and into the ecology or biology lab, where fewer resources are present to support the process of genome assembly curation. To fill this gap, we developed software to assess, filter, and transfer annotation and convert a draft genome assembly and annotation set into the NCBI annotation table (.tbl) format, facilitating submission to the NCBI Genome Assembly database. This software has no dependencies, is compatible across platforms, and utilizes a simple command to perform a variety of simple and complex post-analysis, pre-NCBI submission WGS project tasks. The Genome Annotation Generator is a consistent and user-friendly bioinformatics tool that can be used to generate a .tbl file that is consistent with the NCBI submission pipeline. The Genome Annotation Generator achieves the goal of providing a publicly available tool that will facilitate the submission of annotated genome assemblies to the NCBI. It is useful for any individual researcher or research group that wishes to submit a genome assembly of their study system to the NCBI.
Hall, Brian; Derego, Theodore; Bremer, Forest T; Cannoles, Kyle
2018-01-01
Abstract Background One of the most overlooked, yet critical, components of a whole genome sequencing (WGS) project is the submission and curation of the data to a genomic repository, most commonly the National Center for Biotechnology Information (NCBI). While large genome centers or genome groups have developed software tools for post-annotation assembly filtering, annotation, and conversion into the NCBI’s annotation table format, these tools typically require back-end setup and connection to an Structured Query Language (SQL) database and/or some knowledge of programming (Perl, Python) to implement. With WGS becoming commonplace, genome sequencing projects are moving away from the genome centers and into the ecology or biology lab, where fewer resources are present to support the process of genome assembly curation. To fill this gap, we developed software to assess, filter, and transfer annotation and convert a draft genome assembly and annotation set into the NCBI annotation table (.tbl) format, facilitating submission to the NCBI Genome Assembly database. This software has no dependencies, is compatible across platforms, and utilizes a simple command to perform a variety of simple and complex post-analysis, pre-NCBI submission WGS project tasks. Findings The Genome Annotation Generator is a consistent and user-friendly bioinformatics tool that can be used to generate a .tbl file that is consistent with the NCBI submission pipeline Conclusions The Genome Annotation Generator achieves the goal of providing a publicly available tool that will facilitate the submission of annotated genome assemblies to the NCBI. It is useful for any individual researcher or research group that wishes to submit a genome assembly of their study system to the NCBI. PMID:29635297
DOT National Transportation Integrated Search
2016-08-01
There is optimism that Automated Vehicles (AVs) can improve the safety of the transportation system, : reduce congestion, increase reliability, offer improved mobility solutions to all segments of the population : including the transportation-disadva...
Oppert, Brenda; Perkin, Lindsey; Martynov, Alexander G; Elpidina, Elena N
2018-04-01
The gut is one of the primary interfaces between an insect and its environment. Understanding gene expression profiles in the insect gut can provide insight into interactions with the environment as well as identify potential control methods for pests. We compared the expression profiles of transcripts from the gut of larval stages of two coleopteran insects, Tenebrio molitor and Tribolium castaneum. These tenebrionids have different life cycles, varying in the duration and number of larval instars. T. castaneum has a sequenced genome and has been a model for coleopterans, and we recently obtained a draft genome for T. molitor. We assembled gut transcriptome reads from each insect to their respective genomes and filtered mapped reads to RPKM>1, yielding 11,521 and 17,871 genes in the T. castaneum and T. molitor datasets, respectively. There were identical GO terms in each dataset, and enrichment analyses also identified shared GO terms. From these datasets, we compiled an ortholog list of 6907 genes; 45% of the total assembled reads from T. castaneum were found in the top 25 orthologs, but only 27% of assembled reads were found in the top 25 T. molitor orthologs. There were 2281 genes unique to T. castaneum, and 2088 predicted genes unique to T. molitor, although improvements to the T. molitor genome will likely reduce these numbers as more orthologs are identified. We highlight a few unique genes in T. castaneum or T. molitor that may relate to distinct biological functions. A large number of putative genes expressed in the larval gut with uncharacterized functions (36 and 68% from T. castaneum and T. molitor, respectively) support the need for further research. These data are the first step in building a comprehensive understanding of the physiology of the gut in tenebrionid insects, illustrating commonalities and differences that may be related to speciation and environmental adaptation. Published by Elsevier Ltd.
Evolution of modern approaches to express uncertainty in measurement
NASA Astrophysics Data System (ADS)
Kacker, Raghu; Sommer, Klaus-Dieter; Kessel, Rüdiger
2007-12-01
An object of this paper is to discuss the logical development of the concept of uncertainty in measurement and the methods for its quantification from the classical error analysis to the modern approaches based on the Guide to the Expression of Uncertainty in Measurement (GUM). We review authoritative literature on error analysis and then discuss its limitations which motivated the experts from the International Committee for Weights and Measures (CIPM), the International Bureau of Weights and Measures (BIPM) and various national metrology institutes to develop specific recommendations which form the basis of the GUM. We discuss the new concepts introduced by the GUM and their merits and limitations. The limitations of the GUM led the BIPM Joint Committee on Guides in Metrology to develop an alternative approach—the draft Supplement 1 to the GUM (draft GUM-S1). We discuss the draft GUM-S1 and its merits and limitations. We hope this discussion will lead to a more effective use of the GUM and the draft GUM-S1 and stimulate investigations leading to further improvements in the methods to quantify uncertainty in measurement.
Asadi, Leila; Beigi, Marjan; Valiani, Mahbube; Mardani, Fardin
2017-01-01
Medical errors are the main concerns in health systems, which considering their ascending rate in the recent years, especially in the field of midwifery, have caused a medical crisis. Considering the importance of evidence-based health services as a way to improve health systems, the aim of this study was to suggest a guideline for preventing malpractice in midwifery services. In this cross-sectional study that was conducted in 2013, we investigated 206 cases that were referred to the Isfahan Legal Medicine Organization and Medical Council of Forensic Medicine from 2006-2011. Data were collected by a checklist and were analyzed using SPSS-16 software. Descriptive statistical tests (mean, maximum, minimum, standard deviation, frequency, and percentage agreement) were used to describe the data. Then, we used the Delphi technique with the participation from 17 experts in midwifery, gynecology, and legal medicine to provide an evidence-based draft guideline for prevention of midwifery errors. A total of 206 cases were reviewed. In 66 cases (32%) the verdict for malpractice in midwifery services was approved. A practical draft guideline for preventing clinical errors for midwifery in the fields of pregnancy, delivery, and postpartum period was developed. This evidence-based draft guideline can improve the attention of all the healthcare providers, especially midwives and physicians to prevent urgent problems and offer effective health services for mothers and infants.
Asadi, Leila; Beigi, Marjan; Valiani, Mahbube; Mardani, Fardin
2017-01-01
Background: Medical errors are the main concerns in health systems, which considering their ascending rate in the recent years, especially in the field of midwifery, have caused a medical crisis. Considering the importance of evidence-based health services as a way to improve health systems, the aim of this study was to suggest a guideline for preventing malpractice in midwifery services. Materials and Methods: In this cross-sectional study that was conducted in 2013, we investigated 206 cases that were referred to the Isfahan Legal Medicine Organization and Medical Council of Forensic Medicine from 2006–2011. Data were collected by a checklist and were analyzed using SPSS-16 software. Descriptive statistical tests (mean, maximum, minimum, standard deviation, frequency, and percentage agreement) were used to describe the data. Then, we used the Delphi technique with the participation from 17 experts in midwifery, gynecology, and legal medicine to provide an evidence-based draft guideline for prevention of midwifery errors. Results: A total of 206 cases were reviewed. In 66 cases (32%) the verdict for malpractice in midwifery services was approved. A practical draft guideline for preventing clinical errors for midwifery in the fields of pregnancy, delivery, and postpartum period was developed. Conclusions: This evidence-based draft guideline can improve the attention of all the healthcare providers, especially midwives and physicians to prevent urgent problems and offer effective health services for mothers and infants. PMID:28904546
Hoy, Marjorie A.; Waterhouse, Robert M.; Wu, Ke; Estep, Alden S.; Ioannidis, Panagiotis; Palmer, William J.; Pomerantz, Aaron F.; Simão, Felipe A.; Thomas, Jainy; Jiggins, Francis M.; Murphy, Terence D.; Pritham, Ellen J.; Robertson, Hugh M.; Zdobnov, Evgeny M.; Gibbs, Richard A.; Richards, Stephen
2016-01-01
Metaseiulus occidentalis is an eyeless phytoseiid predatory mite employed for the biological control of agricultural pests including spider mites. Despite appearances, these predator and prey mites are separated by some 400 Myr of evolution and radically different lifestyles. We present a 152-Mb draft assembly of the M. occidentalis genome: Larger than that of its favored prey, Tetranychus urticae, but considerably smaller than those of many other chelicerates, enabling an extremely contiguous and complete assembly to be built—the best arachnid to date. Aided by transcriptome data, genome annotation cataloged 18,338 protein-coding genes and identified large numbers of Helitron transposable elements. Comparisons with other arthropods revealed a particularly dynamic and turbulent genomic evolutionary history. Its genes exhibit elevated molecular evolution, with strikingly high numbers of intron gains and losses, in stark contrast to the deer tick Ixodes scapularis. Uniquely among examined arthropods, this predatory mite’s Hox genes are completely atomized, dispersed across the genome, and it encodes five copies of the normally single-copy RNA processing Dicer-2 gene. Examining gene families linked to characteristic biological traits of this tiny predator provides initial insights into processes of sex determination, development, immune defense, and how it detects, disables, and digests its prey. As the first reference genome for the Phytoseiidae, and for any species with the rare sex determination system of parahaploidy, the genome of the western orchard predatory mite improves genomic sampling of chelicerates and provides invaluable new resources for functional genomic analyses of this family of agriculturally important mites. PMID:26951779
Naor, Michael; Heyman, Samuel N; Bader, Tarif; Merin, Ofer
2017-01-01
The Israeli Defense Force (IDF) Medical Corps developed a model of airborne field hospital. This model was structured to deal with disaster settings, requiring self-sufficiency, innovation and flexible operative mode in the setup of large margins of uncertainty regarding the disaster environment. The current study is aimed to critically analyze the experience, gathered in ten such missions worldwide. Interviews with physicians who actively participated in the missions from 1988 until 2015 as chief medical officers combined with literature review of principal medical and auxiliary publications in order to assess and integrate information about the assembly of these missions. A body of knowledge was accumulated over the years by the IDF Medical Corps from deploying numerous relief missions to both natural (earthquake, typhoon, and tsunami), and man-made disasters, occurring in nine countries (Armenia, Rwanda, Kosovo, Turkey, India, Haiti, Japan, Philippines, and Nepal). This study shows an evolutionary pattern with improvements implemented from one mission to the other, with special adaptations (creativity and improvisation) to accommodate logistics barriers. The principals and operative function for deploying medical relief system, proposed over 20 years ago, were challenged and validated in the subsequent missions of IDF outlined in the current study. These principals, with the advantage of the military infrastructure and the expertise of drafted civilian medical professionals enable the rapid assembly and allocation of highly competent medical facilities in disaster settings. This structure model is to large extent self-sufficient with a substantial operative flexibility that permits early deployment upon request while the disaster assessment and definition of needs are preliminary.
Kanost, Michael R.; Arrese, Estela L.; Cao, Xiaolong; Chen, Yun-Ru; Chellapilla, Sanjay; Goldsmith, Marian R; Grosse-Wilde, Ewald; Heckel, David G.; Herndon, Nicolae; Jiang, Haobo; Papanicolaou, Alexie; Qu, Jiaxin; Soulages, Jose L.; Vogel, Heiko; Walters, James; Waterhouse, Robert M.; Ahn, Seung-Joon; Almeida, Francisca C.; An, Chunju; Aqrawi, Peshtewani; Bretschneider, Anne; Bryant, William B.; Bucks, Sascha; Chao, Hsu; Chevignon, Germain; Christen, Jayne M.; Clarke, David F.; Dittmer, Neal T.; Ferguson, Laura C.F.; Garavelou, Spyridoula; Gordon, Karl H.J.; Gunaratna, Ramesh T.; Han, Yi; Hauser, Frank; He, Yan; Heidel-Fischer, Hanna; Hirsh, Ariana; Hu, Yingxia; Jiang, Hongbo; Kalra, Divya; Klinner, Christian; König, Christopher; Kovar, Christie; Kroll, Ashley R.; Kuwar, Suyog S.; Lee, Sandy L.; Lehman, Rüdiger; Li, Kai; Li, Zhaofei; Liang, Hanquan; Lovelace, Shanna; Lu, Zhiqiang; Mansfield, Jennifer H.; McCulloch, Kyle J.; Mathew, Tittu; Morton, Brian; Muzny, Donna M.; Neunemann, David; Ongeri, Fiona; Pauchet, Yannick; Pu, Ling-Ling; Pyrousis, Ioannis; Rao, Xiang-Jun; Redding, Amanda; Roesel, Charles; Sanchez-Gracia, Alejandro; Schaack, Sarah; Shukla, Aditi; Tetreau, Guillaume; Wang, Yang; Xiong, Guang-Hua; Traut, Walther; Walsh, Tom K.; Worley, Kim C.; Wu, Di; Wu, Wenbi; Wu, Yuan-Qing; Zhang, Xiufeng; Zou, Zhen; Zucker, Hannah; Briscoe, Adriana D.; Burmester, Thorsten; Clem, Rollie J.; Feyereisen, René; Grimmelikhuijzen, Cornelis J.P; Hamodrakas, Stavros J.; Hansson, Bill S.; Huguet, Elisabeth; Jermiin, Lars S.; Lan, Que; Lehman, Herman K.; Lorenzen, Marce; Merzendorfer, Hans; Michalopoulos, Ioannis; Morton, David B.; Muthukrishnan, Subbaratnam; Oakeshott, John G.; Palmer, Will; Park, Yoonseong; Passarelli, A. Lorena; Rozas, Julio; Schwartz, Lawrence M.; Smith, Wendy; Southgate, Agnes; Vilcinskas, Andreas; Vogt, Richard; Wang, Ping; Werren, John; Yu, Xiao-Qiang; Zhou, Jing-Jiang; Brown, Susan J.; Scherer, Steven E.; Richards, Stephen; Blissard, Gary W.
2016-01-01
Manduca sexta, known as the tobacco hornworm or Carolina sphinx moth, is a lepidopteran insect that is used extensively as a model system for research in insect biochemistry, physiology, neurobiology, development, and immunity. One important benefit of this species as an experimental model is its extremely large size, reaching more than 10 g in the larval stage. M. sexta larvae feed on solanaceous plants and thus must tolerate a substantial challenge from plant allelochemicals, including nicotine. We report the sequence and annotation of the M. sexta genome, and a survey of gene expression in various tissues and developmental stages. The Msex_1.0 genome assembly resulted in a total genome size of 419.4 Mbp. Repetitive sequences accounted for 25.8% of the assembled genome. The official gene set is comprised of 15,451 protein-coding genes, of which 2498 were manually curated. Extensive RNA-seq data from many tissues and developmental stages were used to improve gene models and for insights into gene expression patterns. Genome wide synteny analysis indicated a high level of macrosynteny in the Lepidoptera. Annotation and analyses were carried out for gene families involved in a wide spectrum of biological processes, including apoptosis, vacuole sorting, growth and development, structures of exoskeleton, egg shells, and muscle, vision, chemosensation, ion channels, signal transduction, neuropeptide signaling, neurotransmitter synthesis and transport, nicotine tolerance, lipid metabolism, and immunity. This genome sequence, annotation, and analysis provide an important new resource from a well-studied model insect species and will facilitate further biochemical and mechanistic experimental studies of many biological systems in insects. PMID:27522922
Shedding genomic light on Aristotle's lantern.
Sodergren, Erica; Shen, Yufeng; Song, Xingzhi; Zhang, Lan; Gibbs, Richard A; Weinstock, George M
2006-12-01
Sea urchins have proved fascinating to biologists since the time of Aristotle who compared the appearance of their bony mouth structure to a lantern in The History of Animals. Throughout modern times it has been a model system for research in developmental biology. Now, the genome of the sea urchin Strongylocentrotus purpuratus is the first echinoderm genome to be sequenced. A high quality draft sequence assembly was produced using the Atlas assembler to combine whole genome shotgun sequences with sequences from a collection of BACs selected to form a minimal tiling path along the genome. A formidable challenge was presented by the high degree of heterozygosity between the two haplotypes of the selected male representative of this marine organism. This was overcome by use of the BAC tiling path backbone, in which each BAC represents a single haplotype, as well as by improvements in the Atlas software. Another innovation introduced in this project was the sequencing of pools of tiling path BACs rather than individual BAC sequencing. The Clone-Array Pooled Shotgun Strategy greatly reduced the cost and time devoted to preparing shotgun libraries from BAC clones. The genome sequence was analyzed with several gene prediction methods to produce a comprehensive gene list that was then manually refined and annotated by a volunteer team of sea urchin experts. This latter annotation community edited over 9000 gene models and uncovered many unexpected aspects of the sea urchin genetic content impacting transcriptional regulation, immunology, sensory perception, and an organism's development. Analysis of the basic deuterostome genetic complement supports the sea urchin's role as a model system for deuterostome and, by extension, chordate development.
A draft genome assembly of the army worm, Spodoptera frugiperda.
Kakumani, Pavan Kumar; Malhotra, Pawan; Mukherjee, Sunil K; Bhatnagar, Raj K
2014-08-01
Spodoptera is an agriculturally important pest insect and studies in understanding its biology have been limited by the unavailability of its genome. In the present study, the genomic DNA was sequenced and assembled into 37,243 scaffolds of size, 358 Mb with N50 of 53.7 kb. Based on degree of identity, we could anchor 305 Mb of the genome onto all the 28 chromosomes of Bombyx mori. Repeat elements were identified, which accounts for 20.28% of the total genome. Further, we predicted 11,595 genes, with an average intron length of 726 bp. The genes were annotated and domain analysis revealed that Sf genes share a significant homology and expression pattern with B. mori, despite differences in KOG gene categories and representation of certain protein families. The present study on Sf genome would help in the characterization of cellular pathways to understand its biology and comparative evolutionary studies among lepidopteran family members to help annotate their genomes. Copyright © 2014 Elsevier Inc. All rights reserved.
Qi, Weihong; Vaughan, Lloyd; Katharios, Pantelis; Schlapbach, Ralph; Seth-Smith, Helena M.B.
2016-01-01
Advances in single-cell and mini-metagenome sequencing have enabled important investigations into uncultured bacteria. In this study, we applied the mini-metagenome sequencing method to assemble genome drafts of the uncultured causative agents of epitheliocystis, an emerging infectious disease in the Mediterranean aquaculture species gilthead seabream. We sequenced multiple cyst samples and constructed 11 genome drafts from a novel beta-proteobacterial lineage, Candidatus Ichthyocystis. The draft genomes demonstrate features typical of pathogenic bacteria with an obligate intracellular lifestyle: a reduced genome of up to 2.6 Mb, reduced G + C content, and reduced metabolic capacity. Reconstruction of metabolic pathways reveals that Ca. Ichthyocystis genomes lack all amino acid synthesis pathways, compelling them to scavenge from the fish host. All genomes encode type II, III, and IV secretion systems, a large repertoire of predicted effectors, and a type IV pilus. These are all considered to be virulence factors, required for adherence, invasion, and host manipulation. However, no evidence of lipopolysaccharide synthesis could be found. Beyond the core functions shared within the genus, alignments showed distinction into different species, characterized by alternative large gene families. These comprise up to a third of each genome, appear to have arisen through duplication and diversification, encode many effector proteins, and are seemingly critical for virulence. Thus, Ca. Ichthyocystis represents a novel obligatory intracellular pathogenic beta-proteobacterial lineage. The methods used: mini-metagenome analysis and manual annotation, have generated important insights into the lifestyle and evolution of the novel, uncultured pathogens, elucidating many putative virulence factors including an unprecedented array of novel gene families. PMID:27190004
The draft genome of Ruellia speciosa (Beautiful Wild Petunia: Acanthaceae).
Zhuang, Yongbin; Tripp, Erin A
2017-04-01
The genus Ruellia (Wild Petunias; Acanthaceae) is characterized by an enormous diversity of floral shapes and colours manifested among closely related species. Using Illumina platform, we reconstructed the draft genome of Ruellia speciosa, with a scaffold size of 1,021 Mb (or ∼1.02 Gb) and an N50 size of 17,908 bp, spanning ∼93% of the estimated genome (∼1.1 Gb). The draft assembly predicted 40,124 gene models and phylogenetic analyses of four key enzymes involved in anthocyanin colour production [flavanone 3-hydroxylase (F3H), flavonoid 3'-hydroxylase (F3'H), flavonoid 3',5'-hydroxylase (F3'5'H), and dihydroflavonol 4-reductase (DFR)] found that most angiosperms here sampled harboured at least one copy of F3H, F3'H, and DFR. In contrast, fewer than one-half (but including R. speciosa) harboured a copy of F3'5'H, supporting observations that blue flowers and/or fruits, which this enzyme is required for, are less common among flowering plants. Ka/Ks analyses of duplicated copies of F3'H and DFR in R. speciosa suggested purifying selection in the former but detected evidence of positive selection in the latter. The genome sequence and annotation of R. speciosa represents only one of only four families sequenced in the large and important Asterid clade of flowering plants and, as such, will facilitate extensive future research on this diverse group, particularly with respect to floral evolution. © The Author 2017. Published by Oxford University Press on behalf of Kazusa DNA Research Institute.
The draft genome of Ruellia speciosa (Beautiful Wild Petunia: Acanthaceae)
Zhuang, Yongbin
2017-01-01
Abstract The genus Ruellia (Wild Petunias; Acanthaceae) is characterized by an enormous diversity of floral shapes and colours manifested among closely related species. Using Illumina platform, we reconstructed the draft genome of Ruellia speciosa, with a scaffold size of 1,021 Mb (or ∼1.02 Gb) and an N50 size of 17,908 bp, spanning ∼93% of the estimated genome (∼1.1 Gb). The draft assembly predicted 40,124 gene models and phylogenetic analyses of four key enzymes involved in anthocyanin colour production [flavanone 3-hydroxylase (F3H), flavonoid 3′-hydroxylase (F3′H), flavonoid 3′,5′-hydroxylase (F3′5′H), and dihydroflavonol 4-reductase (DFR)] found that most angiosperms here sampled harboured at least one copy of F3H, F3′H, and DFR. In contrast, fewer than one-half (but including R. speciosa) harboured a copy of F3′5′H, supporting observations that blue flowers and/or fruits, which this enzyme is required for, are less common among flowering plants. Ka/Ks analyses of duplicated copies of F3′H and DFR in R. speciosa suggested purifying selection in the former but detected evidence of positive selection in the latter. The genome sequence and annotation of R. speciosa represents only one of only four families sequenced in the large and important Asterid clade of flowering plants and, as such, will facilitate extensive future research on this diverse group, particularly with respect to floral evolution. PMID:28431014
76 FR 78252 - Environmental Impacts Statements; Notice of Availability
Federal Register 2010, 2011, 2012, 2013, 2014
2011-12-16
..., Rubicon Trail Easement and Resource Improvement Project, Construction and Operation, Right-of-Way Grant...: Andrea Catanzaro (409) 766-6346 EIS No. 20110421, Draft EIS, USFS, CA, Greys Mountain Ecological...--La Crosse Transmission System Improvement Project, Proposed Construction and Operation of a 345...
Federal Register 2010, 2011, 2012, 2013, 2014
2012-06-27
... modernize the design at the I-405, SR-91 and a portion of the I-5 interchanges, modernize and reconfigure... Intelligent Transportation Systems (ITS) improvements; improvements to 42 local arterial intersections within the I-710 Corridor; aesthetic enhancements; and, drainage and water quality improvement design...
Federal Register 2010, 2011, 2012, 2013, 2014
2010-06-24
... for Improvements to the Calexico West Port of Entry, Calexico, CA AGENCY: Public Buildings Service... Impact Statement (EIS) for Improvements to the Calexico West Port of Entry, Calexico, California, for... and soils) or topic area (e.g., traffic, environmental justice). After the public comment period...
Federal Register 2010, 2011, 2012, 2013, 2014
2011-11-10
... and parking apron improvements; construction of additional bachelor enlisted quarters (BEQs... MCB Hawaii Kaneohe Bay, types of basing facilities that are required, improvements at training areas... construction, replacement, and renovation of facilities at MCB Hawaii Kaneohe Bay, and include improvements to...
Genome and transcriptome of the porcine whipworm Trichuris suis
Jex, Aaron R.; Nejsum, Peter; Schwarz, Erich M.; Hu, Li; Young, Neil D.; Hall, Ross S.; Korhonen, Pasi K.; Liao, Shengguang; Thamsborg, Stig; Xia, Jinquan; Xu, Pengwei; Wang, Shaowei; Scheerlinck, Jean-Pierre Y.; Hofmann, Andreas; Sternberg, Paul W.; Wang, Jun; Gasser, Robin B.
2014-01-01
Trichuris (whipworm) infects 1 billion people worldwide, and causes a disease (trichuriasis) that results in major socioeconomic losses in both humans and pigs. Trichuriasis relates to an inflammation of the large intestine manifested in bloody diarrhoea, and chronic disease can cause malnourishment and stunting in children. Paradoxically, Trichuris of pigs has shown substantial promise as a treatment for human autoimmune disorders, including inflammatory bowel disease (IBD) and multiple sclerosis (MS). Here, we report ~80 megabase (Mb) draft assemblies of the genomes of adult male and female T. suis, and explore stage-, sex- and tissue-specific transcription of messenger and small non-coding RNAs. PMID:24929829
Genome and transcriptome of the porcine whipworm Trichuris suis.
Jex, Aaron R; Nejsum, Peter; Schwarz, Erich M; Hu, Li; Young, Neil D; Hall, Ross S; Korhonen, Pasi K; Liao, Shengguang; Thamsborg, Stig; Xia, Jinquan; Xu, Pengwei; Wang, Shaowei; Scheerlinck, Jean-Pierre Y; Hofmann, Andreas; Sternberg, Paul W; Wang, Jun; Gasser, Robin B
2014-07-01
Trichuris (whipworm) infects 1 billion people worldwide and causes a disease (trichuriasis) that results in major socioeconomic losses in both humans and pigs. Trichuriasis relates to an inflammation of the large intestine manifested in bloody diarrhea, and chronic disease can cause malnourishment and stunting in children. Paradoxically, Trichuris of pigs has shown substantial promise as a treatment for human autoimmune disorders, including inflammatory bowel disease (IBD) and multiple sclerosis. Here we report whole-genome sequencing at ∼140-fold coverage of adult male and female T. suis and ∼80-Mb draft assemblies. We explore stage-, sex- and tissue-specific transcription of mRNAs and small noncoding RNAs.
A synteny-based draft genome sequence of the forage grass Lolium perenne.
Byrne, Stephen L; Nagy, Istvan; Pfeifer, Matthias; Armstead, Ian; Swain, Suresh; Studer, Bruno; Mayer, Klaus; Campbell, Jacqueline D; Czaban, Adrian; Hentrup, Stephan; Panitz, Frank; Bendixen, Christian; Hedegaard, Jakob; Caccamo, Mario; Asp, Torben
2015-11-01
Here we report the draft genome sequence of perennial ryegrass (Lolium perenne), an economically important forage and turf grass species that is widely cultivated in temperate regions worldwide. It is classified along with wheat, barley, oats and Brachypodium distachyon in the Pooideae sub-family of the grass family (Poaceae). Transcriptome data was used to identify 28,455 gene models, and we utilized macro-co-linearity between perennial ryegrass and barley, and synteny within the grass family, to establish a synteny-based linear gene order. The gametophytic self-incompatibility mechanism enables the pistil of a plant to reject self-pollen and therefore promote out-crossing. We have used the sequence assembly to characterize transcriptional changes in the stigma during pollination with both compatible and incompatible pollen. Characterization of the pollen transcriptome identified homologs to pollen allergens from a range of species, many of which were expressed to very high levels in mature pollen grains, and are potentially involved in the self-incompatibility mechanism. The genome sequence provides a valuable resource for future breeding efforts based on genomic prediction, and will accelerate the development of new varieties for more productive grasslands. © 2015 The Authors The Plant Journal © 2015 John Wiley & Sons Ltd.
Speth, Daan R; Lagkouvardos, Ilias; Wang, Yong; Qian, Pei-Yuan; Dutilh, Bas E; Jetten, Mike S M
2017-07-01
Several recent studies have indicated that members of the phylum Planctomycetes are abundantly present at the brine-seawater interface (BSI) above multiple brine pools in the Red Sea. Planctomycetes include bacteria capable of anaerobic ammonium oxidation (anammox). Here, we investigated the possibility of anammox at BSI sites using metagenomic shotgun sequencing of DNA obtained from the BSI above the Discovery Deep brine pool. Analysis of sequencing reads matching the 16S rRNA and hzsA genes confirmed presence of anammox bacteria of the genus Scalindua. Phylogenetic analysis of the 16S rRNA gene indicated that this Scalindua sp. belongs to a distinct group, separate from the anammox bacteria in the seawater column, that contains mostly sequences retrieved from high-salt environments. Using coverage- and composition-based binning, we extracted and assembled the draft genome of the dominant anammox bacterium. Comparative genomic analysis indicated that this Scalindua species uses compatible solutes for osmoadaptation, in contrast to other marine anammox bacteria that likely use a salt-in strategy. We propose the name Candidatus Scalindua rubra for this novel species, alluding to its discovery in the Red Sea.
Pfeiffer, Friedhelm; Zamora-Lagos, Maria-Antonia; Blettinger, Martin; Yeroslaviz, Assa; Dahl, Andreas; Gruber, Stephan; Habermann, Bianca H
2018-01-05
Due to the predominant usage of short-read sequencing to date, most bacterial genome sequences reported in the last years remain at the draft level. This precludes certain types of analyses, such as the in-depth analysis of genome plasticity. Here we report the finalized genome sequence of the environmental strain Aeromonas salmonicida subsp. pectinolytica 34mel, for which only a draft genome with 253 contigs is currently available. Successful completion of the transposon-rich genome critically depended on the PacBio long read sequencing technology. Using finalized genome sequences of A. salmonicida subsp. pectinolytica and other Aeromonads, we report the detailed analysis of the transposon composition of these bacterial species. Mobilome evolution is exemplified by a complex transposon, which has shifted from pathogenicity-related to environmental-related gene content in A. salmonicida subsp. pectinolytica 34mel. Obtaining the complete, circular genome of A. salmonicida subsp. pectinolytica allowed us to perform an in-depth analysis of its mobilome. We demonstrate the mobilome-dependent evolution of this strain's genetic profile from pathogenic to environmental.
Poirier, Simon; Coeuret, Gwendoline; Champomier-Vergès, Marie-Christine; Chaillou, Stéphane
2018-06-14
In this study, we present the draft genome sequences of nine strains from various psychrotrophic species identified in meat products and being recognized as important emerging food spoilers. Many of these species have only one or few strains being sequenced, and this work will contribute to the improvement of the overall genomic knowledge about them. Copyright © 2018 Poirier et al.
Effect of practice and training in spatial skills on embedded figures scores of males and females.
Johnson, S; Flinn, J M; Tyer, Z E
1979-06-01
The effect of practice and training in spatial skills on scores obtained by male and female students on the Embedded Figures Test was examined. Forms A and B were administered 6 wk. apart to three groups of subjects (ns = 28, 27, 27) enrolled in drafting, mathematics, and liberal arts courses. During the pretest-posttest period the drafting students received training while the other two groups served as controls. Analysis indicated (1) no initial sex difference in test scores; (2) liberal arts students differed significantly from drafting and mathematics students, but there was no significant difference between the last two groups; (3) all groups improved with practice; (4) women receiving training improved more than women who did not; (5) there was a trend toward women receiving spatial training scoring more poorly than males receiving training on the pretest, but there was no significant difference on the posttest. These results suggest that sex differences in embedded-figures scores found by many previous experimenters may have been associated with differences in prior experience in spatial skills and by a confounding of sex with area of academic study.
Theoretical considerations and measurements for phoropters
NASA Astrophysics Data System (ADS)
Zhang, Jiyan; Liu, Wenli; Sun, Jie
2008-10-01
A phoropter is one of the most popular ophthalmic instruments used in current optometry practice. The quality and verification of the instrument are of the utmost importance. In 1997, International Organization for Standardization published the first ISO standard for requirements of phoropters. However, in China, few standard and test method are suggested for phoropters. Research work on test method for phoropters was carried out early in 2004 by China National Institute of Metrology. In this paper, first, structure of phoropters is described. Then, theoretical considerations for its optical design are analyzed. Next, a newly developed instrument is introduced and measurements are taken. By calibration, the indication error of the instrument is not over 0.05m-1. Finally, measurement results show that the quality situation of phoropters is not as good as expected because of production and assembly error. Optical design shall be improved especially for combinations of both spherical and cylindrical lenses with higher power. Besides, optical requirements specified in ISO standard are found to be a little strict and hard to meet. A proposal for revision of this international standard is drafted and discussed on ISO meeting of 2007 held in Tokyo.
Davies, Louise; Donnelly, Kyla Z; Goodman, Daisy J; Ogrinc, Greg
2016-04-01
The Standards for Quality Improvement Reporting Excellence (SQUIRE) Guideline was published in 2008 (SQUIRE 1.0) and was the first publication guideline specifically designed to advance the science of healthcare improvement. Advances in the discipline of improvement prompted us to revise it. We adopted a novel approach to the revision by asking end-users to 'road test' a draft version of SQUIRE 2.0. The aim was to determine whether they understood and implemented the guidelines as intended by the developers. Forty-four participants were assigned a manuscript section (ie, introduction, methods, results, discussion) and asked to use the draft Guidelines to guide their writing process. They indicated the text that corresponded to each SQUIRE item used and submitted it along with a confidential survey. The survey examined usability of the Guidelines using Likert-scaled questions and participants' interpretation of key concepts in SQUIRE using open-ended questions. On the submitted text, we evaluated concordance between participants' item usage/interpretation and the developers' intended application. For the survey, the Likert-scaled responses were summarised using descriptive statistics and the open-ended questions were analysed by content analysis. Consistent with the SQUIRE Guidelines' recommendation that not every item be included, less than one-third (n=14) of participants applied every item in their section in full. Of the 85 instances when an item was partially used or was omitted, only 7 (8.2%) of these instances were due to participants not understanding the item. Usage of Guideline items was highest for items most similar to standard scientific reporting (ie, 'Specific aim of the improvement' (introduction), 'Description of the improvement' (methods) and 'Implications for further studies' (discussion)) and lowest (<20% of the time) for those unique to healthcare improvement (ie, 'Assessment methods for context factors that contributed to success or failure' and 'Costs and strategic trade-offs'). Items unique to healthcare improvement, specifically 'Evolution of the improvement', 'Context elements that influenced the improvement', 'The logic on which the improvement was based', 'Process and outcome measures', demonstrated poor concordance between participants' interpretation and developers' intended application. User testing of a draft version of SQUIRE 2.0 revealed which items have poor concordance between developer intent and author usage, which will inform final editing of the Guideline and development of supporting supplementary materials. It also identified the items that require special attention when teaching about scholarly writing in healthcare improvement. Published by the BMJ Publishing Group Limited. For permission to use (where not already granted under a licence) please go to http://www.bmj.com/company/products-services/rights-and-licensing/
Improving the energy efficiency of sparse linear system solvers on multicore and manycore systems.
Anzt, H; Quintana-Ortí, E S
2014-06-28
While most recent breakthroughs in scientific research rely on complex simulations carried out in large-scale supercomputers, the power draft and energy spent for this purpose is increasingly becoming a limiting factor to this trend. In this paper, we provide an overview of the current status in energy-efficient scientific computing by reviewing different technologies used to monitor power draft as well as power- and energy-saving mechanisms available in commodity hardware. For the particular domain of sparse linear algebra, we analyse the energy efficiency of a broad collection of hardware architectures and investigate how algorithmic and implementation modifications can improve the energy performance of sparse linear system solvers, without negatively impacting their performance. © 2014 The Author(s) Published by the Royal Society. All rights reserved.
Draft genome sequence of chickpea (Cicer arietinum) provides a resource for trait improvement.
Varshney, Rajeev K; Song, Chi; Saxena, Rachit K; Azam, Sarwar; Yu, Sheng; Sharpe, Andrew G; Cannon, Steven; Baek, Jongmin; Rosen, Benjamin D; Tar'an, Bunyamin; Millan, Teresa; Zhang, Xudong; Ramsay, Larissa D; Iwata, Aiko; Wang, Ying; Nelson, William; Farmer, Andrew D; Gaur, Pooran M; Soderlund, Carol; Penmetsa, R Varma; Xu, Chunyan; Bharti, Arvind K; He, Weiming; Winter, Peter; Zhao, Shancen; Hane, James K; Carrasquilla-Garcia, Noelia; Condie, Janet A; Upadhyaya, Hari D; Luo, Ming-Cheng; Thudi, Mahendar; Gowda, C L L; Singh, Narendra P; Lichtenzveig, Judith; Gali, Krishna K; Rubio, Josefa; Nadarajan, N; Dolezel, Jaroslav; Bansal, Kailash C; Xu, Xun; Edwards, David; Zhang, Gengyun; Kahl, Guenter; Gil, Juan; Singh, Karam B; Datta, Swapan K; Jackson, Scott A; Wang, Jun; Cook, Douglas R
2013-03-01
Chickpea (Cicer arietinum) is the second most widely grown legume crop after soybean, accounting for a substantial proportion of human dietary nitrogen intake and playing a crucial role in food security in developing countries. We report the ∼738-Mb draft whole genome shotgun sequence of CDC Frontier, a kabuli chickpea variety, which contains an estimated 28,269 genes. Resequencing and analysis of 90 cultivated and wild genotypes from ten countries identifies targets of both breeding-associated genetic sweeps and breeding-associated balancing selection. Candidate genes for disease resistance and agronomic traits are highlighted, including traits that distinguish the two main market classes of cultivated chickpea--desi and kabuli. These data comprise a resource for chickpea improvement through molecular breeding and provide insights into both genome diversity and domestication.
Unique dome design for the SOAR telescope project
NASA Astrophysics Data System (ADS)
Teran, Jose U.; Porter, David S.; Hileman, Edward A.; Neff, Daniel H.
2000-08-01
The SOAR telescope dome is a 20 meter diameter 5/8 spherical structure built on a rotating steel frame with an over the top nesting shutter and covered with a fiberglass panel system. The insulated fiberglass panel system can be self- supporting and is typically used for radomes on ground based tracking systems. The enclosed observing area is ventilated using a down draft ventilation system. The rotating steel frame is comprised of a ring beam and dual arch girders to provide support to the panel system sections and guide the shutter. The dual door shutter incorporates a unique differential drive system that reduces the complexity of the control system. The dome, shutter and windscreen `track' the telescope for maximum wind protection. The dome rotates on sixteen fixed compliant bogie assemblies. The dome is designed for assembly in sections off the facility and lifted into place for minimal impact on assembly of other telescope systems. The expected cost of the complete dome; including structure, drives, and controls is under 1.7 million. The details covered in this paper are the initial trade-offs and rationale required by SOAR to define the dome, the detailed design performed by M3 Engineering and Technology, and the choices made during the design.
Du, Lianming; Li, Wujiao; Fan, Zhenxin; Shen, Fujun; Yang, Mingyu; Wang, Zili; Jian, Zuoyi; Hou, Rong; Yue, Bisong; Zhang, Xiuyue
2015-07-01
The giant panda (Ailuropoda melanoleuca) is one of the most famous flagship species for conservation, and its draft genome has recently been assembled. However, the transcriptome is not yet available. In this study, the blood transcriptomes of three pandas were characterized and about 160 million sequencing reads were generated using Illumina HiSeq 2000 paired-end sequencing technology. The assembly yielded 92 598 transcripts with an average length of 1626 bp and N50 length of 2842 bp. Based on a sequence similarity search against nonredundant (nr) protein database, a total of 38 522 (41.6%) transcripts were annotated. Of these annotated transcripts, 25 142 and 8272 transcripts were assigned to gene ontology terms and clusters of orthologous group, respectively. A search against the Kyoto Encyclopedia of Genes and Genomes Pathway database (KEGG) indicated that 9098 (9.83%) transcripts mapped to 324 KEGG pathways, and the best represented functional categories of pathways were signal transduction and immune system. We have also identified 23 460 microsatellites, 43 560 SNPs as well as 21 456 alternative splicing events in the assembly. Additionally, a total of 24 341 complete open reading frames (ORFs) were detected from the assembly where 1492 ORFs were found to be novel gene loci as these have not been annotated so far in any public database. © 2014 John Wiley & Sons Ltd.
Improved motors for utility applications: Volume 6, Squirrel-cage rotor analysis: Final report
DOE Office of Scientific and Technical Information (OSTI.GOV)
Griffith, J.W.; McCoy, R.M.
1986-11-01
An analysis of squirrel cage induction motor rotors was undertaken in response to an Industry Assessment Study finding 10% of motor failures to be rotor related. The analysis focuses on evaluating rotor design life. The evaluation combines state-of-the-art electromagnetic, thermal, and structural solution techniques into an integrated analysis and presents a simple summary. Finite element techniques are central tools in the analysis. The analysis is applied to a specific forced draft fan drive design. Fans as a category of application have a higher failure rate than other categories of power station auxiliary motor applications. Forced-draft fan drives are one ofmore » the major fan drives which accelerate a relatively high value of rotor load inertia. Various starting and operating conditions are studied for this forced-draft fan drive motor including a representative application duty cycle.« less
The impact of peer review on paediatric forensic reports.
Kariyawasam, Uditha
2016-10-01
To retrospectively evaluate the common grammar and spelling errors of the medico-legal reports written by the doctors at the Victorian Forensic Paediatric Medical Service (VFPMS) in both Royal Children's Hospital (RCH) and Monash Medical Centre. The reports were evaluated at two points in time; before and after peer review. The aim of the study was to ascertain whether peer review improved the grammar and spelling in VFPMS medico-legal reports. Draft VFPMS reports are sent to the VFPMS medical director for peer review. The current study sampled 50 reports that were sent consecutively to Dr. Anne Smith from 1st of May 2015. The 50 corresponding final reports were then retrieved from the VFPMS database. The 50 pairs of draft and final reports were scored using a 50-point scoring system. The scores of the draft reports were compared to the scores of the final report to assess if there was a change in quality as measured using an explicit criteria audit of report structure, simple grammar, jargon use and spelling. The audit did not include evaluation of the validity of forensic opinions. The overall scores were statistically analysed using descriptive statistics and a paired T-test. The scores of the reports improved by 2.24% when the final reports were compared to the draft reports (p < 0.001). The peer-review process resulted in a significantly higher quality of medico-legal reports. The report writing and peer-review process could be assisted by an abbreviated version of the checklist used for the audit. Crown Copyright © 2016. Published by Elsevier Ltd. All rights reserved.
Improved nuclear fuel assembly grid spacer
Marshall, John; Kaplan, Samuel
1977-01-01
An improved fuel assembly grid spacer and method of retaining the basic fuel rod support elements in position within the fuel assembly containment channel. The improvement involves attachment of the grids to the hexagonal channel and of forming the basic fuel rod support element into a grid structure, which provides a design which is insensitive to potential channel distortion (ballooning) at high fluence levels. In addition the improved method eliminates problems associated with component fabrication and assembly.
Federal Register 2010, 2011, 2012, 2013, 2014
2011-05-20
... because they partially overlap in their study areas, purpose, potential improvements, potential effects... that an EIS/EIR will be prepared to describe alternatives, potential environmental effects, and... funding toward flood management improvements. These funds may be matched with those from the Early...
Peer Review Methods for ESL Writing Improvement
ERIC Educational Resources Information Center
Soares, Colleen J.
2004-01-01
This teacher research shows how peer reviews change draft papers. In the majority of cases, final papers improved in content. The study analyzes data collected from 40 intermediate/advanced nonnative speakers of English enrolled in freshman composition for international students at a large private university. It also examines student reflections…
Pozzi, Lara; Knechtle, Beat; Knechtle, Patrizia; Rosemann, Thomas; Lepers, Romuald; Rüst, Christoph Alexander
2014-01-01
The purpose of this study was to examine the sex and age-related differences in performance in a draft-legal ultra-cycling event. Age-related changes in performance across years were investigated in the 24-hour draft-legal cycling event held in Schötz, Switzerland, between 2000 and 2011 using multi-level regression analyses including age, repeated participation and environmental temperatures as co-variables. For all finishers, the age of peak cycling performance decreased significantly (β = -0.273, p = 0.036) from 38 ± 10 to 35 ± 6 years in females but remained unchanged (β = -0.035, p = 0.906) at 41.0 ± 10.3 years in males. For the annual fastest females and males, the age of peak cycling performance remained unchanged at 37.3 ± 8.5 and 38.3 ± 5.4 years, respectively. For all female and male finishers, males improved significantly (β = 7.010, p = 0.006) the cycling distance from 497.8 ± 219.6 km to 546.7 ± 205.0 km whereas females (β = -0.085, p = 0.987) showed an unchanged performance of 593.7 ± 132.3 km. The mean cycling distance achieved by the male winners of 960.5 ± 51.9 km was significantly (p < 0.001) greater than the distance covered by the female winners with 769.7 ± 65.7 km but was not different between the sexes (p > 0.05). The sex difference in performance for the annual winners of 19.7 ± 7.8% remained unchanged across years (p > 0.05). The achieved cycling distance decreased in a curvilinear manner with advancing age. There was a significant age effect (F = 28.4, p < 0.0001) for cycling performance where the fastest cyclists were in age group 35-39 years. In this 24-h cycling draft-legal event, performance in females remained unchanged while their age of peak cycling performance decreased and performance in males improved while their age of peak cycling performance remained unchanged. The annual fastest females and males were 37.3 ± 8.5 and 38.3 ± 5.4 years old, respectively. The sex difference for the fastest finishers was ~20%. It seems that women were not able to profit from drafting to improve their ultra-cycling performance.
Ma, Liping; Xia, Yu; Li, Bing; Yang, Ying; Li, Li-Guan; Tiedje, James M; Zhang, Tong
2016-01-05
The risk associated with antibiotic resistance disseminating from animal and human feces is an urgent public issue. In the present study, we sought to establish a pipeline for annotating antibiotic resistance genes (ARGs) based on metagenomic assembly to investigate ARGs and their co-occurrence with associated genetic elements. Genetic elements found on the assembled genomic fragments include mobile genetic elements (MGEs) and metal resistance genes (MRGs). We then explored the hosts of these resistance genes and the shared resistome of pig, chicken and human fecal samples. High levels of tetracycline, multidrug, erythromycin, and aminoglycoside resistance genes were discovered in these fecal samples. In particular, significantly high level of ARGs (7762 ×/Gb) was detected in adult chicken feces, indicating higher ARG contamination level than other fecal samples. Many ARGs arrangements (e.g., macA-macB and tetA-tetR) were discovered shared by chicken, pig and human feces. In addition, MGEs such as the aadA5-dfrA17-carrying class 1 integron were identified on an assembled scaffold of chicken feces, and are carried by human pathogens. Differential coverage binning analysis revealed significant ARG enrichment in adult chicken feces. A draft genome, annotated as multidrug resistant Escherichia coli, was retrieved from chicken feces metagenomes and was determined to carry diverse ARGs (multidrug, acriflavine, and macrolide). The present study demonstrates the determination of ARG hosts and the shared resistome from metagenomic data sets and successfully establishes the relationship between ARGs, hosts, and environments. This ARG annotation pipeline based on metagenomic assembly will help to bridge the knowledge gaps regarding ARG-associated genes and ARG hosts with metagenomic data sets. Moreover, this pipeline will facilitate the evaluation of environmental risks in the genetic context of ARGs.
Issa, Mohammad Nouh; Ashhab, Yaqoub
2016-09-22
Brucella melitensis Rev.1 is an avirulent strain that is widely used as a live vaccine to control brucellosis in small ruminants. Although an assembled draft version of Rev.1 genome has been available since 2009, this genome has not been investigated to characterize this important vaccine. In the present work, we used the draft genome of Rev.1 to perform a thorough genomic comparison and sequence analysis to identify and characterize the panel of its unique genetic markers. The draft genome of Rev.1 was compared with genome sequences of 36 different Brucella melitensis strains from the Brucella project of the Broad Institute of MIT and Harvard. The comparative analyses revealed 32 genetic alterations (30 SNPs, 1 single-bp insertion and 1 single-bp deletion) that are exclusively present in the Rev.1 genome. In silico analyses showed that 9 out of the 17 non-synonymous mutations are deleterious. Three ABC transporters are among the disrupted genes that can be linked to virulence attenuation. Out of the 32 mutations, 11 Rev.1 specific markers were selected to test their potential to discriminate Rev.1 using a bi-directional allele-specific PCR assay. Six markers were able to distinguish between Rev.1 and a set of control strains. We succeeded in identifying a panel of 32 genome-specific markers of the B. melitensis Rev.1 vaccine strain. Extensive in silico analysis showed that a considerable number of these mutations could severely affect the function of the associated genes. In addition, some of the discovered markers were able to discriminate Rev.1 strain from a group of control strains using practical PCR tests that can be applied in resource-limited settings. Copyright © 2016 Elsevier Ltd. All rights reserved.
Draft genome of the Antarctic dragonfish, Parachaenichthys charcoti.
Ahn, Do-Hwan; Shin, Seung Chul; Kim, Bo-Mi; Kang, Seunghyun; Kim, Jin-Hyoung; Ahn, Inhye; Park, Joonho; Park, Hyun
2017-08-01
The Antarctic bathydraconid dragonfish, Parachaenichthys charcoti, is an Antarctic notothenioid teleost endemic to the Southern Ocean. The Southern Ocean has cooled to -1.8ºC over the past 30 million years, and the seawater had retained this cold temperature and isolated oceanic environment because of the Antarctic Circumpolar Current. Notothenioids dominate Antarctic fish, making up 90% of the biomass, and all notothenioids have undergone molecular and ecological diversification to survive in this cold environment. Therefore, they are considered an attractive Antarctic fish model for evolutionary and ancestral genomic studies. Bathydraconidae is a speciose family of the Notothenioidei, the dominant taxonomic component of Antarctic teleosts. To understand the process of evolution of Antarctic fish, we select a typical Antarctic bathydraconid dragonfish, P. charcoti. Here, we have sequenced, de novo assembled, and annotated a comprehensive genome from P. charcoti. The draft genome of P. charcoti is 709 Mb in size. The N50 contig length is 6145 bp, and its N50 scaffold length 178 362 kb. The genome of P. charcoti is predicted to contain 32 712 genes, 18 455 of which have been assigned preliminary functions. A total of 8951 orthologous groups common to 7 species of fish were identified, while 333 genes were identified in P. charcoti only; 2519 orthologous groups were also identified in both P. charcoti and N. coriiceps, another Antarctic fish. Four gene ontology terms were statistically overrepresented among the 333 genes unique to P. charcoti, according to gene ontology enrichment analysis. The draft P. charcoti genome will broaden our understanding of the evolution of Antarctic fish in their extreme environment. It will provide a basis for further investigating the unusual characteristics of Antarctic fishes. © The Author 2017. Published by Oxford University Press.
Metagenomic Analysis of Therapeutic PYO Phage Cocktails from 1997 to 2014
Larsen, Mette Voldby
2017-01-01
Phage therapy has regained interest in recent years due to the alarming spread of antibiotic resistance. Whilst phage cocktails are commonly sold in pharmacies in countries such as Georgia and Russia, this is not the case in western countries due to western regulatory agencies requiring a thorough characterization of the drug. Here, DNA sequencing of constituent biological entities constitutes a first step. The pyophage (PYO) cocktail is one of the main commercial products of the Georgian Eliava Institute of Bacteriophage, Microbiology and Virology and is used to cure skin infections. Since its first production in the 1930s, the composition of the cocktail has been periodically modified to add phages effective against emerging pathogenic strains. In this paper, we compared the composition of three PYO cocktails from 1997 (PYO97), 2000 (PYO2000) and 2014 (PYO2014). Based on next generation sequencing, de novo assembly and binning of contigs into draft genomes based on tetranucleotide distance, thirty and twenty-nine phage draft genomes were predicted in PYO97 and PYO2014, respectively. Of these, thirteen and fifteen shared high similarity to known phages. Eleven draft genomes were found to be common in the two cocktails. One of these showed no similarity to publicly available phage genomes. Representatives of phages targeting E. faecalis, E. faecium, E. coli, Proteus, P. aeruginosa and S. aureus were found in both cocktails. Finally, we estimated larger overlap of the PYO2000 cocktail to PYO97 compared to PYO2014. Using next generation sequencing and metagenomics analysis, we were able to characterize and compare the content of PYO cocktails separated by 17 years in time. Even though the cocktail composition is upgraded every six months, we found it to remain relatively stable over the years. PMID:29099783
Quality scores for 32,000 genomes
Land, Miriam L.; Hyatt, Doug; Jun, Se-Ran; ...
2014-12-08
More than 80% of the microbial genomes in GenBank are of ‘draft’ quality (12,553 draft vs. 2,679 finished, as of October, 2013). In this study, we have examined all the microbial DNA sequences available for complete, draft, and Sequence Read Archive genomes in GenBank as well as three other major public databases, and assigned quality scores for more than 30,000 prokaryotic genome sequences. Scores were assigned using four categories: the completeness of the assembly, the presence of full-length rRNA genes, tRNA composition and the presence of a set of 102 conserved genes in prokaryotes. Most (~88%) of the genomes hadmore » quality scores of 0.8 or better and can be safely used for standard comparative genomics analysis. We compared genomes across factors that may influence the score. We found that although sequencing depth coverage of over 100x did not ensure a better score, sequencing read length was a better indicator of sequencing quality. With few exceptions, most of the 30,000 genomes have nearly all the 102 essential genes. The score can be used to set thresholds for screening data when analyzing “all published genomes” and reference data is either not available or not applicable. The scores highlighted organisms for which commonly used tools do not perform well. This information can be used to improve tools and to serve a broad group of users as more diverse organisms are sequenced. Finally and unexpectedly, the comparison of predicted tRNAs across 15,000 high quality genomes showed that anticodons beginning with an ‘A’ (codons ending with a ‘U’) are almost non-existent, with the exception of one arginine codon (CGU); this has been noted previously in the literature for a few genomes, but not with the depth found here.« less
Quality scores for 32,000 genomes
DOE Office of Scientific and Technical Information (OSTI.GOV)
Land, Miriam L.; Hyatt, Doug; Jun, Se-Ran
More than 80% of the microbial genomes in GenBank are of ‘draft’ quality (12,553 draft vs. 2,679 finished, as of October, 2013). In this study, we have examined all the microbial DNA sequences available for complete, draft, and Sequence Read Archive genomes in GenBank as well as three other major public databases, and assigned quality scores for more than 30,000 prokaryotic genome sequences. Scores were assigned using four categories: the completeness of the assembly, the presence of full-length rRNA genes, tRNA composition and the presence of a set of 102 conserved genes in prokaryotes. Most (~88%) of the genomes hadmore » quality scores of 0.8 or better and can be safely used for standard comparative genomics analysis. We compared genomes across factors that may influence the score. We found that although sequencing depth coverage of over 100x did not ensure a better score, sequencing read length was a better indicator of sequencing quality. With few exceptions, most of the 30,000 genomes have nearly all the 102 essential genes. The score can be used to set thresholds for screening data when analyzing “all published genomes” and reference data is either not available or not applicable. The scores highlighted organisms for which commonly used tools do not perform well. This information can be used to improve tools and to serve a broad group of users as more diverse organisms are sequenced. Finally and unexpectedly, the comparison of predicted tRNAs across 15,000 high quality genomes showed that anticodons beginning with an ‘A’ (codons ending with a ‘U’) are almost non-existent, with the exception of one arginine codon (CGU); this has been noted previously in the literature for a few genomes, but not with the depth found here.« less
Davies, Louise; Donnelly, Kyla Z; Goodman, Daisy J; Ogrinc, Greg
2016-01-01
Background The Standards for Quality Improvement Reporting Excellence (SQUIRE) Guideline was published in 2008 (SQUIRE 1.0) and was the first publication guideline specifically designed to advance the science of healthcare improvement. Advances in the discipline of improvement prompted us to revise it. We adopted a novel approach to the revision by asking end-users to ‘road test’ a draft version of SQUIRE 2.0. The aim was to determine whether they understood and implemented the guidelines as intended by the developers. Methods Forty-four participants were assigned a manuscript section (ie, introduction, methods, results, discussion) and asked to use the draft Guidelines to guide their writing process. They indicated the text that corresponded to each SQUIRE item used and submitted it along with a confidential survey. The survey examined usability of the Guidelines using Likert-scaled questions and participants’ interpretation of key concepts in SQUIRE using open-ended questions. On the submitted text, we evaluated concordance between participants’ item usage/interpretation and the developers’ intended application. For the survey, the Likert-scaled responses were summarised using descriptive statistics and the open-ended questions were analysed by content analysis. Results Consistent with the SQUIRE Guidelines’ recommendation that not every item be included, less than one-third (n=14) of participants applied every item in their section in full. Of the 85 instances when an item was partially used or was omitted, only 7 (8.2%) of these instances were due to participants not understanding the item. Usage of Guideline items was highest for items most similar to standard scientific reporting (ie, ‘Specific aim of the improvement’ (introduction), ‘Description of the improvement’ (methods) and ‘Implications for further studies’ (discussion)) and lowest (<20% of the time) for those unique to healthcare improvement (ie, ‘Assessment methods for context factors that contributed to success or failure’ and ‘Costs and strategic trade-offs’). Items unique to healthcare improvement, specifically ‘Evolution of the improvement’, ‘Context elements that influenced the improvement’, ‘The logic on which the improvement was based’, ‘Process and outcome measures’, demonstrated poor concordance between participants’ interpretation and developers’ intended application. Conclusions User testing of a draft version of SQUIRE 2.0 revealed which items have poor concordance between developer intent and author usage, which will inform final editing of the Guideline and development of supporting supplementary materials. It also identified the items that require special attention when teaching about scholarly writing in healthcare improvement. PMID:26263916
78 FR 47003 - Draft National Spatial Data Infrastructure Strategic Plan; Comment Request
Federal Register 2010, 2011, 2012, 2013, 2014
2013-08-02
... NSDI.'' Executive Order 12906 describes the NSDI as ``the technology, policies, standards, and human resources necessary to acquire, process, store, distribute, and improve utilization of geospatial data...
Bielaszewska, Martina; Karch, Helge; Toth, Ian K.
2012-01-01
Background An Escherichia coli O104:H4 outbreak in Germany in summer 2011 caused 53 deaths, over 4000 individual infections across Europe, and considerable economic, social and political impact. This outbreak was the first in a position to exploit rapid, benchtop high-throughput sequencing (HTS) technologies and crowdsourced data analysis early in its investigation, establishing a new paradigm for rapid response to disease threats. We describe a novel strategy for design of diagnostic PCR primers that exploited this rapid draft bacterial genome sequencing to distinguish between E. coli O104:H4 outbreak isolates and other pathogenic E. coli isolates, including the historical hæmolytic uræmic syndrome (HUSEC) E. coli HUSEC041 O104:H4 strain, which possesses the same serotype as the outbreak isolates. Methodology/Principal Findings Primers were designed using a novel alignment-free strategy against eleven draft whole genome assemblies of E. coli O104:H4 German outbreak isolates from the E. coli O104:H4 Genome Analysis Crowd-Sourcing Consortium website, and a negative sequence set containing 69 E. coli chromosome and plasmid sequences from public databases. Validation in vitro against 21 ‘positive’ E. coli O104:H4 outbreak and 32 ‘negative’ non-outbreak EHEC isolates indicated that individual primer sets exhibited 100% sensitivity for outbreak isolates, with false positive rates of between 9% and 22%. A minimal combination of two primers discriminated between outbreak and non-outbreak E. coli isolates with 100% sensitivity and 100% specificity. Conclusions/Significance Draft genomes of isolates of disease outbreak bacteria enable high throughput primer design and enhanced diagnostic performance in comparison to traditional molecular assays. Future outbreak investigations will be able to harness HTS rapidly to generate draft genome sequences and diagnostic primer sets, greatly facilitating epidemiology and clinical diagnostics. We expect that high throughput primer design strategies will enable faster, more precise responses to future disease outbreaks of bacterial origin, and help to mitigate their societal impact. PMID:22496820
Free acquisition and dissemination of data through remote sensing. [Landsat program legal aspects
NASA Technical Reports Server (NTRS)
Hosenball, S. N.
1976-01-01
Free acquisition and dissemination of data through remote sensing is discussed with reference to the Landsat program. The role of the Scientific and Technical Subcommittee of the U.N. General Assembly's Committee on the Peaceful Uses of Outer Space has made recommendations on the expansion of existing ground stations and on the establishment of an experimental center for training in remote sensing. The working group for the legal subcommittee of the same U.N. committee indicates that there are common elements in the three drafts on remote sensing submitted to it: a call for international cooperation and the belief that remote sensing should be conducted for the benefit of all mankind.
Water resources data, New Mexico, water year 1991
,
1992-01-01
managing our Nation's land and water resources. Hydrologic data for New Mexico are contained in this volume. This report is the culmination of a concerted effort by dedicated personnel of the u.s. Geological Survey who collected, compiled, analyzed. verified, and organized the data. and who typed, edited, and assembled the report. The authors had primary responsibility for assuring that the information contained herein is accurate. complete, and adheres to Geological Survey policy and established guidelines. The following individuals contributed significantly to the completion of the "report: Deanne E. Kimball Cynthia J. Shattuck K.M. Lange, M.F. Ortiz,and K.L. Hamilton processed the text of the report, and B. J. Henson drafted the illustrations.