Sample records for sequence tag analysis

  1. Multiplex analysis of DNA

    DOEpatents

    Church, George M.; Kieffer-Higgins, Stephen

    1992-01-01

    This invention features vectors and a method for sequencing DNA. The method includes the steps of: a) ligating the DNA into a vector comprising a tag sequence, the tag sequence includes at least 15 bases, wherein the tag sequence will not hybridize to the DNA under stringent hybridization conditions and is unique in the vector, to form a hybrid vector, b) treating the hybrid vector in a plurality of vessels to produce fragments comprising the tag sequence, wherein the fragments differ in length and terminate at a fixed known base or bases, wherein the fixed known base or bases differs in each vessel, c) separating the fragments from each vessel according to their size, d) hybridizing the fragments with an oligonucleotide able to hybridize specifically with the tag sequence, and e) detecting the pattern of hybridization of the tag sequence, wherein the pattern reflects the nucleotide sequence of the DNA.

  2. Serial analysis of gene expression (SAGE) in normal human trabecular meshwork.

    PubMed

    Liu, Yutao; Munro, Drew; Layfield, David; Dellinger, Andrew; Walter, Jeffrey; Peterson, Katherine; Rickman, Catherine Bowes; Allingham, R Rand; Hauser, Michael A

    2011-04-08

    To identify the genes expressed in normal human trabecular meshwork tissue, a tissue critical to the pathogenesis of glaucoma. Total RNA was extracted from human trabecular meshwork (HTM) harvested from 3 different donors. Extracted RNA was used to synthesize individual SAGE (serial analysis of gene expression) libraries using the I-SAGE Long kit from Invitrogen. Libraries were analyzed using SAGE 2000 software to extract the 17 base pair sequence tags. The extracted sequence tags were mapped to the genome using SAGE Genie map. A total of 298,834 SAGE tags were identified from all HTM libraries (96,842, 88,126, and 113,866 tags, respectively). Collectively, there were 107,325 unique tags. There were 10,329 unique tags with a minimum of 2 counts from a single library. These tags were mapped to known unique Unigene clusters. Approximately 29% of the tags (orphan tags) did not map to a known Unigene cluster. Thirteen percent of the tags mapped to at least 2 Unigene clusters. Sequence tags from many glaucoma-related genes, including myocilin, optineurin, and WD repeat domain 36, were identified. This is the first time SAGE analysis has been used to characterize the gene expression profile in normal HTM. SAGE analysis provides an unbiased sampling of gene expression of the target tissue. These data will provide new and valuable information to improve understanding of the biology of human aqueous outflow.

  3. An efficient annotation and gene-expression derivation tool for Illumina Solexa datasets.

    PubMed

    Hosseini, Parsa; Tremblay, Arianne; Matthews, Benjamin F; Alkharouf, Nadim W

    2010-07-02

    The data produced by an Illumina flow cell with all eight lanes occupied, produces well over a terabyte worth of images with gigabytes of reads following sequence alignment. The ability to translate such reads into meaningful annotation is therefore of great concern and importance. Very easily, one can get flooded with such a great volume of textual, unannotated data irrespective of read quality or size. CASAVA, a optional analysis tool for Illumina sequencing experiments, enables the ability to understand INDEL detection, SNP information, and allele calling. To not only extract from such analysis, a measure of gene expression in the form of tag-counts, but furthermore to annotate such reads is therefore of significant value. We developed TASE (Tag counting and Analysis of Solexa Experiments), a rapid tag-counting and annotation software tool specifically designed for Illumina CASAVA sequencing datasets. Developed in Java and deployed using jTDS JDBC driver and a SQL Server backend, TASE provides an extremely fast means of calculating gene expression through tag-counts while annotating sequenced reads with the gene's presumed function, from any given CASAVA-build. Such a build is generated for both DNA and RNA sequencing. Analysis is broken into two distinct components: DNA sequence or read concatenation, followed by tag-counting and annotation. The end result produces output containing the homology-based functional annotation and respective gene expression measure signifying how many times sequenced reads were found within the genomic ranges of functional annotations. TASE is a powerful tool to facilitate the process of annotating a given Illumina Solexa sequencing dataset. Our results indicate that both homology-based annotation and tag-count analysis are achieved in very efficient times, providing researchers to delve deep in a given CASAVA-build and maximize information extraction from a sequencing dataset. TASE is specially designed to translate sequence data in a CASAVA-build into functional annotations while producing corresponding gene expression measurements. Achieving such analysis is executed in an ultrafast and highly efficient manner, whether the analysis be a single-read or paired-end sequencing experiment. TASE is a user-friendly and freely available application, allowing rapid analysis and annotation of any given Illumina Solexa sequencing dataset with ease.

  4. TagDust2: a generic method to extract reads from sequencing data.

    PubMed

    Lassmann, Timo

    2015-01-28

    Arguably the most basic step in the analysis of next generation sequencing data (NGS) involves the extraction of mappable reads from the raw reads produced by sequencing instruments. The presence of barcodes, adaptors and artifacts subject to sequencing errors makes this step non-trivial. Here I present TagDust2, a generic approach utilizing a library of hidden Markov models (HMM) to accurately extract reads from a wide array of possible read architectures. TagDust2 extracts more reads of higher quality compared to other approaches. Processing of multiplexed single, paired end and libraries containing unique molecular identifiers is fully supported. Two additional post processing steps are included to exclude known contaminants and filter out low complexity sequences. Finally, TagDust2 can automatically detect the library type of sequenced data from a predefined selection. Taken together TagDust2 is a feature rich, flexible and adaptive solution to go from raw to mappable NGS reads in a single step. The ability to recognize and record the contents of raw reads will help to automate and demystify the initial, and often poorly documented, steps in NGS data analysis pipelines. TagDust2 is freely available at: http://tagdust.sourceforge.net .

  5. An efficient annotation and gene-expression derivation tool for Illumina Solexa datasets

    PubMed Central

    2010-01-01

    Background The data produced by an Illumina flow cell with all eight lanes occupied, produces well over a terabyte worth of images with gigabytes of reads following sequence alignment. The ability to translate such reads into meaningful annotation is therefore of great concern and importance. Very easily, one can get flooded with such a great volume of textual, unannotated data irrespective of read quality or size. CASAVA, a optional analysis tool for Illumina sequencing experiments, enables the ability to understand INDEL detection, SNP information, and allele calling. To not only extract from such analysis, a measure of gene expression in the form of tag-counts, but furthermore to annotate such reads is therefore of significant value. Findings We developed TASE (Tag counting and Analysis of Solexa Experiments), a rapid tag-counting and annotation software tool specifically designed for Illumina CASAVA sequencing datasets. Developed in Java and deployed using jTDS JDBC driver and a SQL Server backend, TASE provides an extremely fast means of calculating gene expression through tag-counts while annotating sequenced reads with the gene's presumed function, from any given CASAVA-build. Such a build is generated for both DNA and RNA sequencing. Analysis is broken into two distinct components: DNA sequence or read concatenation, followed by tag-counting and annotation. The end result produces output containing the homology-based functional annotation and respective gene expression measure signifying how many times sequenced reads were found within the genomic ranges of functional annotations. Conclusions TASE is a powerful tool to facilitate the process of annotating a given Illumina Solexa sequencing dataset. Our results indicate that both homology-based annotation and tag-count analysis are achieved in very efficient times, providing researchers to delve deep in a given CASAVA-build and maximize information extraction from a sequencing dataset. TASE is specially designed to translate sequence data in a CASAVA-build into functional annotations while producing corresponding gene expression measurements. Achieving such analysis is executed in an ultrafast and highly efficient manner, whether the analysis be a single-read or paired-end sequencing experiment. TASE is a user-friendly and freely available application, allowing rapid analysis and annotation of any given Illumina Solexa sequencing dataset with ease. PMID:20598141

  6. Digital transcriptome analysis of putative sex-determination genes in papaya (Carica papaya).

    PubMed

    Urasaki, Naoya; Tarora, Kazuhiko; Shudo, Ayano; Ueno, Hiroki; Tamaki, Moritoshi; Miyagi, Norimichi; Adaniya, Shinichi; Matsumura, Hideo

    2012-01-01

    Papaya (Carica papaya) is a trioecious plant species that has male, female and hermaphrodite flowers on different plants. The primitive sex chromosomes genetically determine the sex of the papaya. Although draft sequences of the papaya genome are already available, the genes for sex determination have not been identified, likely due to the complicated structure of its sex-chromosome sequences. To identify the candidate genes for sex determination, we conducted a transcriptome analysis of flower samples from male, female and hermaphrodite plants using high-throughput SuperSAGE for digital gene expression analysis. Among the short sequence tags obtained from the transcripts, 312 unique tags were specifically mapped to the primitive sex chromosome (X or Y(h)) sequences. An annotation analysis revealed that retroelements are the most abundant sequences observed in the genes corresponding to these tags. The majority of tags on the sex chromosomes were located on the X chromosome, and only 30 tags were commonly mapped to both the X and Y(h) chromosome, implying a loss of many genes on the Y(h) chromosome. Nevertheless, candidate Y(h) chromosome-specific female determination genes, including a MADS-box gene, were identified. Information on these sex chromosome-specific expressed genes will help elucidating sex determination in the papaya.

  7. Digital Transcriptome Analysis of Putative Sex-Determination Genes in Papaya (Carica papaya)

    PubMed Central

    Urasaki, Naoya; Tarora, Kazuhiko; Shudo, Ayano; Ueno, Hiroki; Tamaki, Moritoshi; Miyagi, Norimichi; Adaniya, Shinichi; Matsumura, Hideo

    2012-01-01

    Papaya (Carica papaya) is a trioecious plant species that has male, female and hermaphrodite flowers on different plants. The primitive sex chromosomes genetically determine the sex of the papaya. Although draft sequences of the papaya genome are already available, the genes for sex determination have not been identified, likely due to the complicated structure of its sex-chromosome sequences. To identify the candidate genes for sex determination, we conducted a transcriptome analysis of flower samples from male, female and hermaphrodite plants using high-throughput SuperSAGE for digital gene expression analysis. Among the short sequence tags obtained from the transcripts, 312 unique tags were specifically mapped to the primitive sex chromosome (X or Yh) sequences. An annotation analysis revealed that retroelements are the most abundant sequences observed in the genes corresponding to these tags. The majority of tags on the sex chromosomes were located on the X chromosome, and only 30 tags were commonly mapped to both the X and Yh chromosome, implying a loss of many genes on the Yh chromosome. Nevertheless, candidate Yh chromosome-specific female determination genes, including a MADS-box gene, were identified. Information on these sex chromosome-specific expressed genes will help elucidating sex determination in the papaya. PMID:22815863

  8. DAMe: a toolkit for the initial processing of datasets with PCR replicates of double-tagged amplicons for DNA metabarcoding analyses.

    PubMed

    Zepeda-Mendoza, Marie Lisandra; Bohmann, Kristine; Carmona Baez, Aldo; Gilbert, M Thomas P

    2016-05-03

    DNA metabarcoding is an approach for identifying multiple taxa in an environmental sample using specific genetic loci and taxa-specific primers. When combined with high-throughput sequencing it enables the taxonomic characterization of large numbers of samples in a relatively time- and cost-efficient manner. One recent laboratory development is the addition of 5'-nucleotide tags to both primers producing double-tagged amplicons and the use of multiple PCR replicates to filter erroneous sequences. However, there is currently no available toolkit for the straightforward analysis of datasets produced in this way. We present DAMe, a toolkit for the processing of datasets generated by double-tagged amplicons from multiple PCR replicates derived from an unlimited number of samples. Specifically, DAMe can be used to (i) sort amplicons by tag combination, (ii) evaluate PCR replicates dissimilarity, and (iii) filter sequences derived from sequencing/PCR errors, chimeras, and contamination. This is attained by calculating the following parameters: (i) sequence content similarity between the PCR replicates from each sample, (ii) reproducibility of each unique sequence across the PCR replicates, and (iii) copy number of the unique sequences in each PCR replicate. We showcase the insights that can be obtained using DAMe prior to taxonomic assignment, by applying it to two real datasets that vary in their complexity regarding number of samples, sequencing libraries, PCR replicates, and used tag combinations. Finally, we use a third mock dataset to demonstrate the impact and importance of filtering the sequences with DAMe. DAMe allows the user-friendly manipulation of amplicons derived from multiple samples with PCR replicates built in a single or multiple sequencing libraries. It allows the user to: (i) collapse amplicons into unique sequences and sort them by tag combination while retaining the sample identifier and copy number information, (ii) identify sequences carrying unused tag combinations, (iii) evaluate the comparability of PCR replicates of the same sample, and (iv) filter tagged amplicons from a number of PCR replicates using parameters of minimum length, copy number, and reproducibility across the PCR replicates. This enables an efficient analysis of complex datasets, and ultimately increases the ease of handling datasets from large-scale studies.

  9. Single nucleotide polymorphisms from Theobroma cacao expressed sequence tags associated with witches' broom disease in cacao.

    PubMed

    Lima, L S; Gramacho, K P; Carels, N; Novais, R; Gaiotto, F A; Lopes, U V; Gesteira, A S; Zaidan, H A; Cascardo, J C M; Pires, J L; Micheli, F

    2009-07-14

    In order to increase the efficiency of cacao tree resistance to witches' broom disease, which is caused by Moniliophthora perniciosa (Tricholomataceae), we looked for molecular markers that could help in the selection of resistant cacao genotypes. Among the different markers useful for developing marker-assisted selection, single nucleotide polymorphisms (SNPs) constitute the most common type of sequence difference between alleles and can be easily detected by in silico analysis from expressed sequence tag libraries. We report the first detection and analysis of SNPs from cacao-M. perniciosa interaction expressed sequence tags, using bioinformatics. Selection based on analysis of these SNPs should be useful for developing cacao varieties resistant to this devastating disease.

  10. An integrated PCR colony hybridization approach to screen cDNA libraries for full-length coding sequences.

    PubMed

    Pollier, Jacob; González-Guzmán, Miguel; Ardiles-Diaz, Wilson; Geelen, Danny; Goossens, Alain

    2011-01-01

    cDNA-Amplified Fragment Length Polymorphism (cDNA-AFLP) is a commonly used technique for genome-wide expression analysis that does not require prior sequence knowledge. Typically, quantitative expression data and sequence information are obtained for a large number of differentially expressed gene tags. However, most of the gene tags do not correspond to full-length (FL) coding sequences, which is a prerequisite for subsequent functional analysis. A medium-throughput screening strategy, based on integration of polymerase chain reaction (PCR) and colony hybridization, was developed that allows in parallel screening of a cDNA library for FL clones corresponding to incomplete cDNAs. The method was applied to screen for the FL open reading frames of a selection of 163 cDNA-AFLP tags from three different medicinal plants, leading to the identification of 109 (67%) FL clones. Furthermore, the protocol allows for the use of multiple probes in a single hybridization event, thus significantly increasing the throughput when screening for rare transcripts. The presented strategy offers an efficient method for the conversion of incomplete expressed sequence tags (ESTs), such as cDNA-AFLP tags, to FL-coding sequences.

  11. Identification of differentially expressed genes in cucumber (Cucumis sativus L.) root under waterlogging stress by digital gene expression profile.

    PubMed

    Qi, Xiao-Hua; Xu, Xue-Wen; Lin, Xiao-Jian; Zhang, Wen-Jie; Chen, Xue-Hao

    2012-03-01

    High-throughput tag-sequencing (Tag-seq) analysis based on the Solexa Genome Analyzer platform was applied to analyze the gene expression profiling of cucumber plant at 5 time points over a 24h period of waterlogging treatment. Approximately 5.8 million total clean sequence tags per library were obtained with 143013 distinct clean tag sequences. Approximately 23.69%-29.61% of the distinct clean tags were mapped unambiguously to the unigene database, and 53.78%-60.66% of the distinct clean tags were mapped to the cucumber genome database. Analysis of the differentially expressed genes revealed that most of the genes were down-regulated in the waterlogging stages, and the differentially expressed genes mainly linked to carbon metabolism, photosynthesis, reactive oxygen species generation/scavenging, and hormone synthesis/signaling. Finally, quantitative real-time polymerase chain reaction using nine genes independently verified the tag-mapped results. This present study reveals the comprehensive mechanisms of waterlogging-responsive transcription in cucumber. Copyright © 2011 Elsevier Inc. All rights reserved.

  12. A combination of LongSAGE with Solexa sequencing is well suited to explore the depth and the complexity of transcriptome

    PubMed Central

    Hanriot, Lucie; Keime, Céline; Gay, Nadine; Faure, Claudine; Dossat, Carole; Wincker, Patrick; Scoté-Blachon, Céline; Peyron, Christelle; Gandrillon, Olivier

    2008-01-01

    Background "Open" transcriptome analysis methods allow to study gene expression without a priori knowledge of the transcript sequences. As of now, SAGE (Serial Analysis of Gene Expression), LongSAGE and MPSS (Massively Parallel Signature Sequencing) are the mostly used methods for "open" transcriptome analysis. Both LongSAGE and MPSS rely on the isolation of 21 pb tag sequences from each transcript. In contrast to LongSAGE, the high throughput sequencing method used in MPSS enables the rapid sequencing of very large libraries containing several millions of tags, allowing deep transcriptome analysis. However, a bias in the complexity of the transcriptome representation obtained by MPSS was recently uncovered. Results In order to make a deep analysis of mouse hypothalamus transcriptome avoiding the limitation introduced by MPSS, we combined LongSAGE with the Solexa sequencing technology and obtained a library of more than 11 millions of tags. We then compared it to a LongSAGE library of mouse hypothalamus sequenced with the Sanger method. Conclusion We found that Solexa sequencing technology combined with LongSAGE is perfectly suited for deep transcriptome analysis. In contrast to MPSS, it gives a complex representation of transcriptome as reliable as a LongSAGE library sequenced by the Sanger method. PMID:18796152

  13. Sequence analysis of diacylglycerol acyltransferases

    USDA-ARS?s Scientific Manuscript database

    Diacylglycerol acyltransferases (DGATs) catalyze the final step of triacylglycerol (TAG) biosynthesis in eukaryotes. DGATs esterify sn-1,2-diacylglycerol with a long-chain fatty acyl-CoA. Plants and animals deficient in DGATs accumulate less TAG and over-expression of DGATs increases TAG. DGAT knock...

  14. Analysis and Functional Annotation of an Expressed Sequence Tag Collection for Tropical Crop Sugarcane

    PubMed Central

    Vettore, André L.; da Silva, Felipe R.; Kemper, Edson L.; Souza, Glaucia M.; da Silva, Aline M.; Ferro, Maria Inês T.; Henrique-Silva, Flavio; Giglioti, Éder A.; Lemos, Manoel V.F.; Coutinho, Luiz L.; Nobrega, Marina P.; Carrer, Helaine; França, Suzelei C.; Bacci, Maurício; Goldman, Maria Helena S.; Gomes, Suely L.; Nunes, Luiz R.; Camargo, Luis E.A.; Siqueira, Walter J.; Van Sluys, Marie-Anne; Thiemann, Otavio H.; Kuramae, Eiko E.; Santelli, Roberto V.; Marino, Celso L.; Targon, Maria L.P.N.; Ferro, Jesus A.; Silveira, Henrique C.S.; Marini, Danyelle C.; Lemos, Eliana G.M.; Monteiro-Vitorello, Claudia B.; Tambor, José H.M.; Carraro, Dirce M.; Roberto, Patrícia G.; Martins, Vanderlei G.; Goldman, Gustavo H.; de Oliveira, Regina C.; Truffi, Daniela; Colombo, Carlos A.; Rossi, Magdalena; de Araujo, Paula G.; Sculaccio, Susana A.; Angella, Aline; Lima, Marleide M.A.; de Rosa, Vicente E.; Siviero, Fábio; Coscrato, Virginia E.; Machado, Marcos A.; Grivet, Laurent; Di Mauro, Sonia M.Z.; Nobrega, Francisco G.; Menck, Carlos F.M.; Braga, Marilia D.V.; Telles, Guilherme P.; Cara, Frank A.A.; Pedrosa, Guilherme; Meidanis, João; Arruda, Paulo

    2003-01-01

    To contribute to our understanding of the genome complexity of sugarcane, we undertook a large-scale expressed sequence tag (EST) program. More than 260,000 cDNA clones were partially sequenced from 26 standard cDNA libraries generated from different sugarcane tissues. After the processing of the sequences, 237,954 high-quality ESTs were identified. These ESTs were assembled into 43,141 putative transcripts. Of the assembled sequences, 35.6% presented no matches with existing sequences in public databases. A global analysis of the whole SUCEST data set indicated that 14,409 assembled sequences (33% of the total) contained at least one cDNA clone with a full-length insert. Annotation of the 43,141 assembled sequences associated almost 50% of the putative identified sugarcane genes with protein metabolism, cellular communication/signal transduction, bioenergetics, and stress responses. Inspection of the translated assembled sequences for conserved protein domains revealed 40,821 amino acid sequences with 1415 Pfam domains. Reassembling the consensus sequences of the 43,141 transcripts revealed a 22% redundancy in the first assembling. This indicated that possibly 33,620 unique genes had been identified and indicated that >90% of the sugarcane expressed genes were tagged. PMID:14613979

  15. Microbial Diversity in Deep-sea Methane Seep Sediments Presented by SSU rRNA Gene Tag Sequencing

    PubMed Central

    Nunoura, Takuro; Takaki, Yoshihiro; Kazama, Hiromi; Hirai, Miho; Ashi, Juichiro; Imachi, Hiroyuki; Takai, Ken

    2012-01-01

    Microbial community structures in methane seep sediments in the Nankai Trough were analyzed by tag-sequencing analysis for the small subunit (SSU) rRNA gene using a newly developed primer set. The dominant members of Archaea were Deep-sea Hydrothermal Vent Euryarchaeotic Group 6 (DHVEG 6), Marine Group I (MGI) and Deep Sea Archaeal Group (DSAG), and those in Bacteria were Alpha-, Gamma-, Delta- and Epsilonproteobacteria, Chloroflexi, Bacteroidetes, Planctomycetes and Acidobacteria. Diversity and richness were examined by 8,709 and 7,690 tag-sequences from sediments at 5 and 25 cm below the seafloor (cmbsf), respectively. The estimated diversity and richness in the methane seep sediment are as high as those in soil and deep-sea hydrothermal environments, although the tag-sequences obtained in this study were not sufficient to show whole microbial diversity in this analysis. We also compared the diversity and richness of each taxon/division between the sediments from the two depths, and found that the diversity and richness of some taxa/divisions varied significantly along with the depth. PMID:22510646

  16. OSIRIS-REx Touch-And-Go (TAG) Navigation Performance

    NASA Technical Reports Server (NTRS)

    Berry, Kevin; Antreasian, Peter; Moreau, Michael C.; May, Alex; Sutter, Brian

    2015-01-01

    The Origins Spectral Interpretation Resource identification Security Regolith Explorer (OSIRIS-REx) mission is a NASA New Frontiers mission launching in 2016 to rendezvous with the near-Earth asteroid (101955) Bennu in late 2018. Following an extensive campaign of proximity operations activities to characterize the properties of Bennu and select a suitable sample site, OSIRIES-REx will fly a Touch-And-Go (TAG) trajectory to the asteroid's surface to obtain a regolith sample. The paper summarizes the mission design of the TAG sequence, the propulsive required to achieve the trajectory, and the sequence of events leading up to the TAG event. The paper will summarize the Monte-Carlo simulation of the TAG sequence and present analysis results that demonstrate the ability to conduct the TAG within 25 meters of the selected sample site and +-2 cms of the targeted contact velocity. The paper will describe some of the challenges associated with conducting precision navigation operations and ultimately contacting a very small asteroid.

  17. OSIRI-REx Touch and Go (TAG) Navigation Performance

    NASA Technical Reports Server (NTRS)

    Berry, Kevin; Antreasian, Peter; Moreau, Michael C.; May, Alex; Sutter, Brian

    2015-01-01

    The Origins Spectral Interpretation Resource Identification Security Regolith Explorer (OSIRIS-REx) mission is a NASA New Frontiers mission launching in 2016 to rendezvous with the near-Earth asteroid (101955) Bennu in late 2018. Following an extensive campaign of proximity operations activities to characterize the properties of Bennu and select a suitable sample site, OSIRIS-REx will fly a Touch-And-Go (TAG) trajectory to the asteroid's surface to obtain a regolith sample. The paper summarizes the mission design of the TAG sequence, the propulsive maneuvers required to achieve the trajectory, and the sequence of events leading up to the TAG event. The paper also summarizes the Monte-Carlo simulation of the TAG sequence and presents analysis results that demonstrate the ability to conduct the TAG within 25 meters of the selected sample site and 2 cm/s of the targeted contact velocity. The paper describes some of the challenges associated with conducting precision navigation operations and ultimately contacting a very small asteroid.

  18. The use of coded PCR primers enables high-throughput sequencing of multiple homolog amplification products by 454 parallel sequencing.

    PubMed

    Binladen, Jonas; Gilbert, M Thomas P; Bollback, Jonathan P; Panitz, Frank; Bendixen, Christian; Nielsen, Rasmus; Willerslev, Eske

    2007-02-14

    The invention of the Genome Sequence 20 DNA Sequencing System (454 parallel sequencing platform) has enabled the rapid and high-volume production of sequence data. Until now, however, individual emulsion PCR (emPCR) reactions and subsequent sequencing runs have been unable to combine template DNA from multiple individuals, as homologous sequences cannot be subsequently assigned to their original sources. We use conventional PCR with 5'-nucleotide tagged primers to generate homologous DNA amplification products from multiple specimens, followed by sequencing through the high-throughput Genome Sequence 20 DNA Sequencing System (GS20, Roche/454 Life Sciences). Each DNA sequence is subsequently traced back to its individual source through 5'tag-analysis. We demonstrate that this new approach enables the assignment of virtually all the generated DNA sequences to the correct source once sequencing anomalies are accounted for (miss-assignment rate<0.4%). Therefore, the method enables accurate sequencing and assignment of homologous DNA sequences from multiple sources in single high-throughput GS20 run. We observe a bias in the distribution of the differently tagged primers that is dependent on the 5' nucleotide of the tag. In particular, primers 5' labelled with a cytosine are heavily overrepresented among the final sequences, while those 5' labelled with a thymine are strongly underrepresented. A weaker bias also exists with regards to the distribution of the sequences as sorted by the second nucleotide of the dinucleotide tags. As the results are based on a single GS20 run, the general applicability of the approach requires confirmation. However, our experiments demonstrate that 5'primer tagging is a useful method in which the sequencing power of the GS20 can be applied to PCR-based assays of multiple homologous PCR products. The new approach will be of value to a broad range of research areas, such as those of comparative genomics, complete mitochondrial analyses, population genetics, and phylogenetics.

  19. Methyl-CpG island-associated genome signature tags

    DOEpatents

    Dunn, John J

    2014-05-20

    Disclosed is a method for analyzing the organismic complexity of a sample through analysis of the nucleic acid in the sample. In the disclosed method, through a series of steps, including digestion with a type II restriction enzyme, ligation of capture adapters and linkers and digestion with a type IIS restriction enzyme, genome signature tags are produced. The sequences of a statistically significant number of the signature tags are determined and the sequences are used to identify and quantify the organisms in the sample. Various embodiments of the invention described herein include methods for using single point genome signature tags to analyze the related families present in a sample, methods for analyzing sequences associated with hyper- and hypo-methylated CpG islands, methods for visualizing organismic complexity change in a sampling location over time and methods for generating the genome signature tag profile of a sample of fragmented DNA.

  20. Haplotag: Software for Haplotype-Based Genotyping-by-Sequencing Analysis

    PubMed Central

    Tinker, Nicholas A.; Bekele, Wubishet A.; Hattori, Jiro

    2016-01-01

    Genotyping-by-sequencing (GBS), and related methods, are based on high-throughput short-read sequencing of genomic complexity reductions followed by discovery of single nucleotide polymorphisms (SNPs) within sequence tags. This provides a powerful and economical approach to whole-genome genotyping, facilitating applications in genomics, diversity analysis, and molecular breeding. However, due to the complexity of analyzing large data sets, applications of GBS may require substantial time, expertise, and computational resources. Haplotag, the novel GBS software described here, is freely available, and operates with minimal user-investment on widely available computer platforms. Haplotag is unique in fulfilling the following set of criteria: (1) operates without a reference genome; (2) can be used in a polyploid species; (3) provides a discovery mode, and a production mode; (4) discovers polymorphisms based on a model of tag-level haplotypes within sequenced tags; (5) reports SNPs as well as haplotype-based genotypes; and (6) provides an intuitive visual “passport” for each inferred locus. Haplotag is optimized for use in a self-pollinating plant species. PMID:26818073

  1. Parallel gene analysis with allele-specific padlock probes and tag microarrays

    PubMed Central

    Banér, Johan; Isaksson, Anders; Waldenström, Erik; Jarvius, Jonas; Landegren, Ulf; Nilsson, Mats

    2003-01-01

    Parallel, highly specific analysis methods are required to take advantage of the extensive information about DNA sequence variation and of expressed sequences. We present a scalable laboratory technique suitable to analyze numerous target sequences in multiplexed assays. Sets of padlock probes were applied to analyze single nucleotide variation directly in total genomic DNA or cDNA for parallel genotyping or gene expression analysis. All reacted probes were then co-amplified and identified by hybridization to a standard tag oligonucleotide array. The technique was illustrated by analyzing normal and pathogenic variation within the Wilson disease-related ATP7B gene, both at the level of DNA and RNA, using allele-specific padlock probes. PMID:12930977

  2. Genome-Wide Analysis of Oleosin Gene Family in 22 Tree Species: An Accelerator for Metabolic Engineering of BioFuel Crops and Agrigenomics Industrial Applications?

    PubMed Central

    2015-01-01

    Abstract Trees contribute to enormous plant oil reserves because many trees contain 50%–80% of oil (triacylglycerols, TAGs) in the fruits and kernels. TAGs accumulate in subcellular structures called oil bodies/droplets, in which TAGs are covered by low-molecular-mass hydrophobic proteins called oleosins (OLEs). The OLEs/TAGs ratio determines the size and shape of intracellular oil bodies. There is a lack of comprehensive sequence analysis and structural information of OLEs among diverse trees. The objectives of this study were to identify OLEs from 22 tree species (e.g., tung tree, tea-oil tree, castor bean), perform genome-wide analysis of OLEs, classify OLEs, identify conserved sequence motifs and amino acid residues, and predict secondary and three-dimensional structures in tree OLEs and OLE subfamilies. Data mining identified 65 OLEs with perfect conservation of the “proline knot” motif (PX5SPX3P) from 19 trees. These OLEs contained >40% hydrophobic amino acid residues. They displayed similar properties and amino acid composition. Genome-wide phylogenetic analysis and multiple sequence alignment demonstrated that these proteins could be classified into five OLE subfamilies. There were distinct patterns of sequence conservation among the OLE subfamilies and within individual tree species. Computational modeling indicated that OLEs were composed of at least three α-helixes connected with short coils without any β-strand and that they exhibited distinct 3D structures and ligand binding sites. These analyses provide fundamental information in the similarity and specificity of diverse OLE isoforms within the same subfamily and among the different species, which should facilitate studying the structure-function relationship and identify critical amino acid residues in OLEs for metabolic engineering of tree TAGs. PMID:26258573

  3. Genome-Wide Analysis of Oleosin Gene Family in 22 Tree Species: An Accelerator for Metabolic Engineering of BioFuel Crops and Agrigenomics Industrial Applications?

    PubMed

    Cao, Heping

    2015-09-01

    Trees contribute to enormous plant oil reserves because many trees contain 50%-80% of oil (triacylglycerols, TAGs) in the fruits and kernels. TAGs accumulate in subcellular structures called oil bodies/droplets, in which TAGs are covered by low-molecular-mass hydrophobic proteins called oleosins (OLEs). The OLEs/TAGs ratio determines the size and shape of intracellular oil bodies. There is a lack of comprehensive sequence analysis and structural information of OLEs among diverse trees. The objectives of this study were to identify OLEs from 22 tree species (e.g., tung tree, tea-oil tree, castor bean), perform genome-wide analysis of OLEs, classify OLEs, identify conserved sequence motifs and amino acid residues, and predict secondary and three-dimensional structures in tree OLEs and OLE subfamilies. Data mining identified 65 OLEs with perfect conservation of the "proline knot" motif (PX5SPX3P) from 19 trees. These OLEs contained >40% hydrophobic amino acid residues. They displayed similar properties and amino acid composition. Genome-wide phylogenetic analysis and multiple sequence alignment demonstrated that these proteins could be classified into five OLE subfamilies. There were distinct patterns of sequence conservation among the OLE subfamilies and within individual tree species. Computational modeling indicated that OLEs were composed of at least three α-helixes connected with short coils without any β-strand and that they exhibited distinct 3D structures and ligand binding sites. These analyses provide fundamental information in the similarity and specificity of diverse OLE isoforms within the same subfamily and among the different species, which should facilitate studying the structure-function relationship and identify critical amino acid residues in OLEs for metabolic engineering of tree TAGs.

  4. Peptides derivatized with bicyclic quaternary ammonium ionization tags. Sequencing via tandem mass spectrometry.

    PubMed

    Setner, Bartosz; Rudowska, Magdalena; Klem, Ewelina; Cebrat, Marek; Szewczuk, Zbigniew

    2014-10-01

    Improving the sensitivity of detection and fragmentation of peptides to provide reliable sequencing of peptides is an important goal of mass spectrometric analysis. Peptides derivatized by bicyclic quaternary ammonium ionization tags: 1-azabicyclo[2.2.2]octane (ABCO) or 1,4-diazabicyclo[2.2.2]octane (DABCO), are characterized by an increased detection sensitivity in electrospray ionization mass spectrometry (ESI-MS) and longer retention times on the reverse-phase (RP) chromatography columns. The improvement of the detection limit was observed even for peptides dissolved in 10 mM NaCl. Collision-induced dissociation tandem mass spectrometry of quaternary ammonium salts derivatives of peptides showed dominant a- and b-type ions, allowing facile sequencing of peptides. The bicyclic ionization tags are stable in collision-induced dissociation experiments, and the resulted fragmentation pattern is not significantly influenced by either acidic or basic amino acid residues in the peptide sequence. Obtained results indicate the general usefulness of the bicyclic quaternary ammonium ionization tags for ESI-MS/MS sequencing of peptides. Copyright © 2014 John Wiley & Sons, Ltd.

  5. Illuminator, a desktop program for mutation detection using short-read clonal sequencing.

    PubMed

    Carr, Ian M; Morgan, Joanne E; Diggle, Christine P; Sheridan, Eamonn; Markham, Alexander F; Logan, Clare V; Inglehearn, Chris F; Taylor, Graham R; Bonthron, David T

    2011-10-01

    Current methods for sequencing clonal populations of DNA molecules yield several gigabases of data per day, typically comprising reads of < 100 nt. Such datasets permit widespread genome resequencing and transcriptome analysis or other quantitative tasks. However, this huge capacity can also be harnessed for the resequencing of smaller (gene-sized) target regions, through the simultaneous parallel analysis of multiple subjects, using sample "tagging" or "indexing". These methods promise to have a huge impact on diagnostic mutation analysis and candidate gene testing. Here we describe a software package developed for such studies, offering the ability to resolve pooled samples carrying barcode tags and to align reads to a reference sequence using a mutation-tolerant process. The program, Illuminator, can identify rare sequence variants, including insertions and deletions, and permits interactive data analysis on standard desktop computers. It facilitates the effective analysis of targeted clonal sequencer data without dedicated computational infrastructure or specialized training. Copyright © 2011 Elsevier Inc. All rights reserved.

  6. Analysis of expressed sequence tags from a single wheat cultivar facilitates interpretation of tandem mass spectrometry data and discrimination of gamma gliadin proteins that may play different functional roles in flour

    USDA-ARS?s Scientific Manuscript database

    The complement of gamma gliadin genes expressed in the wheat cultivar Butte 86 was evaluated by analyzing publicly available expressed sequence tag (EST) data. Eleven contigs were assembled from 153 Butte 86 ESTs. Nine of the contigs encoded full-length proteins and four of the proteins contained an...

  7. Improved serial analysis of V1 ribosomal sequence tags (SARST-V1) provides a rapid, comprehensive, sequence-based characterization of bacterial diversity and community composition.

    PubMed

    Yu, Zhongtang; Yu, Marie; Morrison, Mark

    2006-04-01

    Serial analysis of ribosomal sequence tags (SARST) is a recently developed technology that can generate large 16S rRNA gene (rrs) sequence data sets from microbiomes, but there are numerous enzymatic and purification steps required to construct the ribosomal sequence tag (RST) clone libraries. We report here an improved SARST method, which still targets the V1 hypervariable region of rrs genes, but reduces the number of enzymes, oligonucleotides, reagents, and technical steps needed to produce the RST clone libraries. The new method, hereafter referred to as SARST-V1, was used to examine the eubacterial diversity present in community DNA recovered from the microbiome resident in the ovine rumen. The 190 sequenced clones contained 1055 RSTs and no less than 236 unique phylotypes (based on > or = 95% sequence identity) that were assigned to eight different eubacterial phyla. Rarefaction and monomolecular curve analyses predicted that the complete RST clone library contains 99% of the 353 unique phylotypes predicted to exist in this microbiome. When compared with ribosomal intergenic spacer analysis (RISA) of the same community DNA sample, as well as a compilation of nine previously published conventional rrs clone libraries prepared from the same type of samples, the RST clone library provided a more comprehensive characterization of the eubacterial diversity present in rumen microbiomes. As such, SARST-V1 should be a useful tool applicable to comprehensive examination of diversity and composition in microbiomes and offers an affordable, sequence-based method for diversity analysis.

  8. Improved microarray methods for profiling the yeast knockout strain collection

    PubMed Central

    Yuan, Daniel S.; Pan, Xuewen; Ooi, Siew Loon; Peyser, Brian D.; Spencer, Forrest A.; Irizarry, Rafael A.; Boeke, Jef D.

    2005-01-01

    A remarkable feature of the Yeast Knockout strain collection is the presence of two unique 20mer TAG sequences in almost every strain. In principle, the relative abundances of strains in a complex mixture can be profiled swiftly and quantitatively by amplifying these sequences and hybridizing them to microarrays, but TAG microarrays have not been widely used. Here, we introduce a TAG microarray design with sophisticated controls and describe a robust method for hybridizing high concentrations of dye-labeled TAGs in single-stranded form. We also highlight the importance of avoiding PCR contamination and provide procedures for detection and eradication. Validation experiments using these methods yielded false positive (FP) and false negative (FN) rates for individual TAG detection of 3–6% and 15–18%, respectively. Analysis demonstrated that cross-hybridization was the chief source of FPs, while TAG amplification defects were the main cause of FNs. The materials, protocols, data and associated software described here comprise a suite of experimental resources that should facilitate the use of TAG microarrays for a wide variety of genetic screens. PMID:15994458

  9. An expressed sequence tag (EST) data mining strategy succeeding in the discovery of new G-protein coupled receptors.

    PubMed

    Wittenberger, T; Schaller, H C; Hellebrand, S

    2001-03-30

    We have developed a comprehensive expressed sequence tag database search method and used it for the identification of new members of the G-protein coupled receptor superfamily. Our approach proved to be especially useful for the detection of expressed sequence tag sequences that do not encode conserved parts of a protein, making it an ideal tool for the identification of members of divergent protein families or of protein parts without conserved domain structures in the expressed sequence tag database. At least 14 of the expressed sequence tags found with this strategy are promising candidates for new putative G-protein coupled receptors. Here, we describe the sequence and expression analysis of five new members of this receptor superfamily, namely GPR84, GPR86, GPR87, GPR90 and GPR91. We also studied the genomic structure and chromosomal localization of the respective genes applying in silico methods. A cluster of six closely related G-protein coupled receptors was found on the human chromosome 3q24-3q25. It consists of four orphan receptors (GPR86, GPR87, GPR91, and H963), the purinergic receptor P2Y1, and the uridine 5'-diphosphoglucose receptor KIAA0001. It seems likely that these receptors evolved from a common ancestor and therefore might have related ligands. In conclusion, we describe a data mining procedure that proved to be useful for the identification and first characterization of new genes and is well applicable for other gene families. Copyright 2001 Academic Press.

  10. Application safety evaluation of the radio frequency identification tag under magnetic resonance imaging.

    PubMed

    Fei, Xiaolu; Li, Shanshan; Gao, Shan; Wei, Lan; Wang, Lihong

    2014-09-04

    Radio Frequency Identification(RFID) has been widely used in healthcare facilities, but it has been paid little attention whether RFID applications are safe enough under healthcare environment. The purpose of this study is to assess the effects of RFID tags on Magnetic Resonance (MR) imaging in a typical electromagnetic environment in hospitals, and to evaluate the safety of their applications. A Magphan phantom was used to simulate the imaging objects, while active RFID tags were placed at different distances (0, 4, 8, 10 cm) from the phantom border. The phantom was scanned by using three typical sequences including spin-echo (SE) sequence, gradient-echo (GRE) sequence and inversion-recovery (IR) sequence. The quality of the image was quantitatively evaluated by using signal-to-noise ratio (SNR), uniformity, high-contrast resolution, and geometric distortion. RFID tags were read by an RFID reader to calculate their usable rate. RFID tags can be read properly after being placed in high magnetic field for up to 30 minutes. SNR: There were no differences between the group with RFID tags and the group without RFID tags using SE and IR sequence, but it was lower when using GRE sequence.Uniformity: There was a significant difference between the group with RFID tags and the group without RFID tags using SE and GRE sequence. Geometric distortion and high-contrast resolution: There were no obvious differences found. Active RFID tags can affect MR imaging quality, especially using the GRE sequence. Increasing the distance from the RFID tags to the imaging objects can reduce that influence. When the distance was longer than 8 cm, MR imaging quality were almost unaffected. However, the Gradient Echo related sequence is not recommended when patients wear a RFID wristband.

  11. Expressed sequence tags from Atta laevigata and identification of candidate genes for the control of pest leaf-cutting ants.

    PubMed

    Rodovalho, Cynara M; Ferro, Milene; Fonseca, Fernando Pp; Antonio, Erik A; Guilherme, Ivan R; Henrique-Silva, Flávio; Bacci, Maurício

    2011-06-17

    Leafcutters are the highest evolved within Neotropical ants in the tribe Attini and model systems for studying caste formation, labor division and symbiosis with microorganisms. Some species of leafcutters are agricultural pests controlled by chemicals which affect other animals and accumulate in the environment. Aiming to provide genetic basis for the study of leafcutters and for the development of more specific and environmentally friendly methods for the control of pest leafcutters, we generated expressed sequence tag data from Atta laevigata, one of the pest ants with broad geographic distribution in South America. The analysis of the expressed sequence tags allowed us to characterize 2,006 unique sequences in Atta laevigata. Sixteen of these genes had a high number of transcripts and are likely positively selected for high level of gene expression, being responsible for three basic biological functions: energy conservation through redox reactions in mitochondria; cytoskeleton and muscle structuring; regulation of gene expression and metabolism. Based on leafcutters lifestyle and reports of genes involved in key processes of other social insects, we identified 146 sequences potential targets for controlling pest leafcutters. The targets are responsible for antixenobiosis, development and longevity, immunity, resistance to pathogens, pheromone function, cell signaling, behavior, polysaccharide metabolism and arginine kynase activity. The generation and analysis of expressed sequence tags from Atta laevigata have provided important genetic basis for future studies on the biology of leaf-cutting ants and may contribute to the development of a more specific and environmentally friendly method for the control of agricultural pest leafcutters.

  12. Expressed sequence tags from Atta laevigata and identification of candidate genes for the control of pest leaf-cutting ants

    PubMed Central

    2011-01-01

    Background Leafcutters are the highest evolved within Neotropical ants in the tribe Attini and model systems for studying caste formation, labor division and symbiosis with microorganisms. Some species of leafcutters are agricultural pests controlled by chemicals which affect other animals and accumulate in the environment. Aiming to provide genetic basis for the study of leafcutters and for the development of more specific and environmentally friendly methods for the control of pest leafcutters, we generated expressed sequence tag data from Atta laevigata, one of the pest ants with broad geographic distribution in South America. Results The analysis of the expressed sequence tags allowed us to characterize 2,006 unique sequences in Atta laevigata. Sixteen of these genes had a high number of transcripts and are likely positively selected for high level of gene expression, being responsible for three basic biological functions: energy conservation through redox reactions in mitochondria; cytoskeleton and muscle structuring; regulation of gene expression and metabolism. Based on leafcutters lifestyle and reports of genes involved in key processes of other social insects, we identified 146 sequences potential targets for controlling pest leafcutters. The targets are responsible for antixenobiosis, development and longevity, immunity, resistance to pathogens, pheromone function, cell signaling, behavior, polysaccharide metabolism and arginine kynase activity. Conclusion The generation and analysis of expressed sequence tags from Atta laevigata have provided important genetic basis for future studies on the biology of leaf-cutting ants and may contribute to the development of a more specific and environmentally friendly method for the control of agricultural pest leafcutters. PMID:21682882

  13. The structure of the SBP-Tag–streptavidin complex reveals a novel helical scaffold bridging binding pockets on separate subunits

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Barrette-Ng, Isabelle H.; Wu, Sau-Ching; Tjia, Wai-Mui

    2013-05-01

    The structure of the SBP-Tag–streptavidin complex reveals a novel mode of peptide recognition in which a single peptide binds simultaneously to biotin-binding pockets from adjacent subunits of streptavidin. The molecular details of peptide recognition suggest how the SBP-Tag can be further modified to become an even more useful tag for a wider range of biotechnological applications. The 38-residue SBP-Tag binds to streptavidin more tightly (K{sub d} ≃ 2.5–4.9 nM) than most if not all other known peptide sequences. Crystallographic analysis at 1.75 Å resolution shows that the SBP-Tag binds to streptavidin in an unprecedented manner by simultaneously interacting with biotin-bindingmore » pockets from two separate subunits. An N-terminal HVV peptide sequence (residues 12–14) and a C-terminal HPQ sequence (residues 31–33) form the bulk of the direct interactions between the SBP-Tag and the two biotin-binding pockets. Surprisingly, most of the peptide spanning these two sites (residues 17–28) adopts a regular α-helical structure that projects three leucine side chains into a groove formed at the interface between two streptavidin protomers. The crystal structure shows that residues 1–10 and 35–38 of the original SBP-Tag identified through in vitro selection and deletion analysis do not appear to contact streptavidin and thus may not be important for binding. A 25-residue peptide comprising residues 11–34 (SBP-Tag2) was synthesized and shown using surface plasmon resonance to bind streptavidin with very similar affinity and kinetics when compared with the SBP-Tag. The SBP-Tag2 was also added to the C-terminus of β-lactamase and was shown to be just as effective as the full-length SBP-Tag in affinity purification. These results validate the molecular structure of the SBP-Tag–streptavidin complex and establish a minimal bivalent streptavidin-binding tag from which further rational design and optimization can proceed.« less

  14. Structure-function analysis of diacylglycerol acyltransferase sequences for metabolic engineering and drug discovery

    USDA-ARS?s Scientific Manuscript database

    Diacylglycerol acyltransferase families (DGATs) catalyze the final and rate-limiting step of triacylglycerol (TAG) biosynthesis in eukaryotic organisms. DGAT knockout mice are resistant to diet-induced obesity and lack milk secretion. Over-expression of DGATs increases TAG in plants. Therefore, unde...

  15. Magnetic Resonance Arterial Spin Tagging for Non-Invasive Pharmacokinetic Analysis of Breast Cancer

    DTIC Science & Technology

    2000-10-01

    sequence software that we had developed for this project. In addition, we revised the pulse sequences to utilize the high performance gradients (40 mT/ m ...peak, 150 mT/ m /ms rise) of the system. We believe these revised sequences will provide better arterial spin tagged data for perfusion measurement. All...U.... ...... ... -- v p I _1 i-:F~ ----- ! - .Ag Jig. H aI .. M e fI6lo 3 ~ ~ 2 0’,~- A.11. I 1 1 9 - HP ~ ~ IM I 15 L 1 1 8 = NIAt I C J1 5

  16. A microsphere-based assay for mutation analysis of the biotinidase gene using dried blood spots

    PubMed Central

    Lindau-Shepard, Barbara; Janik, David K.; Pass, Kenneth A.

    2012-01-01

    Biotinidase deficiency is an autosomal recessive syndrome caused by defects in the biotinidase gene, the product of which affects biotin metabolism. Newborn screening (NBS) for biotinidase deficiency can identify affected infants prior to onset of symptoms; biotin supplementation can resolve or prevent the clinical features. In NBS, dry blood spots (DBS) are usually tested for biotinidase enzyme activity by colorimetric analysis. By taking advantage of the multiplexing capabilities of the Luminex platform, we have developed a microsphere-based array genotyping method for the simultaneous detection of six disease causing mutations in the biotinidase gene, thereby permitting a second tier of molecular analysis. Genomic DNA was extracted from 3.2 mm DBS. Biotinidase gene sequences, containing the mutations of interest, were amplified by multiplexed polymerase chain reaction, followed by multiplexed allele-specific primer extension using universally tagged genotyping primers. The products were then hybridized to anti-tag carrying xTAG microspheres and detected on the Luminex platform. Genotypes were verified by sequencing. Genotyping results of 22 known biotinidase deficient samples by our xTAG biotinidase assay was in concordance with the results obtained from DNA sequencing, for all 6 mutations used in our panel. These results indicate that genotyping by an xTAG microsphere-based array is accurate, flexible, and can be adapted for high-throughput. Since NBS for biotinidase deficiency is by enzymatic assay, less than optimal quality of the DBS itself can compromise enzyme activity, while the DNA from these samples mostly remains unaffected. This assay warrants evaluation as a viable complement to the biotinidase semi-quantitative colorimetric assay. PMID:27625817

  17. Sequence analysis reveals genomic factors affecting EST-SSR primer performance and polymorphism

    USDA-ARS?s Scientific Manuscript database

    Search for simple sequence repeat (SSR) motifs and design of flanking primers in expressed sequence tag (EST) sequences can be easily done at a large scale using bioinformatics programs. However, failed amplification and/or detection, along with lack of polymorphism, is often seen among randomly sel...

  18. Developmental staging of male murine embryonic gonad by SAGE analysis

    PubMed Central

    Lee, Tin-Lap; Li, Yunmin; Alba, Diana; Vong, Queenie P.; Wu, Shao-Ming; Baxendale, Vanessa; Rennert, Owen M.; Lau, Yun-Fai Chris; Chan, Wai-Yee

    2012-01-01

    Despite the identification of key genes such as Sry integral to embryonic gonadal development, the genomic classification and identification of chromosomal activation of this process is still poorly understood. To better understand the genetic regulation of gonadal development, we performed Serial Analysis of Gene Expression (SAGE) to profile the genes and novel transcripts, and an average of 152,000 tags from male embryonic gonads at E10.5 (embryonic day 10.5), E11.5, E12.5, E13.5, E15.5 and E17.5 were analyzed. A total of 275,583 non-singleton tags that do not map to any annotated sequence were identified in the six gonad libraries, and 47,255 tags were mapped to 24,975 annotated sequences, among which 987 sequences were uncharacterized. Utilizing an unsupervised pattern identification technique, we established molecular staging of male gonadal development. Rather than providing a static descriptive analysis, we developed algorithms to cluster the SAGE data and assign SAGE tags to a corresponding chromosomal position; these data are displayed in chromosome graphic format. A prominent increase in global genomic activity from E10.5 to E17.5 was observed. Important chromosomal regions related to the developmental processes were identified and validated based on established mouse models with developmental disorders. These regions may represent markers for early diagnosis for disorders of male gonad development as well as potential treatment targets. PMID:19376482

  19. TagDigger: user-friendly extraction of read counts from GBS and RAD-seq data.

    PubMed

    Clark, Lindsay V; Sacks, Erik J

    2016-01-01

    In genotyping-by-sequencing (GBS) and restriction site-associated DNA sequencing (RAD-seq), read depth is important for assessing the quality of genotype calls and estimating allele dosage in polyploids. However, existing pipelines for GBS and RAD-seq do not provide read counts in formats that are both accurate and easy to access. Additionally, although existing pipelines allow previously-mined SNPs to be genotyped on new samples, they do not allow the user to manually specify a subset of loci to examine. Pipelines that do not use a reference genome assign arbitrary names to SNPs, making meta-analysis across projects difficult. We created the software TagDigger, which includes three programs for analyzing GBS and RAD-seq data. The first script, tagdigger_interactive.py, rapidly extracts read counts and genotypes from FASTQ files using user-supplied sets of barcodes and tags. Input and output is in CSV format so that it can be opened by spreadsheet software. Tag sequences can also be imported from the Stacks, TASSEL-GBSv2, TASSEL-UNEAK, or pyRAD pipelines, and a separate file can be imported listing the names of markers to retain. A second script, tag_manager.py, consolidates marker names and sequences across multiple projects. A third script, barcode_splitter.py, assists with preparing FASTQ data for deposit in a public archive by splitting FASTQ files by barcode and generating MD5 checksums for the resulting files. TagDigger is open-source and freely available software written in Python 3. It uses a scalable, rapid search algorithm that can process over 100 million FASTQ reads per hour. TagDigger will run on a laptop with any operating system, does not consume hard drive space with intermediate files, and does not require programming skill to use.

  20. Analysis of SSR information in EST resources of sugarcane

    USDA-ARS?s Scientific Manuscript database

    Expressed sequence tags ( ESTs) offer the opportunity to exploit single, low -copy, conserved sequence motifs for the development of simple sequence repeats ( SSRs). The total of 262 113 ESTs of sugarcane (Saccharum officinarum) in the database of NCBI were downloaded and analyzed, which resulted in...

  1. Studies of a biochemical factory: tomato trichome deep expressed sequence tag sequencing and proteomics.

    PubMed

    Schilmiller, Anthony L; Miner, Dennis P; Larson, Matthew; McDowell, Eric; Gang, David R; Wilkerson, Curtis; Last, Robert L

    2010-07-01

    Shotgun proteomics analysis allows hundreds of proteins to be identified and quantified from a single sample at relatively low cost. Extensive DNA sequence information is a prerequisite for shotgun proteomics, and it is ideal to have sequence for the organism being studied rather than from related species or accessions. While this requirement has limited the set of organisms that are candidates for this approach, next generation sequencing technologies make it feasible to obtain deep DNA sequence coverage from any organism. As part of our studies of specialized (secondary) metabolism in tomato (Solanum lycopersicum) trichomes, 454 sequencing of cDNA was combined with shotgun proteomics analyses to obtain in-depth profiles of genes and proteins expressed in leaf and stem glandular trichomes of 3-week-old plants. The expressed sequence tag and proteomics data sets combined with metabolite analysis led to the discovery and characterization of a sesquiterpene synthase that produces beta-caryophyllene and alpha-humulene from E,E-farnesyl diphosphate in trichomes of leaf but not of stem. This analysis demonstrates the utility of combining high-throughput cDNA sequencing with proteomics experiments in a target tissue. These data can be used for dissection of other biochemical processes in these specialized epidermal cells.

  2. Studies of a Biochemical Factory: Tomato Trichome Deep Expressed Sequence Tag Sequencing and Proteomics1[W][OA

    PubMed Central

    Schilmiller, Anthony L.; Miner, Dennis P.; Larson, Matthew; McDowell, Eric; Gang, David R.; Wilkerson, Curtis; Last, Robert L.

    2010-01-01

    Shotgun proteomics analysis allows hundreds of proteins to be identified and quantified from a single sample at relatively low cost. Extensive DNA sequence information is a prerequisite for shotgun proteomics, and it is ideal to have sequence for the organism being studied rather than from related species or accessions. While this requirement has limited the set of organisms that are candidates for this approach, next generation sequencing technologies make it feasible to obtain deep DNA sequence coverage from any organism. As part of our studies of specialized (secondary) metabolism in tomato (Solanum lycopersicum) trichomes, 454 sequencing of cDNA was combined with shotgun proteomics analyses to obtain in-depth profiles of genes and proteins expressed in leaf and stem glandular trichomes of 3-week-old plants. The expressed sequence tag and proteomics data sets combined with metabolite analysis led to the discovery and characterization of a sesquiterpene synthase that produces β-caryophyllene and α-humulene from E,E-farnesyl diphosphate in trichomes of leaf but not of stem. This analysis demonstrates the utility of combining high-throughput cDNA sequencing with proteomics experiments in a target tissue. These data can be used for dissection of other biochemical processes in these specialized epidermal cells. PMID:20431087

  3. Application of the High Resolution Melting analysis for genetic mapping of Sequence Tagged Site markers in narrow-leafed lupin (Lupinus angustifolius L.).

    PubMed

    Kamel, Katarzyna A; Kroc, Magdalena; Święcicki, Wojciech

    2015-01-01

    Sequence tagged site (STS) markers are valuable tools for genetic and physical mapping that can be successfully used in comparative analyses among related species. Current challenges for molecular markers genotyping in plants include the lack of fast, sensitive and inexpensive methods suitable for sequence variant detection. In contrast, high resolution melting (HRM) is a simple and high-throughput assay, which has been widely applied in sequence polymorphism identification as well as in the studies of genetic variability and genotyping. The present study is the first attempt to use the HRM analysis to genotype STS markers in narrow-leafed lupin (Lupinus angustifolius L.). The sensitivity and utility of this method was confirmed by the sequence polymorphism detection based on melting curve profiles in the parental genotypes and progeny of the narrow-leafed lupin mapping population. Application of different approaches, including amplicon size and a simulated heterozygote analysis, has allowed for successful genetic mapping of 16 new STS markers in the narrow-leafed lupin genome.

  4. Real-time single-molecule electronic DNA sequencing by synthesis using polymer-tagged nucleotides on a nanopore array

    PubMed Central

    Fuller, Carl W.; Kumar, Shiv; Porel, Mintu; Chien, Minchen; Bibillo, Arek; Stranges, P. Benjamin; Dorwart, Michael; Tao, Chuanjuan; Li, Zengmin; Guo, Wenjing; Shi, Shundi; Korenblum, Daniel; Trans, Andrew; Aguirre, Anne; Liu, Edward; Harada, Eric T.; Pollard, James; Bhat, Ashwini; Cech, Cynthia; Yang, Alexander; Arnold, Cleoma; Palla, Mirkó; Hovis, Jennifer; Chen, Roger; Morozova, Irina; Kalachikov, Sergey; Russo, James J.; Kasianowicz, John J.; Davis, Randy; Roever, Stefan; Church, George M.; Ju, Jingyue

    2016-01-01

    DNA sequencing by synthesis (SBS) offers a robust platform to decipher nucleic acid sequences. Recently, we reported a single-molecule nanopore-based SBS strategy that accurately distinguishes four bases by electronically detecting and differentiating four different polymer tags attached to the 5′-phosphate of the nucleotides during their incorporation into a growing DNA strand catalyzed by DNA polymerase. Further developing this approach, we report here the use of nucleotides tagged at the terminal phosphate with oligonucleotide-based polymers to perform nanopore SBS on an α-hemolysin nanopore array platform. We designed and synthesized several polymer-tagged nucleotides using tags that produce different electrical current blockade levels and verified they are active substrates for DNA polymerase. A highly processive DNA polymerase was conjugated to the nanopore, and the conjugates were complexed with primer/template DNA and inserted into lipid bilayers over individually addressable electrodes of the nanopore chip. When an incoming complementary-tagged nucleotide forms a tight ternary complex with the primer/template and polymerase, the tag enters the pore, and the current blockade level is measured. The levels displayed by the four nucleotides tagged with four different polymers captured in the nanopore in such ternary complexes were clearly distinguishable and sequence-specific, enabling continuous sequence determination during the polymerase reaction. Thus, real-time single-molecule electronic DNA sequencing data with single-base resolution were obtained. The use of these polymer-tagged nucleotides, combined with polymerase tethering to nanopores and multiplexed nanopore sensors, should lead to new high-throughput sequencing methods. PMID:27091962

  5. A family of cellular proteins related to snake venom disintegrins.

    PubMed

    Weskamp, G; Blobel, C P

    1994-03-29

    Disintegrins are short soluble integrin ligands that were initially identified in snake venom. A previously recognized cellular protein with a disintegrin domain was the guinea pig sperm protein PH-30, a protein implicated in sperm-egg membrane binding and fusion. Here we present peptide sequences that are characteristic for several cellular disintegrin-domain proteins. These peptide sequences were deduced from cDNA sequence tags that were generated by polymerase chain reaction from various mouse tissue and a mouse muscle cell line. Northern blot analysis with four sequence tags revealed distinct mRNA expression patterns. Evidently, cellular proteins containing a disintegrin domain define a superfamily of potential integrin ligands that are likely to function in important cell-cell and cell-matrix interactions.

  6. Exploiting EST databases for the development and characterisation of 3425 gene-tagged CISP markers in biofuel crop sugarcane and their transferability in cereals and orphan tropical grasses.

    PubMed

    Chandra, Amaresh; Jain, Radha; Solomon, Sushil; Shrivastava, Shiksha; Roy, Ajoy K

    2013-02-04

    Sugarcane is an important cash crop, providing 70% of the global raw sugar as well as raw material for biofuel production. Genetic analysis is hindered in sugarcane because of its large and complex polyploid genome and lack of sufficiently informative gene-tagged markers. Modern genomics has produced large amount of ESTs, which can be exploited to develop molecular markers based on comparative analysis with EST datasets of related crops and whole rice genome sequence, and accentuate their cross-technical functionality in orphan crops like tropical grasses. Utilising 246,180 Saccharum officinarum EST sequences vis-à-vis its comparative analysis with ESTs of sorghum and barley and the whole rice genome sequence, we have developed 3425 novel gene-tagged markers - namely, conserved-intron scanning primers (CISP) - using the web program GeMprospector. Rice orthologue annotation results indicated homology of 1096 sequences with expressed proteins, 491 with hypothetical proteins. The remaining 1838 were miscellaneous in nature. A total of 367 primer-pairs were tested in diverse panel of samples. The data indicate amplification of 41% polymorphic bands leading to 0.52 PIC and 3.50 MI with a set of sugarcane varieties and Saccharum species. In addition, a moderate technical functionality of a set of such markers with orphan tropical grasses (22%) and fodder cum cereal oat (33%) is observed. Developed gene-tagged CISP markers exhibited considerable technical functionality with varieties of sugarcane and unexplored species of tropical grasses. These markers would thus be particularly useful in identifying the economical traits in sugarcane and developing conservation strategies for orphan tropical grasses.

  7. Generation and analysis of expressed sequence tags from a cDNA library of the fruiting body of Ganoderma lucidum

    PubMed Central

    2010-01-01

    Background Little genomic or trancriptomic information on Ganoderma lucidum (Lingzhi) is known. This study aims to discover the transcripts involved in secondary metabolite biosynthesis and developmental regulation of G. lucidum using an expressed sequence tag (EST) library. Methods A cDNA library was constructed from the G. lucidum fruiting body. Its high-quality ESTs were assembled into unique sequences with contigs and singletons. The unique sequences were annotated according to sequence similarities to genes or proteins available in public databases. The detection of simple sequence repeats (SSRs) was preformed by online analysis. Results A total of 1,023 clones were randomly selected from the G. lucidum library and sequenced, yielding 879 high-quality ESTs. These ESTs showed similarities to a diverse range of genes. The sequences encoding squalene epoxidase (SE) and farnesyl-diphosphate synthase (FPS) were identified in this EST collection. Several candidate genes, such as hydrophobin, MOB2, profilin and PHO84 were detected for the first time in G. lucidum. Thirteen (13) potential SSR-motif microsatellite loci were also identified. Conclusion The present study demonstrates a successful application of EST analysis in the discovery of transcripts involved in the secondary metabolite biosynthesis and the developmental regulation of G. lucidum. PMID:20230644

  8. Evaluating information content of SNPs for sample-tagging in re-sequencing projects.

    PubMed

    Hu, Hao; Liu, Xiang; Jin, Wenfei; Hilger Ropers, H; Wienker, Thomas F

    2015-05-15

    Sample-tagging is designed for identification of accidental sample mix-up, which is a major issue in re-sequencing studies. In this work, we develop a model to measure the information content of SNPs, so that we can optimize a panel of SNPs that approach the maximal information for discrimination. The analysis shows that as low as 60 optimized SNPs can differentiate the individuals in a population as large as the present world, and only 30 optimized SNPs are in practice sufficient in labeling up to 100 thousand individuals. In the simulated populations of 100 thousand individuals, the average Hamming distances, generated by the optimized set of 30 SNPs are larger than 18, and the duality frequency, is lower than 1 in 10 thousand. This strategy of sample discrimination is proved robust in large sample size and different datasets. The optimized sets of SNPs are designed for Whole Exome Sequencing, and a program is provided for SNP selection, allowing for customized SNP numbers and interested genes. The sample-tagging plan based on this framework will improve re-sequencing projects in terms of reliability and cost-effectiveness.

  9. Development of expressed sequence tag-simple sequence repeat markers for genetic characterization and population structure analysis of Praxelis clematidea (Asteraceae).

    PubMed

    Wang, Q Z; Huang, M; Downie, S R; Chen, Z X

    2016-05-23

    Invasive plants tend to spread aggressively in new habitats and an understanding of their genetic diversity and population structure is useful for their management. In this study, expressed sequence tag-simple sequence repeat (EST-SSR) markers were developed for the invasive plant species Praxelis clematidea (Asteraceae) from 5548 Stevia rebaudiana (Asteraceae) expressed sequence tags (ESTs). A total of 133 microsatellite-containing ESTs (2.4%) were identified, of which 56 (42.1%) were hexanucleotide repeat motifs and 50 (37.6%) were trinucleotide repeat motifs. Of the 24 primer pairs designed from these 133 ESTs, 7 (29.2%) resulted in significant polymorphisms. The number of alleles per locus ranged from 5 to 9. The relatively high genetic diversity (H = 0.2667, I = 0.4212, and P = 100%) of P. clematidea was related to high gene flow (Nm = 1.4996) among populations. The coefficient of population differentiation (GST = 0.2500) indicated that most genetic variation occurred within populations. A Mantel test suggested that there was significant correlation between genetic distance and geographical distribution (r = 0.3192, P = 0.012). These results further support the transferability of EST-SSR markers between closely related genera of the same family.

  10. Digital gene expression analysis with sample multiplexing and PCR duplicate detection: A straightforward protocol.

    PubMed

    Rozenberg, Andrey; Leese, Florian; Weiss, Linda C; Tollrian, Ralph

    2016-01-01

    Tag-Seq is a high-throughput approach used for discovering SNPs and characterizing gene expression. In comparison to RNA-Seq, Tag-Seq eases data processing and allows detection of rare mRNA species using only one tag per transcript molecule. However, reduced library complexity raises the issue of PCR duplicates, which distort gene expression levels. Here we present a novel Tag-Seq protocol that uses the least biased methods for RNA library preparation combined with a novel approach for joint PCR template and sample labeling. In our protocol, input RNA is fragmented by hydrolysis, and poly(A)-bearing RNAs are selected and directly ligated to mixed DNA-RNA P5 adapters. The P5 adapters contain i5 barcodes composed of sample-specific (moderately) degenerate base regions (mDBRs), which later allow detection of PCR duplicates. The P7 adapter is attached via reverse transcription with individual i7 barcodes added during the amplification step. The resulting libraries can be sequenced on an Illumina sequencer. After sample demultiplexing and PCR duplicate removal with a free software tool we designed, the data are ready for downstream analysis. Our protocol was tested on RNA samples from predator-induced and control Daphnia microcrustaceans.

  11. Transcriptome-wide analysis of WRKY transcription factors in wheat and their leaf rust responsive expression profiling.

    PubMed

    Satapathy, Lopamudra; Singh, Dharmendra; Ranjan, Prashant; Kumar, Dhananjay; Kumar, Manish; Prabhu, Kumble Vinod; Mukhopadhyay, Kunal

    2014-12-01

    WRKY, a plant-specific transcription factor family, has important roles in pathogen defense, abiotic cues and phytohormone signaling, yet little is known about their roles and molecular mechanism of function in response to rust diseases in wheat. We identified 100 TaWRKY sequences using wheat Expressed Sequence Tag database of which 22 WRKY sequences were novel. Identified proteins were characterized based on their zinc finger motifs and phylogenetic analysis clustered them into six clades consisting of class IIc and class III WRKY proteins. Functional annotation revealed major functions in metabolic and cellular processes in control plants; whereas response to stimuli, signaling and defense in pathogen inoculated plants, their major molecular function being binding to DNA. Tag-based expression analysis of the identified genes revealed differential expression between mock and Puccinia triticina inoculated wheat near isogenic lines. Gene expression was also performed with six rust-related microarray experiments at Gene Expression Omnibus database. TaWRKY10, 15, 17 and 56 were common in both tag-based and microarray-based differential expression analysis and could be representing rust specific WRKY genes. The obtained results will bestow insight into the functional characterization of WRKY transcription factors responsive to leaf rust pathogenesis that can be used as candidate genes in molecular breeding programs to improve biotic stress tolerance in wheat.

  12. OSIRIS-REx Touch-And-Go (TAG) Mission Design and Analysis

    NASA Technical Reports Server (NTRS)

    Berry, Kevin; Sutter, Brian; May, Alex; Williams, Ken; Barbee, Brent W.; Beckman, Mark; Williams, Bobby

    2013-01-01

    The Origins Spectral Interpretation Resource Identification Security Regolith Explorer (OSIRIS-REx) mission is a NASA New Frontiers mission launching in 2016 to rendezvous with the near-Earth asteroid (101955) 1999 RQ36 in late 2018. After several months in formation with and orbit about the asteroid, OSIRIS-REx will fly a Touch-And-Go (TAG) trajectory to the asteroid s surface to obtain a regolith sample. This paper describes the mission design of the TAG sequence and the propulsive maneuvers required to achieve the trajectory. This paper also shows preliminary results of orbit covariance analysis and Monte-Carlo analysis that demonstrate the ability to arrive at a targeted location on the surface of RQ36 within a 25 meter radius with 98.3% confidence.

  13. DSAP: deep-sequencing small RNA analysis pipeline.

    PubMed

    Huang, Po-Jung; Liu, Yi-Chung; Lee, Chi-Ching; Lin, Wei-Chen; Gan, Richie Ruei-Chi; Lyu, Ping-Chiang; Tang, Petrus

    2010-07-01

    DSAP is an automated multiple-task web service designed to provide a total solution to analyzing deep-sequencing small RNA datasets generated by next-generation sequencing technology. DSAP uses a tab-delimited file as an input format, which holds the unique sequence reads (tags) and their corresponding number of copies generated by the Solexa sequencing platform. The input data will go through four analysis steps in DSAP: (i) cleanup: removal of adaptors and poly-A/T/C/G/N nucleotides; (ii) clustering: grouping of cleaned sequence tags into unique sequence clusters; (iii) non-coding RNA (ncRNA) matching: sequence homology mapping against a transcribed sequence library from the ncRNA database Rfam (http://rfam.sanger.ac.uk/); and (iv) known miRNA matching: detection of known miRNAs in miRBase (http://www.mirbase.org/) based on sequence homology. The expression levels corresponding to matched ncRNAs and miRNAs are summarized in multi-color clickable bar charts linked to external databases. DSAP is also capable of displaying miRNA expression levels from different jobs using a log(2)-scaled color matrix. Furthermore, a cross-species comparative function is also provided to show the distribution of identified miRNAs in different species as deposited in miRBase. DSAP is available at http://dsap.cgu.edu.tw.

  14. A tag-based approach for high-throughput analysis of CCWGG methylation.

    PubMed

    Denisova, Oksana V; Chernov, Andrei V; Koledachkina, Tatyana Y; Matvienko, Nicholas I

    2007-10-15

    Non-CpG methylation occurring in the context of CNG sequences is found in plants at a large number of genomic loci. However, there is still little information available about non-CpG methylation in mammals. Efficient methods that would allow detection of scarcely localized methylated sites in small quantities of DNA are required to elucidate the biological role of non-CpG methylation in both plants and animals. In this study, we tested a new whole genome approach to identify sites of CCWGG methylation (W is A or T), a particular case of CNG methylation, in genomic DNA. This technique is based on digestion of DNAs with methylation-sensitive restriction endonucleases EcoRII-C and AjnI. Short DNAs flanking methylated CCWGG sites (tags) are selectively purified and assembled in tandem arrays of up to nine tags. This allows high-throughput sequencing of tags, identification of flanking regions, and their exact positions in the genome. In this study, we tested specificity and efficiency of the approach.

  15. The transcription factor GCN4 regulates PHM8 and alters triacylglycerol metabolism in Saccharomyces cerevisiae.

    PubMed

    Yadav, Kamlesh Kumar; Rajasekharan, Ram

    2016-11-01

    PHM8 is a very important enzyme in nonpolar lipid metabolism because of its role in triacylglycerol (TAG) biosynthesis under phosphate stress conditions. It is positively regulated by the PHO4 transcription factor under low phosphate conditions; however, its regulation has not been explored under normal physiological conditions. General control nonderepressible (GCN4), a basic leucine-zipper transcription factor activates the transcription of amino acids, purine biosynthesis genes and many stress response genes under various stress conditions. In this study, we demonstrate that the level of TAG is regulated by the transcription factor GCN4. GCN4 directly binds to its consensus recognition sequence (TGACTC) in the PHM8 promoter and controls its expression. The analysis of cells expressing the P PHM8 -lacZ reporter gene showed that mutations (TGACTC-GGGCCC) in the GCN4-binding sequence caused a significant increase in β-galactosidase activity. Mutation in the GCN4 binding sequence causes an increase in PHM8 expression, lysophosphatidic acid phosphatase activity and TAG level. PHM8, in conjunction with DGA1, a mono- and diacylglycerol transferase, controls the level of TAG. These results revealed that GCN4 negatively regulates PHM8 and that deletion of GCN4 causes de-repression of PHM8, which is responsible for the increased TAG content in gcn4∆ cells.

  16. High-resolution phylogenetic microbial community profiling

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Singer, Esther; Bushnell, Brian; Coleman-Derr, Devin

    Over the past decade, high-throughput short-read 16S rRNA gene amplicon sequencing has eclipsed clone-dependent long-read Sanger sequencing for microbial community profiling. The transition to new technologies has provided more quantitative information at the expense of taxonomic resolution with implications for inferring metabolic traits in various ecosystems. We applied single-molecule real-time sequencing for microbial community profiling, generating full-length 16S rRNA gene sequences at high throughput, which we propose to name PhyloTags. We benchmarked and validated this approach using a defined microbial community. When further applied to samples from the water column of meromictic Sakinaw Lake, we show that while community structuresmore » at the phylum level are comparable between PhyloTags and Illumina V4 16S rRNA gene sequences (iTags), variance increases with community complexity at greater water depths. PhyloTags moreover allowed less ambiguous classification. Last, a platform-independent comparison of PhyloTags and in silico generated partial 16S rRNA gene sequences demonstrated significant differences in community structure and phylogenetic resolution across multiple taxonomic levels, including a severe underestimation in the abundance of specific microbial genera involved in nitrogen and methane cycling across the Lake's water column. Thus, PhyloTags provide a reliable adjunct or alternative to cost-effective iTags, enabling more accurate phylogenetic resolution of microbial communities and predictions on their metabolic potential.« less

  17. High-resolution phylogenetic microbial community profiling

    DOE PAGES

    Singer, Esther; Bushnell, Brian; Coleman-Derr, Devin; ...

    2016-02-09

    Over the past decade, high-throughput short-read 16S rRNA gene amplicon sequencing has eclipsed clone-dependent long-read Sanger sequencing for microbial community profiling. The transition to new technologies has provided more quantitative information at the expense of taxonomic resolution with implications for inferring metabolic traits in various ecosystems. We applied single-molecule real-time sequencing for microbial community profiling, generating full-length 16S rRNA gene sequences at high throughput, which we propose to name PhyloTags. We benchmarked and validated this approach using a defined microbial community. When further applied to samples from the water column of meromictic Sakinaw Lake, we show that while community structuresmore » at the phylum level are comparable between PhyloTags and Illumina V4 16S rRNA gene sequences (iTags), variance increases with community complexity at greater water depths. PhyloTags moreover allowed less ambiguous classification. Last, a platform-independent comparison of PhyloTags and in silico generated partial 16S rRNA gene sequences demonstrated significant differences in community structure and phylogenetic resolution across multiple taxonomic levels, including a severe underestimation in the abundance of specific microbial genera involved in nitrogen and methane cycling across the Lake's water column. Thus, PhyloTags provide a reliable adjunct or alternative to cost-effective iTags, enabling more accurate phylogenetic resolution of microbial communities and predictions on their metabolic potential.« less

  18. Serial analysis of gene expression in the silkworm, Bombyx mori.

    PubMed

    Huang, Jianhua; Miao, Xuexia; Jin, Weirong; Couble, Pierre; Mita, Kasuei; Zhang, Yong; Liu, Wenbin; Zhuang, Leijun; Shen, Yan; Keime, Celine; Gandrillon, Olivier; Brouilly, Patrick; Briolay, Jerome; Zhao, Guoping; Huang, Yongping

    2005-08-01

    The silkworm Bombyx mori is one of the most economically important insects and serves as a model for Lepidoptera insects. We used serial analysis of gene expression (SAGE) to derive profiles of expressed genes during the developmental life cycle of the silkworm and to create a reference for understanding silkworm metamorphosis. We generated four SAGE libraries, one from each of the four developmental stages of the silkworm. In total we obtained 257,964 SAGE tags, of which 39,485 were unique tags. Sorted by copy number, 14.1% of the unique tags were detected at a median to high level (five or more copies), 24.2% at lower levels (two to four copies), and 61.7% as single copies. Using a basic local alignment search tool on the EST database, 35% of the tags matched known silkworm expressed sequence tags. SAGE demonstrated that a number of the genes were up- or down-regulated during the four developmental phases of the egg, larva, pupa, and adult. Furthermore, we found that the generation of longer cDNA fragments from SAGE tags constituted the most efficient method of gene identification, which facilitated the analysis of a large number of unknown genes.

  19. Exploring the Presence of microDNAs in Prostate Cancer Cell Lines, Tissue, and Sera of Prostate Cancer Patients and its Possible Application as Biomarker

    DTIC Science & Technology

    2016-04-01

    Sequence tags were mapped on the human reference genome using the Novoalign software. Only those...ends of the linear islands to create a novel junctional sequence that does not exist in the genome . Thus the PE- sequence of a fragment that breaks at... genome (Fig. 3b). Those PE-tags where one tag maps uniquely to an island and the other remains unmapped, but passes the sequence quality filter,

  20. Genome-Wide Mutagenesis in Borrelia burgdorferi.

    PubMed

    Lin, Tao; Gao, Lihui

    2018-01-01

    Signature-tagged mutagenesis (STM) is a functional genomics approach to identify bacterial virulence determinants and virulence factors by simultaneously screening multiple mutants in a single host animal, and has been utilized extensively for the study of bacterial pathogenesis, host-pathogen interactions, and spirochete and tick biology. The signature-tagged transposon mutagenesis has been developed to investigate virulence determinants and pathogenesis of Borrelia burgdorferi. Mutants in genes important in virulence are identified by negative selection in which the mutants fail to colonize or disseminate in the animal host and tick vector. STM procedure combined with Luminex Flex ® Map™ technology and next-generation sequencing (e.g., Tn-seq) are the powerful high-throughput tools for the determination of Borrelia burgdorferi virulence determinants. The assessment of multiple tissue sites and two DNA resources at two different time points using Luminex Flex ® Map™ technology provides a robust data set. B. burgdorferi transposon mutant screening indicates that a high proportion of genes are the novel virulence determinants that are required for mouse and tick infection. In this protocol, an effective signature-tagged Himar1-based transposon suicide vector was developed and used to generate a sequence-defined library of nearly 4800 mutants in the infectious B. burgdorferi B31 clone. In STM, signature-tagged suicide vectors are constructed by inserting unique DNA sequences (tags) into the transposable elements. The signature-tagged transposon mutants are generated when transposon suicide vectors are transformed into an infectious B. burgdorferi clone, and the transposable element is transposed into the 5'-TA-3' sequence in the B. burgdorferi genome with the signature tag. The transposon library is created and consists of many sub-libraries, each sub-library has several hundreds of mutants with same tags. A group of mice or ticks are infected with a mixed population of mutants with different tags, after recovered from different tissues of infected mice and ticks, mutants from output pool and input pool are detected using high-throughput, semi-quantitative Luminex ® FLEXMAP™ or next-generation sequencing (Tn-seq) technologies. Thus far, we have created a high-density, sequence-defined transposon library of over 6600 STM mutants for the efficient genome-wide investigation of genes and gene products required for wild-type pathogenesis, host-pathogen interactions, in vitro growth, in vivo survival, physiology, morphology, chemotaxis, motility, structure, metabolism, gene regulation, plasmid maintenance and replication, etc. The insertion sites of 4480 transposon mutants have been determined. About 800 predicted protein-encoding genes in the genome were disrupted in the STM transposon library. The infectivity and some functions of 800 mutants in 500 genes have been determined. Analysis of these transposon mutants has yielded valuable information regarding the genes and gene products important in the pathogenesis and biology of B. burgdorferi and its tick vectors.

  1. Application of Stochastic Labeling with Random-Sequence Barcodes for Simultaneous Quantification and Sequencing of Environmental 16S rRNA Genes.

    PubMed

    Hoshino, Tatsuhiko; Inagaki, Fumio

    2017-01-01

    Next-generation sequencing (NGS) is a powerful tool for analyzing environmental DNA and provides the comprehensive molecular view of microbial communities. For obtaining the copy number of particular sequences in the NGS library, however, additional quantitative analysis as quantitative PCR (qPCR) or digital PCR (dPCR) is required. Furthermore, number of sequences in a sequence library does not always reflect the original copy number of a target gene because of biases caused by PCR amplification, making it difficult to convert the proportion of particular sequences in the NGS library to the copy number using the mass of input DNA. To address this issue, we applied stochastic labeling approach with random-tag sequences and developed a NGS-based quantification protocol, which enables simultaneous sequencing and quantification of the targeted DNA. This quantitative sequencing (qSeq) is initiated from single-primer extension (SPE) using a primer with random tag adjacent to the 5' end of target-specific sequence. During SPE, each DNA molecule is stochastically labeled with the random tag. Subsequently, first-round PCR is conducted, specifically targeting the SPE product, followed by second-round PCR to index for NGS. The number of random tags is only determined during the SPE step and is therefore not affected by the two rounds of PCR that may introduce amplification biases. In the case of 16S rRNA genes, after NGS sequencing and taxonomic classification, the absolute number of target phylotypes 16S rRNA gene can be estimated by Poisson statistics by counting random tags incorporated at the end of sequence. To test the feasibility of this approach, the 16S rRNA gene of Sulfolobus tokodaii was subjected to qSeq, which resulted in accurate quantification of 5.0 × 103 to 5.0 × 104 copies of the 16S rRNA gene. Furthermore, qSeq was applied to mock microbial communities and environmental samples, and the results were comparable to those obtained using digital PCR and relative abundance based on a standard sequence library. We demonstrated that the qSeq protocol proposed here is advantageous for providing less-biased absolute copy numbers of each target DNA with NGS sequencing at one time. By this new experiment scheme in microbial ecology, microbial community compositions can be explored in more quantitative manner, thus expanding our knowledge of microbial ecosystems in natural environments.

  2. Transcriptome Analysis of Gene Expression during Chinese Water Chestnut Storage Organ Formation

    PubMed Central

    Chen, Sainan; Wang, Yan; Yu, Meizhen; Chen, Xuehao; Li, Liangjun; Yin, Jingjing

    2016-01-01

    The product organ (storage organ; corm) of the Chinese water chestnut has become a very popular food in Asian countries because of its unique nutritional value. Corm formation is a complex biological process, and extensive whole genome analysis of transcripts during corm development has not been carried out. In this study, four corm libraries at different developmental stages were constructed, and gene expression was identified using a high-throughput tag sequencing technique. Approximately 4.9 million tags were sequenced, and 4,371,386, 4,372,602, 4,782,494, and 5,276,540 clean tags, including 119,676, 110,701, 100,089, and 101,239 distinct tags, respectively, were obtained after removal of low-quality tags from each library. More than 39% of the distinct tags were unambiguous and could be mapped to reference genes, while 40% were unambiguous tag-mapped genes. After mapping their functions in existing databases, a total of 11,592, 10,949, 10,585, and 7,111 genes were annotated from the B1, B2, B3, and B4 libraries, respectively. Analysis of the differentially expressed genes (DEGs) in B1/B2, B2/B3, and B3/B4 libraries showed that most of the DEGs at the B1/B2 stages were involved in carbohydrate and hormone metabolism, while the majority of DEGs were involved in energy metabolism and carbohydrate metabolism at the B2/B3 and B3/B4 stages. All of the upregulated transcription factors and 9 important genes related to product organ formation in the above four stages were also identified. The expression changes of nine of the identified DEGs were validated using a quantitative PCR approach. This study provides a comprehensive understanding of gene expression during corm formation in the Chinese water chestnut. PMID:27716802

  3. Evaluation of anonymous and expressed sequence tag derived polymorphic microsatellite markers in the tobacco budworm Heliothis virescens (Lepidoptera: noctuidae)

    USDA-ARS?s Scientific Manuscript database

    Polymorphic genetic markers were identified and characterized using a partial genomic library of Heliothis virescens enriched for simple sequence repeats (SSR) and nucleotide sequences of expressed sequence tags (EST). Nucleotide sequences of 192 clones from the partial genomic library yielded 147 u...

  4. Characterization of microbial associations with methanotrophic archaea and sulfate-reducing bacteria through statistical comparison of nested Magneto-FISH enrichments

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Trembath-Reichert, Elizabeth; Case, David H.; Orphan, Victoria J.

    Methane seep systems along continental margins host diverse and dynamic microbial assemblages, sustained in large part through the microbially mediated process of sulfate-coupled Anaerobic Oxidation of Methane (AOM). This methanotrophic metabolism has been linked to consortia of anaerobic methane-oxidizing archaea (ANME) and sulfate-reducing bacteria (SRB). These two groups are the focus of numerous studies; however, less is known about the wide diversity of other seep associated microorganisms. We selected a hierarchical set of FISH probes targeting a range ofDeltaproteobacteriadiversity. Using the Magneto-FISH enrichment technique, we then magnetically captured CARD-FISH hybridized cells and their physically associated microorganisms from a methane seepmore » sediment incubation. DNA from nested Magneto-FISH experiments was analyzed using Illumina tag 16S rRNA gene sequencing (iTag). Enrichment success and potential bias with iTag was evaluated in the context of full-length 16S rRNA gene clone libraries, CARD-FISH, functional gene clone libraries, and iTag mock communities. We determined commonly used Earth Microbiome Project (EMP) iTAG primers introduced bias in some common methane seep microbial taxa that reduced the ability to directly compare OTU relative abundances within a sample, but comparison of relative abundances between samples (in nearly all cases) and whole community-based analyses were robust. The iTag dataset was subjected to statistical co-occurrence measures of the most abundant OTUs to determine which taxa in this dataset were most correlated across all samples. In addition, many non-canonical microbial partnerships were statistically significant in our co-occurrence network analysis, most of which were not recovered with conventional clone library sequencing, demonstrating the utility of combining Magneto-FISH and iTag sequencing methods for hypothesis generation of associations within complex microbial communities. Network analysis pointed to many co-occurrences containing putatively heterotrophic, candidate phyla such as OD1, Atribacteria, MBG-B, and Hyd24-12 and the potential for complex sulfur cycling involving Epsilon-, Delta-, and Gammaproteobacteria in methane seep ecosystems.« less

  5. Characterization of microbial associations with methanotrophic archaea and sulfate-reducing bacteria through statistical comparison of nested Magneto-FISH enrichments

    DOE PAGES

    Trembath-Reichert, Elizabeth; Case, David H.; Orphan, Victoria J.

    2016-04-18

    Methane seep systems along continental margins host diverse and dynamic microbial assemblages, sustained in large part through the microbially mediated process of sulfate-coupled Anaerobic Oxidation of Methane (AOM). This methanotrophic metabolism has been linked to consortia of anaerobic methane-oxidizing archaea (ANME) and sulfate-reducing bacteria (SRB). These two groups are the focus of numerous studies; however, less is known about the wide diversity of other seep associated microorganisms. We selected a hierarchical set of FISH probes targeting a range ofDeltaproteobacteriadiversity. Using the Magneto-FISH enrichment technique, we then magnetically captured CARD-FISH hybridized cells and their physically associated microorganisms from a methane seepmore » sediment incubation. DNA from nested Magneto-FISH experiments was analyzed using Illumina tag 16S rRNA gene sequencing (iTag). Enrichment success and potential bias with iTag was evaluated in the context of full-length 16S rRNA gene clone libraries, CARD-FISH, functional gene clone libraries, and iTag mock communities. We determined commonly used Earth Microbiome Project (EMP) iTAG primers introduced bias in some common methane seep microbial taxa that reduced the ability to directly compare OTU relative abundances within a sample, but comparison of relative abundances between samples (in nearly all cases) and whole community-based analyses were robust. The iTag dataset was subjected to statistical co-occurrence measures of the most abundant OTUs to determine which taxa in this dataset were most correlated across all samples. In addition, many non-canonical microbial partnerships were statistically significant in our co-occurrence network analysis, most of which were not recovered with conventional clone library sequencing, demonstrating the utility of combining Magneto-FISH and iTag sequencing methods for hypothesis generation of associations within complex microbial communities. Network analysis pointed to many co-occurrences containing putatively heterotrophic, candidate phyla such as OD1, Atribacteria, MBG-B, and Hyd24-12 and the potential for complex sulfur cycling involving Epsilon-, Delta-, and Gammaproteobacteria in methane seep ecosystems.« less

  6. Characterization of microbial associations with methanotrophic archaea and sulfate-reducing bacteria through statistical comparison of nested Magneto-FISH enrichments.

    PubMed

    Trembath-Reichert, Elizabeth; Case, David H; Orphan, Victoria J

    2016-01-01

    Methane seep systems along continental margins host diverse and dynamic microbial assemblages, sustained in large part through the microbially mediated process of sulfate-coupled Anaerobic Oxidation of Methane (AOM). This methanotrophic metabolism has been linked to consortia of anaerobic methane-oxidizing archaea (ANME) and sulfate-reducing bacteria (SRB). These two groups are the focus of numerous studies; however, less is known about the wide diversity of other seep associated microorganisms. We selected a hierarchical set of FISH probes targeting a range of Deltaproteobacteria diversity. Using the Magneto-FISH enrichment technique, we then magnetically captured CARD-FISH hybridized cells and their physically associated microorganisms from a methane seep sediment incubation. DNA from nested Magneto-FISH experiments was analyzed using Illumina tag 16S rRNA gene sequencing (iTag). Enrichment success and potential bias with iTag was evaluated in the context of full-length 16S rRNA gene clone libraries, CARD-FISH, functional gene clone libraries, and iTag mock communities. We determined commonly used Earth Microbiome Project (EMP) iTAG primers introduced bias in some common methane seep microbial taxa that reduced the ability to directly compare OTU relative abundances within a sample, but comparison of relative abundances between samples (in nearly all cases) and whole community-based analyses were robust. The iTag dataset was subjected to statistical co-occurrence measures of the most abundant OTUs to determine which taxa in this dataset were most correlated across all samples. Many non-canonical microbial partnerships were statistically significant in our co-occurrence network analysis, most of which were not recovered with conventional clone library sequencing, demonstrating the utility of combining Magneto-FISH and iTag sequencing methods for hypothesis generation of associations within complex microbial communities. Network analysis pointed to many co-occurrences containing putatively heterotrophic, candidate phyla such as OD1, Atribacteria, MBG-B, and Hyd24-12 and the potential for complex sulfur cycling involving Epsilon-, Delta-, and Gammaproteobacteria in methane seep ecosystems.

  7. Differential gene expression in the siphonophore Nanomia bijuga (Cnidaria) assessed with multiple next-generation sequencing workflows.

    PubMed

    Siebert, Stefan; Robinson, Mark D; Tintori, Sophia C; Goetz, Freya; Helm, Rebecca R; Smith, Stephen A; Shaner, Nathan; Haddock, Steven H D; Dunn, Casey W

    2011-01-01

    We investigated differential gene expression between functionally specialized feeding polyps and swimming medusae in the siphonophore Nanomia bijuga (Cnidaria) with a hybrid long-read/short-read sequencing strategy. We assembled a set of partial gene reference sequences from long-read data (Roche 454), and generated short-read sequences from replicated tissue samples that were mapped to the references to quantify expression. We collected and compared expression data with three short-read expression workflows that differ in sample preparation, sequencing technology, and mapping tools. These workflows were Illumina mRNA-Seq, which generates sequence reads from random locations along each transcript, and two tag-based approaches, SOLiD SAGE and Helicos DGE, which generate reads from particular tag sites. Differences in expression results across workflows were mostly due to the differential impact of missing data in the partial reference sequences. When all 454-derived gene reference sequences were considered, Illumina mRNA-Seq detected more than twice as many differentially expressed (DE) reference sequences as the tag-based workflows. This discrepancy was largely due to missing tag sites in the partial reference that led to false negatives in the tag-based workflows. When only the subset of reference sequences that unambiguously have tag sites was considered, we found broad congruence across workflows, and they all identified a similar set of DE sequences. Our results are promising in several regards for gene expression studies in non-model organisms. First, we demonstrate that a hybrid long-read/short-read sequencing strategy is an effective way to collect gene expression data when an annotated genome sequence is not available. Second, our replicated sampling indicates that expression profiles are highly consistent across field-collected animals in this case. Third, the impacts of partial reference sequences on the ability to detect DE can be mitigated through workflow choice and deeper reference sequencing.

  8. Differential Gene Expression in the Siphonophore Nanomia bijuga (Cnidaria) Assessed with Multiple Next-Generation Sequencing Workflows

    PubMed Central

    Siebert, Stefan; Robinson, Mark D.; Tintori, Sophia C.; Goetz, Freya; Helm, Rebecca R.; Smith, Stephen A.; Shaner, Nathan; Haddock, Steven H. D.; Dunn, Casey W.

    2011-01-01

    We investigated differential gene expression between functionally specialized feeding polyps and swimming medusae in the siphonophore Nanomia bijuga (Cnidaria) with a hybrid long-read/short-read sequencing strategy. We assembled a set of partial gene reference sequences from long-read data (Roche 454), and generated short-read sequences from replicated tissue samples that were mapped to the references to quantify expression. We collected and compared expression data with three short-read expression workflows that differ in sample preparation, sequencing technology, and mapping tools. These workflows were Illumina mRNA-Seq, which generates sequence reads from random locations along each transcript, and two tag-based approaches, SOLiD SAGE and Helicos DGE, which generate reads from particular tag sites. Differences in expression results across workflows were mostly due to the differential impact of missing data in the partial reference sequences. When all 454-derived gene reference sequences were considered, Illumina mRNA-Seq detected more than twice as many differentially expressed (DE) reference sequences as the tag-based workflows. This discrepancy was largely due to missing tag sites in the partial reference that led to false negatives in the tag-based workflows. When only the subset of reference sequences that unambiguously have tag sites was considered, we found broad congruence across workflows, and they all identified a similar set of DE sequences. Our results are promising in several regards for gene expression studies in non-model organisms. First, we demonstrate that a hybrid long-read/short-read sequencing strategy is an effective way to collect gene expression data when an annotated genome sequence is not available. Second, our replicated sampling indicates that expression profiles are highly consistent across field-collected animals in this case. Third, the impacts of partial reference sequences on the ability to detect DE can be mitigated through workflow choice and deeper reference sequencing. PMID:21829563

  9. High-resolution melt analysis to identify and map sequence-tagged site anchor points onto linkage maps: a white lupin (Lupinus albus) map as an exemplar.

    PubMed

    Croxford, Adam E; Rogers, Tom; Caligari, Peter D S; Wilkinson, Michael J

    2008-01-01

    * The provision of sequence-tagged site (STS) anchor points allows meaningful comparisons between mapping studies but can be a time-consuming process for nonmodel species or orphan crops. * Here, the first use of high-resolution melt analysis (HRM) to generate STS markers for use in linkage mapping is described. This strategy is rapid and low-cost, and circumvents the need for labelled primers or amplicon fractionation. * Using white lupin (Lupinus albus, x = 25) as a case study, HRM analysis was applied to identify 91 polymorphic markers from expressed sequence tag (EST)-derived and genomic libraries. Of these, 77 generated STS anchor points in the first fully resolved linkage map of the species. The map also included 230 amplified fragment length polymorphisms (AFLP) loci, spanned 1916 cM (84.2% coverage) and divided into the expected 25 linkage groups. * Quantitative trait loci (QTL) analyses performed on the population revealed genomic regions associated with several traits, including the agronomically important time to flowering (tf), alkaloid synthesis and stem height (Ph). Use of HRM-STS markers also allowed us to make direct comparisons between our map and that of the related crop, Lupinus angustifolius, based on the conversion of RFLP, microsatellite and single nucleotide polymorphism (SNP) markers into HRM markers.

  10. Analysis of expressed sequence tags from Prunus mume flower and fruit and development of simple sequence repeat markers

    PubMed Central

    2010-01-01

    Background Expressed Sequence Tag (EST) has been a cost-effective tool in molecular biology and represents an abundant valuable resource for genome annotation, gene expression, and comparative genomics in plants. Results In this study, we constructed a cDNA library of Prunus mume flower and fruit, sequenced 10,123 clones of the library, and obtained 8,656 expressed sequence tag (EST) sequences with high quality. The ESTs were assembled into 4,473 unigenes composed of 1,492 contigs and 2,981 singletons and that have been deposited in NCBI (accession IDs: GW868575 - GW873047), among which 1,294 unique ESTs were with known or putative functions. Furthermore, we found 1,233 putative simple sequence repeats (SSRs) in the P. mume unigene dataset. We randomly tested 42 pairs of PCR primers flanking potential SSRs, and 14 pairs were identified as true-to-type SSR loci and could amplify polymorphic bands from 20 individual plants of P. mume. We further used the 14 EST-SSR primer pairs to test the transferability on peach and plum. The result showed that nearly 89% of the primer pairs produced target PCR bands in the two species. A high level of marker polymorphism was observed in the plum species (65%) and low in the peach (46%), and the clustering analysis of the three species indicated that these SSR markers were useful in the evaluation of genetic relationships and diversity between and within the Prunus species. Conclusions We have constructed the first cDNA library of P. mume flower and fruit, and our data provide sets of molecular biology resources for P. mume and other Prunus species. These resources will be useful for further study such as genome annotation, new gene discovery, gene functional analysis, molecular breeding, evolution and comparative genomics between Prunus species. PMID:20626882

  11. Analysis of beta-carotene hydroxylase gene cDNA isolated from the American oil-palm (Elaeis oleifera) mesocarp tissue cDNA library

    PubMed Central

    Bhore, Subhash J; Kassim, Amelia; Loh, Chye Ying; Shah, Farida H

    2010-01-01

    It is well known that the nutritional quality of the American oil-palm (Elaeis oleifera) mesocarp oil is superior to that of African oil-palm (Elaeis guineensis Jacq. Tenera) mesocarp oil. Therefore, it is of important to identify the genetic features for its superior value. This could be achieved through the genome sequencing of the oil-palm. However, the genome sequence is not available in the public domain due to commercial secrecy. Hence, we constructed a cDNA library and generated expressed sequence tags (3,205) from the mesocarp tissue of the American oil-palm. We continued to annotate each of these cDNAs after submitting to GenBank/DDBJ/EMBL. A rough analysis turned our attention to the beta-carotene hydroxylase (Chyb) enzyme encoding cDNA. Then, we completed the full sequencing of cDNA clone for its both strands using M13 forward and reverse primers. The full nucleotide and protein sequence was further analyzed and annotated using various Bioinformatics tools. The analysis results showed the presence of fatty acid hydroxylase superfamily domain in the protein sequence. The multiple sequence alignment of selected Chyb amino acid sequences from other plant species and algal members with E. oleifera Chyb using ClustalW and its phylogenetic analysis suggest that Chyb from monocotyledonous plant species, Lilium hubrid, Crocus sativus and Zea mays are the most evolutionary related with E. oleifera Chyb. This study reports the annotation of E. oleifera Chyb. Abbreviations ESTs - expressed sequence tags, EoChyb - Elaeis oleifera beta-carotene hydroxylase, MC - main cluster PMID:21364789

  12. High-resolution mapping of the 11q13 amplicon and identification of a gene, TAOS1, that is amplified and overexpressed in oral cancer cells

    PubMed Central

    Huang, Xin; Gollin, Susanne M.; Raja, Siva; Godfrey, Tony E.

    2002-01-01

    Amplification of chromosomal band 11q13 is a common event in human cancer. It has been reported in about 45% of head and neck carcinomas and in other cancers including esophageal, breast, liver, lung, and bladder cancer. To understand the mechanism of 11q13 amplification and to identify the potential oncogene(s) driving it, we have fine-mapped the structure of the amplicon in oral squamous cell carcinoma cell lines and localized the proximal and distal breakpoints. A 5-Mb physical map of the region has been prepared from which sequence is available. We quantified copy number of sequence-tagged site markers at 42–550 kb intervals along the length of the amplicon and defined the amplicon core and breakpoints by using TaqMan-based quantitative microsatellite analysis. The core of the amplicon maps to a 1.5-Mb region. The proximal breakpoint localizes to two intervals between sequence-tagged site markers, 550 kb and 160 kb in size, and the distal breakpoint maps to a 250 kb interval. The cyclin D1 gene maps to the amplicon core, as do two new expressed sequence tag clusters. We have analyzed one of these expressed sequence tag clusters and now report that it contains a previously uncharacterized gene, TAOS1 (tumor amplified and overexpressed sequence 1), which is both amplified and overexpressed in oral cancer cells. The data suggest that TAOS1 may be an amplification-dependent candidate oncogene with a role in the development and/or progression of human tumors, including oral squamous cell carcinomas. The approach described here should be useful for characterizing amplified genomic regions in a wide variety of tumors. PMID:12172009

  13. A normalization strategy for comparing tag count data

    PubMed Central

    2012-01-01

    Background High-throughput sequencing, such as ribonucleic acid sequencing (RNA-seq) and chromatin immunoprecipitation sequencing (ChIP-seq) analyses, enables various features of organisms to be compared through tag counts. Recent studies have demonstrated that the normalization step for RNA-seq data is critical for a more accurate subsequent analysis of differential gene expression. Development of a more robust normalization method is desirable for identifying the true difference in tag count data. Results We describe a strategy for normalizing tag count data, focusing on RNA-seq. The key concept is to remove data assigned as potential differentially expressed genes (DEGs) before calculating the normalization factor. Several R packages for identifying DEGs are currently available, and each package uses its own normalization method and gene ranking algorithm. We compared a total of eight package combinations: four R packages (edgeR, DESeq, baySeq, and NBPSeq) with their default normalization settings and with our normalization strategy. Many synthetic datasets under various scenarios were evaluated on the basis of the area under the curve (AUC) as a measure for both sensitivity and specificity. We found that packages using our strategy in the data normalization step overall performed well. This result was also observed for a real experimental dataset. Conclusion Our results showed that the elimination of potential DEGs is essential for more accurate normalization of RNA-seq data. The concept of this normalization strategy can widely be applied to other types of tag count data and to microarray data. PMID:22475125

  14. Application of an E. coli signal sequence as a versatile inclusion body tag.

    PubMed

    Jong, Wouter S P; Vikström, David; Houben, Diane; van den Berg van Saparoea, H Bart; de Gier, Jan-Willem; Luirink, Joen

    2017-03-21

    Heterologous protein production in Escherichia coli often suffers from bottlenecks such as proteolytic degradation, complex purification procedures and toxicity towards the expression host. Production of proteins in an insoluble form in inclusion bodies (IBs) can alleviate these problems. Unfortunately, the propensity of heterologous proteins to form IBs is variable and difficult to predict. Hence, fusing the target protein to an aggregation prone polypeptide or IB-tag is a useful strategy to produce difficult-to-express proteins in an insoluble form. When screening for signal sequences that mediate optimal targeting of heterologous proteins to the periplasmic space of E. coli, we observed that fusion to the 39 amino acid signal sequence of E. coli TorA (ssTorA) did not promote targeting but rather directed high-level expression of the human proteins hEGF, Pla2 and IL-3 in IBs. Further analysis revealed that ssTorA even mediated IB formation of the highly soluble endogenous E. coli proteins TrxA and MBP. The ssTorA also induced aggregation when fused to the C-terminus of target proteins and appeared functional as IB-tag in E. coli K-12 as well as B strains. An additive effect on IB-formation was observed upon fusion of multiple ssTorA sequences in tandem, provoking almost complete aggregation of TrxA and MBP. The ssTorA-moiety was successfully used to produce the intrinsically unstable hEGF and the toxic fusion partner SymE, demonstrating its applicability as an IB-tag for difficult-to-express and toxic proteins. We present proof-of-concept for the use of ssTorA as a small, versatile tag for robust E. coli-based expression of heterologous proteins in IBs.

  15. Rapid in silico cloning of genes using expressed sequence tags (ESTs).

    PubMed

    Gill, R W; Sanseau, P

    2000-01-01

    Expressed sequence tags (ESTs) are short single-pass DNA sequences obtained from either end of cDNA clones. These ESTs are derived from a vast number of cDNA libraries obtained from different species. Human ESTs are the bulk of the data and have been widely used to identify new members of gene families, as markers on the human chromosomes, to discover polymorphism sites and to compare expression patterns in different tissues or pathologies states. Information strategies have been devised to query EST databases. Since most of the analysis is performed with a computer, the term "in silico" strategy has been coined. In this chapter we will review the current status of EST databases, the pros and cons of EST-type data and describe possible strategies to retrieve meaningful information.

  16. Inventory of high-abundance mRNAs in skeletal muscle of normal men.

    PubMed

    Welle, S; Bhatt, K; Thornton, C A

    1999-05-01

    G42875rial analysis of gene expression (SAGE) method was used to generate a catalog of 53,875 short (14 base) expressed sequence tags from polyadenylated RNA obtained from vastus lateralis muscle of healthy young men. Over 12,000 unique tags were detected. The frequency of occurrence of each tag reflects the relative abundance of the corresponding mRNA. The mRNA species that were detected 10 or more times, each comprising >/=0.02% of the mRNA population, accounted for 64% of the mRNA mass but <10% of the total number of mRNA species detected. Almost all of the abundant tags matched mRNA or EST sequences cataloged in GenBank. Mitochondrial transcripts accounted for approximately 20% of the polyadenylated RNA. Transcripts encoding proteins of the myofibrils were the most abundant nuclear-encoded mRNAs. Transcripts encoding ribosomal proteins, and those encoding proteins involved in energy metabolism, also were very abundant. The database can be used as a reference for investigations of alterations in gene expression associated with conditions that influence muscle function, such as muscular dystrophies, aging, and exercise.

  17. New features of triacylglycerol biosynthetic pathways of peanut seeds in early developmental stages.

    PubMed

    Yu, Mingli; Liu, Fengzhen; Zhu, Weiwei; Sun, Meihong; Liu, Jiang; Li, Xinzheng

    2015-11-01

    The peanut (Arachis hypogaea L.) is one of the three most important oil crops in the world due to its high average oil content (50 %). To reveal the biosynthetic pathways of seed oil in the early developmental stages of peanut pods with the goal of improving the oil quality, we presented a method combining deep sequencing analysis of the peanut pod transcriptome and quantitative real-time PCR (RT-PCR) verification of seed oil-related genes. From the sequencing data, approximately 1500 lipid metabolism-associated Unigenes were identified. The RT-PCR results quantified the different expression patterns of these triacylglycerol (TAG) synthesis-related genes in the early developmental stages of peanut pods. Based on these results and analysis, we proposed a novel construct of the metabolic pathways involved in the biosynthesis of TAG, including the Kennedy pathway, acyl-CoA-independent pathway and proposed monoacylglycerol pathway. It showed that the biosynthetic pathways of TAG in the early developmental stages of peanut pods were much more complicated than a simple, unidirectional, linear pathway.

  18. An Ambystoma mexicanum EST sequencing project: analysis of 17,352 expressed sequence tags from embryonic and regenerating blastema cDNA libraries

    PubMed Central

    Habermann, Bianca; Bebin, Anne-Gaelle; Herklotz, Stephan; Volkmer, Michael; Eckelt, Kay; Pehlke, Kerstin; Epperlein, Hans Henning; Schackert, Hans Konrad; Wiebe, Glenis; Tanaka, Elly M

    2004-01-01

    Background The ambystomatid salamander, Ambystoma mexicanum (axolotl), is an important model organism in evolutionary and regeneration research but relatively little sequence information has so far been available. This is a major limitation for molecular studies on caudate development, regeneration and evolution. To address this lack of sequence information we have generated an expressed sequence tag (EST) database for A. mexicanum. Results Two cDNA libraries, one made from stage 18-22 embryos and the other from day-6 regenerating tail blastemas, generated 17,352 sequences. From the sequenced ESTs, 6,377 contigs were assembled that probably represent 25% of the expressed genes in this organism. Sequence comparison revealed significant homology to entries in the NCBI non-redundant database. Further examination of this gene set revealed the presence of genes involved in important cell and developmental processes, including cell proliferation, cell differentiation and cell-cell communication. On the basis of these data, we have performed phylogenetic analysis of key cell-cycle regulators. Interestingly, while cell-cycle proteins such as the cyclin B family display expected evolutionary relationships, the cyclin-dependent kinase inhibitor 1 gene family shows an unusual evolutionary behavior among the amphibians. Conclusions Our analysis reveals the importance of a comprehensive sequence set from a representative of the Caudata and illustrates that the EST sequence database is a rich source of molecular, developmental and regeneration studies. To aid in data mining, the ESTs have been organized into an easily searchable database that is freely available online. PMID:15345051

  19. Defining the transcriptome assembly and its use for genome dynamics and transcriptome profiling studies in pigeonpea (Cajanus cajan L.).

    PubMed

    Dubey, Anuja; Farmer, Andrew; Schlueter, Jessica; Cannon, Steven B; Abernathy, Brian; Tuteja, Reetu; Woodward, Jimmy; Shah, Trushar; Mulasmanovic, Benjamin; Kudapa, Himabindu; Raju, Nikku L; Gothalwal, Ragini; Pande, Suresh; Xiao, Yongli; Town, Chris D; Singh, Nagendra K; May, Gregory D; Jackson, Scott; Varshney, Rajeev K

    2011-06-01

    This study reports generation of large-scale genomic resources for pigeonpea, a so-called 'orphan crop species' of the semi-arid tropic regions. FLX/454 sequencing carried out on a normalized cDNA pool prepared from 31 tissues produced 494 353 short transcript reads (STRs). Cluster analysis of these STRs, together with 10 817 Sanger ESTs, resulted in a pigeonpea trancriptome assembly (CcTA) comprising of 127 754 tentative unique sequences (TUSs). Functional analysis of these TUSs highlights several active pathways and processes in the sampled tissues. Comparison of the CcTA with the soybean genome showed similarity to 10 857 and 16 367 soybean gene models (depending on alignment methods). Additionally, Illumina 1G sequencing was performed on Fusarium wilt (FW)- and sterility mosaic disease (SMD)-challenged root tissues of 10 resistant and susceptible genotypes. More than 160 million sequence tags were used to identify FW- and SMD-responsive genes. Sequence analysis of CcTA and the Illumina tags identified a large new set of markers for use in genetics and breeding, including 8137 simple sequence repeats, 12 141 single-nucleotide polymorphisms and 5845 intron-spanning regions. Genomic resources developed in this study should be useful for basic and applied research, not only for pigeonpea improvement but also for other related, agronomically important legumes.

  20. Tab2, a novel recombinant polypeptide tag offering sensitive and specific protein detection and reliable affinity purification.

    PubMed

    Crusius, Kerstin; Finster, Silke; McClary, John; Xia, Wei; Larsen, Brent; Schneider, Douglas; Lu, Hong-Tao; Biancalana, Sara; Xuan, Jian-Ai; Newton, Alicia; Allen, Debbie; Bringmann, Peter; Cobb, Ronald R

    2006-10-01

    The detection and purification of proteins are often time-consuming and frequently involve complicated protocols. The addition of a peptide tag to recombinant proteins can make this process more efficient. Many of the commonly used tags, such as Flagtrade mark, Myc, HA and V5 are recognized by specific monoclonal antibodies and therefore, allow immunoaffinity-based purification. Enhancing the current scope of flexibility in using diverse peptide tags, we report here the development of a novel, short polypeptide tag (Tab2) for detection and purification of recombinant proteins. The Tab2 epitope corresponds to the NH2-terminal seven amino acid residues of human TGFalpha. A monoclonal anti-Tab2 antibody was raised and characterized. To investigate the potential of this peptide sequence as a novel tag for recombinant proteins, we expressed several different recombinant proteins containing this tag in E. coli, baculovirus, and mammalian cells. The data presented demonstrates the Tab2 tag-anti-Tab2 antibody combination is a reliable tool enabling specific Western blot detection, FACS analysis, and immunoprecipitation as well as non-denaturing protein affinity purification.

  1. Analysis and functional annotation of expressed sequence tags from the fall armyworm Spodoptera frugiperda

    PubMed Central

    Deng, Youping; Dong, Yinghua; Thodima, Venkata; Clem, Rollie J; Passarelli, A Lorena

    2006-01-01

    Background Little is known about the genome sequences of lepidopteran insects, although this group of insects has been studied extensively in the fields of endocrinology, development, immunity, and pathogen-host interactions. In addition, cell lines derived from Spodoptera frugiperda and other lepidopteran insects are routinely used for baculovirus foreign gene expression. This study reports the results of an expressed sequence tag (EST) sequencing project in cells from the lepidopteran insect S. frugiperda, the fall armyworm. Results We have constructed an EST database using two cDNA libraries from the S. frugiperda-derived cell line, SF-21. The database consists of 2,367 ESTs which were assembled into 244 contigs and 951 singlets for a total of 1,195 unique sequences. Conclusion S. frugiperda is an agriculturally important pest insect and genomic information will be instrumental for establishing initial transcriptional profiling and gene function studies, and for obtaining information about genes manipulated during infections by insect pathogens such as baculoviruses. PMID:17052344

  2. Characteristics of the Lotus japonicus gene repertoire deduced from large-scale expressed sequence tag (EST) analysis.

    PubMed

    Asamizu, Erika; Nakamura, Yasukazu; Sato, Shusei; Tabata, Satoshi

    2004-02-01

    To perform a comprehensive analysis of genes expressed in a model legume, Lotus japonicus, a total of 74472 3'-end expressed sequence tags (EST) were generated from cDNA libraries produced from six different organs. Clustering of sequences was performed with an identity criterion of 95% for 50 bases, and a total of 20457 non-redundant sequences, 8503 contigs and 11954 singletons were generated. EST sequence coverage was analyzed by using the annotated L. japonicus genomic sequence and 1093 of the 1889 predicted protein-encoding genes (57.9%) were hit by the EST sequence(s). Gene content was compared to several plant species. Among the 8503 contigs, 471 were identified as sequences conserved only in leguminous species and these included several disease resistance-related genes. This suggested that in legumes, these genes may have evolved specifically to resist pathogen attack. The rate of gene sequence divergence was assessed by comparing similarity level and functional category based on the Gene Ontology (GO) annotation of Arabidopsis genes. This revealed that genes encoding ribosomal proteins, as well as those related to translation, photosynthesis, and cellular structure were more abundantly represented in the highly conserved class, and that genes encoding transcription factors and receptor protein kinases were abundantly represented in the less conserved class. To make the sequence information and the cDNA clones available to the research community, a Web database with useful services was created at http://www.kazusa.or.jp/en/plant/lotus/EST/.

  3. Rapid Covalent Fluorescence Labeling of Membrane Proteins on Live Cells via Coiled-Coil Templated Acyl Transfer.

    PubMed

    Reinhardt, Ulrike; Lotze, Jonathan; Mörl, Karin; Beck-Sickinger, Annette G; Seitz, Oliver

    2015-10-21

    Fluorescently labeled proteins enable the microscopic imaging of protein localization and function in live cells. In labeling reactions targeted against specific tag sequences, the size of the fluorophore-tag is of major concern. The tag should be small to prevent interference with protein function. Furthermore, rapid and covalent labeling methods are desired to enable the analysis of fast biological processes. Herein, we describe the development of a method in which the formation of a parallel coiled coil triggers the transfer of a fluorescence dye from a thioester-linked coil peptide conjugate onto a cysteine-modified coil peptide. This labeling method requires only small tag sequences (max 23 aa) and occurs with high tag specificity. We show that size matching of the coil peptides and a suitable thioester reactivity allow the acyl transfer reaction to proceed within minutes (rather than hours). We demonstrate the versatility of this method by applying it to the labeling of different G-protein coupled membrane receptors including the human neuropeptide Y receptors 1, 2, 4, 5, the neuropeptide FF receptors 1 and 2, and the dopamine receptor 1. The labeled receptors are fully functional and able to bind the respective ligand with high affinity. Activity is not impaired as demonstrated by activation, internalization, and recycling experiments.

  4. Cloning, analysis and functional annotation of expressed sequence tags from the Earthworm Eisenia fetida

    PubMed Central

    Pirooznia, Mehdi; Gong, Ping; Guan, Xin; Inouye, Laura S; Yang, Kuan; Perkins, Edward J; Deng, Youping

    2007-01-01

    Background Eisenia fetida, commonly known as red wiggler or compost worm, belongs to the Lumbricidae family of the Annelida phylum. Little is known about its genome sequence although it has been extensively used as a test organism in terrestrial ecotoxicology. In order to understand its gene expression response to environmental contaminants, we cloned 4032 cDNAs or expressed sequence tags (ESTs) from two E. fetida libraries enriched with genes responsive to ten ordnance related compounds using suppressive subtractive hybridization-PCR. Results A total of 3144 good quality ESTs (GenBank dbEST accession number EH669363–EH672369 and EL515444–EL515580) were obtained from the raw clone sequences after cleaning. Clustering analysis yielded 2231 unique sequences including 448 contigs (from 1361 ESTs) and 1783 singletons. Comparative genomic analysis showed that 743 or 33% of the unique sequences shared high similarity with existing genes in the GenBank nr database. Provisional function annotation assigned 830 Gene Ontology terms to 517 unique sequences based on their homology with the annotated genomes of four model organisms Drosophila melanogaster, Mus musculus, Saccharomyces cerevisiae, and Caenorhabditis elegans. Seven percent of the unique sequences were further mapped to 99 Kyoto Encyclopedia of Genes and Genomes pathways based on their matching Enzyme Commission numbers. All the information is stored and retrievable at a highly performed, web-based and user-friendly relational database called EST model database or ESTMD version 2. Conclusion The ESTMD containing the sequence and annotation information of 4032 E. fetida ESTs is publicly accessible at . PMID:18047730

  5. High Throughput Biological Analysis Using Multi-bit Magnetic Digital Planar Tags

    NASA Astrophysics Data System (ADS)

    Hong, B.; Jeong, J.-R.; Llandro, J.; Hayward, T. J.; Ionescu, A.; Trypiniotis, T.; Mitrelias, T.; Kopper, K. P.; Steinmuller, S. J.; Bland, J. A. C.

    2008-06-01

    We report a new magnetic labelling technology for high-throughput biomolecular identification and DNA sequencing. Planar multi-bit magnetic tags have been designed and fabricated, which comprise a magnetic barcode formed by an ensemble of micron-sized thin film Ni80Fe20 bars encapsulated in SU8. We show that by using a globally applied magnetic field and magneto-optical Kerr microscopy the magnetic elements in the multi-bit magnetic tags can be addressed individually and encoded/decoded remotely. The critical steps needed to show the feasibility of this technology are demonstrated, including fabrication, flow transport, remote writing and reading, and successful functionalization of the tags as verified by fluorescence detection. This approach is ideal for encoding information on tags in microfluidic flow or suspension, for such applications as labelling of chemical precursors during drug synthesis and combinatorial library-based high-throughput multiplexed bioassays.

  6. Genome-wide characterization and selection of expressed sequence tag simple sequence repeat primers for optimized marker distribution and reliability in peach

    USDA-ARS?s Scientific Manuscript database

    Expressed sequence tag (EST) simple sequence repeats (SSRs) in Prunus were mined, and flanking primers designed and used for genome-wide characterization and selection of primers to optimize marker distribution and reliability. A total of 12,618 contigs were assembled from 84,727 ESTs, along with 34...

  7. Transcript Profile of the Response of Two Soybean Genotypes to Potassium Deficiency

    PubMed Central

    Hao, QingNan; Sha, AiHua; Shan, ZhiHui; Chen, LiMiao; Zhou, Rong; Zhi, HaiJian; Zhou, XinAn

    2012-01-01

    The macronutrient potassium (K) is essential to plant growth and development. Crop yield potential is often affected by lack of soluble K. The molecular regulation mechanism of physiological and biochemical responses to K starvation in soybean roots and shoots is not fully understood. In the present study, two soybean varieties were subjected to low-K stress conditions: a low-K-tolerant variety (You06-71) and a low-K-sensitive variety (HengChun04-11). Eight libraries were generated for analysis: 2 genotypes ×2 tissues (roots and shoots) ×2 time periods [short term (0.5 to 12 h) and long term (3 to 12 d)]. RNA derived from the roots and shoots of these two varieties across two periods (short term and long term) were sequenced and the transcriptomes were compared using high-throughput tag-sequencing. To this end, a large number of clean tags (tags used for analysis after removal of dirty tags) corresponding to distinct tags (all types of clean tags) were identified in eight libraries (L1, You06-71-root short term; L2, HengChun04-11-root short term; L3, You06-71-shoot short term; L4, HengChun04-11-shoot short term; L5, You06-71-root long term; L6, HengChun04-11-root long term; L7, You06-71-shoot long term; L8, HengChun04-11-shoot long term). All clean tags were mapped to the available soybean (Glycine max) transcript database (http://www.soybase.org). Many genes showed substantial differences in expression across the libraries. In total, 5,440 transcripts involved in 118 KEGG pathways were either up- or down-regulated. Fifteen genes were randomly selected and their expression levels were confirmed using quantitative RT-PCR. Our results provide preliminary information on the molecular mechanism of potassium absorption and transport under low-K stress conditions in different soybean tissues. PMID:22792192

  8. Development of an Expressed Sequence Tag (EST) Resource for Wheat (Triticum aestivum L.)

    PubMed Central

    Lazo, G. R.; Chao, S.; Hummel, D. D.; Edwards, H.; Crossman, C. C.; Lui, N.; Matthews, D. E.; Carollo, V. L.; Hane, D. L.; You, F. M.; Butler, G. E.; Miller, R. E.; Close, T. J.; Peng, J. H.; Lapitan, N. L. V.; Gustafson, J. P.; Qi, L. L.; Echalier, B.; Gill, B. S.; Dilbirligi, M.; Randhawa, H. S.; Gill, K. S.; Greene, R. A.; Sorrells, M. E.; Akhunov, E. D.; Dvořák, J.; Linkiewicz, A. M.; Dubcovsky, J.; Hossain, K. G.; Kalavacharla, V.; Kianian, S. F.; Mahmoud, A. A.; Miftahudin; Ma, X.-F.; Conley, E. J.; Anderson, J. A.; Pathan, M. S.; Nguyen, H. T.; McGuire, P. E.; Qualset, C. O.; Anderson, O. D.

    2004-01-01

    This report describes the rationale, approaches, organization, and resource development leading to a large-scale deletion bin map of the hexaploid (2n = 6x = 42) wheat genome (Triticum aestivum L.). Accompanying reports in this issue detail results from chromosome bin-mapping of expressed sequence tags (ESTs) representing genes onto the seven homoeologous chromosome groups and a global analysis of the entire mapped wheat EST data set. Among the resources developed were the first extensive public wheat EST collection (113,220 ESTs). Described are protocols for sequencing, sequence processing, EST nomenclature, and the assembly of ESTs into contigs. These contigs plus singletons (unassembled ESTs) were used for selection of distinct sequence motif unigenes. Selected ESTs were rearrayed, validated by 5′ and 3′ sequencing, and amplified for probing a series of wheat aneuploid and deletion stocks. Images and data for all Southern hybridizations were deposited in databases and were used by the coordinators for each of the seven homoeologous chromosome groups to validate the mapping results. Results from this project have established the foundation for future developments in wheat genomics. PMID:15514037

  9. Primer and platform effects on 16S rRNA tag sequencing

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Tremblay, Julien; Singh, Kanwar; Fern, Alison

    Sequencing of 16S rRNA gene tags is a popular method for profiling and comparing microbial communities. The protocols and methods used, however, vary considerably with regard to amplification primers, sequencing primers, sequencing technologies; as well as quality filtering and clustering. How results are affected by these choices, and whether data produced with different protocols can be meaningfully compared, is often unknown. Here we compare results obtained using three different amplification primer sets (targeting V4, V6–V8, and V7–V8) and two sequencing technologies (454 pyrosequencing and Illumina MiSeq) using DNA from a mock community containing a known number of species as wellmore » as complex environmental samples whose PCR-independent profiles were estimated using shotgun sequencing. We find that paired-end MiSeq reads produce higher quality data and enabled the use of more aggressive quality control parameters over 454, resulting in a higher retention rate of high quality reads for downstream data analysis. While primer choice considerably influences quantitative abundance estimations, sequencing platform has relatively minor effects when matched primers are used. In conclusion, beta diversity metrics are surprisingly robust to both primer and sequencing platform biases.« less

  10. Primer and platform effects on 16S rRNA tag sequencing

    DOE PAGES

    Tremblay, Julien; Singh, Kanwar; Fern, Alison; ...

    2015-08-04

    Sequencing of 16S rRNA gene tags is a popular method for profiling and comparing microbial communities. The protocols and methods used, however, vary considerably with regard to amplification primers, sequencing primers, sequencing technologies; as well as quality filtering and clustering. How results are affected by these choices, and whether data produced with different protocols can be meaningfully compared, is often unknown. Here we compare results obtained using three different amplification primer sets (targeting V4, V6–V8, and V7–V8) and two sequencing technologies (454 pyrosequencing and Illumina MiSeq) using DNA from a mock community containing a known number of species as wellmore » as complex environmental samples whose PCR-independent profiles were estimated using shotgun sequencing. We find that paired-end MiSeq reads produce higher quality data and enabled the use of more aggressive quality control parameters over 454, resulting in a higher retention rate of high quality reads for downstream data analysis. While primer choice considerably influences quantitative abundance estimations, sequencing platform has relatively minor effects when matched primers are used. In conclusion, beta diversity metrics are surprisingly robust to both primer and sequencing platform biases.« less

  11. Comparative and Functional Genomics of Rhodococcus opacus PD630 for Biofuels Development

    PubMed Central

    Holder, Jason W.; Ulrich, Jil C.; DeBono, Anthony C.; Godfrey, Paul A.; Desjardins, Christopher A.; Zucker, Jeremy; Zeng, Qiandong; Leach, Alex L. B.; Ghiviriga, Ion; Dancel, Christine; Abeel, Thomas; Gevers, Dirk; Kodira, Chinnappa D.; Desany, Brian; Affourtit, Jason P.; Birren, Bruce W.; Sinskey, Anthony J.

    2011-01-01

    The Actinomycetales bacteria Rhodococcus opacus PD630 and Rhodococcus jostii RHA1 bioconvert a diverse range of organic substrates through lipid biosynthesis into large quantities of energy-rich triacylglycerols (TAGs). To describe the genetic basis of the Rhodococcus oleaginous metabolism, we sequenced and performed comparative analysis of the 9.27 Mb R. opacus PD630 genome. Metabolic-reconstruction assigned 2017 enzymatic reactions to the 8632 R. opacus PD630 genes we identified. Of these, 261 genes were implicated in the R. opacus PD630 TAGs cycle by metabolic reconstruction and gene family analysis. Rhodococcus synthesizes uncommon straight-chain odd-carbon fatty acids in high abundance and stores them as TAGs. We have identified these to be pentadecanoic, heptadecanoic, and cis-heptadecenoic acids. To identify bioconversion pathways, we screened R. opacus PD630, R. jostii RHA1, Ralstonia eutropha H16, and C. glutamicum 13032 for growth on 190 compounds. The results of the catabolic screen, phylogenetic analysis of the TAGs cycle enzymes, and metabolic product characterizations were integrated into a working model of prokaryotic oleaginy. PMID:21931557

  12. Sequencing degraded DNA from non-destructively sampled museum specimens for RAD-tagging and low-coverage shotgun phylogenetics.

    PubMed

    Tin, Mandy Man-Ying; Economo, Evan Philip; Mikheyev, Alexander Sergeyevich

    2014-01-01

    Ancient and archival DNA samples are valuable resources for the study of diverse historical processes. In particular, museum specimens provide access to biotas distant in time and space, and can provide insights into ecological and evolutionary changes over time. However, archival specimens are difficult to handle; they are often fragile and irreplaceable, and typically contain only short segments of denatured DNA. Here we present a set of tools for processing such samples for state-of-the-art genetic analysis. First, we report a protocol for minimally destructive DNA extraction of insect museum specimens, which produced sequenceable DNA from all of the samples assayed. The 11 specimens analyzed had fragmented DNA, rarely exceeding 100 bp in length, and could not be amplified by conventional PCR targeting the mitochondrial cytochrome oxidase I gene. Our approach made these samples amenable to analysis with commonly used next-generation sequencing-based molecular analytic tools, including RAD-tagging and shotgun genome re-sequencing. First, we used museum ant specimens from three species, each with its own reference genome, for RAD-tag mapping. Were able to use the degraded DNA sequences, which were sequenced in full, to identify duplicate reads and filter them prior to base calling. Second, we re-sequenced six Hawaiian Drosophila species, with millions of years of divergence, but with only a single available reference genome. Despite a shallow coverage of 0.37 ± 0.42 per base, we could recover a sufficient number of overlapping SNPs to fully resolve the species tree, which was consistent with earlier karyotypic studies, and previous molecular studies, at least in the regions of the tree that these studies could resolve. Although developed for use with degraded DNA, all of these techniques are readily applicable to more recent tissue, and are suitable for liquid handling automation.

  13. A universal TagModule collection for parallel genetic analysis of microorganisms

    PubMed Central

    Oh, Julia; Fung, Eula; Price, Morgan N.; Dehal, Paramvir S.; Davis, Ronald W.; Giaever, Guri; Nislow, Corey; Arkin, Adam P.; Deutschbauer, Adam

    2010-01-01

    Systems-level analyses of non-model microorganisms are limited by the existence of numerous uncharacterized genes and a corresponding over-reliance on automated computational annotations. One solution to this challenge is to disrupt gene function using DNA tag technology, which has been highly successful in parallelizing reverse genetics in Saccharomyces cerevisiae and has led to discoveries in gene function, genetic interactions and drug mechanism of action. To extend the yeast DNA tag methodology to a wide variety of microorganisms and applications, we have created a universal, sequence-verified TagModule collection. A hallmark of the 4280 TagModules is that they are cloned into a Gateway entry vector, thus facilitating rapid transfer to any compatible genetic system. Here, we describe the application of the TagModules to rapidly generate tagged mutants by transposon mutagenesis in the metal-reducing bacterium Shewanella oneidensis MR-1 and the pathogenic yeast Candida albicans. Our results demonstrate the optimal hybridization properties of the TagModule collection, the flexibility in applying the strategy to diverse microorganisms and the biological insights that can be gained from fitness profiling tagged mutant collections. The publicly available TagModule collection is a platform-independent resource for the functional genomics of a wide range of microbial systems in the post-genome era. PMID:20494978

  14. Differential expression of a novel gene during seed triacylglycerol accumulation in lupin species ( Lupinus angustifolius L. and L. mutabilis L.).

    PubMed

    Francki, Michael G; Whitaker, Peta; Smith, Penelope M; Atkins, Craig A

    2002-11-01

    Seed triacylglycerols (TAGs) are stored as energy reserves and extracted for various end-product uses. In lupins, seed oil content varies from 16% in Lupinus mutabilisto 8% in L. angustifolius. We have shown that TAGs rapidly accumulate during mid-stages of seed development in L. mutabilis compared to the lower seed oil species, L. angustifolius. In this study, we have targeted the key enzymes of the lipid biosynthetic pathway, acetyl-CoA carboxylase (ACCase) and diacylglycerol acyltransferase (DAGAT), to determine factors regulating TAG accumulation between two lupin species. A twofold increase in ACCase activity was observed in L. mutabilis relative to L. angustifolius and correlated with rapid TAG accumulation. No difference in DAGAT activity was detected. We have identified, cloned and partially characterised a novel gene differentially expressed during TAG accumulation between L. angustifolius and L. mutabilis. The gene has some identity to the glucose dehydrogenase family previously described in barley and bacteria and the significance of its expression levels during seed development in relation to TAG accumulation is discussed. DNA sequence analysis of the promoter in both L. angustifolius and L. mutabilis identified putative matrix attachment regions and recognition sequences for transcription binding sites similar to those found in the Adh1 gene from Arabidopsis. The identical promoter regions between species indicate that differential gene expression is controlled by alternative transcription factors, accessibility to binding sites or a combination of both.

  15. hSAGEing: an improved SAGE-based software for identification of human tissue-specific or common tumor markers and suppressors.

    PubMed

    Yang, Cheng-Hong; Chuang, Li-Yeh; Shih, Tsung-Mu; Chang, Hsueh-Wei

    2010-12-17

    SAGE (serial analysis of gene expression) is a powerful method of analyzing gene expression for the entire transcriptome. There are currently many well-developed SAGE tools. However, the cross-comparison of different tissues is seldom addressed, thus limiting the identification of common- and tissue-specific tumor markers. To improve the SAGE mining methods, we propose a novel function for cross-tissue comparison of SAGE data by combining the mathematical set theory and logic with a unique "multi-pool method" that analyzes multiple pools of pair-wise case controls individually. When all the settings are in "inclusion", the common SAGE tag sequences are mined. When one tissue type is in "inclusion" and the other types of tissues are not in "inclusion", the selected tissue-specific SAGE tag sequences are generated. They are displayed in tags-per-million (TPM) and fold values, as well as visually displayed in four kinds of scales in a color gradient pattern. In the fold visualization display, the top scores of the SAGE tag sequences are provided, along with cluster plots. A user-defined matrix file is designed for cross-tissue comparison by selecting libraries from publically available databases or user-defined libraries. The hSAGEing tool provides a combination of friendly cross-tissue analysis and an interface for comparing SAGE libraries for the first time. Some up- or down-regulated genes with tissue-specific or common tumor markers and suppressors are identified computationally. The tool is useful and convenient for in silico cancer transcriptomic studies and is freely available at http://bio.kuas.edu.tw/hSAGEing.

  16. Generation and analysis of expressed sequence tags from the bone marrow of Chinese Sika deer.

    PubMed

    Yao, Baojin; Zhao, Yu; Zhang, Mei; Li, Juan

    2012-03-01

    Sika deer is one of the best-known and highly valued animals of China. Despite its economic, cultural, and biological importance, there has not been a large-scale sequencing project for Sika deer to date. With the ultimate goal of sequencing the complete genome of this organism, we first established a bone marrow cDNA library for Sika deer and generated a total of 2,025 reads. After processing the sequences, 2,017 high-quality expressed sequence tags (ESTs) were obtained. These ESTs were assembled into 1,157 unigenes, including 238 contigs and 919 singletons. Comparative analyses indicated that 888 (76.75%) of the unigenes had significant matches to sequences in the non-redundant protein database, In addition to highly expressed genes, such as stearoyl-CoA desaturase, cytochrome c oxidase, adipocyte-type fatty acid-binding protein, adiponectin and thymosin beta-4, we also obtained vascular endothelial growth factor-A and heparin-binding growth-associated molecule, both of which are of great importance for angiogenesis research. There were 244 (21.09%) unigenes with no significant match to any sequence in current protein or nucleotide databases, and these sequences may represent genes with unknown function in Sika deer. Open reading frame analysis of the sequences was performed using the getorf program. In addition, the sequences were functionally classified using the gene ontology hierarchy, clusters of orthologous groups of proteins and Kyoto encyclopedia of genes and genomes databases. Analysis of ESTs described in this paper provides an important resource for the transcriptome exploration of Sika deer, and will also facilitate further studies on functional genomics, gene discovery and genome annotation of Sika deer.

  17. Structure-Related Roles for the Conservation of the HIV-1 Fusion Peptide Sequence Revealed by Nuclear Magnetic Resonance.

    PubMed

    Serrano, Soraya; Huarte, Nerea; Rujas, Edurne; Andreu, David; Nieva, José L; Jiménez, María Angeles

    2017-10-17

    Despite extensive characterization of the human immunodeficiency virus type 1 (HIV-1) hydrophobic fusion peptide (FP), the structure-function relationships underlying its extraordinary degree of conservation remain poorly understood. Specifically, the fact that the tandem repeat of the FLGFLG tripeptide is absolutely conserved suggests that high hydrophobicity may not suffice to unleash FP function. Here, we have compared the nuclear magnetic resonance (NMR) structures adopted in nonpolar media by two FP surrogates, wtFP-tag and scrFP-tag, which had equal hydrophobicity but contained wild-type and scrambled core sequences LFLGFLG and FGLLGFL, respectively. In addition, these peptides were tagged at their C-termini with an epitope sequence that folded independently, thereby allowing Western blot detection without interfering with FP structure. We observed similar α-helical FP conformations for both specimens dissolved in the low-polarity medium 25% (v/v) 1,1,1,3,3,3-hexafluoro-2-propanol (HFIP), but important differences in contact with micelles of the membrane mimetic dodecylphosphocholine (DPC). Thus, whereas wtFP-tag preserved a helix displaying a Gly-rich ridge, the scrambled sequence lost in great part the helical structure upon being solubilized in DPC. Western blot analyses further revealed the capacity of wtFP-tag to assemble trimers in membranes, whereas membrane oligomers were not observed in the case of the scrFP-tag sequence. We conclude that, beyond hydrophobicity, preserving sequence order is an important feature for defining the secondary structures and oligomeric states adopted by the HIV FP in membranes.

  18. Genetically encoded fluorescent tags

    PubMed Central

    Thorn, Kurt

    2017-01-01

    Genetically encoded fluorescent tags are protein sequences that can be fused to a protein of interest to render it fluorescent. These tags have revolutionized cell biology by allowing nearly any protein to be imaged by light microscopy at submicrometer spatial resolution and subsecond time resolution in a live cell or organism. They can also be used to measure protein abundance in thousands to millions of cells using flow cytometry. Here I provide an introduction to the different genetic tags available, including both intrinsically fluorescent proteins and proteins that derive their fluorescence from binding of either endogenous or exogenous fluorophores. I discuss their optical and biological properties and guidelines for choosing appropriate tags for an experiment. Tools for tagging nucleic acid sequences and reporter molecules that detect the presence of different biomolecules are also briefly discussed. PMID:28360214

  19. Construction of a yeast artificial chromosome contig encompassing the chromosome 14 Alzheimer`s disease locus

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Sharma, V.; Bonnycastle, L.; Poorkai, P.

    1994-09-01

    We have constructed a yeast artificial chromosome (YAC) contig of chromosome 14q24.3 which encompasses the chromosome 14 Alzheimer`s disease locus (AD3). Determined by linkage analysis of early-onset Alzheimer`s disease kindreds, this interval is bounded by the genetic markers D14S61-D14S63 and spans approximately 15 centimorgans. The contig consists of 29 markers and 74 YACs of which 57 are defined by one or more sequence tagged sites (STSs). The STS markers comprise 5 genes, 16 short tandem repeat polymorphisms and 8 cDNA clones. An additional number of genes, expressed sequence tags and cDNA fragments have been identified and localized to the contigmore » by hybridization and sequence analysis of anonymous clones isolated by cDNA direct selection techniques. A minimal contig of about 15 YACs averaging 0.5-1.5 megabase in length will span this interval and is, at first approximation, in rough agreement with the genetic map. For two regions of the contig, our coverage has relied on L1/THE fingerprint and Alu-PCR hybridization data of YACs provided by CEPH/Genethon. We are currently developing sequence tagged sites from these to confirm the overlaps revealed by the fingerprint data. Among the genes which map to the contig are transforming growth factor beta 3, c-fos, and heat shock protein 2A (HSPA2). C-fos is not a candidate gene for AD3 based on the sequence analysis of affected and unaffected individuals. HSPA2 maps to the proximal edge of the contig and Calmodulin 1, a candidate gene from 4q24.3, maps outside of the region. The YAC contig is a framework physical map from which cosmid or P1 clone contigs can be constructed. As more genes and cDNAs are mapped, a highly resolved transcription map will emerge, a necessary step towards positionally cloning the AD3 gene.« less

  20. Transcriptome profile analysis of young floral buds of fertile and sterile plants from the self-pollinated offspring of the hybrid between novel restorer line NR1 and Nsa CMS line in Brassica napus

    PubMed Central

    2013-01-01

    Background The fertile and sterile plants were derived from the self-pollinated offspring of the F1 hybrid between the novel restorer line NR1 and the Nsa CMS line in Brassica napus. To elucidate gene expression and regulation caused by the A and C subgenomes of B. napus, as well as the alien chromosome and cytoplasm from Sinapis arvensis during the development of young floral buds, we performed a genome-wide high-throughput transcriptomic sequencing for young floral buds of sterile and fertile plants. Results In this study, equal amounts of total RNAs taken from young floral buds of sterile and fertile plants were sequenced using the Illumina/Solexa platform. After filtered out low quality data, a total of 2,760,574 and 2,714,441 clean tags were remained in the two libraries, from which 242,163 (Ste) and 253,507 (Fer) distinct tags were obtained. All distinct sequencing tags were annotated using all possible CATG+17-nt sequences of the genome and transcriptome of Brassica rapa and those of Brassica oleracea as the reference sequences, respectively. In total, 3231 genes of B. rapa and 3371 genes of B. oleracea were detected with significant differential expression levels. GO and pathway-based analyses were performed to determine and further to understand the biological functions of those differentially expressed genes (DEGs). In addition, there were 1089 specially expressed unknown tags in Fer, which were neither mapped to B. oleracea nor to B. rapa, and these unique tags were presumed to arise basically from the added alien chromosome of S. arvensis. Fifteen genes were randomly selected and their expression levels were confirmed by quantitative RT-PCR, and fourteen of them showed consistent expression patterns with the digital gene expression (DGE) data. Conclusions A number of genes were differentially expressed between the young floral buds of sterile and fertile plants. Some of these genes may be candidates for future research on CMS in Nsa line, fertility restoration and improved agronomic traits in NR1 line. Further study of the unknown tags which were specifically expressed in Fer will help to explore desirable agronomic traits from wild species. PMID:23324545

  1. Transcriptome profile analysis of young floral buds of fertile and sterile plants from the self-pollinated offspring of the hybrid between novel restorer line NR1 and Nsa CMS line in Brassica napus.

    PubMed

    Yan, Xiaohong; Dong, Caihua; Yu, Jingyin; Liu, Wanghui; Jiang, Chenghong; Liu, Jia; Hu, Qiong; Fang, Xiaoping; Wei, Wenhui

    2013-01-16

    The fertile and sterile plants were derived from the self-pollinated offspring of the F1 hybrid between the novel restorer line NR1 and the Nsa CMS line in Brassica napus. To elucidate gene expression and regulation caused by the A and C subgenomes of B. napus, as well as the alien chromosome and cytoplasm from Sinapis arvensis during the development of young floral buds, we performed a genome-wide high-throughput transcriptomic sequencing for young floral buds of sterile and fertile plants. In this study, equal amounts of total RNAs taken from young floral buds of sterile and fertile plants were sequenced using the Illumina/Solexa platform. After filtered out low quality data, a total of 2,760,574 and 2,714,441 clean tags were remained in the two libraries, from which 242,163 (Ste) and 253,507 (Fer) distinct tags were obtained. All distinct sequencing tags were annotated using all possible CATG+17-nt sequences of the genome and transcriptome of Brassica rapa and those of Brassica oleracea as the reference sequences, respectively. In total, 3231 genes of B. rapa and 3371 genes of B. oleracea were detected with significant differential expression levels. GO and pathway-based analyses were performed to determine and further to understand the biological functions of those differentially expressed genes (DEGs). In addition, there were 1089 specially expressed unknown tags in Fer, which were neither mapped to B. oleracea nor to B. rapa, and these unique tags were presumed to arise basically from the added alien chromosome of S. arvensis. Fifteen genes were randomly selected and their expression levels were confirmed by quantitative RT-PCR, and fourteen of them showed consistent expression patterns with the digital gene expression (DGE) data. A number of genes were differentially expressed between the young floral buds of sterile and fertile plants. Some of these genes may be candidates for future research on CMS in Nsa line, fertility restoration and improved agronomic traits in NR1 line. Further study of the unknown tags which were specifically expressed in Fer will help to explore desirable agronomic traits from wild species.

  2. A large scale analysis of cDNA in Arabidopsis thaliana: generation of 12,028 non-redundant expressed sequence tags from normalized and size-selected cDNA libraries.

    PubMed

    Asamizu, E; Nakamura, Y; Sato, S; Tabata, S

    2000-06-30

    For comprehensive analysis of genes expressed in the model dicotyledonous plant, Arabidopsis thaliana, expressed sequence tags (ESTs) were accumulated. Normalized and size-selected cDNA libraries were constructed from aboveground organs, flower buds, roots, green siliques and liquid-cultured seedlings, respectively, and a total of 14,026 5'-end ESTs and 39,207 3'-end ESTs were obtained. The 3'-end ESTs could be clustered into 12,028 non-redundant groups. Similarity search of the non-redundant ESTs against the public non-redundant protein database indicated that 4816 groups show similarity to genes of known function, 1864 to hypothetical genes, and the remaining 5348 are novel sequences. Gene coverage by the non-redundant ESTs was analyzed using the annotated genomic sequences of approximately 10 Mb on chromosomes 3 and 5. A total of 923 regions were hit by at least one EST, among which only 499 regions were hit by the ESTs deposited in the public database. The result indicates that the EST source generated in this project complements the EST data in the public database and facilitates new gene discovery.

  3. Development, characterization and cross species amplification of polymorphic microsatellite markers from expressed sequence tags of turmeric (Curcuma longa L.).

    PubMed

    Siju, S; Dhanya, K; Syamkumar, S; Sasikumar, B; Sheeja, T E; Bhat, A I; Parthasarathy, V A

    2010-02-01

    Expressed sequence tags (ESTs) from turmeric (Curcuma longa L.) were used for the screening of type and frequency of Class I (hypervariable) simple sequence repeats (SSRs). A total of 231 microsatellite repeats were detected from 12,593 EST sequences of turmeric after redundancy elimination. The average density of Class I SSRs accounts to one SSR per 17.96 kb of EST. Mononucleotides were the most abundant class of microsatellite repeat in turmeric ESTs followed by trinucleotides. A robust set of 17 polymorphic EST-SSRs were developed and used for evaluating 20 turmeric accessions. The number of alleles detected ranged from 3 to 8 per loci. The developed markers were also evaluated in 13 related species of C. longa confirming high rate (100%) of cross species transferability. The polymorphic microsatellite markers generated from this study could be used for genetic diversity analysis and resolving the taxonomic confusion prevailing in the genus.

  4. Generation of non-genomic oligonucleotide tag sequences for RNA template-specific PCR

    PubMed Central

    Pinto, Fernando Lopes; Svensson, Håkan; Lindblad, Peter

    2006-01-01

    Background In order to overcome genomic DNA contamination in transcriptional studies, reverse template-specific polymerase chain reaction, a modification of reverse transcriptase polymerase chain reaction, is used. The possibility of using tags whose sequences are not found in the genome further improves reverse specific polymerase chain reaction experiments. Given the absence of software available to produce genome suitable tags, a simple tool to fulfill such need was developed. Results The program was developed in Perl, with separate use of the basic local alignment search tool, making the tool platform independent (known to run on Windows XP and Linux). In order to test the performance of the generated tags, several molecular experiments were performed. The results show that Tagenerator is capable of generating tags with good priming properties, which will deliberately not result in PCR amplification of genomic DNA. Conclusion The program Tagenerator is capable of generating tag sequences that combine genome absence with good priming properties for RT-PCR based experiments, circumventing the effects of genomic DNA contamination in an RNA sample. PMID:16820068

  5. ScanRanker: Quality Assessment of Tandem Mass Spectra via Sequence Tagging

    PubMed Central

    Ma, Ze-Qiang; Chambers, Matthew C.; Ham, Amy-Joan L.; Cheek, Kristin L.; Whitwell, Corbin W.; Aerni, Hans-Rudolf; Schilling, Birgit; Miller, Aaron W.; Caprioli, Richard M.; Tabb, David L.

    2011-01-01

    In shotgun proteomics, protein identification by tandem mass spectrometry relies on bioinformatics tools. Despite recent improvements in identification algorithms, a significant number of high quality spectra remain unidentified for various reasons. Here we present ScanRanker, an open-source tool that evaluates the quality of tandem mass spectra via sequence tagging with reliable performance in data from different instruments. The superior performance of ScanRanker enables it not only to find unassigned high quality spectra that evade identification through database search, but also to select spectra for de novo sequencing and cross-linking analysis. In addition, we demonstrate that the distribution of ScanRanker scores predicts the richness of identifiable spectra among multiple LC-MS/MS runs in an experiment, and ScanRanker scores assist the process of peptide assignment validation to increase confident spectrum identifications. The source code and executable versions of ScanRanker are available from http://fenchurch.mc.vanderbilt.edu. PMID:21520941

  6. Expression, purification, and functional analysis of the C-terminal domain of Herbaspirillum seropedicae NifA protein.

    PubMed

    Monteiro, Rose A; Souza, Emanuel M; Geoffrey Yates, M; Steffens, M Berenice R; Pedrosa, Fábio O; Chubatsu, Leda S

    2003-02-01

    The Herbaspirillum seropedicae NifA protein is responsible for nif gene expression. The C-terminal domain of the H. seropedicae NifA protein, fused to a His-Tag sequence (His-Tag-C-terminal), was over-expressed and purified by metal-affinity chromatography to yield a highly purified and active protein. Band-shift assays showed that the NifA His-Tag-C-terminal bound specifically to the H. seropedicae nifB promoter region in vitro. In vivo analysis showed that this protein inhibited the Central + C-terminal domains of NifA protein from activating the nifH promoter of K. pneumoniae in Escherichia coli, indicating that the protein must be bound to the NifA-binding site (UAS site) at the nifH promoter region to activate transcription. Copyright 2002 Elsevier Science (USA)

  7. Cyanine-based probe\\tag-peptide pair fluorescence protein imaging and fluorescence protein imaging methods

    DOEpatents

    Mayer-Cumblidge, M. Uljana; Cao, Haishi

    2013-01-15

    A molecular probe comprises two arsenic atoms and at least one cyanine based moiety. A method of producing a molecular probe includes providing a molecule having a first formula, treating the molecule with HgOAc, and subsequently transmetallizing with AsCl.sub.3. The As is liganded to ethanedithiol to produce a probe having a second formula. A method of labeling a peptide includes providing a peptide comprising a tag sequence and contacting the peptide with a biarsenical molecular probe. A complex is formed comprising the tag sequence and the molecular probe. A method of studying a peptide includes providing a mixture containing a peptide comprising a peptide tag sequence, adding a biarsenical probe to the mixture, and monitoring the fluorescence of the mixture.

  8. Cyanine-based probe\\tag-peptide pair for fluorescence protein imaging and fluorescence protein imaging methods

    DOEpatents

    Mayer-Cumblidge, M Uljana [Richland, WA; Cao, Haishi [Richland, WA

    2010-08-17

    A molecular probe comprises two arsenic atoms and at least one cyanine based moiety. A method of producing a molecular probe includes providing a molecule having a first formula, treating the molecule with HgOAc, and subsequently transmetallizing with AsCl.sub.3. The As is liganded to ethanedithiol to produce a probe having a second formula. A method of labeling a peptide includes providing a peptide comprising a tag sequence and contacting the peptide with a biarsenical molecular probe. A complex is formed comprising the tag sequence and the molecular probe. A method of studying a peptide includes providing a mixture containing a peptide comprising a peptide tag sequence, adding a biarsenical probe to the mixture, and monitoring the fluorescence of the mixture.

  9. Generation and analysis of a barcode-tagged insertion mutant library in the fission yeast Schizosaccharomyces pombe.

    PubMed

    Chen, Bo-Ruei; Hale, Devin C; Ciolek, Peter J; Runge, Kurt W

    2012-05-03

    Barcodes are unique DNA sequence tags that can be used to specifically label individual mutants. The barcode-tagged open reading frame (ORF) haploid deletion mutant collections in the budding yeast Saccharomyces cerevisiae and the fission yeast Schizosaccharomyces pombe allow for high-throughput mutant phenotyping because the relative growth of mutants in a population can be determined by monitoring the proportions of their associated barcodes. While these mutant collections have greatly facilitated genome-wide studies, mutations in essential genes are not present, and the roles of these genes are not as easily studied. To further support genome-scale research in S. pombe, we generated a barcode-tagged fission yeast insertion mutant library that has the potential of generating viable mutations in both essential and non-essential genes and can be easily analyzed using standard molecular biological techniques. An insertion vector containing a selectable ura4+ marker and a random barcode was used to generate a collection of 10,000 fission yeast insertion mutants stored individually in 384-well plates and as six pools of mixed mutants. Individual barcodes are flanked by Sfi I recognition sites and can be oligomerized in a unique orientation to facilitate barcode sequencing. Independent genetic screens on a subset of mutants suggest that this library contains a diverse collection of single insertion mutations. We present several approaches to determine insertion sites. This collection of S. pombe barcode-tagged insertion mutants is well-suited for genome-wide studies. Because insertion mutations may eliminate, reduce or alter the function of essential and non-essential genes, this library will contain strains with a wide range of phenotypes that can be assayed by their associated barcodes. The design of the barcodes in this library allows for barcode sequencing using next generation or standard benchtop cloning approaches.

  10. GSyellow, a Multifaceted Tag for Functional Protein Analysis in Monocot and Dicot Plants.

    PubMed

    Besbrugge, Nienke; Van Leene, Jelle; Eeckhout, Dominique; Cannoot, Bernard; Kulkarni, Shubhada R; De Winne, Nancy; Persiau, Geert; Van De Slijke, Eveline; Bontinck, Michiel; Aesaert, Stijn; Impens, Francis; Gevaert, Kris; Van Damme, Daniel; Van Lijsebettens, Mieke; Inzé, Dirk; Vandepoele, Klaas; Nelissen, Hilde; De Jaeger, Geert

    2018-06-01

    The ability to tag proteins has boosted the emergence of generic molecular methods for protein functional analysis. Fluorescent protein tags are used to visualize protein localization, and affinity tags enable the mapping of molecular interactions by, for example, tandem affinity purification or chromatin immunoprecipitation. To apply these widely used molecular techniques on a single transgenic plant line, we developed a multifunctional tandem affinity purification tag, named GS yellow , which combines the streptavidin-binding peptide tag with citrine yellow fluorescent protein. We demonstrated the versatility of the GS yellow tag in the dicot Arabidopsis ( Arabidopsis thaliana ) using a set of benchmark proteins. For proof of concept in monocots, we assessed the localization and dynamic interaction profile of the leaf growth regulator ANGUSTIFOLIA3 (AN3), fused to the GS yellow tag, along the growth zone of the maize ( Zea mays ) leaf. To further explore the function of ZmAN3, we mapped its DNA-binding landscape in the growth zone of the maize leaf through chromatin immunoprecipitation sequencing. Comparison with AN3 target genes mapped in the developing maize tassel or in Arabidopsis cell cultures revealed strong conservation of AN3 target genes between different maize tissues and across monocots and dicots, respectively. In conclusion, the GS yellow tag offers a powerful molecular tool for distinct types of protein functional analyses in dicots and monocots. As this approach involves transforming a single construct, it is likely to accelerate both basic and translational plant research. © 2018 American Society of Plant Biologists. All rights reserved.

  11. Transcriptome sequencing and de novo analysis of the copepod Calanus sinicus using 454 GS FLX.

    PubMed

    Ning, Juan; Wang, Minxiao; Li, Chaolun; Sun, Song

    2013-01-01

    Despite their species abundance and primary economic importance, genomic information about copepods is still limited. In particular, genomic resources are lacking for the copepod Calanus sinicus, which is a dominant species in the coastal waters of East Asia. In this study, we performed de novo transcriptome sequencing to produce a large number of expressed sequence tags for the copepod C. sinicus. Copepodid larvae and adults were used as the basic material for transcriptome sequencing. Using 454 pyrosequencing, a total of 1,470,799 reads were obtained, which were assembled into 56,809 high quality expressed sequence tags. Based on their sequence similarity to known proteins, about 14,000 different genes were identified, including members of all major conserved signaling pathways. Transcripts that were putatively involved with growth, lipid metabolism, molting, and diapause were also identified among these genes. Differentially expressed genes related to several processes were found in C. sinicus copepodid larvae and adults. We detected 284,154 single nucleotide polymorphisms (SNPs) that provide a resource for gene function studies. Our data provide the most comprehensive transcriptome resource available for C. sinicus. This resource allowed us to identify genes associated with primary physiological processes and SNPs in coding regions, which facilitated the quantitative analysis of differential gene expression. These data should provide foundation for future genetic and genomic studies of this and related species.

  12. JC Virus Mediates Invasion and Migration in Colorectal Metastasis

    PubMed Central

    Link, Alexander; Shin, Sung Kwan; Nagasaka, Takeshi; Balaguer, Francesc; Koi, Minoru; Jung, Barbara; Boland, C. Richard; Goel, Ajay

    2009-01-01

    Introduction JC Virus (JCV), a human polyomavirus, is frequently present in colorectal cancers (CRCs). JCV large T-Ag (T-Ag) expressed in approximately half of all CRC's, however, its functional role in CRC is poorly understood. We hypothesized that JCV T-Ag may mediate metastasis in CRC cells through increased migration and invasion. Material and Methods CRC cell lines (HCT116 and SW837) were stably transfected with JCV early transcript sequences cloned into pCR3 or empty vectors. Migration and invasion assays were performed using Boyden chambers. Global gene expression analysis was performed to identify genetic targets and pathways altered by T-Ag expression. Microarray results were validated by qRT-PCR, protein expression analyses and immunohistochemistry. Matching primary CRCs and liver metastases from 33 patients were analyzed for T-Ag expression by immunohistochemistry. Results T-Ag expressing cell lines showed 2 to 3-fold increase in migration and invasion compared to controls. JCV T-Ag expression resulted in differential expression of several genetic targets, including genes that mediate cell migration and invasion. Pathway analysis suggested a significant involvement of these genes with AKT and MAPK signaling. Treatment with selective PI3K/AKT and MAPK pathway inhibitors resulted in reduced migration and invasion. In support of our in-vitro results, immunohistochemical staining of the advanced stage tumors revealed frequent JCV T-Ag expression in metastatic primary tumors (92%) as well as in their matching liver metastasis (73%). Conclusion These data suggest that JCV T-Ag expression in CRC associates with a metastatic phenotype, which may partly be mediated through the AKT/MAPK signaling pathway. Frequent expression of JCV T-Ag in CRC liver metastasis provides further clues supporting a mechanistic role for JCV as a possible mediator of cellular motility and invasion in CRC. PMID:19997600

  13. Expressed sequence tag analysis of human RPE/choroid for the NEIBank Project: over 6000 non-redundant transcripts, novel genes and splice variants.

    PubMed

    Wistow, Graeme; Bernstein, Steven L; Wyatt, M Keith; Fariss, Robert N; Behal, Amita; Touchman, Jeffrey W; Bouffard, Gerald; Smith, Don; Peterson, Katherine

    2002-06-15

    The retinal pigment epithelium (RPE) and choroid comprise a functional unit of the eye that is essential to normal retinal health and function. Here we describe expressed sequence tag (EST) analysis of human RPE/choroid as part of a project for ocular bioinformatics. A cDNA library (cs) was made from human RPE/choroid and sequenced. Data were analyzed and assembled using the program GRIST (GRouping and Identification of Sequence Tags). Complete sequencing, Northern and Western blots, RH mapping, peptide antibody synthesis and immunofluorescence (IF) have been used to examine expression patterns and genome location for selected transcripts and proteins. Ten thousand individual sequence reads yield over 6300 unique gene clusters of which almost half have no matches with named genes. One of the most abundant transcripts is from a gene (named "alpha") that maps to the BBS1 region of chromosome 11. A number of tissue preferred transcripts are common to both RPE/choroid and iris. These include oculoglycan/opticin, for which an alternative splice form is detected in RPE/choroid, and "oculospanin" (Ocsp), a novel tetraspanin that maps to chromosome 17q. Antiserum to Ocsp detects expression in RPE, iris, ciliary body, and retinal ganglion cells by IF. A newly identified gene for a zinc-finger protein (TIRC) maps to 19q13.4. Variant transcripts of several genes were also detected. Most notably, the predominant form of Bestrophin represented in cs contains a longer open reading frame as a result of splice junction skipping. The unamplified cs library gives a view of the transcriptional repertoire of the adult RPE/choroid. A large number of potentially novel genes and splice forms and candidates for genetic diseases are revealed. Clones from this collection are being included in a large, nonredundant set for cDNA microarray construction.

  14. Application of Cydia pomonella expressed sequence tags: identification and expression of three general odorant binding proteins in codling moth

    USDA-ARS?s Scientific Manuscript database

    The codling moth, Cydia pomonella, is one of the most important pests of pome fruits in the world, yet the molecular genetics and physiology of this insect remains poorly understood. A combined assembly of 8340 expressed sequence tags (ESTs) was generated from Roche 454 GS-FLX sequencing of 8 tissu...

  15. M13-Tailed Simple Sequence Repeat (SSR) Markers in Studies of Genetic Diversity and Population Structure of Common Oat Germplasm.

    PubMed

    Onyśk, Agnieszka; Boczkowska, Maja

    2017-01-01

    Simple Sequence Repeat (SSR) markers are one of the most frequently used molecular markers in studies of crop diversity and population structure. This is due to their uniform distribution in the genome, the high polymorphism, reproducibility, and codominant character. Additional advantages are the possibility of automatic analysis and simple interpretation of the results. The M13 tagged PCR reaction significantly reduces the costs of analysis by the automatic genetic analyzers. Here, we also disclose a short protocol of SSR data analysis.

  16. Preparation of next-generation sequencing libraries using Nextera™ technology: simultaneous DNA fragmentation and adaptor tagging by in vitro transposition.

    PubMed

    Caruccio, Nicholas

    2011-01-01

    DNA library preparation is a common entry point and bottleneck for next-generation sequencing. Current methods generally consist of distinct steps that often involve significant sample loss and hands-on time: DNA fragmentation, end-polishing, and adaptor-ligation. In vitro transposition with Nextera™ Transposomes simultaneously fragments and covalently tags the target DNA, thereby combining these three distinct steps into a single reaction. Platform-specific sequencing adaptors can be added, and the sample can be enriched and bar-coded using limited-cycle PCR to prepare di-tagged DNA fragment libraries. Nextera technology offers a streamlined, efficient, and high-throughput method for generating bar-coded libraries compatible with multiple next-generation sequencing platforms.

  17. SPlinted Ligation Adapter Tagging (SPLAT), a novel library preparation method for whole genome bisulphite sequencing

    PubMed Central

    Manlig, Erika; Wahlberg, Per

    2017-01-01

    Abstract Sodium bisulphite treatment of DNA combined with next generation sequencing (NGS) is a powerful combination for the interrogation of genome-wide DNA methylation profiles. Library preparation for whole genome bisulphite sequencing (WGBS) is challenging due to side effects of the bisulphite treatment, which leads to extensive DNA damage. Recently, a new generation of methods for bisulphite sequencing library preparation have been devised. They are based on initial bisulphite treatment of the DNA, followed by adaptor tagging of single stranded DNA fragments, and enable WGBS using low quantities of input DNA. In this study, we present a novel approach for quick and cost effective WGBS library preparation that is based on splinted adaptor tagging (SPLAT) of bisulphite-converted single-stranded DNA. Moreover, we validate SPLAT against three commercially available WGBS library preparation techniques, two of which are based on bisulphite treatment prior to adaptor tagging and one is a conventional WGBS method. PMID:27899585

  18. Identification of human chromosome 22 transcribed sequences with ORF expressed sequence tags

    PubMed Central

    de Souza, Sandro J.; Camargo, Anamaria A.; Briones, Marcelo R. S.; Costa, Fernando F.; Nagai, Maria Aparecida; Verjovski-Almeida, Sergio; Zago, Marco A.; Andrade, Luis Eduardo C.; Carrer, Helaine; El-Dorry, Hamza F. A.; Espreafico, Enilza M.; Habr-Gama, Angelita; Giannella-Neto, Daniel; Goldman, Gustavo H.; Gruber, Arthur; Hackel, Christine; Kimura, Edna T.; Maciel, Rui M. B.; Marie, Suely K. N.; Martins, Elizabeth A. L.; Nóbrega, Marina P.; Paçó-Larson, Maria Luisa; Pardini, Maria Inês M. C.; Pereira, Gonçalo G.; Pesquero, João Bosco; Rodrigues, Vanderlei; Rogatto, Silvia R.; da Silva, Ismael D. C. G.; Sogayar, Mari C.; de Fátima Sonati, Maria; Tajara, Eloiza H.; Valentini, Sandro R.; Acencio, Marcio; Alberto, Fernando L.; Amaral, Maria Elisabete J.; Aneas, Ivy; Bengtson, Mário Henrique; Carraro, Dirce M.; Carvalho, Alex F.; Carvalho, Lúcia Helena; Cerutti, Janete M.; Corrêa, Maria Lucia C.; Costa, Maria Cristina R.; Curcio, Cyntia; Gushiken, Tsieko; Ho, Paulo L.; Kimura, Elza; Leite, Luciana C. C.; Maia, Gustavo; Majumder, Paromita; Marins, Mozart; Matsukuma, Adriana; Melo, Analy S. A.; Mestriner, Carlos Alberto; Miracca, Elisabete C.; Miranda, Daniela C.; Nascimento, Ana Lucia T. O.; Nóbrega, Francisco G.; Ojopi, Élida P. B.; Pandolfi, José Rodrigo C.; Pessoa, Luciana Gilbert; Rahal, Paula; Rainho, Claudia A.; da Ro's, Nancy; de Sá, Renata G.; Sales, Magaly M.; da Silva, Neusa P.; Silva, Tereza C.; da Silva, Wilson; Simão, Daniel F.; Sousa, Josane F.; Stecconi, Daniella; Tsukumo, Fernando; Valente, Valéria; Zalcberg, Heloisa; Brentani, Ricardo R.; Reis, Luis F. L.; Dias-Neto, Emmanuel; Simpson, Andrew J. G.

    2000-01-01

    Transcribed sequences in the human genome can be identified with confidence only by alignment with sequences derived from cDNAs synthesized from naturally occurring mRNAs. We constructed a set of 250,000 cDNAs that represent partial expressed gene sequences and that are biased toward the central coding regions of the resulting transcripts. They are termed ORF expressed sequence tags (ORESTES). The 250,000 ORESTES were assembled into 81,429 contigs. Of these, 1,181 (1.45%) were found to match sequences in chromosome 22 with at least one ORESTES contig for 162 (65.6%) of the 247 known genes, for 67 (44.6%) of the 150 related genes, and for 45 of the 148 (30.4%) EST-predicted genes on this chromosome. Using a set of stringent criteria to validate our sequences, we identified a further 219 previously unannotated transcribed sequences on chromosome 22. Of these, 171 were in fact also defined by EST or full length cDNA sequences available in GenBank but not utilized in the initial annotation of the first human chromosome sequence. Thus despite representing less than 15% of all expressed human sequences in the public databases at the time of the present analysis, ORESTES sequences defined 48 transcribed sequences on chromosome 22 not defined by other sequences. All of the transcribed sequences defined by ORESTES coincided with DNA regions predicted as encoding exons by genscan. (http://genes.mit.edu/GENSCAN.html). PMID:11070084

  19. openSputnik--a database to ESTablish comparative plant genomics using unsaturated sequence collections.

    PubMed

    Rudd, Stephen

    2005-01-01

    The public expressed sequence tag collections are continually being enriched with high-quality sequences that represent an ever-expanding range of taxonomically diverse plant species. While these sequence collections provide biased insight into the populations of expressed genes available within individual species and their associated tissues, the information is conceivably of wider relevance in a comparative context. When we consider the available expressed sequence tag (EST) collections of summer 2004, most of the major plant taxonomic clades are at least superficially represented. Investigation of the five million available plant ESTs provides a wealth of information that has applications in modelling the routes of plant genome evolution and the identification of lineage-specific genes and gene families. Over four million ESTs from over 50 distinct plant species have been collated within an EST analysis pipeline called openSputnik. The ESTs were resolved down into approximately one million unigene sequences. These have been annotated using orthology-based annotation transfer from reference plant genomes and using a variety of contemporary bioinformatics methods to assign peptide, structural and functional attributes. The openSputnik database is available at http://sputnik.btk.fi.

  20. Transcriptome analysis in Concholepas concholepas (Gastropoda, Muricidae): mining and characterization of new genomic and molecular markers.

    PubMed

    Cárdenas, Leyla; Sánchez, Roland; Gomez, Daniela; Fuenzalida, Gonzalo; Gallardo-Escárate, Cristián; Tanguy, Arnaud

    2011-09-01

    The marine gastropod Concholepas concholepas, locally known as the "loco", is the main target species of the benthonic Chilean fisheries. Genetic and genomic tools are necessary to study the genome of this species in order to understand the molecular basis of its development, growth, and other key traits to improve the management strategies and to identify local adaptation to prevent loss of biodiversity. Here, we use pyrosequencing technologies to generate the first transcriptomic database from adult specimens of the loco. After trimming, a total of 140,756 Expressed Sequence Tag sequences were achieved. Clustering and assembly analysis identified 19,219 contigs and 105,435 singleton sequences. BlastN analysis showed a significant identity with Expressed Sequence Tags of different gastropod species available in public databases. Similarly, BlastX results showed that only 895 out of the total 124,654 had significant hits and may represent novel genes for marine gastropods. From this database, simple sequence repeat motifs were also identified and a total of 38 primer pairs were designed and tested to assess their potential as informative markers and to investigate their cross-species amplification in different related gastropod species. This dataset represents the first publicly available 454 data for a marine gastropod endemic to the southeastern Pacific coast, providing a valuable transcriptomic resource for future efforts of gene discovery and development of functional markers in other marine gastropods. Copyright © 2011 Elsevier B.V. All rights reserved.

  1. Sequence tagging reveals unexpected modifications in toxicoproteomics

    PubMed Central

    Dasari, Surendra; Chambers, Matthew C.; Codreanu, Simona G.; Liebler, Daniel C.; Collins, Ben C.; Pennington, Stephen R.; Gallagher, William M.; Tabb, David L.

    2010-01-01

    Toxicoproteomic samples are rich in posttranslational modifications (PTMs) of proteins. Identifying these modifications via standard database searching can incur significant performance penalties. Here we describe the latest developments in TagRecon, an algorithm that leverages inferred sequence tags to identify modified peptides in toxicoproteomic data sets. TagRecon identifies known modifications more effectively than the MyriMatch database search engine. TagRecon outperformed state of the art software in recognizing unanticipated modifications from LTQ, Orbitrap, and QTOF data sets. We developed user-friendly software for detecting persistent mass shifts from samples. We follow a three-step strategy for detecting unanticipated PTMs in samples. First, we identify the proteins present in the sample with a standard database search. Next, identified proteins are interrogated for unexpected PTMs with a sequence tag-based search. Finally, additional evidence is gathered for the detected mass shifts with a refinement search. Application of this technology on toxicoproteomic data sets revealed unintended cross-reactions between proteins and sample processing reagents. Twenty five proteins in rat liver showed signs of oxidative stress when exposed to potentially toxic drugs. These results demonstrate the value of mining toxicoproteomic data sets for modifications. PMID:21214251

  2. Characterizing the Grape Transcriptome. Analysis of Expressed Sequence Tags from Multiple Vitis Species and Development of a Compendium of Gene Expression during Berry Development1[w

    PubMed Central

    Silva, Francisco Goes da; Iandolino, Alberto; Al-Kayal, Fadi; Bohlmann, Marlene C.; Cushman, Mary Ann; Lim, Hyunju; Ergul, Ali; Figueroa, Rubi; Kabuloglu, Elif K.; Osborne, Craig; Rowe, Joan; Tattersall, Elizabeth; Leslie, Anna; Xu, Jane; Baek, JongMin; Cramer, Grant R.; Cushman, John C.; Cook, Douglas R.

    2005-01-01

    We report the analysis and annotation of 146,075 expressed sequence tags from Vitis species. The majority of these sequences were derived from different cultivars of Vitis vinifera, comprising an estimated 25,746 unique contig and singleton sequences that survey transcription in various tissues and developmental stages and during biotic and abiotic stress. Putatively homologous proteins were identified for over 17,752 of the transcripts, with 1,962 transcripts further subdivided into one or more Gene Ontology categories. A simple structured vocabulary, with modules for plant genotype, plant development, and stress, was developed to describe the relationship between individual expressed sequence tags and cDNA libraries; the resulting vocabulary provides query terms to facilitate data mining within the context of a relational database. As a measure of the extent to which characterized metabolic pathways were encompassed by the data set, we searched for homologs of the enzymes leading from glycolysis, through the oxidative/nonoxidative pentose phosphate pathway, and into the general phenylpropanoid pathway. Homologs were identified for 65 of these 77 enzymes, with 86% of enzymatic steps represented by paralogous genes. Differentially expressed transcripts were identified by means of a stringent believability index cutoff of ≥98.4%. Correlation analysis and two-dimensional hierarchical clustering grouped these transcripts according to similarity of expression. In the broadest analysis, 665 differentially expressed transcripts were identified across 29 cDNA libraries, representing a range of developmental and stress conditions. The groupings revealed expected associations between plant developmental stages and tissue types, with the notable exception of abiotic stress treatments. A more focused analysis of flower and berry development identified 87 differentially expressed transcripts and provides the basis for a compendium that relates gene expression and annotation to previously characterized aspects of berry development and physiology. Comparison with published results for select genes, as well as correlation analysis between independent data sets, suggests that the inferred in silico patterns of expression are likely to be an accurate representation of transcript abundance for the conditions surveyed. Thus, the combined data set reveals the in silico expression patterns for hundreds of genes in V. vinifera, the majority of which have not been previously studied within this species. PMID:16219919

  3. Genome-Wide Analysis of Differentially Expressed Genes Relevant to Rhizome Formation in Lotus Root (Nelumbo nucifera Gaertn)

    PubMed Central

    Yin, Jingjing; Li, Liangjun; Chen, Xuehao

    2013-01-01

    Lotus root is a popular wetland vegetable which produces edible rhizome. At the molecular level, the regulation of rhizome formation is very complex, which has not been sufficiently addressed in research. In this study, to identify differentially expressed genes (DEGs) in lotus root, four libraries (L1 library: stolon stage, L2 library: initial swelling stage, L3 library: middle swelling stage, L4: later swelling stage) were constructed from the rhizome development stages. High-throughput tag-sequencing technique was used which is based on Solexa Genome Analyzer Platform. Approximately 5.0 million tags were sequenced, and 4542104, 4474755, 4777919, and 4750348 clean tags including 151282, 137476, 215872, and 166005 distinct tags were obtained after removal of low quality tags from each library respectively. More than 43% distinct tags were unambiguous tags mapping to the reference genes, and 40% were unambiguous tag-mapped genes. From L1, L2, L3, and L4, total 20471, 18785, 23448, and 21778 genes were annotated, after mapping their functions in existing databases. Profiling of gene expression in L1/L2, L2/L3, and L3/L4 libraries were different among most of the selected 20 DEGs. Most of the DEGs in L1/L2 libraries were relevant to fiber development and stress response, while in L2/L3 and L3/L4 libraries, major of the DEGs were involved in metabolism of energy and storage. All up-regulated transcriptional factors in four libraries and 14 important rhizome formation-related genes in four libraries were also identified. In addition, the expression of 9 genes from identified DEGs was performed by qRT-PCR method. In a summary, this study provides a comprehensive understanding of gene expression during the rhizome formation in lotus root. PMID:23840598

  4. Database-independent Protein Sequencing (DiPS) Enables Full-length de Novo Protein and Antibody Sequence Determination.

    PubMed

    Savidor, Alon; Barzilay, Rotem; Elinger, Dalia; Yarden, Yosef; Lindzen, Moshit; Gabashvili, Alexandra; Adiv Tal, Ophir; Levin, Yishai

    2017-06-01

    Traditional "bottom-up" proteomic approaches use proteolytic digestion, LC-MS/MS, and database searching to elucidate peptide identities and their parent proteins. Protein sequences absent from the database cannot be identified, and even if present in the database, complete sequence coverage is rarely achieved even for the most abundant proteins in the sample. Thus, sequencing of unknown proteins such as antibodies or constituents of metaproteomes remains a challenging problem. To date, there is no available method for full-length protein sequencing, independent of a reference database, in high throughput. Here, we present Database-independent Protein Sequencing, a method for unambiguous, rapid, database-independent, full-length protein sequencing. The method is a novel combination of non-enzymatic, semi-random cleavage of the protein, LC-MS/MS analysis, peptide de novo sequencing, extraction of peptide tags, and their assembly into a consensus sequence using an algorithm named "Peptide Tag Assembler." As proof-of-concept, the method was applied to samples of three known proteins representing three size classes and to a previously un-sequenced, clinically relevant monoclonal antibody. Excluding leucine/isoleucine and glutamic acid/deamidated glutamine ambiguities, end-to-end full-length de novo sequencing was achieved with 99-100% accuracy for all benchmarking proteins and the antibody light chain. Accuracy of the sequenced antibody heavy chain, including the entire variable region, was also 100%, but there was a 23-residue gap in the constant region sequence. © 2017 by The American Society for Biochemistry and Molecular Biology, Inc.

  5. Linear reduction method for predictive and informative tag SNP selection.

    PubMed

    He, Jingwu; Westbrooks, Kelly; Zelikovsky, Alexander

    2005-01-01

    Constructing a complete human haplotype map is helpful when associating complex diseases with their related SNPs. Unfortunately, the number of SNPs is very large and it is costly to sequence many individuals. Therefore, it is desirable to reduce the number of SNPs that should be sequenced to a small number of informative representatives called tag SNPs. In this paper, we propose a new linear algebra-based method for selecting and using tag SNPs. We measure the quality of our tag SNP selection algorithm by comparing actual SNPs with SNPs predicted from selected linearly independent tag SNPs. Our experiments show that for sufficiently long haplotypes, knowing only 0.4% of all SNPs the proposed linear reduction method predicts an unknown haplotype with the error rate below 2% based on 10% of the population.

  6. Analysis of ER Resident Proteins in S. cerevisiae: Implementation of H/KDEL Retrieval Sequences

    PubMed Central

    Young, Carissa L.; Raden, David L.; Robinson, Anne S.

    2013-01-01

    An elaborate quality control system regulates ER homeostasis by ensuring the fidelity of protein synthesis and maturation. In budding yeast, genomic analyses and high-throughput proteomic studies have identified ER resident proteins that restore homeostasis following local perturbations. Yet, how these folding factors modulate stress has been largely unexplored. In this study, we designed a series of PCR-based modules including codon-optimized epitopes and FP variants complete with C-terminal H/KDEL retrieval motifs. These conserved sequences are inherent to most soluble ER resident proteins. To monitor multiple proteins simultaneously, H/KDEL cassettes are available with six different selection markers, providing optimal flexibility for live-cell imaging and multicolor labeling in vivo. A single pair of PCR primers can be used for the amplification of these 26 modules, enabling numerous combinations of tags and selection markers. The versatility of pCY H/KDEL cassettes was demonstrated by labeling BiP/Kar2p, Pdi1p, and Scj1p with all novel tags, thus providing a direct comparison among FP variants. Furthermore, to advance in vitro studies of yeast ER proteins, Strep-tag II was engineered with a C-terminal retrieval sequence. Here, an efficient purification strategy was established for BiP under physiological conditions. PMID:23324027

  7. Construction and Cloning of Reporter-Tagged Replicon cDNA for an In Vitro Replication Study of Murine Norovirus-1 (MNV-1).

    PubMed

    Ahmad, Muhammad Khairi; Tabana, Yasser M; Ahmed, Mowaffaq Adam; Sandai, Doblin Anak; Mohamed, Rafeezul; Ismail, Ida Shazrina; Zulkiflie, Nurulisa; Yunus, Muhammad Amir

    2017-12-01

    A norovirus maintains its viability, infectivity and virulence by its ability to replicate. However, the biological mechanisms of the process remain to be explored. In this work, the NanoLuc™ Luciferase gene was used to develop a reporter-tagged replicon system to study norovirus replication. The NanoLuc™ Luciferase reporter protein was engineered to be expressed as a fusion protein for MNV-1 minor capsid protein, VP2. The foot-and-mouth disease virus 2A (FMDV2A) sequence was inserted between the 3'end of the reporter gene and the VP2 start sequence to allow co-translational 'cleavage' of fusion proteins during intracellular transcript expression. Amplification of the fusion gene was performed using a series of standard and overlapping polymerase chain reactions. The resulting amplicon was then cloned into three readily available backbones of MNV-1 cDNA clones. Restriction enzyme analysis indicated that the NanoLucTM Luciferase gene was successfully inserted into the parental MNV-1 cDNA clone. The insertion was further confirmed by using DNA sequencing. NanoLuc™ Luciferase-tagged MNV-1 cDNA clones were successfully engineered. Such clones can be exploited to develop robust experimental assays for in vitro assessments of viral RNA replication.

  8. The Transcriptome of Compatible and Incompatible Interactions of Potato (Solanum tuberosum) with Phytophthora infestans Revealed by DeepSAGE Analysis

    PubMed Central

    Gyetvai, Gabor; Sønderkær, Mads; Göbel, Ulrike; Basekow, Rico; Ballvora, Agim; Imhoff, Maren; Kersten, Birgit; Nielsen, Kåre-Lehman; Gebhardt, Christiane

    2012-01-01

    Late blight, caused by the oomycete Phytophthora infestans, is the most important disease of potato (Solanum tuberosum). Understanding the molecular basis of resistance and susceptibility to late blight is therefore highly relevant for developing resistant cultivars, either by marker-assissted selection or by transgenic approaches. Specific P. infestans races having the Avr1 effector gene trigger a hypersensitive resistance response in potato plants carrying the R1 resistance gene (incompatible interaction) and cause disease in plants lacking R1 (compatible interaction). The transcriptomes of the compatible and incompatible interaction were captured by DeepSAGE analysis of 44 biological samples comprising five genotypes, differing only by the presence or absence of the R1 transgene, three infection time points and three biological replicates. 30.859 unique 21 base pair sequence tags were obtained, one third of which did not match any known potato transcript sequence. Two third of the tags were expressed at low frequency (<10 tag counts/million). 20.470 unitags matched to approximately twelve thousand potato transcribed genes. Tag frequencies were compared between compatible and incompatible interactions over the infection time course and between compatible and incompatible genotypes. Transcriptional changes were more numerous in compatible than in incompatible interactions. In contrast to incompatible interactions, transcriptional changes in the compatible interaction were observed predominantly for multigene families encoding defense response genes and genes functional in photosynthesis and CO2 fixation. Numerous transcriptional differences were also observed between near isogenic genotypes prior to infection with P. infestans. Our DeepSAGE transcriptome analysis uncovered novel candidate genes for plant host pathogen interactions, examples of which are discussed with respect to possible function. PMID:22328937

  9. Structure-function analysis of diacylglycerol acyltransferase sequences from 70 organisms

    USDA-ARS?s Scientific Manuscript database

    Diacylglycerol acyltransferases (DGATs) catalyze the final and rate-limiting step of triacylglycerol (TAG) biosynthesis in eukaryotic organisms. Understanding the roles of DGATs will help to create transgenic plants with value-added properties and provide clues for therapeutic intervention for obes...

  10. Molecular phenotype of zebrafish ovarian follicle by serial analysis of gene expression and proteomic profiling, and comparison with the transcriptomes of other animals

    PubMed Central

    Knoll-Gellida, Anja; André, Michèle; Gattegno, Tamar; Forgue, Jean; Admon, Arie; Babin, Patrick J

    2006-01-01

    Background The ability of an oocyte to develop into a viable embryo depends on the accumulation of specific maternal information and molecules, such as RNAs and proteins. A serial analysis of gene expression (SAGE) was carried out in parallel with proteomic analysis on fully-grown ovarian follicles from zebrafish (Danio rerio). The data obtained were compared with ovary/follicle/egg molecular phenotypes of other animals, published or available in public sequence databases. Results Sequencing of 27,486 SAGE tags identified 11,399 different ones, including 3,329 tags with an occurrence superior to one. Fifty-eight genes were expressed at over 0.15% of the total population and represented 17.34% of the mRNA population identified. The three most expressed transcripts were a rhamnose-binding lectin, beta-actin 2, and a transcribed locus similar to the H2B histone family. Comparison with the large-scale expressed sequence tags sequencing approach revealed highly expressed transcripts that were not previously known to be expressed at high levels in fish ovaries, like the short-sized polarized metallothionein 2 transcript. A higher sensitivity for the detection of transcripts with a characterized maternal genetic contribution was also demonstrated compared to large-scale sequencing of cDNA libraries. Ferritin heavy polypeptide 1, heat shock protein 90-beta, lactate dehydrogenase B4, beta-actin isoforms, tubulin beta 2, ATP synthase subunit 9, together with 40 S ribosomal protein S27a, were common highly-expressed transcripts of vertebrate ovary/unfertilized egg. Comparison of transcriptome and proteome data revealed that transcript levels provide little predictive value with respect to the extent of protein abundance. All the proteins identified by proteomic analysis of fully-grown zebrafish follicles had at least one transcript counterpart, with two exceptions: eosinophil chemotactic cytokine and nothepsin. Conclusion This study provides a complete sequence data set of maternal mRNA stored in zebrafish germ cells at the end of oogenesis. This catalogue contains highly-expressed transcripts that are part of a vertebrate ovarian expressed gene signature. Comparison of transcriptome and proteome data identified downregulated transcripts or proteins potentially incorporated in the oocyte by endocytosis. The molecular phenotype described provides groundwork for future experimental approaches aimed at identifying functionally important stored maternal transcripts and proteins involved in oogenesis and early stages of embryo development. PMID:16526958

  11. Exploring the Presence of microDNAs in Prostate Cancer Cell Lines, Tissue, and Sera of Prostate Cancer Patients and its Possible Application as Biomarker

    DTIC Science & Technology

    2015-08-01

    Sequence tags were mapped on the human reference genome using the Novoalign software. Only those tags... the linear islands to create a novel junctional sequence that does not exist in the genome . Thus the PE- sequence of a fragment that breaks at or...identified in cancer cell lines. (b) Median percent GC content of microDNAs and the genomic sequences up- or downstream of the source loci are

  12. Genomic analysis of expressed sequence tags in American black bear Ursus americanus

    PubMed Central

    2010-01-01

    Background Species of the bear family (Ursidae) are important organisms for research in molecular evolution, comparative physiology and conservation biology, but relatively little genetic sequence information is available for this group. Here we report the development and analyses of the first large scale Expressed Sequence Tag (EST) resource for the American black bear (Ursus americanus). Results Comprehensive analyses of molecular functions, alternative splicing, and tissue-specific expression of 38,757 black bear EST sequences were conducted using the dog genome as a reference. We identified 18 genes, involved in functions such as lipid catabolism, cell cycle, and vesicle-mediated transport, that are showing rapid evolution in the bear lineage Three genes, Phospholamban (PLN), cysteine glycine-rich protein 3 (CSRP3) and Troponin I type 3 (TNNI3), are related to heart contraction, and defects in these genes in humans lead to heart disease. Two genes, biphenyl hydrolase-like (BPHL) and CSRP3, contain positively selected sites in bear. Global analysis of evolution rates of hibernation-related genes in bear showed that they are largely conserved and slowly evolving genes, rather than novel and fast-evolving genes. Conclusion We provide a genomic resource for an important mammalian organism and our study sheds new light on the possible functions and evolution of bear genes. PMID:20338065

  13. Genomic analysis of expressed sequence tags in American black bear Ursus americanus.

    PubMed

    Zhao, Sen; Shao, Chunxuan; Goropashnaya, Anna V; Stewart, Nathan C; Xu, Yichi; Tøien, Øivind; Barnes, Brian M; Fedorov, Vadim B; Yan, Jun

    2010-03-26

    Species of the bear family (Ursidae) are important organisms for research in molecular evolution, comparative physiology and conservation biology, but relatively little genetic sequence information is available for this group. Here we report the development and analyses of the first large scale Expressed Sequence Tag (EST) resource for the American black bear (Ursus americanus). Comprehensive analyses of molecular functions, alternative splicing, and tissue-specific expression of 38,757 black bear EST sequences were conducted using the dog genome as a reference. We identified 18 genes, involved in functions such as lipid catabolism, cell cycle, and vesicle-mediated transport, that are showing rapid evolution in the bear lineage Three genes, Phospholamban (PLN), cysteine glycine-rich protein 3 (CSRP3) and Troponin I type 3 (TNNI3), are related to heart contraction, and defects in these genes in humans lead to heart disease. Two genes, biphenyl hydrolase-like (BPHL) and CSRP3, contain positively selected sites in bear. Global analysis of evolution rates of hibernation-related genes in bear showed that they are largely conserved and slowly evolving genes, rather than novel and fast-evolving genes. We provide a genomic resource for an important mammalian organism and our study sheds new light on the possible functions and evolution of bear genes.

  14. Expressed sequence tag (EST) analysis of two subspecies of Metarhizium anisopliae reveals a plethora of secreted proteins with potential activity in insect hosts.

    PubMed

    Freimoser, Florian M; Screen, Steven; Bagga, Savita; Hu, Gang; St Leger, Raymond J

    2003-01-01

    Expressed sequence tag (EST) libraries for Metarhizium anisopliae, the causative agent of green muscardine disease, were developed from the broad host-range pathogen Metarhizium anisopliae sf. anisopliae and the specific grasshopper pathogen, M. anisopliae sf. acridum. Approximately 1,700 5' end sequences from each subspecies were generated from cDNA libraries representing fungi grown under conditions that maximize secretion of cuticle-degrading enzymes. Both subspecies had ESTs for virtually all pathogenicity-related genes cloned to date from M. anisopliae, but many novel genes encoding potential virulence factors were also tagged. Enzymes with potential targets in the insect host included proteases, chitinases, phospholipases, lipases, esterases, phosphatases and enzymes producing toxic secondary metabolites. A diverse array of proteases composed 36 % of all M. anisopliae sf. anisopliae ESTs. Eighty percent of the ESTs that could be clustered into functional groups had significant matches (E<10(-5)) in other ascomycete fungi. These included genes reported to have specific roles in pathogens with plant or vertebrate hosts. Many of the remaining ESTs had their best BLAST match among animal, plant and bacterial sequences. These include genes with plant and microbial counterparts that produce potent antimicrobials. The abundance of transcripts discovered for different functional groups varied between the two subspecies of M. anisopliae in a manner consistent with ecological adaptations of the two pathogens. By hastening gene discovery this project has enhanced development of improved mycoinsecticides. In addition, the M. anisopliae ESTs represent a significant contribution to the extensive database of sequences from ascomycetes that are saprophytes or plant and vertebrate pathogens. Comparative analyses of these sequences is providing important information about the biology and evolutionary history of this clade.

  15. Design and characterization of a nanopore-coupled polymerase for single-molecule DNA sequencing by synthesis on an electrode array

    PubMed Central

    Stranges, P. Benjamin; Palla, Mirkó; Kalachikov, Sergey; Nivala, Jeff; Dorwart, Michael; Trans, Andrew; Kumar, Shiv; Porel, Mintu; Chien, Minchen; Tao, Chuanjuan; Morozova, Irina; Li, Zengmin; Shi, Shundi; Aberra, Aman; Arnold, Cleoma; Yang, Alexander; Aguirre, Anne; Harada, Eric T.; Korenblum, Daniel; Pollard, James; Bhat, Ashwini; Gremyachinskiy, Dmitriy; Bibillo, Arek; Chen, Roger; Davis, Randy; Russo, James J.; Fuller, Carl W.; Roever, Stefan; Ju, Jingyue; Church, George M.

    2016-01-01

    Scalable, high-throughput DNA sequencing is a prerequisite for precision medicine and biomedical research. Recently, we presented a nanopore-based sequencing-by-synthesis (Nanopore-SBS) approach, which used a set of nucleotides with polymer tags that allow discrimination of the nucleotides in a biological nanopore. Here, we designed and covalently coupled a DNA polymerase to an α-hemolysin (αHL) heptamer using the SpyCatcher/SpyTag conjugation approach. These porin–polymerase conjugates were inserted into lipid bilayers on a complementary metal oxide semiconductor (CMOS)-based electrode array for high-throughput electrical recording of DNA synthesis. The designed nanopore construct successfully detected the capture of tagged nucleotides complementary to a DNA base on a provided template. We measured over 200 tagged-nucleotide signals for each of the four bases and developed a classification method to uniquely distinguish them from each other and background signals. The probability of falsely identifying a background event as a true capture event was less than 1.2%. In the presence of all four tagged nucleotides, we observed sequential additions in real time during polymerase-catalyzed DNA synthesis. Single-polymerase coupling to a nanopore, in combination with the Nanopore-SBS approach, can provide the foundation for a low-cost, single-molecule, electronic DNA-sequencing platform. PMID:27729524

  16. Cell-free translational screening of an expression sequence tag library of Clonorchis sinensis for novel antigen discovery.

    PubMed

    Kasi, Devi; Catherine, Christy; Lee, Seung-Won; Lee, Kyung-Ho; Kim, Yu Jung; Ro Lee, Myeong; Ju, Jung Won; Kim, Dong-Myung

    2017-05-01

    The rapidly evolving cloning and sequencing technologies have enabled understanding of genomic structure of parasite genomes, opening up new ways of combatting parasite-related diseases. To make the most of the exponentially accumulating genomic data, however, it is crucial to analyze the proteins encoded by these genomic sequences. In this study, we adopted an engineered cell-free protein synthesis system for large-scale expression screening of an expression sequence tag (EST) library of Clonorchis sinensis to identify potential antigens that can be used for diagnosis and treatment of clonorchiasis. To allow high-throughput expression and identification of individual genes comprising the library, a cell-free synthesis reaction was designed such that both the template DNA and the expressed proteins were co-immobilized on the same microbeads, leading to microbead-based linkage of the genotype and phenotype. This reaction configuration allowed streamlined expression, recovery, and analysis of proteins. This approach enabled us to identify 21 antigenic proteins. © 2017 American Institute of Chemical Engineers Biotechnol. Prog., 33:832-837, 2017. © 2017 American Institute of Chemical Engineers.

  17. In silico Analysis of 3′-End-Processing Signals in Aspergillus oryzae Using Expressed Sequence Tags and Genomic Sequencing Data

    PubMed Central

    Tanaka, Mizuki; Sakai, Yoshifumi; Yamada, Osamu; Shintani, Takahiro; Gomi, Katsuya

    2011-01-01

    To investigate 3′-end-processing signals in Aspergillus oryzae, we created a nucleotide sequence data set of the 3′-untranslated region (3′ UTR) plus 100 nucleotides (nt) sequence downstream of the poly(A) site using A. oryzae expressed sequence tags and genomic sequencing data. This data set comprised 1065 sequences derived from 1042 unique genes. The average 3′ UTR length in A. oryzae was 241 nt, which is greater than that in yeast but similar to that in plants. The 3′ UTR and 100 nt sequence downstream of the poly(A) site is notably U-rich, while the region located 15–30 nt upstream of the poly(A) site is markedly A-rich. The most frequently found hexanucleotide in this A-rich region is AAUGAA, although this sequence accounts for only 6% of all transcripts. These data suggested that A. oryzae has no highly conserved sequence element equivalent to AAUAAA, a mammalian polyadenylation signal. We identified that putative 3′-end-processing signals in A. oryzae, while less well conserved than those in mammals, comprised four sequence elements: the furthest upstream U-rich element, A-rich sequence, cleavage site, and downstream U-rich element flanking the cleavage site. Although these putative 3′-end-processing signals are similar to those in yeast and plants, some notable differences exist between them. PMID:21586533

  18. Expressed sequence tag analysis of adult human lens for the NEIBank Project: over 2000 non-redundant transcripts, novel genes and splice variants.

    PubMed

    Wistow, Graeme; Bernstein, Steven L; Wyatt, M Keith; Behal, Amita; Touchman, Jeffrey W; Bouffard, Gerald; Smith, Don; Peterson, Katherine

    2002-06-15

    To explore the expression profile of the human lens and to provide a resource for microarray studies, expressed sequence tag (EST) analysis has been performed on cDNA libraries from adult lenses. A cDNA library was constructed from two adult (40 year old) human lenses. Over two thousand clones were sequenced from the unamplified, un-normalized library. The library was then normalized and a further 2200 sequences were obtained. All the data were analyzed using GRIST (GRouping and Identification of Sequence Tags), a procedure for gene identification and clustering. The lens library (by) contains a low percentage of non-mRNA contaminants and a high fraction (over 75%) of apparently full length cDNA clones. Approximately 2000 reads from the unamplified library yields 810 clusters, potentially representing individual genes expressed in the lens. After normalization, the content of crystallins and other abundant cDNAs is markedly reduced and a similar number of reads from this library (fs) yields 1455 unique groups of which only two thirds correspond to named genes in GenBank. Among the most abundant cDNAs is one for a novel gene related to glutamine synthetase, which was designated "lengsin" (LGS). Analyses of ESTs also reveal examples of alternative transcripts, including a major alternative splice form for the lens specific membrane protein MP19. Variant forms for other transcripts, including those encoding the apoptosis inhibitor Livin and the armadillo repeat protein ARVCF, are also described. The lens cDNA libraries are a resource for gene discovery, full length cDNAs for functional studies and microarrays. The discovery of an abundant, novel transcript, lengsin, and a major novel splice form of MP19 reflect the utility of unamplified libraries constructed from dissected tissue. Many novel transcripts and splice forms are represented, some of which may be candidates for genetic diseases.

  19. Micropreparative capillary gel electrophoresis of DNA: rapid expressed sequence tag library construction.

    PubMed

    Shi, Liang; Khandurina, Julia; Ronai, Zsolt; Li, Bi-Yu; Kwan, Wai King; Wang, Xun; Guttman, András

    2003-01-01

    A capillary gel electrophoresis based automated DNA fraction collection technique was developed to support a novel DNA fragment-pooling strategy for expressed sequence tag (EST) library construction. The cDNA population is first cleaved by BsaJ I and EcoR I restriction enzymes, and then subpooled by selective ligation with specific adapters followed by polymerase chain reaction (PCR) amplification and labeling. Combination of this cDNA fingerprinting method with high-resolution capillary gel electrophoresis separation and precise fractionation of individual cDNA transcript representatives avoids redundant fragment selection and concomitant repetitive sequencing of abundant transcripts. Using a computer-controlled capillary electrophoresis device the transcript representatives were separated by their size and fractions were automatically collected in every 30 s into 96-well plates. The high resolving power of the sieving matrix ensured sequencing grade separation of the DNA fragments (i.e., single-base resolution) and successful fraction collection. Performance and precision of the fraction collection procedure was validated by PCR amplification of the collected DNA fragments followed by capillary electrophoresis analysis for size and purity verification. The collected and PCR-amplified transcript representatives, ranging up to several hundred base pairs, were then sequenced to create an EST library.

  20. Decision Tree Algorithm-Generated Single-Nucleotide Polymorphism Barcodes of rbcL Genes for 38 Brassicaceae Species Tagging.

    PubMed

    Yang, Cheng-Hong; Wu, Kuo-Chuan; Chuang, Li-Yeh; Chang, Hsueh-Wei

    2018-01-01

    DNA barcode sequences are accumulating in large data sets. A barcode is generally a sequence larger than 1000 base pairs and generates a computational burden. Although the DNA barcode was originally envisioned as straightforward species tags, the identification usage of barcode sequences is rarely emphasized currently. Single-nucleotide polymorphism (SNP) association studies provide us an idea that the SNPs may be the ideal target of feature selection to discriminate between different species. We hypothesize that SNP-based barcodes may be more effective than the full length of DNA barcode sequences for species discrimination. To address this issue, we tested a r ibulose diphosphate carboxylase ( rbcL ) S NP b arcoding (RSB) strategy using a decision tree algorithm. After alignment and trimming, 31 SNPs were discovered in the rbcL sequences from 38 Brassicaceae plant species. In the decision tree construction, these SNPs were computed to set up the decision rule to assign the sequences into 2 groups level by level. After algorithm processing, 37 nodes and 31 loci were required for discriminating 38 species. Finally, the sequence tags consisting of 31 rbcL SNP barcodes were identified for discriminating 38 Brassicaceae species based on the decision tree-selected SNP pattern using RSB method. Taken together, this study provides the rational that the SNP aspect of DNA barcode for rbcL gene is a useful and effective sequence for tagging 38 Brassicaceae species.

  1. Characterization by Suppression Subtractive Hybridization of Transcripts That Are Differentially Expressed in Leaves of Anthracnose-Resistant Ramie Cultivar.

    PubMed

    Xuxia, Wang; Jie, Chen; Bo, Wang; Lijun, Liu; Hui, Jiang; Diluo, Tang; Dingxiang, Peng

    2012-01-01

    For the purpose of screening putative anthracnose resistance-related genes of ramie ( Boehmeria nivea L. Gaud), a cDNA library was constructed by suppression subtractive hybridization using anthracnose-resistant cultivar Huazhu no. 4. The cDNAs from Huazhu no. 4, which were infected with Colletotrichum gloeosporioides , were used as the tester and cDNAs from uninfected Huazhu no. 4 as the driver. Sequencing analysis and homology searching showed that these clones represented 132 single genes, which were assigned to functional categories, including 14 putative cellular functions, according to categories established for Arabidopsis . These 132 genes included 35 disease resistance and stress tolerance-related genes including putative heat-shock protein 90, metallothionein, PR-1.2 protein, catalase gene, WRKY family genes, and proteinase inhibitor-like protein. Partial disease-related genes were further analyzed by reverse transcription PCR and RNA gel blot. These expressed sequence tags are the first anthracnose resistance-related expressed sequence tags reported in ramie.

  2. Construction of a Lotus japonicus late nodulin expressed sequence tag library and identification of novel nodule-specific genes.

    PubMed Central

    Szczyglowski, K; Hamburger, D; Kapranov, P; de Bruijn, F J

    1997-01-01

    A range of novel expressed sequence tags (ESTs) associated with late developmental events during nodule organogenesis in the legume Lotus japonicus were identified using mRNA differential display; 110 differentially displayed polymerase chain reaction products were cloned and analyzed. Of 88 unique cDNAs obtained, 22 shared significant homology to DNA/protein sequences in the respective databases. This group comprises, among others, a nodule-specific homolog of protein phosphatase 2C, a peptide transporter protein, and a nodule-specific form of cytochrome P450. RNA gel-blot analysis of 16 differentially displayed ESTs confirmed their nodule-specific expression pattern. The kinetics of mRNA accumulation of the majority of the ESTs analyzed were found to resemble the expression pattern observed for the L. japonicus leghemoglobin gene. These results indicate that the newly isolated molecular markers correspond to genes induced during late developmental stages of L. japonicus nodule organogenesis and provide important, novel tools for the study of nodulation. PMID:9276951

  3. Genome-wide identification, classification, and expression analysis of the arabinogalactan protein gene family in rice (Oryza sativa L.)

    PubMed Central

    Zhao, Jie

    2010-01-01

    Arabinogalactan proteins (AGPs) comprise a family of hydroxyproline-rich glycoproteins that are implicated in plant growth and development. In this study, 69 AGPs are identified from the rice genome, including 13 classical AGPs, 15 arabinogalactan (AG) peptides, three non-classical AGPs, three early nodulin-like AGPs (eNod-like AGPs), eight non-specific lipid transfer protein-like AGPs (nsLTP-like AGPs), and 27 fasciclin-like AGPs (FLAs). The results from expressed sequence tags, microarrays, and massively parallel signature sequencing tags are used to analyse the expression of AGP-encoding genes, which is confirmed by real-time PCR. The results reveal that several rice AGP-encoding genes are predominantly expressed in anthers and display differential expression patterns in response to abscisic acid, gibberellic acid, and abiotic stresses. Based on the results obtained from this analysis, an attempt has been made to link the protein structures and expression patterns of rice AGP-encoding genes to their functions. Taken together, the genome-wide identification and expression analysis of the rice AGP gene family might facilitate further functional studies of rice AGPs. PMID:20423940

  4. Analysis of expressed sequence tags for Frankliniella occidentalis, the western flower thrips.

    PubMed

    Rotenberg, D; Whitfield, A E

    2010-08-01

    Thrips are members of the insect order Thysanoptera and Frankliniella occidentalis (the western flower thrips) is the most economically important pest within this order. F. occidentalis is both a direct pest of crops and an efficient vector of plant viruses, including Tomato spotted wilt virus (TSWV). Despite the world-wide importance of thrips in agriculture, there is little knowledge of the F. occidentalis genome or gene functions at this time. A normalized cDNA library was constructed from first instar thrips and 13 839 expressed sequence tags (ESTs) were obtained. Our EST data assembled into 894 contigs and 11 806 singletons (12 700 nonredundant sequences). We found that 31% of these sequences had significant similarity (E< or = 10(-10)) to protein sequences in the National Center for Biotechnology Information nonredundant (nr) protein database, and 25% were functionally annotated using Blast 2GO. We identified 74 sequences with putative homology to proteins associated with insect innate immunity. Sixteen sequences had significant similarity to proteins associated with small RNA-mediated gene silencing pathways (RNA interference; RNAi), including the antiviral pathway (short interfering RNA-mediated pathway). Our EST collection provides new sequence resources for characterizing gene functions in F. occidentalis and other thrips species with regards to vital biological processes, studying the mechanism of interactions with the viruses harboured and transmitted by the vector, and identifying new insect gene-centred targets for plant disease and insect control.

  5. Characterization of expressed sequence tag-derived simple sequence repeat markers for Aspergillus flavus: emphasis on variability of isolates from the southern United States.

    PubMed

    Wang, Xinwang; Wadl, Phillip A; Wood-Jones, Alicia; Windham, Gary; Trigiano, Robert N; Scruggs, Mary; Pilgrim, Candace; Baird, Richard

    2012-12-01

    Simple sequence repeat (SSR) markers were developed from Aspergillus flavus expressed sequence tag (EST) database to conduct an analysis of genetic relationships of Aspergillus isolates from numerous host species and geographical regions, but primarily from the United States. Twenty-nine primers were designed from 362 tri-nucleotide EST-SSR sequences. Eighteen polymorphic loci were used to genotype 96 Aspergillus species isolates. The number of alleles detected per locus ranged from 2 to 24 with a mean of 8.2 alleles. Haploid diversity ranged from 0.28 to 0.91. Genetic distance matrix was used to perform principal coordinates analysis (PCA) and to generate dendrograms using unweighted pair group method with arithmetic mean (UPGMA). Two principal coordinates explained more than 75 % of the total variation among the isolates. One clade was identified for A. flavus isolates (n = 87) with the other Aspergillus species (n = 7) using PCA, but five distinct clusters were present when the others taxa were excluded from the analysis. Six groups were noted when the EST-SSR data were compared using UPGMA. However, the latter PCA or UPGMA comparison resulted in no direct associations with host species, geographical region or aflatoxin production. Furthermore, there was no direct correlation to visible morphological features such as sclerotial types. The isolates from Mississippi Delta region, which contained the largest percentage of isolates, did not show any unusual clustering except for isolates K32, K55, and 199. Further studies of these three isolates are warranted to evaluate their pathogenicity, aflatoxin production potential, additional gene sequences (e.g., RPB2), and morphological comparisons.

  6. SuperSAGE analysis of the Nicotiana attenuata transcriptome after fatty acid-amino acid elicitation (FAC): identification of early mediators of insect responses

    PubMed Central

    2010-01-01

    Background Plants trigger and tailor defense responses after perception of the oral secretions (OS) of attacking specialist lepidopteran larvae. Fatty acid-amino acid conjugates (FACs) in the OS of the Manduca sexta larvae are necessary and sufficient to elicit the herbivory-specific responses in Nicotiana attenuata, an annual wild tobacco species. How FACs are perceived and activate signal transduction mechanisms is unknown. Results We used SuperSAGE combined with 454 sequencing to quantify the early transcriptional changes elicited by the FAC N-linolenoyl-glutamic acid (18:3-Glu) and virus induced gene silencing (VIGS) to examine the function of candidate genes in the M. sexta-N. attenuata interaction. The analysis targeted mRNAs encoding regulatory components: rare transcripts with very rapid FAC-elicited kinetics (increases within 60 and declines within 120 min). From 12,744 unique Tag sequences identified (UniTags), 430 and 117 were significantly up- and down-regulated ≥ 2.5-fold, respectively, after 18:3-Glu elicitation compared to wounding. Based on gene ontology classification, more than 25% of the annotated UniTags corresponded to putative regulatory components, including 30 transcriptional regulators and 22 protein kinases. Quantitative PCR analysis was used to analyze the FAC-dependent regulation of a subset of 27 of these UniTags and for most of them a rapid and transient induction was confirmed. Six FAC-regulated genes were functionally characterized by VIGS and two, a putative lipid phosphate phosphatase (LPP) and a protein of unknown function, were identified as important mediators of the M. sexta-N. attenuata interaction. Conclusions The analysis of the early changes in the transcriptome of N. attenuata after FAC elicitation using SuperSAGE/454 has identified regulatory genes involved in insect-specific mediated responses in plants. Moreover, it has provided a foundation for the identification of additional novel regulators associated with this process. PMID:20398280

  7. SuperSAGE analysis of the Nicotiana attenuata transcriptome after fatty acid-amino acid elicitation (FAC): identification of early mediators of insect responses.

    PubMed

    Gilardoni, Paola A; Schuck, Stefan; Jüngling, Ruth; Rotter, Björn; Baldwin, Ian T; Bonaventure, Gustavo

    2010-04-14

    Plants trigger and tailor defense responses after perception of the oral secretions (OS) of attacking specialist lepidopteran larvae. Fatty acid-amino acid conjugates (FACs) in the OS of the Manduca sexta larvae are necessary and sufficient to elicit the herbivory-specific responses in Nicotiana attenuata, an annual wild tobacco species. How FACs are perceived and activate signal transduction mechanisms is unknown. We used SuperSAGE combined with 454 sequencing to quantify the early transcriptional changes elicited by the FAC N-linolenoyl-glutamic acid (18:3-Glu) and virus induced gene silencing (VIGS) to examine the function of candidate genes in the M. sexta-N. attenuata interaction. The analysis targeted mRNAs encoding regulatory components: rare transcripts with very rapid FAC-elicited kinetics (increases within 60 and declines within 120 min). From 12,744 unique Tag sequences identified (UniTags), 430 and 117 were significantly up- and down-regulated >or= 2.5-fold, respectively, after 18:3-Glu elicitation compared to wounding. Based on gene ontology classification, more than 25% of the annotated UniTags corresponded to putative regulatory components, including 30 transcriptional regulators and 22 protein kinases. Quantitative PCR analysis was used to analyze the FAC-dependent regulation of a subset of 27 of these UniTags and for most of them a rapid and transient induction was confirmed. Six FAC-regulated genes were functionally characterized by VIGS and two, a putative lipid phosphate phosphatase (LPP) and a protein of unknown function, were identified as important mediators of the M. sexta-N. attenuata interaction. The analysis of the early changes in the transcriptome of N. attenuata after FAC elicitation using SuperSAGE/454 has identified regulatory genes involved in insect-specific mediated responses in plants. Moreover, it has provided a foundation for the identification of additional novel regulators associated with this process.

  8. Generation and analysis of a barcode-tagged insertion mutant library in the fission yeast Schizosaccharomyces pombe

    PubMed Central

    2012-01-01

    Background Barcodes are unique DNA sequence tags that can be used to specifically label individual mutants. The barcode-tagged open reading frame (ORF) haploid deletion mutant collections in the budding yeast Saccharomyces cerevisiae and the fission yeast Schizosaccharomyces pombe allow for high-throughput mutant phenotyping because the relative growth of mutants in a population can be determined by monitoring the proportions of their associated barcodes. While these mutant collections have greatly facilitated genome-wide studies, mutations in essential genes are not present, and the roles of these genes are not as easily studied. To further support genome-scale research in S. pombe, we generated a barcode-tagged fission yeast insertion mutant library that has the potential of generating viable mutations in both essential and non-essential genes and can be easily analyzed using standard molecular biological techniques. Results An insertion vector containing a selectable ura4+ marker and a random barcode was used to generate a collection of 10,000 fission yeast insertion mutants stored individually in 384-well plates and as six pools of mixed mutants. Individual barcodes are flanked by Sfi I recognition sites and can be oligomerized in a unique orientation to facilitate barcode sequencing. Independent genetic screens on a subset of mutants suggest that this library contains a diverse collection of single insertion mutations. We present several approaches to determine insertion sites. Conclusions This collection of S. pombe barcode-tagged insertion mutants is well-suited for genome-wide studies. Because insertion mutations may eliminate, reduce or alter the function of essential and non-essential genes, this library will contain strains with a wide range of phenotypes that can be assayed by their associated barcodes. The design of the barcodes in this library allows for barcode sequencing using next generation or standard benchtop cloning approaches. PMID:22554201

  9. Profiling cellular protein complexes by proximity ligation with dual tag microarray readout.

    PubMed

    Hammond, Maria; Nong, Rachel Yuan; Ericsson, Olle; Pardali, Katerina; Landegren, Ulf

    2012-01-01

    Patterns of protein interactions provide important insights in basic biology, and their analysis plays an increasing role in drug development and diagnostics of disease. We have established a scalable technique to compare two biological samples for the levels of all pairwise interactions among a set of targeted protein molecules. The technique is a combination of the proximity ligation assay with readout via dual tag microarrays. In the proximity ligation assay protein identities are encoded as DNA sequences by attaching DNA oligonucleotides to antibodies directed against the proteins of interest. Upon binding by pairs of antibodies to proteins present in the same molecular complexes, ligation reactions give rise to reporter DNA molecules that contain the combined sequence information from the two DNA strands. The ligation reactions also serve to incorporate a sample barcode in the reporter molecules to allow for direct comparison between pairs of samples. The samples are evaluated using a dual tag microarray where information is decoded, revealing which pairs of tags that have become joined. As a proof-of-concept we demonstrate that this approach can be used to detect a set of five proteins and their pairwise interactions both in cellular lysates and in fixed tissue culture cells. This paper provides a general strategy to analyze the extent of any pairwise interactions in large sets of molecules by decoding reporter DNA strands that identify the interacting molecules.

  10. Construction and Cloning of Reporter-Tagged Replicon cDNA for an In Vitro Replication Study of Murine Norovirus-1 (MNV-1)

    PubMed Central

    Ahmad, Muhammad Khairi; Tabana, Yasser M; Ahmed, Mowaffaq Adam; Sandai, Doblin Anak; Mohamed, Rafeezul; Ismail, Ida Shazrina; Zulkiflie, Nurulisa; Yunus, Muhammad Amir

    2017-01-01

    Background A norovirus maintains its viability, infectivity and virulence by its ability to replicate. However, the biological mechanisms of the process remain to be explored. In this work, the NanoLuc™ Luciferase gene was used to develop a reporter-tagged replicon system to study norovirus replication. Methods The NanoLuc™ Luciferase reporter protein was engineered to be expressed as a fusion protein for MNV-1 minor capsid protein, VP2. The foot-and-mouth disease virus 2A (FMDV2A) sequence was inserted between the 3′end of the reporter gene and the VP2 start sequence to allow co-translational ‘cleavage’ of fusion proteins during intracellular transcript expression. Amplification of the fusion gene was performed using a series of standard and overlapping polymerase chain reactions. The resulting amplicon was then cloned into three readily available backbones of MNV-1 cDNA clones. Results Restriction enzyme analysis indicated that the NanoLucTM Luciferase gene was successfully inserted into the parental MNV-1 cDNA clone. The insertion was further confirmed by using DNA sequencing. Conclusion NanoLuc™ Luciferase-tagged MNV-1 cDNA clones were successfully engineered. Such clones can be exploited to develop robust experimental assays for in vitro assessments of viral RNA replication. PMID:29379384

  11. A high-resolution whole genome radiation hybrid map of human chromosome 17q22-q25.3 across the genes for GH and TK

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Foster, J.W.; Schafer, A.J.; Critcher, R.

    1996-04-15

    We have constructed a whole genome radiation hybrid (WG-RH) map across a region of human chromosome 17q, from growth hormone (GH) to thymidine kinase (TK). A panel of 128 WG-RH hybrid cell lines generated by X-irradiation and fusion has been tested for the retention of 39 sequence-tagged site (STS) markers by the polymerase chain reaction. This genome mapping technique has allowed the integration of existing VNTR and microsatellite markers with additional new markers and existing STS markers previously mapped to this region by other means. The WG-RH map includes eight expressed sequence tag (EST) and three anonymous markers developed formore » this study, together with 23 anonymous microsatellites and five existing ESTs. Analysis of these data resulted in a high-density comprehensive map across this region of the genome. A subset of these markers has been used to produce a framework map consisting of 20 loci ordered with odds greater than 1000:1. The markers are of sufficient density to build a YAC contig across this region based on marker content. We have developed sequence tags for both ends of a 2.1-Mb YAC and mapped these using the WG-RH panel, allowing a direct comparison of cRay{sub 6000} to physical distance. 31 refs., 3 figs., 2 tabs.« less

  12. Structure-function analysis of diacylglycerol acyltransferase sequences from tung tree and 82 other Organisms

    USDA-ARS?s Scientific Manuscript database

    Diacylglycerol acyltransferase family (DGATs) catalyzes the final and rate-limiting step of triacylglycerol (TAG) biosynthesis in eukaryotic organisms. DGATs esterify sn-1,2-diacylglycerol with a long-chain fatty acyl-CoA. Understanding the roles of DGATs will help to create transgenic plants with v...

  13. Variant amino acid residues alter the enzyme activity of peanut type 2 Diacylglycerol Acyltransferases

    USDA-ARS?s Scientific Manuscript database

    Diacylglycerol acyltransferase (DGAT) catalyzes the final, rate-limiting step in triacylglycerol (TAG) biosynthesis via the acyl-CoA-dependent acylation of diacylglycerol. In this study, type-2 DGAT2 genes were cloned from eleven peanut cultivars. Sequence analysis revealed at least eight peanut D...

  14. Gene expression profiling of the plant pathogenic basidiomycetous fungus Rhizoctonia solani AG 4 reveals putative virulence factors

    USDA-ARS?s Scientific Manuscript database

    Rhizoctonia solani is a ubiquitous basidiomycetous soilborne fungal pathogen causing damping off of seedlings, aerial blights and postharvest diseases. To gain insight into the molecular mechanisms of pathogenesis a global approach based on analysis of expressed sequence tags (ESTs) was undertaken. ...

  15. SAGE Analysis of Transcriptome Responses in Arabidopsis Roots Exposed to 2,4,6-Trinitrotoluene1

    PubMed Central

    Ekman, Drew R.; Lorenz, W. Walter; Przybyla, Alan E.; Wolfe, N. Lee; Dean, Jeffrey F.D.

    2003-01-01

    Serial analysis of gene expression was used to profile transcript levels in Arabidopsis roots and assess their responses to 2,4,6-trinitrotoluene (TNT) exposure. SAGE libraries representing control and TNT-exposed seedling root transcripts were constructed, and each was sequenced to a depth of roughly 32,000 tags. More than 19,000 unique tags were identified overall. The second most highly induced tag (27-fold increase) represented a glutathione S-transferase. Cytochrome P450 enzymes, as well as an ABC transporter and a probable nitroreductase, were highly induced by TNT exposure. Analyses also revealed an oxidative stress response upon TNT exposure. Although some increases were anticipated in light of current models for xenobiotic metabolism in plants, evidence for unsuspected conjugation pathways was also noted. Identifying transcriptome-level responses to TNT exposure will better define the metabolic pathways plants use to detoxify this xenobiotic compound, which should help improve phytoremediation strategies directed at TNT and other nitroaromatic compounds. PMID:14551330

  16. Differences in Brain Transcriptomes of Closely Related Baikal Coregonid Species

    PubMed Central

    Bychenko, Oksana S.; Sukhanova, Lyubov V.; Azhikina, Tatyana L.; Skvortsov, Timofey A.; Belomestnykh, Tuyana V.; Sverdlov, Eugene D.

    2014-01-01

    The aim of this work was to get deeper insight into genetic factors involved in the adaptive divergence of closely related species, specifically two representatives of Baikal coregonids—Baikal whitefish (Coregonus baicalensis Dybowski) and Baikal omul (Coregonus migratorius Georgi)—that diverged from a common ancestor as recently as 10–20 thousand years ago. Using the Serial Analysis of Gene Expression method, we obtained libraries of short representative cDNA sequences (tags) from the brains of Baikal whitefish and omul. A comparative analysis of the libraries revealed quantitative differences among ~4% tags of the fishes under study. Based on the similarity of these tags with cDNA of known organisms, we identified candidate genes taking part in adaptive divergence. The most important candidate genes related to the adaptation of Baikal whitefish and Baikal omul, identified in this work, belong to the genes of cell metabolism, nervous and immune systems, protein synthesis, and regulatory genes as well as to DTSsa4 Tc1-like transposons which are widespread among fishes. PMID:24719892

  17. Sonication-based isolation and enrichment of Chlorella protothecoides chloroplasts for illumina genome sequencing

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Angelova, Angelina; Park, Sang-Hycuk; Kyndt, John

    2013-09-01

    With the increasing world demand for biofuel, a number of oleaginous algal species are being considered as renewable sources of oil. Chlorella protothecoides Krüger synthesizes triacylglycerols (TAGs) as storage compounds that can be converted into renewable fuel utilizing an anabolic pathway that is poorly understood. The paucity of algal chloroplast genome sequences has been an important constraint to chloroplast transformation and for studying gene expression in TAGs pathways. In this study, the intact chloroplasts were released from algal cells using sonication followed by sucrose gradient centrifugation, resulting in a 2.36-fold enrichment of chloroplasts from C. protothecoides, based on qPCR analysis.more » The C. protothecoides chloroplast genome (cpDNA) was determined using the Illumina HiSeq 2000 sequencing platform and found to be 84,576 Kb in size (8.57 Kb) in size, with a GC content of 30.8 %. This is the first report of an optimized protocol that uses a sonication step, followed by sucrose gradient centrifugation, to release and enrich intact chloroplasts from a microalga (C. prototheocoides) of sufficient quality to permit chloroplast genome sequencing with high coverage, while minimizing nuclear genome contamination. The approach is expected to guide chloroplast isolation from other oleaginous algal species for a variety of uses that benefit from enrichment of chloroplasts, ranging from biochemical analysis to genomics studies.« less

  18. Increasing ecological inference from high throughput sequencing of fungi in the environment through a tagging approach

    Treesearch

    D. Lee Taylor; Michael G. Booth; Jack W. McFarland; Ian C. Herriott; Niall J. Lennon; Chad Nusbaum; Thomas G. Marr

    2008-01-01

    High throughput sequencing methods are widely used in analyses of microbial diversity but are generally applied to small numbers of samples, which precludes charaterization of patterns of microbial diversity across space and time. We have designed a primer-tagging approach that allows pooling and subsequent sorting of numerous samples, which is directed to...

  19. SapTrap, a Toolkit for High-Throughput CRISPR/Cas9 Gene Modification in Caenorhabditis elegans.

    PubMed

    Schwartz, Matthew L; Jorgensen, Erik M

    2016-04-01

    In principle, clustered regularly interspaced short palindromic repeats (CRISPR)/Cas9 allows genetic tags to be inserted at any locus. However, throughput is limited by the laborious construction of repair templates and guide RNA constructs and by the identification of modified strains. We have developed a reagent toolkit and plasmid assembly pipeline, called "SapTrap," that streamlines the production of targeting vectors for tag insertion, as well as the selection of modified Caenorhabditis elegans strains. SapTrap is a high-efficiency modular plasmid assembly pipeline that produces single plasmid targeting vectors, each of which encodes both a guide RNA transcript and a repair template for a particular tagging event. The plasmid is generated in a single tube by cutting modular components with the restriction enzyme SapI, which are then "trapped" in a fixed order by ligation to generate the targeting vector. A library of donor plasmids supplies a variety of protein tags, a selectable marker, and regulatory sequences that allow cell-specific tagging at either the N or the C termini. All site-specific sequences, such as guide RNA targeting sequences and homology arms, are supplied as annealed synthetic oligonucleotides, eliminating the need for PCR or molecular cloning during plasmid assembly. Each tag includes an embedded Cbr-unc-119 selectable marker that is positioned to allow concurrent expression of both the tag and the marker. We demonstrate that SapTrap targeting vectors direct insertion of 3- to 4-kb tags at six different loci in 10-37% of injected animals. Thus SapTrap vectors introduce the possibility for high-throughput generation of CRISPR/Cas9 genome modifications. Copyright © 2016 by the Genetics Society of America.

  20. De Novo Transcriptomic Analysis of an Oleaginous Microalga: Pathway Description and Gene Discovery for Production of Next-Generation Biofuels

    PubMed Central

    Wan, LingLin; Han, Juan; Sang, Min; Li, AiFen; Wu, Hong; Yin, ShunJi; Zhang, ChengWu

    2012-01-01

    Background Eustigmatos cf. polyphem is a yellow-green unicellular soil microalga belonging to the eustimatophyte with high biomass and considerable production of triacylglycerols (TAGs) for biofuels, which is thus referred to as an oleaginous microalga. The paucity of microalgae genome sequences, however, limits development of gene-based biofuel feedstock optimization studies. Here we describe the sequencing and de novo transcriptome assembly for a non-model microalgae species, E. cf. polyphem, and identify pathways and genes of importance related to biofuel production. Results We performed the de novo assembly of E. cf. polyphem transcriptome using Illumina paired-end sequencing technology. In a single run, we produced 29,199,432 sequencing reads corresponding to 2.33 Gb total nucleotides. These reads were assembled into 75,632 unigenes with a mean size of 503 bp and an N50 of 663 bp, ranging from 100 bp to >3,000 bp. Assembled unigenes were subjected to BLAST similarity searches and annotated with Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) orthology identifiers. These analyses identified the majority of carbohydrate, fatty acids, TAG and carotenoids biosynthesis and catabolism pathways in E. cf. polyphem. Conclusions Our data provides the construction of metabolic pathways involved in the biosynthesis and catabolism of carbohydrate, fatty acids, TAG and carotenoids in E. cf. polyphem and provides a foundation for the molecular genetics and functional genomics required to direct metabolic engineering efforts that seek to enhance the quantity and character of microalgae-based biofuel feedstock. PMID:22536352

  1. Re-Assembly and Analysis of an Ancient Variola Virus Genome.

    PubMed

    Smithson, Chad; Imbery, Jacob; Upton, Chris

    2017-09-08

    We report a major improvement to the assembly of published short read sequencing data from an ancient variola virus (VARV) genome by the removal of contig-capping sequencing tags and manual searches for gap-spanning reads. The new assembly, together with camelpox and taterapox genomes, permitted new dates to be calculated for the last common ancestor of all VARV genomes. The analysis of recently sequenced VARV-like cowpox virus genomes showed that single nucleotide polymorphisms (SNPs) and amino acid changes in the vaccinia virus (VACV)-Cop-O1L ortholog, predicted to be associated with VARV host specificity and virulence, were introduced into the lineage before the divergence of these viruses. A comparison of the ancient and modern VARV genome sequences also revealed a measurable drift towards adenine + thymine (A + T) richness.

  2. PET-Tool: a software suite for comprehensive processing and managing of Paired-End diTag (PET) sequence data.

    PubMed

    Chiu, Kuo Ping; Wong, Chee-Hong; Chen, Qiongyu; Ariyaratne, Pramila; Ooi, Hong Sain; Wei, Chia-Lin; Sung, Wing-Kin Ken; Ruan, Yijun

    2006-08-25

    We recently developed the Paired End diTag (PET) strategy for efficient characterization of mammalian transcriptomes and genomes. The paired end nature of short PET sequences derived from long DNA fragments raised a new set of bioinformatics challenges, including how to extract PETs from raw sequence reads, and correctly yet efficiently map PETs to reference genome sequences. To accommodate and streamline data analysis of the large volume PET sequences generated from each PET experiment, an automated PET data process pipeline is desirable. We designed an integrated computation program package, PET-Tool, to automatically process PET sequences and map them to the genome sequences. The Tool was implemented as a web-based application composed of four modules: the Extractor module for PET extraction; the Examiner module for analytic evaluation of PET sequence quality; the Mapper module for locating PET sequences in the genome sequences; and the Project Manager module for data organization. The performance of PET-Tool was evaluated through the analyses of 2.7 million PET sequences. It was demonstrated that PET-Tool is accurate and efficient in extracting PET sequences and removing artifacts from large volume dataset. Using optimized mapping criteria, over 70% of quality PET sequences were mapped specifically to the genome sequences. With a 2.4 GHz LINUX machine, it takes approximately six hours to process one million PETs from extraction to mapping. The speed, accuracy, and comprehensiveness have proved that PET-Tool is an important and useful component in PET experiments, and can be extended to accommodate other related analyses of paired-end sequences. The Tool also provides user-friendly functions for data quality check and system for multi-layer data management.

  3. Distinct profiles of expressed sequence tags during intestinal regeneration in the sea cucumber Holothuria glaberrima

    PubMed Central

    Rojas-Cartagena, Carmencita; Ortíz-Pineda, Pablo; Ramírez-Gómez, Francisco; Suárez-Castillo, Edna C.; Matos-Cruz, Vanessa; Rodríguez, Carlos; Ortíz-Zuazaga, Humberto; García-Arrarás, José E.

    2010-01-01

    Repair and regeneration are key processes for tissue maintenance, and their disruption may lead to disease states. Little is known about the molecular mechanisms that underline the repair and regeneration of the digestive tract. The sea cucumber Holothuria glaberrima represents an excellent model to dissect and characterize the molecular events during intestinal regeneration. To study the gene expression profile, cDNA libraries were constructed from normal, 3-day, and 7-day regenerating intestines of H. glaberrima. Clones were randomly sequenced and queried against the nonredundant protein database at the National Center for Biotechnology Information. RT-PCR analyses were made of several genes to determine their expression profile during intestinal regeneration. A total of 5,173 sequences from three cDNA libraries were obtained. About 46.2, 35.6, and 26.2% of the sequences for the normal, 3-days, and 7-days cDNA libraries, respectively, shared significant similarity with known sequences in the protein database of GenBank but only present 10% of similarity among them. Analysis of the libraries in terms of functional processes, protein domains, and most common sequences suggests that a differential expression profile is taking place during the regeneration process. Further examination of the expressed sequence tag dataset revealed that 12 putative genes are differentially expressed at significant level (R > 6). Experimental validation by RT-PCR analysis reveals that at least three genes (unknown C-4677-1, melanotransferrin, and centaurin) present a differential expression during regeneration. These findings strongly suggest that the gene expression profile varies among regeneration stages and provide evidence for the existence of differential gene expression. PMID:17579180

  4. Expressed Sequence Tag Analysis of the Human Pathogen Paracoccidioides brasiliensis Yeast Phase: Identification of Putative Homologues of Candida albicans Virulence and Pathogenicity Genes

    PubMed Central

    Goldman, Gustavo H.; dos Reis Marques, Everaldo; Custódio Duarte Ribeiro, Diógenes; Ângelo de Souza Bernardes, Luciano; Quiapin, Andréa Carla; Vitorelli, Patrícia Marostica; Savoldi, Marcela; Semighini, Camile P.; de Oliveira, Regina C.; Nunes, Luiz R.; Travassos, Luiz R.; Puccia, Rosana; Batista, Wagner L.; Ferreira, Leslie Ecker; Moreira, Júlio C.; Bogossian, Ana Paula; Tekaia, Fredj; Nobrega, Marina Pasetto; Nobrega, Francisco G.; Goldman, Maria Helena S.

    2003-01-01

    Paracoccidioides brasiliensis, a thermodimorphic fungus, is the causative agent of the prevalent systemic mycosis in Latin America, paracoccidioidomycosis. We present here a survey of expressed genes in the yeast pathogenic phase of P. brasiliensis. We obtained 13,490 expressed sequence tags from both 5′ and 3′ ends. Clustering analysis yielded the partial sequences of 4,692 expressed genes that were functionally classified by similarity to known genes. We have identified several Candida albicans virulence and pathogenicity homologues in P. brasiliensis. Furthermore, we have analyzed the expression of some of these genes during the dimorphic yeast-mycelium-yeast transition by real-time quantitative reverse transcription-PCR. Clustering analysis of the mycelium-yeast transition revealed three groups: (i) RBT, hydrophobin, and isocitrate lyase; (ii) malate dehydrogenase, contigs Pb1067 and Pb1145, GPI, and alternative oxidase; and (iii) ubiquitin, delta-9-desaturase, HSP70, HSP82, and HSP104. The first two groups displayed high mRNA expression in the mycelial phase, whereas the third group showed higher mRNA expression in the yeast phase. Our results suggest the possible conservation of pathogenicity and virulence mechanisms among fungi, expand considerably gene identification in P. brasiliensis, and provide a broader basis for further progress in understanding its biological peculiarities. PMID:12582121

  5. Metatranscriptomics of the marine sponge Geodia barretti: tackling phylogeny and function of its microbial community.

    PubMed

    Radax, Regina; Rattei, Thomas; Lanzen, Anders; Bayer, Christoph; Rapp, Hans Tore; Urich, Tim; Schleper, Christa

    2012-05-01

    Geodia barretti is a marine cold-water sponge harbouring high numbers of microorganisms. Significant rates of nitrification have been observed in this sponge, indicating a substantial contribution to nitrogen turnover in marine environments with high sponge cover. In order to get closer insights into the phylogeny and function of the active microbial community and the interaction with its host G. barretti, a metatranscriptomic approach was employed, using the simultaneous analysis of rRNA and mRNA. Of the 262 298 RNA-tags obtained by pyrosequencing, 92% were assigned to ribosomal RNA (ribo-tags). A total of 109 325 SSU rRNA ribo-tags revealed a detailed picture of the community, dominated by group SAR202 of Chloroflexi, candidate phylum Poribacteria and Acidobacteria, which was different in its composition from that obtained in clone libraries prepared form the same samples. Optimized assembly strategies allowed the reconstruction of full-length rRNA sequences from the short ribo-tags for more detailed phylogenetic studies of the dominant taxa. Cells of several phyla were visualized by FISH analyses for confirmation. Of the remaining 21 325 RNA-tags, 10 023 were assigned to mRNA-tags, based on similarities to genes in the databases. A wide range of putative functional gene transcripts from over 10 different phyla were identified among the bacterial mRNA-tags. The most abundant mRNAs were those encoding key metabolic enzymes of nitrification from ammonia-oxidizing archaea as well as candidate genes involved in related processes. Our analysis demonstrates the potential and limits of using a combined rRNA and mRNA approach to explore the microbial community profile, phylogenetic assignments and metabolic activities of a complex, but little explored microbial community. © 2012 Society for Applied Microbiology and Blackwell Publishing Ltd.

  6. Difficulties in Generating Specific Antibodies for Immunohistochemical Detection of Nitrosylated Tubulins

    PubMed Central

    Kamnev, Anton; Muhar, Matthias; Preinreich, Martina; Ammer, Hermann; Propst, Friedrich

    2013-01-01

    Protein S-nitrosylation, the covalent attachment of a nitroso moiety to thiol groups of specific cysteine residues, is one of the major pathways of nitric oxide signaling. Hundreds of proteins are subject to this transient post-translational modification and for some the functional consequences have been identified. Biochemical assays for the analysis of protein S-nitrosylation have been established and can be used to study if and under what conditions a given protein is S-nitrosylated. In contrast, the equally desirable subcellular localization of specific S-nitrosylated protein isoforms has not been achieved to date. In the current study we attempted to specifically localize S-nitrosylated α- and β-tubulin isoforms in primary neurons after fixation. The approach was based on in situ replacement of the labile cysteine nitroso modification with a stable tag and the subsequent use of antibodies which recognize the tag in the context of the tubulin polypeptide sequence flanking the cysteine residue of interest. We established a procedure for tagging S-nitrosylated proteins in cultured primary neurons and obtained polyclonal anti-tag antibodies capable of specifically detecting tagged proteins on immunoblots and in fixed cells. However, the antibodies were not specific for tubulin isoforms. We suggest that different tagging strategies or alternative methods such as fluorescence resonance energy transfer techniques might be more successful. PMID:23840827

  7. A scalable strategy for high-throughput GFP tagging of endogenous human proteins.

    PubMed

    Leonetti, Manuel D; Sekine, Sayaka; Kamiyama, Daichi; Weissman, Jonathan S; Huang, Bo

    2016-06-21

    A central challenge of the postgenomic era is to comprehensively characterize the cellular role of the ∼20,000 proteins encoded in the human genome. To systematically study protein function in a native cellular background, libraries of human cell lines expressing proteins tagged with a functional sequence at their endogenous loci would be very valuable. Here, using electroporation of Cas9 nuclease/single-guide RNA ribonucleoproteins and taking advantage of a split-GFP system, we describe a scalable method for the robust, scarless, and specific tagging of endogenous human genes with GFP. Our approach requires no molecular cloning and allows a large number of cell lines to be processed in parallel. We demonstrate the scalability of our method by targeting 48 human genes and show that the resulting GFP fluorescence correlates with protein expression levels. We next present how our protocols can be easily adapted for the tagging of a given target with GFP repeats, critically enabling the study of low-abundance proteins. Finally, we show that our GFP tagging approach allows the biochemical isolation of native protein complexes for proteomic studies. Taken together, our results pave the way for the large-scale generation of endogenously tagged human cell lines for the proteome-wide analysis of protein localization and interaction networks in a native cellular context.

  8. Sampling and pyrosequencing methods for characterizing bacterial communities in the human gut using 16S sequence tags.

    PubMed

    Wu, Gary D; Lewis, James D; Hoffmann, Christian; Chen, Ying-Yu; Knight, Rob; Bittinger, Kyle; Hwang, Jennifer; Chen, Jun; Berkowsky, Ronald; Nessel, Lisa; Li, Hongzhe; Bushman, Frederic D

    2010-07-30

    Intense interest centers on the role of the human gut microbiome in health and disease, but optimal methods for analysis are still under development. Here we present a study of methods for surveying bacterial communities in human feces using 454/Roche pyrosequencing of 16S rRNA gene tags. We analyzed fecal samples from 10 individuals and compared methods for storage, DNA purification and sequence acquisition. To assess reproducibility, we compared samples one cm apart on a single stool specimen for each individual. To analyze storage methods, we compared 1) immediate freezing at -80 degrees C, 2) storage on ice for 24 or 3) 48 hours. For DNA purification methods, we tested three commercial kits and bead beating in hot phenol. Variations due to the different methodologies were compared to variation among individuals using two approaches--one based on presence-absence information for bacterial taxa (unweighted UniFrac) and the other taking into account their relative abundance (weighted UniFrac). In the unweighted analysis relatively little variation was associated with the different analytical procedures, and variation between individuals predominated. In the weighted analysis considerable variation was associated with the purification methods. Particularly notable was improved recovery of Firmicutes sequences using the hot phenol method. We also carried out surveys of the effects of different 454 sequencing methods (FLX versus Titanium) and amplification of different 16S rRNA variable gene segments. Based on our findings we present recommendations for protocols to collect, process and sequence bacterial 16S rDNA from fecal samples--some major points are 1) if feasible, bead-beating in hot phenol or use of the PSP kit improves recovery; 2) storage methods can be adjusted based on experimental convenience; 3) unweighted (presence-absence) comparisons are less affected by lysis method.

  9. Linear reduction methods for tag SNP selection.

    PubMed

    He, Jingwu; Zelikovsky, Alex

    2004-01-01

    It is widely hoped that constructing a complete human haplotype map will help to associate complex diseases with certain SNP's. Unfortunately, the number of SNP's is huge and it is very costly to sequence many individuals. Therefore, it is desirable to reduce the number of SNP's that should be sequenced to considerably small number of informative representatives, so called tag SNP's. In this paper, we propose a new linear algebra based method for selecting and using tag SNP's. Our method is purely combinatorial and can be combined with linkage disequilibrium (LD) and block based methods. We measure the quality of our tag SNP selection algorithm by comparing actual SNP's with SNP's linearly predicted from linearly chosen tag SNP's. We obtain an extremely good compression and prediction rates. For example, for long haplotypes (>25000 SNP's), knowing only 0.4% of all SNP's we predict the entire unknown haplotype with 2% accuracy while the prediction method is based on a 10% sample of the population.

  10. Too much data, but little inter-changeability: a lesson learned from mining public data on tissue specificity of gene expression.

    PubMed

    Li, Shuyu; Li, Yiqun Helen; Wei, Tao; Su, Eric Wen; Duffin, Kevin; Liao, Birong

    2006-10-25

    The tissue expression pattern of a gene often provides an important clue to its potential role in a biological process. A vast amount of gene expression data have been and are being accumulated in public repository through different technology platforms. However, exploitations of these rich data sources remain limited in part due to issues of technology standardization. Our objective is to test the data comparability between SAGE and microarray technologies, through examining the expression pattern of genes under normal physiological states across variety of tissues. There are 42-54% of genes showing significant correlations in tissue expression patterns between SAGE and GeneChip, with 30-40% of genes whose expression patterns are positively correlated and 10-15% of genes whose expression patterns are negatively correlated at a statistically significant level (p = 0.05). Our analysis suggests that the discrepancy on the expression patterns derived from technology platforms is not likely from the heterogeneity of tissues used in these technologies, or other spurious correlations resulting from microarray probe design, abundance of genes, or gene function. The discrepancy can be partially explained by errors in the original assignment of SAGE tags to genes due to the evolution of sequence databases. In addition, sequence analysis has indicated that many SAGE tags and Affymetrix array probe sets are mapped to different splice variants or different sequence regions although they represent the same gene, which also contributes to the observed discrepancies between SAGE and array expression data. To our knowledge, this is the first report attempting to mine gene expression patterns across tissues using public data from different technology platforms. Unlike previous similar studies that only demonstrated the discrepancies between the two gene expression platforms, we carried out in-depth analysis to further investigate the cause for such discrepancies. Our study shows that the exploitation of rich public expression resource requires extensive knowledge about the technologies, and experiment. Informatic methodologies for better interoperability among platforms still remain a gap. One of the areas that can be improved practically is the accurate sequence mapping of SAGE tags and array probes to full-length genes.

  11. A SAGE based approach to human glomerular endothelium: defining the transcriptome, finding a novel molecule and highlighting endothelial diversity.

    PubMed

    Sengoelge, Guerkan; Winnicki, Wolfgang; Kupczok, Anne; von Haeseler, Arndt; Schuster, Michael; Pfaller, Walter; Jennings, Paul; Weltermann, Ansgar; Blake, Sophia; Sunder-Plassmann, Gere

    2014-08-27

    Large scale transcript analysis of human glomerular microvascular endothelial cells (HGMEC) has never been accomplished. We designed this study to define the transcriptome of HGMEC and facilitate a better characterization of these endothelial cells with unique features. Serial analysis of gene expression (SAGE) was used for its unbiased approach to quantitative acquisition of transcripts. We generated a HGMEC SAGE library consisting of 68,987 transcript tags. Then taking advantage of large public databases and advanced bioinformatics we compared the HGMEC SAGE library with a SAGE library of non-cultured ex vivo human glomeruli (44,334 tags) which contained endothelial cells. The 823 tags common to both which would have the potential to be expressed in vivo were subsequently checked against 822,008 tags from 16 non-glomerular endothelial SAGE libraries. This resulted in 268 transcript tags differentially overexpressed in HGMEC compared to non-glomerular endothelia. These tags were filtered using a set of criteria: never before shown in kidney or any type of endothelial cell, absent in all nephron regions except the glomerulus, more highly expressed than statistically expected in HGMEC. Neurogranin, a direct target of thyroid hormone action which had been thought to be brain specific and never shown in endothelial cells before, fulfilled these criteria. Its expression in glomerular endothelium in vitro and in vivo was then verified by real-time-PCR, sequencing and immunohistochemistry. Our results represent an extensive molecular characterization of HGMEC beyond a mere database, underline the endothelial heterogeneity, and propose neurogranin as a potential link in the kidney-thyroid axis.

  12. DeepSAGE Based Differential Gene Expression Analysis under Cold and Freeze Stress in Seabuckthorn (Hippophae rhamnoides L.)

    PubMed Central

    Chaudhary, Saurabh; Sharma, Prakash C.

    2015-01-01

    Seabuckthorn (Hippophae rhamnoides L.), an important plant species of Indian Himalayas, is well known for its immense medicinal and nutritional value. The plant has the ability to sustain growth in harsh environments of extreme temperatures, drought and salinity. We employed DeepSAGE, a tag based approach, to identify differentially expressed genes under cold and freeze stress in seabuckthorn. In total 36.2 million raw tags including 13.9 million distinct tags were generated using Illumina sequencing platform for three leaf tissue libraries including control (CON), cold stress (CS) and freeze stress (FS). After discarding low quality tags, 35.5 million clean tags including 7 million distinct clean tags were obtained. In all, 11922 differentially expressed genes (DEGs) including 6539 up regulated and 5383 down regulated genes were identified in three comparative setups i.e. CON vs CS, CON vs FS and CS vs FS. Gene ontology and KEGG pathway analysis were performed to assign gene ontology term to DEGs and ascertain their biological functions. DEGs were mapped back to our existing seabuckthorn transcriptome assembly comprising of 88,297 putative unigenes leading to the identification of 428 cold and freeze stress responsive genes. Expression of randomly selected 22 DEGs was validated using qRT-PCR that further supported our DeepSAGE results. The present study provided a comprehensive view of global gene expression profile of seabuckthorn under cold and freeze stresses. The DeepSAGE data could also serve as a valuable resource for further functional genomics studies aiming selection of candidate genes for development of abiotic stress tolerant transgenic plants. PMID:25803684

  13. DeepSAGE based differential gene expression analysis under cold and freeze stress in seabuckthorn (Hippophae rhamnoides L.).

    PubMed

    Chaudhary, Saurabh; Sharma, Prakash C

    2015-01-01

    Seabuckthorn (Hippophae rhamnoides L.), an important plant species of Indian Himalayas, is well known for its immense medicinal and nutritional value. The plant has the ability to sustain growth in harsh environments of extreme temperatures, drought and salinity. We employed DeepSAGE, a tag based approach, to identify differentially expressed genes under cold and freeze stress in seabuckthorn. In total 36.2 million raw tags including 13.9 million distinct tags were generated using Illumina sequencing platform for three leaf tissue libraries including control (CON), cold stress (CS) and freeze stress (FS). After discarding low quality tags, 35.5 million clean tags including 7 million distinct clean tags were obtained. In all, 11922 differentially expressed genes (DEGs) including 6539 up regulated and 5383 down regulated genes were identified in three comparative setups i.e. CON vs CS, CON vs FS and CS vs FS. Gene ontology and KEGG pathway analysis were performed to assign gene ontology term to DEGs and ascertain their biological functions. DEGs were mapped back to our existing seabuckthorn transcriptome assembly comprising of 88,297 putative unigenes leading to the identification of 428 cold and freeze stress responsive genes. Expression of randomly selected 22 DEGs was validated using qRT-PCR that further supported our DeepSAGE results. The present study provided a comprehensive view of global gene expression profile of seabuckthorn under cold and freeze stresses. The DeepSAGE data could also serve as a valuable resource for further functional genomics studies aiming selection of candidate genes for development of abiotic stress tolerant transgenic plants.

  14. Selection of mRNA 5'-untranslated region sequence with high translation efficiency through ribosome display

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Mie, Masayasu; Shimizu, Shun; Takahashi, Fumio

    2008-08-15

    The 5'-untranslated region (5'-UTR) of mRNAs functions as a translation enhancer, promoting translation efficiency. Many in vitro translation systems exhibit a reduced efficiency in protein translation due to decreased translation initiation. The use of a 5'-UTR sequence with high translation efficiency greatly enhances protein production in these systems. In this study, we have developed an in vitro selection system that favors 5'-UTRs with high translation efficiency using a ribosome display technique. A 5'-UTR random library, comprised of 5'-UTRs tagged with a His-tag and Renilla luciferase (R-luc) fusion, were in vitro translated in rabbit reticulocytes. By limiting the translation period, onlymore » mRNAs with high translation efficiency were translated. During translation, mRNA, ribosome and translated R-luc with His-tag formed ternary complexes. They were collected with translated His-tag using Ni-particles. Extracted mRNA from ternary complex was amplified using RT-PCR and sequenced. Finally, 5'-UTR with high translation efficiency was obtained from random 5'-UTR library.« less

  15. Re-evaluating microglia expression profiles using RiboTag and cell isolation strategies.

    PubMed

    Haimon, Zhana; Volaski, Alon; Orthgiess, Johannes; Boura-Halfon, Sigalit; Varol, Diana; Shemer, Anat; Yona, Simon; Zuckerman, Binyamin; David, Eyal; Chappell-Maor, Louise; Bechmann, Ingo; Gericke, Martin; Ulitsky, Igor; Jung, Steffen

    2018-06-01

    Transcriptome profiling is widely used to infer functional states of specific cell types, as well as their responses to stimuli, to define contributions to physiology and pathophysiology. Focusing on microglia, the brain's macrophages, we report here a side-by-side comparison of classical cell-sorting-based transcriptome sequencing and the 'RiboTag' method, which avoids cell retrieval from tissue context and yields translatome sequencing information. Conventional whole-cell microglial transcriptomes were found to be significantly tainted by artifacts introduced by tissue dissociation, cargo contamination and transcripts sequestered from ribosomes. Conversely, our data highlight the added value of RiboTag profiling for assessing the lineage accuracy of Cre recombinase expression in transgenic mice. Collectively, this study indicates method-based biases, reveals observer effects and establishes RiboTag-based translatome profiling as a valuable complement to standard sorting-based profiling strategies.

  16. Mining candidate genes associated with powdery mildew resistance in cucumber via super-BSA by specific length amplified fragment (SLAF) sequencing.

    PubMed

    Zhang, Peng; Zhu, Yuqiang; Wang, Lili; Chen, Liping; Zhou, Shengjun

    2015-12-14

    Powdery mildew (PM) is the most common fungal disease of cucumber and other cucurbit crops, while breeding the PM-resistant materials is the effective way to defense this disease, and the recent development of modern genetics and genomics make us aware of that studying the resistance genes is the essential way to breed the PM high-resistance plant. With the ever increasing throughput of next-generation sequencing (NGS), the development of specific length amplified fragment sequencing (SLAF-seq) as a high-resolution strategy for large-scale de novo SNP discovery is gradually applied for functional gene mining. Here we combined the bulked segregant analysis (BSA) with SLAF-seq to identify candidate genes associated with PM resistance in cucumber. A segregating population comprising 251 F2 individuals was developed using H136 (female parent) as susceptible parent and BK2 (male parent) as resistance donor. After PMR test, total genomic DNA was prepared from each plant. Systemic genomic analysis of the GC content, repeat sequence, etc. was carried out by prediction software SLAF_Predict to establish condition to ensure the uniformity and density of the molecular markers. After samples were gel purified, SLAFs were generated at Biomarker Technologies Corporation in Beijing. Based on SLAF tags and the PMR test result, the hot region were annotated. A total of 73,100 high-quality SLAF tags with an average depth of 99.11× were sequenced. Among these, 5,355 polymorphic tags were identified with a polymorphism rate of 7.34 %, including 7.09 % SNPs and other polymorphism types. Finally, 140 associated SLAFs were identified, and two main Hot Regions were detected on chromosome 1 and 6, which contained five genes invovled in defense response, toxin metabolism, cell stress response, and injury response in cucumber. Associated markers identified by super-BSA in this study, could not only speed up the study of the PMR genes, but also provide a feasible solution for breeding the marker-assisted PMR cucumber. Moreover, this study could also be extended to any other species with reference genome.

  17. Interactive Block Games for Assessing Children's Cognitive Skills: Design and Preliminary Evaluation.

    PubMed

    Lee, Kiju; Jeong, Donghwa; Schindler, Rachael C; Hlavaty, Laura E; Gross, Susan I; Short, Elizabeth J

    2018-01-01

    Background: This paper presents design and results from preliminary evaluation of Tangible Geometric Games (TAG-Games) for cognitive assessment in young children. The TAG-Games technology employs a set of sensor-integrated cube blocks, called SIG-Blocks, and graphical user interfaces for test administration and real-time performance monitoring. TAG-Games were administered to children from 4 to 8 years of age for evaluating preliminary efficacy of this new technology-based approach. Methods: Five different sets of SIG-Blocks comprised of geometric shapes, segmented human faces, segmented animal faces, emoticons, and colors, were used for three types of TAG-Games, including Assembly, Shape Matching, and Sequence Memory. Computational task difficulty measures were defined for each game and used to generate items with varying difficulty. For preliminary evaluation, TAG-Games were tested on 40 children. To explore the clinical utility of the information assessed by TAG-Games, three subtests of the age-appropriate Wechsler tests (i.e., Block Design, Matrix Reasoning, and Picture Concept) were also administered. Results: Internal consistency of TAG-Games was evaluated by the split-half reliability test. Weak to moderate correlations between Assembly and Block Design, Shape Matching and Matrix Reasoning, and Sequence Memory and Picture Concept were found. The computational measure of task complexity for each TAG-Game showed a significant correlation with participants' performance. In addition, age-correlations on TAG-Game scores were found, implying its potential use for assessing children's cognitive skills autonomously.

  18. Interactive Block Games for Assessing Children's Cognitive Skills: Design and Preliminary Evaluation

    PubMed Central

    Lee, Kiju; Jeong, Donghwa; Schindler, Rachael C.; Hlavaty, Laura E.; Gross, Susan I.; Short, Elizabeth J.

    2018-01-01

    Background: This paper presents design and results from preliminary evaluation of Tangible Geometric Games (TAG-Games) for cognitive assessment in young children. The TAG-Games technology employs a set of sensor-integrated cube blocks, called SIG-Blocks, and graphical user interfaces for test administration and real-time performance monitoring. TAG-Games were administered to children from 4 to 8 years of age for evaluating preliminary efficacy of this new technology-based approach. Methods: Five different sets of SIG-Blocks comprised of geometric shapes, segmented human faces, segmented animal faces, emoticons, and colors, were used for three types of TAG-Games, including Assembly, Shape Matching, and Sequence Memory. Computational task difficulty measures were defined for each game and used to generate items with varying difficulty. For preliminary evaluation, TAG-Games were tested on 40 children. To explore the clinical utility of the information assessed by TAG-Games, three subtests of the age-appropriate Wechsler tests (i.e., Block Design, Matrix Reasoning, and Picture Concept) were also administered. Results: Internal consistency of TAG-Games was evaluated by the split-half reliability test. Weak to moderate correlations between Assembly and Block Design, Shape Matching and Matrix Reasoning, and Sequence Memory and Picture Concept were found. The computational measure of task complexity for each TAG-Game showed a significant correlation with participants' performance. In addition, age-correlations on TAG-Game scores were found, implying its potential use for assessing children's cognitive skills autonomously. PMID:29868520

  19. Optimal use of tandem biotin and V5 tags in ChIP assays

    PubMed Central

    Kolodziej, Katarzyna E; Pourfarzad, Farzin; de Boer, Ernie; Krpic, Sanja; Grosveld, Frank; Strouboulis, John

    2009-01-01

    Background Chromatin immunoprecipitation (ChIP) assays coupled to genome arrays (Chip-on-chip) or massive parallel sequencing (ChIP-seq) lead to the genome wide identification of binding sites of chromatin associated proteins. However, the highly variable quality of antibodies and the availability of epitopes in crosslinked chromatin can compromise genomic ChIP outcomes. Epitope tags have often been used as more reliable alternatives. In addition, we have employed protein in vivo biotinylation tagging as a very high affinity alternative to antibodies. In this paper we describe the optimization of biotinylation tagging for ChIP and its coupling to a known epitope tag in providing a reliable and efficient alternative to antibodies. Results Using the biotin tagged erythroid transcription factor GATA-1 as example, we describe several optimization steps for the application of the high affinity biotin streptavidin system in ChIP. We find that the omission of SDS during sonication, the use of fish skin gelatin as blocking agent and choice of streptavidin beads can lead to significantly improved ChIP enrichments and lower background compared to antibodies. We also show that the V5 epitope tag performs equally well under the conditions worked out for streptavidin ChIP and that it may suffer less from the effects of formaldehyde crosslinking. Conclusion The combined use of the very high affinity biotin tag with the less sensitive to crosslinking V5 tag provides for a flexible ChIP platform with potential implications in ChIP sequencing outcomes. PMID:19196479

  20. Large-scale transcriptome analysis in chickpea (Cicer arietinum L.), an orphan legume crop of the semi-arid tropics of Asia and Africa.

    PubMed

    Hiremath, Pavana J; Farmer, Andrew; Cannon, Steven B; Woodward, Jimmy; Kudapa, Himabindu; Tuteja, Reetu; Kumar, Ashish; Bhanuprakash, Amindala; Mulaosmanovic, Benjamin; Gujaria, Neha; Krishnamurthy, Laxmanan; Gaur, Pooran M; Kavikishor, Polavarapu B; Shah, Trushar; Srinivasan, Ramamurthy; Lohse, Marc; Xiao, Yongli; Town, Christopher D; Cook, Douglas R; May, Gregory D; Varshney, Rajeev K

    2011-10-01

    Chickpea (Cicer arietinum L.) is an important legume crop in the semi-arid regions of Asia and Africa. Gains in crop productivity have been low however, particularly because of biotic and abiotic stresses. To help enhance crop productivity using molecular breeding techniques, next generation sequencing technologies such as Roche/454 and Illumina/Solexa were used to determine the sequence of most gene transcripts and to identify drought-responsive genes and gene-based molecular markers. A total of 103,215 tentative unique sequences (TUSs) have been produced from 435,018 Roche/454 reads and 21,491 Sanger expressed sequence tags (ESTs). Putative functions were determined for 49,437 (47.8%) of the TUSs, and gene ontology assignments were determined for 20,634 (41.7%) of the TUSs. Comparison of the chickpea TUSs with the Medicago truncatula genome assembly (Mt 3.5.1 build) resulted in 42,141 aligned TUSs with putative gene structures (including 39,281 predicted intron/splice junctions). Alignment of ∼37 million Illumina/Solexa tags generated from drought-challenged root tissues of two chickpea genotypes against the TUSs identified 44,639 differentially expressed TUSs. The TUSs were also used to identify a diverse set of markers, including 728 simple sequence repeats (SSRs), 495 single nucleotide polymorphisms (SNPs), 387 conserved orthologous sequence (COS) markers, and 2088 intron-spanning region (ISR) markers. This resource will be useful for basic and applied research for genome analysis and crop improvement in chickpea. Plant Biotechnology Journal © 2011 Society for Experimental Biology, Association of Applied Biologists and Blackwell Publishing Ltd. No claim to original US government works.

  1. Generation and validation of homozygous fluorescent knock-in cells using CRISPR-Cas9 genome editing.

    PubMed

    Koch, Birgit; Nijmeijer, Bianca; Kueblbeck, Moritz; Cai, Yin; Walther, Nike; Ellenberg, Jan

    2018-06-01

    Gene tagging with fluorescent proteins is essential for investigations of the dynamic properties of cellular proteins. CRISPR-Cas9 technology is a powerful tool for inserting fluorescent markers into all alleles of the gene of interest (GOI) and allows functionality and physiological expression of the fusion protein. It is essential to evaluate such genome-edited cell lines carefully in order to preclude off-target effects caused by (i) incorrect insertion of the fluorescent protein, (ii) perturbation of the fusion protein by the fluorescent proteins or (iii) nonspecific genomic DNA damage by CRISPR-Cas9. In this protocol, we provide a step-by-step description of our systematic pipeline to generate and validate homozygous fluorescent knock-in cell lines.We have used the paired Cas9D10A nickase approach to efficiently insert tags into specific genomic loci via homology-directed repair (HDR) with minimal off-target effects. It is time-consuming and costly to perform whole-genome sequencing of each cell clone to check for spontaneous genetic variations occurring in mammalian cell lines. Therefore, we have developed an efficient validation pipeline of the generated cell lines consisting of junction PCR, Southern blotting analysis, Sanger sequencing, microscopy, western blotting analysis and live-cell imaging for cell-cycle dynamics. This protocol takes between 6 and 9 weeks. With this protocol, up to 70% of the targeted genes can be tagged homozygously with fluorescent proteins, thus resulting in physiological levels and phenotypically functional expression of the fusion proteins.

  2. Generation and Analysis of the Expressed Sequence Tags from the Mycelium of Ganoderma lucidum

    PubMed Central

    Huang, Yen-Hua; Wu, Hung-Yi; Wu, Keh-Ming; Liu, Tze-Tze; Liou, Ruey-Fen; Tsai, Shih-Feng; Shiao, Ming-Shi; Ho, Low-Tone; Tzean, Shean-Shong; Yang, Ueng-Cheng

    2013-01-01

    Ganoderma lucidum (G. lucidum) is a medicinal mushroom renowned in East Asia for its potential biological effects. To enable a systematic exploration of the genes associated with the various phenotypes of the fungus, the genome consortium of G. lucidum has carried out an expressed sequence tag (EST) sequencing project. Using a Sanger sequencing based approach, 47,285 ESTs were obtained from in vitro cultures of G. lucidum mycelium of various durations. These ESTs were further clustered and merged into 7,774 non-redundant expressed loci. The features of these expressed contigs were explored in terms of over-representation, alternative splicing, and natural antisense transcripts. Our results provide an invaluable information resource for exploring the G. lucidum transcriptome and its regulation. Many cases of the genes over-represented in fast-growing dikaryotic mycelium are closely related to growth, such as cell wall and bioactive compound synthesis. In addition, the EST-genome alignments containing putative cassette exons and retained introns were manually curated and then used to make inferences about the predominating splice-site recognition mechanism of G. lucidum. Moreover, a number of putative antisense transcripts have been pinpointed, from which we noticed that two cases are likely to reveal hitherto undiscovered biological pathways. To allow users to access the data and the initial analysis of the results of this project, a dedicated web site has been created at http://csb2.ym.edu.tw/est/. PMID:23658685

  3. Expressed sequence tag based identification and expression analysis of some cold inducible elements in seabuckthorn (Hippophae rhamnoides L.).

    PubMed

    Ghangal, Rajesh; Raghuvanshi, Saurabh; Sharma, Prakash C

    2012-02-01

    A cDNA library was constructed from the mature leaves of seabuckthorn (Hippophae rhamnoides). Expressed Sequence Tags (ESTs) were generated by single pass sequencing of 4500 cDNA clones. We submitted 3412 ESTs to dbEST of NCBI. Clustering of these ESTs yielded 1665 unigenes comprising of 345 contigs and 1320 singletons. Out of 1665 unigenes, 1278 unigenes were annotated by similarity search while the remaining 387 unannotated unigenes were considered as organism specific. Gene Ontology (GO) analysis of the unigene dataset showed 691 unigenes related to biological processes, 727 to molecular functions and 588 to cellular component category. On the basis of similarity search and GO annotation, 43 unigenes were found responsive to biotic and abiotic stresses. To validate this observation, 13 genes that are known to be associated with cold stress tolerance from previous studies in Arabidopsis and 3 novel transcripts were examined by Real time RT-PCR to understand the change in expression pattern under cold/freeze stress. In silico study of occurrence of microsatellites in these ESTs revealed the presence of 62 Simple Sequence Repeats (SSRs), some of which are being explored to assess genetic diversity among seabuckthorn collections. This is the first report of generation of transcriptome data providing information about genes involved in managing plant abiotic stress in seabuckthorn, a plant known for its enormous medicinal and ecological value. Copyright © 2011 Elsevier Masson SAS. All rights reserved.

  4. Expressed sequence tags (ESTs) and phylogenetic analysis of floral genes from a paleoherb species, Asarum caudigerum.

    PubMed

    Zhao, Yinhe; Wang, Guoying; Zhang, Jinpeng; Yang, Junbo; Peng, Shang; Gao, Lianming; Li, Chengyun; Hu, Jinyong; Li, Dezhu; Gao, Lizhi

    2006-07-01

    Asarum caudigerum (Aristolochiaceae) is an important species of paleoherb in relation to understanding the origin and evolution of angiosperm flowers, due to its basal position in the angiosperms. The aim of this study was to isolate floral-related genes from A. caudigerum, and to infer evolutionary relationships among florally expression-related genes, to further illustrate the origin and diversification of flowers in angiosperms. A subtracted floral cDNA library was constructed from floral buds using suppression subtractive hybridization (SSH). The cDNA of floral buds and leaves at the seedling stage were used as a tester and a driver, respectively. To further identify the function of putative MADS-box transcription factors, phylogenetic trees were reconstructed in order to infer evolutionary relationships within the MADS-box gene family. In the forward-subtracted floral cDNA library, 1920 clones were randomly sequenced, from which 567 unique expressed sequence tags (ESTs) were obtained. Among them, 127 genes failed to show significant similarity to any published sequences in GenBank and thus are putatively novel genes. Phylogenetic analysis indicated that a total of 29 MADS-box transcription factors were members of the APETALA3(AP3) subfamily, while nine others were putative MADS-box transcription factors that formed a cluster with MADS-box genes isolated from Amborella, the basal-most angiosperm, and those from the gymnosperms. This suggests that the origin of A. caudigerum is intermediate between the angiosperms and gymnosperms.

  5. Developing expressed sequence tag libraries and the discovery of simple sequence repeat markers for two species of raspberry (Rubus L.).

    PubMed

    Bushakra, Jill M; Lewers, Kim S; Staton, Margaret E; Zhebentyayeva, Tetyana; Saski, Christopher A

    2015-10-26

    Due to a relatively high level of codominant inheritance and transferability within and among taxonomic groups, simple sequence repeat (SSR) markers are important elements in comparative mapping and delineation of genomic regions associated with traits of economic importance. Expressed sequence tags (ESTs) are a source of SSRs that can be used to develop markers to facilitate plant breeding and for more basic research across genera and higher plant orders. Leaf and meristem tissue from 'Heritage' red raspberry (Rubus idaeus) and 'Bristol' black raspberry (R. occidentalis) were utilized for RNA extraction. After conversion to cDNA and library construction, ESTs were sequenced, quality verified, assembled and scanned for SSRs.  Primers flanking the SSRs were designed and a subset tested for amplification, polymorphism and transferability across species. ESTs containing SSRs were functionally annotated using the GenBank non-redundant (nr) database and further classified using the gene ontology database. To accelerate development of EST-SSRs in the genus Rubus (Rosaceae), 1149 and 2358 cDNA sequences were generated from red raspberry and black raspberry, respectively. The cDNA sequences were screened using rigorous filtering criteria which resulted in the identification of 121 and 257 SSR loci for red and black raspberry, respectively. Primers were designed from the surrounding sequences resulting in 131 and 288 primer pairs, respectively, as some sequences contained more than one SSR locus. Sequence analysis revealed that the SSR-containing genes span a diversity of functions and share more sequence identity with strawberry genes than with other Rosaceous species. This resource of Rubus-specific, gene-derived markers will facilitate the construction of linkage maps composed of transferable markers for studying and manipulating important traits in this economically important genus.

  6. Anchoring 9,371 Maize Expressed Sequence Tagged Unigenes to the Bacterial Artificial Chromosome Contig Map by Two-Dimensional Overgo Hybridization1

    PubMed Central

    Gardiner, Jack; Schroeder, Steven; Polacco, Mary L.; Sanchez-Villeda, Hector; Fang, Zhiwei; Morgante, Michele; Landewe, Tim; Fengler, Kevin; Useche, Francisco; Hanafey, Michael; Tingey, Scott; Chou, Hugh; Wing, Rod; Soderlund, Carol; Coe, Edward H.

    2004-01-01

    Our goal is to construct a robust physical map for maize (Zea mays) comprehensively integrated with the genetic map. We have used a two-dimensional 24 × 24 overgo pooling strategy to anchor maize expressed sequence tagged (EST) unigenes to 165,888 bacterial artificial chromosomes (BACs) on high-density filters. A set of 70,716 public maize ESTs seeded derivation of 10,723 EST unigene assemblies. From these assemblies, 10,642 overgo sequences of 40 bp were applied as hybridization probes. BAC addresses were obtained for 9,371 overgo probes, representing an 88% success rate. More than 96% of the successful overgo probes identified two or more BACs, while 5% identified more than 50 BACs. The majority of BACs identified (79%) were hybridized with one or two overgos. A small number of BACs hybridized with eight or more overgos, suggesting that these BACs must be gene rich. Approximately 5,670 overgos identified BACs assembled within one contig, indicating that these probes are highly locus specific. A total of 1,795 megabases (Mb; 87%) of the total 2,050 Mb in BAC contigs were associated with one or more overgos, which are serving as sequence-tagged sites for single nucleotide polymorphism development. Overgo density ranged from less than one overgo per megabase to greater than 20 overgos per megabase. The majority of contigs (52%) hit by overgos contained three to nine overgos per megabase. Analysis of approximately 1,022 Mb of genetically anchored BAC contigs indicates that 9,003 of the total 13,900 overgo-contig sites are genetically anchored. Our results indicate overgos are a powerful approach for generating gene-specific hybridization probes that are facilitating the assembly of an integrated genetic and physical map for maize. PMID:15020742

  7. Myocardial tagging by Cardiovascular Magnetic Resonance: evolution of techniques--pulse sequences, analysis algorithms, and applications

    PubMed Central

    2011-01-01

    Cardiovascular magnetic resonance (CMR) tagging has been established as an essential technique for measuring regional myocardial function. It allows quantification of local intramyocardial motion measures, e.g. strain and strain rate. The invention of CMR tagging came in the late eighties, where the technique allowed for the first time for visualizing transmural myocardial movement without having to implant physical markers. This new idea opened the door for a series of developments and improvements that continue up to the present time. Different tagging techniques are currently available that are more extensive, improved, and sophisticated than they were twenty years ago. Each of these techniques has different versions for improved resolution, signal-to-noise ratio (SNR), scan time, anatomical coverage, three-dimensional capability, and image quality. The tagging techniques covered in this article can be broadly divided into two main categories: 1) Basic techniques, which include magnetization saturation, spatial modulation of magnetization (SPAMM), delay alternating with nutations for tailored excitation (DANTE), and complementary SPAMM (CSPAMM); and 2) Advanced techniques, which include harmonic phase (HARP), displacement encoding with stimulated echoes (DENSE), and strain encoding (SENC). Although most of these techniques were developed by separate groups and evolved from different backgrounds, they are in fact closely related to each other, and they can be interpreted from more than one perspective. Some of these techniques even followed parallel paths of developments, as illustrated in the article. As each technique has its own advantages, some efforts have been made to combine different techniques together for improved image quality or composite information acquisition. In this review, different developments in pulse sequences and related image processing techniques are described along with the necessities that led to their invention, which makes this article easy to read and the covered techniques easy to follow. Major studies that applied CMR tagging for studying myocardial mechanics are also summarized. Finally, the current article includes a plethora of ideas and techniques with over 300 references that motivate the reader to think about the future of CMR tagging. PMID:21798021

  8. Separation efficiency of free-solution conjugated electrophoresis with drag-tags incorporating a synthetic amino acid.

    PubMed

    Seo, Kyung-Ho; Chu, Hun-Su; Yoo, Tae Hyeon; Lee, Sun-Gu; Won, Jong-In

    2016-03-01

    DNA sequencing or separation by conventional capillary electrophoresis with a polymer matrix has some inherent drawbacks, such as the expense of polymer matrix and limitations in sequencing read length. As DNA fragments have a linear charge-to-friction ratio in free solution, DNA fragments cannot be separated by size. However, size-based separation of DNA is possible in free-solution conjugate electrophoresis (FSCE) if a "drag-tag" is attached to DNA fragments because the tag breaks the linear charge-to-friction scaling. Although several previous studies have demonstrated the feasibility of DNA separation by free-solution conjugated electrophoresis, generation of a monodisperse drag-tag and identification of a strong, site-specific conjugation method between a DNA fragment and a drag-tag are challenges that still remain. In this study, we demonstrate an efficient FSCE method by conjugating a biologically synthesized elastin-like polypeptide (ELP) and green fluorescent protein (GFP) to DNA fragments. In addition, to produce strong and site-specific conjugation, a methionine residue in drag-tags is replaced with homopropargylglycine (Hpg), which can be conjugated specifically to a DNA fragment with an azide site. © 2016 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

  9. Molecular characterization of human ABHD2 as TAG lipase and ester hydrolase

    PubMed Central

    M., Naresh Kumar; V.B.S.C., Thunuguntla; G.K., Veeramachaneni; B., Chandra Sekhar; Guntupalli, Swapna; J.S., Bondili

    2016-01-01

    Alterations in lipid metabolism have been progressively documented as a characteristic property of cancer cells. Though, human ABHD2 gene was found to be highly expressed in breast and lung cancers, its biochemical functionality is yet uncharacterized. In the present study we report, human ABHD2 as triacylglycerol (TAG) lipase along with ester hydrolysing capacity. Sequence analysis of ABHD2 revealed the presence of conserved motifs G205XS207XG209 and H120XXXXD125. Phylogenetic analysis showed homology to known lipases, Drosophila melanogaster CG3488. To evaluate the biochemical role, recombinant ABHD2 was expressed in Saccharomyces cerevisiae using pYES2/CT vector and His-tag purified protein showed TAG lipase activity. Ester hydrolase activity was confirmed with pNP acetate, butyrate and palmitate substrates respectively. Further, the ABHD2 homology model was built and the modelled protein was analysed based on the RMSD and root mean square fluctuation (RMSF) of the 100 ns simulation trajectory. Docking the acetate, butyrate and palmitate ligands with the model confirmed covalent binding of ligands with the Ser207 of the GXSXG motif. The model was validated with a mutant ABHD2 developed with alanine in place of Ser207 and the docking studies revealed loss of interaction between selected ligands and the mutant protein active site. Based on the above results, human ABHD2 was identified as a novel TAG lipase and ester hydrolase. PMID:27247428

  10. Molecular characterization of human ABHD2 as TAG lipase and ester hydrolase.

    PubMed

    M, Naresh Kumar; V B S C, Thunuguntla; G K, Veeramachaneni; B, Chandra Sekhar; Guntupalli, Swapna; J S, Bondili

    2016-08-01

    Alterations in lipid metabolism have been progressively documented as a characteristic property of cancer cells. Though, human ABHD2 gene was found to be highly expressed in breast and lung cancers, its biochemical functionality is yet uncharacterized. In the present study we report, human ABHD2 as triacylglycerol (TAG) lipase along with ester hydrolysing capacity. Sequence analysis of ABHD2 revealed the presence of conserved motifs G(205)XS(207)XG(209) and H(120)XXXXD(125) Phylogenetic analysis showed homology to known lipases, Drosophila melanogaster CG3488. To evaluate the biochemical role, recombinant ABHD2 was expressed in Saccharomyces cerevisiae using pYES2/CT vector and His-tag purified protein showed TAG lipase activity. Ester hydrolase activity was confirmed with pNP acetate, butyrate and palmitate substrates respectively. Further, the ABHD2 homology model was built and the modelled protein was analysed based on the RMSD and root mean square fluctuation (RMSF) of the 100 ns simulation trajectory. Docking the acetate, butyrate and palmitate ligands with the model confirmed covalent binding of ligands with the Ser(207) of the GXSXG motif. The model was validated with a mutant ABHD2 developed with alanine in place of Ser(207) and the docking studies revealed loss of interaction between selected ligands and the mutant protein active site. Based on the above results, human ABHD2 was identified as a novel TAG lipase and ester hydrolase. © 2016 The Author(s).

  11. Electrostatic Potential Maps and Natural Bond Orbital Analysis: Visualization and Conceptualization of Reactivity in Sanger's Reagent

    ERIC Educational Resources Information Center

    Mottishaw, Jeffery D.; Erck, Adam R.; Kramer, Jordan H.; Sun, Haoran; Koppang, Miles

    2015-01-01

    Frederick Sanger's early work on protein sequencing through the use of colorimetric labeling combined with liquid chromatography involves an important nucleophilic aromatic substitution (S[subscript N]Ar) reaction in which the N-terminus of a protein is tagged with Sanger's reagent. Understanding the inherent differences between this S[subscript…

  12. Characterization of EST-based SSR loci in the spruce budworm, Choristoneura fumiferana (Lepidoptera: Tortricidae)

    Treesearch

    B.M.T. Brunet; D. Doucet; B.R. Sturtevant; F.A.H. Sperling

    2013-01-01

    After identifying 114 microsatellite loci from Choristoneura fumiferana expressed sequence tags, 87 loci were assayed in a panel of 11 wild-caught individuals, giving 29 polymorphic loci. Further analysis of 20 of these loci on 31 individuals collected from a single population in northern Minnesota identified 14 in Hardy-Weinberg equilibrium.

  13. Cloning and characterization of a novel oocyte-specific gene encoding an F-Box protein in rainbow trout (Oncorhynchus mykiss)

    USDA-ARS?s Scientific Manuscript database

    Oocyte-specific genes play critical roles in oogenesis, folliculogenesis and early embryonic development. Through analysis of expressed sequence tags (ESTs) from a rainbow trout oocyte cDNA library, we identified a novel transcript which is represented by multiple ESTs derived only from the oocyte c...

  14. Cloning and characterization of a novel oocyte-specific gene encoding an F-Box protein in rainbow trout (Oncorhynchus mykiss)

    USDA-ARS?s Scientific Manuscript database

    Oocyte-specific genes play critical roles in oogenesis, folliculogenesis and early embryonic development. Through analysis of expressed sequence tags (ESTs) from a rainbow trout oocyte cDNA library, we identified a novel transcript which is represented by ESTs only from the oocyte library. The novel...

  15. Transcriptome analysis and identification of genes associated with omega-3 fatty acid biosynthesis in Perilla frutescens (L.) var. frutescens

    USDA-ARS?s Scientific Manuscript database

    Background: Perilla (Perilla frutescens (L.) var frutescens) produces high levels of a-linolenic acid (ALA), an omega-3 fatty acid important to health and development. To uncover key genes involved in fatty acid (FA) and triacylglycerol (TAG) synthesis in perilla, we conducted deep sequencing of cD...

  16. Association analysis of bacterial leaf spot resistance and SNP markers derived from expressed sequence tags (ESTs) in lettuce (Lactuca sativa L.)

    USDA-ARS?s Scientific Manuscript database

    Bacterial leaf spot of lettuce, caused by Xanthomonas campestris pv. vitians, is a devastating disease of lettuce worldwide. Since there are no chemicals available for effective control of the disease, host-plant resistance is highly desirable to protect lettuce production. A total of 179 lettuce ge...

  17. Abseq: Ultrahigh-throughput single cell protein profiling with droplet microfluidic barcoding.

    PubMed

    Shahi, Payam; Kim, Samuel C; Haliburton, John R; Gartner, Zev J; Abate, Adam R

    2017-03-14

    Proteins are the primary effectors of cellular function, including cellular metabolism, structural dynamics, and information processing. However, quantitative characterization of proteins at the single-cell level is challenging due to the tiny amount of protein available. Here, we present Abseq, a method to detect and quantitate proteins in single cells at ultrahigh throughput. Like flow and mass cytometry, Abseq uses specific antibodies to detect epitopes of interest; however, unlike these methods, antibodies are labeled with sequence tags that can be read out with microfluidic barcoding and DNA sequencing. We demonstrate this novel approach by characterizing surface proteins of different cell types at the single-cell level and distinguishing between the cells by their protein expression profiles. DNA-tagged antibodies provide multiple advantages for profiling proteins in single cells, including the ability to amplify low-abundance tags to make them detectable with sequencing, to use molecular indices for quantitative results, and essentially limitless multiplexing.

  18. Abseq: Ultrahigh-throughput single cell protein profiling with droplet microfluidic barcoding

    NASA Astrophysics Data System (ADS)

    Shahi, Payam; Kim, Samuel C.; Haliburton, John R.; Gartner, Zev J.; Abate, Adam R.

    2017-03-01

    Proteins are the primary effectors of cellular function, including cellular metabolism, structural dynamics, and information processing. However, quantitative characterization of proteins at the single-cell level is challenging due to the tiny amount of protein available. Here, we present Abseq, a method to detect and quantitate proteins in single cells at ultrahigh throughput. Like flow and mass cytometry, Abseq uses specific antibodies to detect epitopes of interest; however, unlike these methods, antibodies are labeled with sequence tags that can be read out with microfluidic barcoding and DNA sequencing. We demonstrate this novel approach by characterizing surface proteins of different cell types at the single-cell level and distinguishing between the cells by their protein expression profiles. DNA-tagged antibodies provide multiple advantages for profiling proteins in single cells, including the ability to amplify low-abundance tags to make them detectable with sequencing, to use molecular indices for quantitative results, and essentially limitless multiplexing.

  19. Abseq: Ultrahigh-throughput single cell protein profiling with droplet microfluidic barcoding

    PubMed Central

    Shahi, Payam; Kim, Samuel C.; Haliburton, John R.; Gartner, Zev J.; Abate, Adam R.

    2017-01-01

    Proteins are the primary effectors of cellular function, including cellular metabolism, structural dynamics, and information processing. However, quantitative characterization of proteins at the single-cell level is challenging due to the tiny amount of protein available. Here, we present Abseq, a method to detect and quantitate proteins in single cells at ultrahigh throughput. Like flow and mass cytometry, Abseq uses specific antibodies to detect epitopes of interest; however, unlike these methods, antibodies are labeled with sequence tags that can be read out with microfluidic barcoding and DNA sequencing. We demonstrate this novel approach by characterizing surface proteins of different cell types at the single-cell level and distinguishing between the cells by their protein expression profiles. DNA-tagged antibodies provide multiple advantages for profiling proteins in single cells, including the ability to amplify low-abundance tags to make them detectable with sequencing, to use molecular indices for quantitative results, and essentially limitless multiplexing. PMID:28290550

  20. A novel expression system for intracellular production and purification of recombinant affinity-tagged proteins in Aspergillus niger.

    PubMed

    Roth, Andreas H F J; Dersch, Petra

    2010-03-01

    A set of different integrative expression vectors for the intracellular production of recombinant proteins with or without affinity tag in Aspergillus niger was developed. Target genes can be expressed under the control of the highly efficient, constitutive pkiA promoter or the novel sucrose-inducible promoter of the beta-fructofuranosidase (sucA) gene of A. niger in the presence or absence of alternative carbon sources. All expression plasmids contain an identical multiple cloning sequence that allows parallel construction of N- or C-terminally His6- and StrepII-tagged versions of the target proteins. Production of two heterologous model proteins, the green fluorescence protein and the Thermobifida fusca hydrolase, proved the functionality of the vector system. Efficient production and easy detection of the target proteins as well as their fast purification by a one-step affinity chromatography, using the His6- or StrepII-tag sequence, was demonstrated.

  1. A statistical method for assessing peptide identification confidence in accurate mass and time tag proteomics

    PubMed Central

    Stanley, Jeffrey R.; Adkins, Joshua N.; Slysz, Gordon W.; Monroe, Matthew E.; Purvine, Samuel O.; Karpievitch, Yuliya V.; Anderson, Gordon A.; Smith, Richard D.; Dabney, Alan R.

    2011-01-01

    Current algorithms for quantifying peptide identification confidence in the accurate mass and time (AMT) tag approach assume that the AMT tags themselves have been correctly identified. However, there is uncertainty in the identification of AMT tags, as this is based on matching LC-MS/MS fragmentation spectra to peptide sequences. In this paper, we incorporate confidence measures for the AMT tag identifications into the calculation of probabilities for correct matches to an AMT tag database, resulting in a more accurate overall measure of identification confidence for the AMT tag approach. The method is referred to as Statistical Tools for AMT tag Confidence (STAC). STAC additionally provides a Uniqueness Probability (UP) to help distinguish between multiple matches to an AMT tag and a method to calculate an overall false discovery rate (FDR). STAC is freely available for download as both a command line and a Windows graphical application. PMID:21692516

  2. Expressed sequence tags from heat-shocked seagrass Zostera noltii (Hornemann) from its southern distribution range.

    PubMed

    Massa, Sónia I; Pearson, Gareth A; Aires, Tânia; Kube, Michael; Olsen, Jeanine L; Reinhardt, Richard; Serrão, Ester A; Arnaud-Haond, Sophie

    2011-09-01

    Predicted global climate change threatens the distributional ranges of species worldwide. We identified genes expressed in the intertidal seagrass Zostera noltii during recovery from a simulated low tide heat-shock exposure. Five Expressed Sequence Tag (EST) libraries were compared, corresponding to four recovery times following sub-lethal temperature stress, and a non-stressed control. We sequenced and analyzed 7009 sequence reads from 30min, 2h, 4h and 24h after the beginning of the heat-shock (AHS), and 1585 from the control library, for a total of 8594 sequence reads. Among 51 Tentative UniGenes (TUGs) exhibiting significantly different expression between libraries, 19 (37.3%) were identified as 'molecular chaperones' and were over-expressed following heat-shock, while 12 (23.5%) were 'photosynthesis TUGs' generally under-expressed in heat-shocked plants. A time course analysis of expression showed a rapid increase in expression of the molecular chaperone class, most of which were heat-shock proteins; which increased from 2 sequence reads in the control library to almost 230 in the 30min AHS library, followed by a slow decrease during further recovery. In contrast, 'photosynthesis TUGs' were under-expressed 30min AHS compared with the control library, and declined progressively with recovery time in the stress libraries, with a total of 29 sequence reads 24h AHS, compared with 125 in the control. A total of 4734 TUGs were screened for EST-Single Sequence Repeats (EST-SSRs) and 86 microsatellites were identified. Copyright © 2011 Elsevier B.V. All rights reserved.

  3. In search of actionable targets for agrigenomics and microalgal biofuel production: sequence-structural diversity studies on algal and higher plants with a focus on GPAT protein.

    PubMed

    Misra, Namrata; Panda, Prasanna Kumar

    2013-04-01

    The triacylglycerol (TAG) pathway provides several targets for genetic engineering to optimize microalgal lipid productivity. GPAT (glycerol-3-phosphate acyltransferase) is a crucial enzyme that catalyzes the initial step of TAG biosynthesis. Despite many recent biochemical studies, a comprehensive sequence-structure analysis of GPAT across diverse lipid-yielding organisms is lacking. Hence, we performed a comparative genomic analysis of plastid-located GPAT proteins from 7 microalgae and 3 higher plants species. The close evolutionary relationship observed between red algae/diatoms and green algae/plant lineages in the phylogenetic tree were further corroborated by motif and gene structure analysis. The predicted molecular weight, amino acid composition, Instability Index, and hydropathicity profile gave an overall representation of the biochemical features of GPAT protein across the species under study. Furthermore, homology models of GPAT from Chlamydomonas reinhardtii, Arabidopsis thaliana, and Glycine max provided deep insights into the protein architecture and substrate binding sites. Despite low sequence identity found between algal and plant GPATs, the developed models exhibited strikingly conserved topology consisting of 14α helices and 9β sheets arranged in two domains. However, subtle variations in amino acids of fatty acyl binding site were identified that might influence the substrate selectivity of GPAT. Together, the results will provide useful resources to understand the functional and evolutionary relationship of GPAT and potentially benefit in development of engineered enzyme for augmenting algal biofuel production.

  4. Analysis of expressed sequence tags from Maize mosaic rhabdovirus-infected gut tissues of Peregrinus maidis reveals the presence of key components of insect innate immunity.

    PubMed

    Whitfield, A E; Rotenberg, D; Aritua, V; Hogenhout, S A

    2011-04-01

    The corn planthopper, Peregrinus maidis, causes direct feeding damage to plants and transmits Maize mosaic rhabdovirus (MMV) in a persistent-propagative manner. MMV must cross several insect tissue layers for successful transmission to occur, and the gut serves as an important barrier for rhabdovirus transmission. In order to facilitate the identification of proteins that may interact with MMV either by facilitating acquisition or responding to virus infection, we generated and analysed the gut transcriptome of P. maidis. From two normalized cDNA libraries, we generated a P. maidis gut transcriptome composed of 20,771 expressed sequence tags (ESTs). Assembly of the sequences yielded 1860 contigs and 14,032 singletons, and biological roles were assigned to 5793 (36%). Comparison of P. maidis ESTs with other insect amino acid sequences revealed that P. maidis shares greatest sequence similarity with another hemipteran, the brown planthopper Nilaparvata lugens. We identified 202 P. maidis transcripts with putative homology to proteins associated with insect innate immunity, including those implicated in the Toll, Imd, JAK/STAT, Jnk and the small-interfering RNA-mediated pathways. Sequence comparisons between our P. maidis gut EST collection and the currently available National Center for Biotechnology Information EST database collection for Ni. lugens revealed that a pathogen recognition receptor in the Imd pathway, peptidoglycan recognition protein-long class (PGRP-LC), is present in these two members of the family Delphacidae; however, these recognition receptors are lacking in the model hemipteran Acyrthosiphon pisum. In addition, we identified sequences in the P. maidis gut transcriptome that share significant amino acid sequence similarities with the rhabdovirus receptor molecule, acetylcholine receptor (AChR), found in other hosts. This EST analysis sheds new light on immune response pathways in hemipteran guts that will be useful for further dissecting innate defence response pathways to rhabdovirus infection. © 2011 The Authors. Insect Molecular Biology © 2011 The Royal Entomological Society.

  5. Probing the potential of CnaB-type domains for the design of tag/catcher systems

    PubMed Central

    Pröschel, Marlene; Kraner, Max E.; Horn, Anselm H. C.; Schäfer, Lena; Sonnewald, Uwe

    2017-01-01

    Building proteins into larger, post-translational assemblies in a defined and stable way is still a challenging task. A promising approach relies on so-called tag/catcher systems that are fused to the proteins of interest and allow a durable linkage via covalent intermolecular bonds. Tags and catchers are generated by splitting protein domains that contain intramolecular isopeptide or ester bonds that form autocatalytically under physiological conditions. There are already numerous biotechnological and medical applications that demonstrate the usefulness of covalent linkages mediated by these systems. Additional covalent tag/catcher systems would allow creating more complex and ultra-stable protein architectures and networks. Two of the presently available tag/catcher systems were derived from closely related CnaB-domains of Streptococcus pyogenes and Streptococcus dysgalactiae proteins. However, it is unclear whether domain splitting is generally tolerated within the CnaB-family or only by a small subset of these domains. To address this point, we have selected a set of four CnaB domains of low sequence similarity and characterized the resulting tag/catcher systems by computational and experimental methods. Experimental testing for intermolecular isopeptide bond formation demonstrated two of the four systems to be functional. For these two systems length and sequence variations of the peptide tags were investigated revealing only a relatively small effect on the efficiency of the reaction. Our study suggests that splitting into tag and catcher moieties is tolerated by a significant portion of the naturally occurring CnaB-domains, thus providing a large reservoir for the design of novel tag/catcher systems. PMID:28654665

  6. Transcriptome analysis of Schistosoma mansoni larval development using serial analysis of gene expression (SAGE).

    PubMed

    Taft, A S; Vermeire, J J; Bernier, J; Birkeland, S R; Cipriano, M J; Papa, A R; McArthur, A G; Yoshino, T P

    2009-04-01

    Infection of the snail, Biomphalaria glabrata, by the free-swimming miracidial stage of the human blood fluke, Schistosoma mansoni, and its subsequent development to the parasitic sporocyst stage is critical to establishment of viable infections and continued human transmission. We performed a genome-wide expression analysis of the S. mansoni miracidia and developing sporocyst using Long Serial Analysis of Gene Expression (LongSAGE). Five cDNA libraries were constructed from miracidia and in vitro cultured 6- and 20-day-old sporocysts maintained in sporocyst medium (SM) or in SM conditioned by previous cultivation with cells of the B. glabrata embryonic (Bge) cell line. We generated 21 440 SAGE tags and mapped 13 381 to the S. mansoni gene predictions (v4.0e) either by estimating theoretical 3' UTR lengths or using existing 3' EST sequence data. Overall, 432 transcripts were found to be differentially expressed amongst all 5 libraries. In total, 172 tags were differentially expressed between miracidia and 6-day conditioned sporocysts and 152 were differentially expressed between miracidia and 6-day unconditioned sporocysts. In addition, 53 and 45 tags, respectively, were differentially expressed in 6-day and 20-day cultured sporocysts, due to the effects of exposure to Bge cell-conditioned medium.

  7. Frequency tagging to track the neural processing of contrast in fast, continuous sound sequences.

    PubMed

    Nozaradan, Sylvie; Mouraux, André; Cousineau, Marion

    2017-07-01

    The human auditory system presents a remarkable ability to detect rapid changes in fast, continuous acoustic sequences, as best illustrated in speech and music. However, the neural processing of rapid auditory contrast remains largely unclear, probably due to the lack of methods to objectively dissociate the response components specifically related to the contrast from the other components in response to the sequence of fast continuous sounds. To overcome this issue, we tested a novel use of the frequency-tagging approach allowing contrast-specific neural responses to be tracked based on their expected frequencies. The EEG was recorded while participants listened to 40-s sequences of sounds presented at 8Hz. A tone or interaural time contrast was embedded every fifth sound (AAAAB), such that a response observed in the EEG at exactly 8 Hz/5 (1.6 Hz) or harmonics should be the signature of contrast processing by neural populations. Contrast-related responses were successfully identified, even in the case of very fine contrasts. Moreover, analysis of the time course of the responses revealed a stable amplitude over repetitions of the AAAAB patterns in the sequence, except for the response to perceptually salient contrasts that showed a buildup and decay across repetitions of the sounds. Overall, this new combination of frequency-tagging with an oddball design provides a valuable complement to the classic, transient, evoked potentials approach, especially in the context of rapid auditory information. Specifically, we provide objective evidence on the neural processing of contrast embedded in fast, continuous sound sequences. NEW & NOTEWORTHY Recent theories suggest that the basis of neurodevelopmental auditory disorders such as dyslexia might be an impaired processing of fast auditory changes, highlighting how the encoding of rapid acoustic information is critical for auditory communication. Here, we present a novel electrophysiological approach to capture in humans neural markers of contrasts in fast continuous tone sequences. Contrast-specific responses were successfully identified, even for very fine contrasts, providing direct insight on the encoding of rapid auditory information. Copyright © 2017 the American Physiological Society.

  8. Generation, Analysis and Functional Annotation of Expressed Sequence Tags from the Sheepshead Minnow (Cyprinodon variegatus)

    DTIC Science & Technology

    2010-01-01

    any other provision of law, no person shall be subject to a penalty for failing to comply with a collection of information if it does not display a...CA) were cloned using the pGEM-T Easy Vector System (Promega, Madison, WI) and Electromax DH10B T1 Phage Resistant Cells (Invitrogen, Carlsbad, CA...reactions were performed using 75ng of plasmid DNA template and a M13 (-40) forward primer according to the manufac- turer’s protocol for DNA sequencing of

  9. Gapped Spectral Dictionaries and Their Applications for Database Searches of Tandem Mass Spectra*

    PubMed Central

    Jeong, Kyowon; Kim, Sangtae; Bandeira, Nuno; Pevzner, Pavel A.

    2011-01-01

    Generating all plausible de novo interpretations of a peptide tandem mass (MS/MS) spectrum (Spectral Dictionary) and quickly matching them against the database represent a recently emerged alternative approach to peptide identification. However, the sizes of the Spectral Dictionaries quickly grow with the peptide length making their generation impractical for long peptides. We introduce Gapped Spectral Dictionaries (all plausible de novo interpretations with gaps) that can be easily generated for any peptide length thus addressing the limitation of the Spectral Dictionary approach. We show that Gapped Spectral Dictionaries are small thus opening a possibility of using them to speed-up MS/MS searches. Our MS-GappedDictionary algorithm (based on Gapped Spectral Dictionaries) enables proteogenomics applications (such as searches in the six-frame translation of the human genome) that are prohibitively time consuming with existing approaches. MS-GappedDictionary generates gapped peptides that occupy a niche between accurate but short peptide sequence tags and long but inaccurate full length peptide reconstructions. We show that, contrary to conventional wisdom, some high-quality spectra do not have good peptide sequence tags and introduce gapped tags that have advantages over the conventional peptide sequence tags in MS/MS database searches. PMID:21444829

  10. Expressed sequence tag analysis of functional genes associated with adventitious rooting in Liriodendron hybrids.

    PubMed

    Zhong, Y D; Sun, X Y; Liu, E Y; Li, Y Q; Gao, Z; Yu, F X

    2016-06-24

    Liriodendron hybrids (Liriodendron chinense x L. tulipifera) are important landscaping and afforestation hardwood trees. To date, little genomic research on adventitious rooting has been reported in these hybrids, as well as in the genus Liriodendron. In the present study, we used adventitious roots to construct the first cDNA library for Liriodendron hybrids. A total of 5176 expressed sequence tags (ESTs) were generated and clustered into 2921 unigenes. Among these unigenes, 2547 had significant homology to the non-redundant protein database representing a wide variety of putative functions. Homologs of these genes regulated many aspects of adventitious rooting, including those for auxin signal transduction and root hair development. Results of quantitative real-time polymerase chain reaction showed that AUX1, IRE, and FB1 were highly expressed in adventitious roots and the expression of AUX1, ARF1, NAC1, RHD1, and IRE increased during the development of adventitious roots. Additionally, 181 simple sequence repeats were identified from 166 ESTs and more than 91.16% of these were dinucleotide and trinucleotide repeats. To the best of our knowledge, the present study reports the identification of the genes associated with adventitious rooting in the genus Liriodendron for the first time and provides a valuable resource for future genomic studies. Expression analysis of selected genes could allow us to identify regulatory genes that may be essential for adventitious rooting.

  11. Isolation of centromeric-tandem repetitive DNA sequences by chromatin affinity purification using a HaloTag7-fused centromere-specific histone H3 in tobacco.

    PubMed

    Nagaki, Kiyotaka; Shibata, Fukashi; Kanatani, Asaka; Kashihara, Kazunari; Murata, Minoru

    2012-04-01

    The centromere is a multi-functional complex comprising centromeric DNA and a number of proteins. To isolate unidentified centromeric DNA sequences, centromere-specific histone H3 variants (CENH3) and chromatin immunoprecipitation (ChIP) have been utilized in some plant species. However, anti-CENH3 antibody for ChIP must be raised in each species because of its species specificity. Production of the antibodies is time-consuming and costly, and it is not easy to produce ChIP-grade antibodies. In this study, we applied a HaloTag7-based chromatin affinity purification system to isolate centromeric DNA sequences in tobacco. This system required no specific antibody, and made it possible to apply a highly stringent wash to remove contaminated DNA. As a result, we succeeded in isolating five tandem repetitive DNA sequences in addition to the centromeric retrotransposons that were previously identified by ChIP. Three of the tandem repeats were centromere-specific sequences located on different chromosomes. These results confirm the validity of the HaloTag7-based chromatin affinity purification system as an alternative method to ChIP for isolating unknown centromeric DNA sequences. The discovery of more than two chromosome-specific centromeric DNA sequences indicates the mosaic structure of tobacco centromeres. © Springer-Verlag 2011

  12. The salt-responsive transcriptome of chickpea roots and nodules via deepSuperSAGE

    PubMed Central

    2011-01-01

    Background The combination of high-throughput transcript profiling and next-generation sequencing technologies is a prerequisite for genome-wide comprehensive transcriptome analysis. Our recent innovation of deepSuperSAGE is based on an advanced SuperSAGE protocol and its combination with massively parallel pyrosequencing on Roche's 454 sequencing platform. As a demonstration of the power of this combination, we have chosen the salt stress transcriptomes of roots and nodules of the third most important legume crop chickpea (Cicer arietinum L.). While our report is more technology-oriented, it nevertheless addresses a major world-wide problem for crops generally: high salinity. Together with low temperatures and water stress, high salinity is responsible for crop losses of millions of tons of various legume (and other) crops. Continuously deteriorating environmental conditions will combine with salinity stress to further compromise crop yields. As a good example for such stress-exposed crop plants, we started to characterize salt stress responses of chickpeas on the transcriptome level. Results We used deepSuperSAGE to detect early global transcriptome changes in salt-stressed chickpea. The salt stress responses of 86,919 transcripts representing 17,918 unique 26 bp deepSuperSAGE tags (UniTags) from roots of the salt-tolerant variety INRAT-93 two hours after treatment with 25 mM NaCl were characterized. Additionally, the expression of 57,281 transcripts representing 13,115 UniTags was monitored in nodules of the same plants. From a total of 144,200 analyzed 26 bp tags in roots and nodules together, 21,401 unique transcripts were identified. Of these, only 363 and 106 specific transcripts, respectively, were commonly up- or down-regulated (>3.0-fold) under salt stress in both organs, witnessing a differential organ-specific response to stress. Profiting from recent pioneer works on massive cDNA sequencing in chickpea, more than 9,400 UniTags were able to be linked to UniProt entries. Additionally, gene ontology (GO) categories over-representation analysis enabled to filter out enriched biological processes among the differentially expressed UniTags. Subsequently, the gathered information was further cross-checked with stress-related pathways. From several filtered pathways, here we focus exemplarily on transcripts associated with the generation and scavenging of reactive oxygen species (ROS), as well as on transcripts involved in Na+ homeostasis. Although both processes are already very well characterized in other plants, the information generated in the present work is of high value. Information on expression profiles and sequence similarity for several hundreds of transcripts of potential interest is now available. Conclusions This report demonstrates, that the combination of the high-throughput transcriptome profiling technology SuperSAGE with one of the next-generation sequencing platforms allows deep insights into the first molecular reactions of a plant exposed to salinity. Cross validation with recent reports enriched the information about the salt stress dynamics of more than 9,000 chickpea ESTs, and enlarged their pool of alternative transcripts isoforms. As an example for the high resolution of the employed technology that we coin deepSuperSAGE, we demonstrate that ROS-scavenging and -generating pathways undergo strong global transcriptome changes in chickpea roots and nodules already 2 hours after onset of moderate salt stress (25 mM NaCl). Additionally, a set of more than 15 candidate transcripts are proposed to be potential components of the salt overly sensitive (SOS) pathway in chickpea. Newly identified transcript isoforms are potential targets for breeding novel cultivars with high salinity tolerance. We demonstrate that these targets can be integrated into breeding schemes by micro-arrays and RT-PCR assays downstream of the generation of 26 bp tags by SuperSAGE. PMID:21320317

  13. The salt-responsive transcriptome of chickpea roots and nodules via deepSuperSAGE.

    PubMed

    Molina, Carlos; Zaman-Allah, Mainassara; Khan, Faheema; Fatnassi, Nadia; Horres, Ralf; Rotter, Björn; Steinhauer, Diana; Amenc, Laurie; Drevon, Jean-Jacques; Winter, Peter; Kahl, Günter

    2011-02-14

    The combination of high-throughput transcript profiling and next-generation sequencing technologies is a prerequisite for genome-wide comprehensive transcriptome analysis. Our recent innovation of deepSuperSAGE is based on an advanced SuperSAGE protocol and its combination with massively parallel pyrosequencing on Roche's 454 sequencing platform. As a demonstration of the power of this combination, we have chosen the salt stress transcriptomes of roots and nodules of the third most important legume crop chickpea (Cicer arietinum L.). While our report is more technology-oriented, it nevertheless addresses a major world-wide problem for crops generally: high salinity. Together with low temperatures and water stress, high salinity is responsible for crop losses of millions of tons of various legume (and other) crops. Continuously deteriorating environmental conditions will combine with salinity stress to further compromise crop yields. As a good example for such stress-exposed crop plants, we started to characterize salt stress responses of chickpeas on the transcriptome level. We used deepSuperSAGE to detect early global transcriptome changes in salt-stressed chickpea. The salt stress responses of 86,919 transcripts representing 17,918 unique 26 bp deepSuperSAGE tags (UniTags) from roots of the salt-tolerant variety INRAT-93 two hours after treatment with 25 mM NaCl were characterized. Additionally, the expression of 57,281 transcripts representing 13,115 UniTags was monitored in nodules of the same plants. From a total of 144,200 analyzed 26 bp tags in roots and nodules together, 21,401 unique transcripts were identified. Of these, only 363 and 106 specific transcripts, respectively, were commonly up- or down-regulated (>3.0-fold) under salt stress in both organs, witnessing a differential organ-specific response to stress.Profiting from recent pioneer works on massive cDNA sequencing in chickpea, more than 9,400 UniTags were able to be linked to UniProt entries. Additionally, gene ontology (GO) categories over-representation analysis enabled to filter out enriched biological processes among the differentially expressed UniTags. Subsequently, the gathered information was further cross-checked with stress-related pathways. From several filtered pathways, here we focus exemplarily on transcripts associated with the generation and scavenging of reactive oxygen species (ROS), as well as on transcripts involved in Na+ homeostasis. Although both processes are already very well characterized in other plants, the information generated in the present work is of high value. Information on expression profiles and sequence similarity for several hundreds of transcripts of potential interest is now available. This report demonstrates, that the combination of the high-throughput transcriptome profiling technology SuperSAGE with one of the next-generation sequencing platforms allows deep insights into the first molecular reactions of a plant exposed to salinity. Cross validation with recent reports enriched the information about the salt stress dynamics of more than 9,000 chickpea ESTs, and enlarged their pool of alternative transcripts isoforms. As an example for the high resolution of the employed technology that we coin deepSuperSAGE, we demonstrate that ROS-scavenging and -generating pathways undergo strong global transcriptome changes in chickpea roots and nodules already 2 hours after onset of moderate salt stress (25 mM NaCl). Additionally, a set of more than 15 candidate transcripts are proposed to be potential components of the salt overly sensitive (SOS) pathway in chickpea. Newly identified transcript isoforms are potential targets for breeding novel cultivars with high salinity tolerance. We demonstrate that these targets can be integrated into breeding schemes by micro-arrays and RT-PCR assays downstream of the generation of 26 bp tags by SuperSAGE.

  14. Immuno-affinity Capture Followed by TMPP N-Terminus Tagging to Study Catabolism of Therapeutic Proteins.

    PubMed

    Kullolli, Majlinda; Rock, Dan A; Ma, Ji

    2017-02-03

    Characterization of in vitro and in vivo catabolism of therapeutic proteins has increasingly become an integral part of discovery and development process for novel proteins. Unambiguous and efficient identification of catabolites can not only facilitate accurate understanding of pharmacokinetic profiles of drug candidates, but also enables follow up protein engineering to generate more catabolically stable molecules with improved properties (pharmacokinetics and pharmacodynamics). Immunoaffinity capture (IC) followed by top-down intact protein analysis using either matrix-assisted laser desorption/ionization or electrospray ionization mass spectrometry analysis have been the primary methods of choice for catabolite identification. However, the sensitivity and efficiency of these methods is not always sufficient for characterization of novel proteins from complex biomatrices such as plasma or serum. In this study a novel bottom-up targeted protein workflow was optimized for analysis of proteolytic degradation of therapeutic proteins. Selective and sensitive tagging of the alpha-amine at the N-terminus of proteins of interest was performed by immunoaffinity capture of therapeutic protein and its catabolites followed by on-bead succinimidyloxycarbonylmethyl tri-(2,4,6-trimethoxyphenyl N-terminus (TMPP-NTT) tagging. The positively charged hydrophobic TMPP tag facilitates unambiguous sequence identification of all N-terminus peptides from complex tryptic digestion samples via data dependent liquid chromatgraphy-tandem mass spectroscopy. Utility of the workflow was illustrated by definitive analysis of in vitro catabolic profile of neurotensin human Fc (NTs-huFc) protein in mouse serum. The results from this study demonstrated that the IC-TMPP-NTT workflow is a simple and efficient method for catabolite formation in therapeutic proteins.

  15. De Novo Transcriptome Sequencing Reveals Important Molecular Networks and Metabolic Pathways of the Plant, Chlorophytum borivilianum

    PubMed Central

    Kalra, Shikha; Puniya, Bhanwar Lal; Kulshreshtha, Deepika; Kumar, Sunil; Kaur, Jagdeep; Ramachandran, Srinivasan; Singh, Kashmir

    2013-01-01

    Chlorophytum borivilianum, an endangered medicinal plant species is highly recognized for its aphrodisiac properties provided by saponins present in the plant. The transcriptome information of this species is limited and only few hundred expressed sequence tags (ESTs) are available in the public databases. To gain molecular insight of this plant, high throughput transcriptome sequencing of leaf RNA was carried out using Illumina's HiSeq 2000 sequencing platform. A total of 22,161,444 single end reads were retrieved after quality filtering. Available (e.g., De-Bruijn/Eulerian graph) and in-house developed bioinformatics tools were used for assembly and annotation of transcriptome. A total of 101,141 assembled transcripts were obtained, with coverage size of 22.42 Mb and average length of 221 bp. Guanine-cytosine (GC) content was found to be 44%. Bioinformatics analysis, using non-redundant proteins, gene ontology (GO), enzyme commission (EC) and kyoto encyclopedia of genes and genomes (KEGG) databases, extracted all the known enzymes involved in saponin and flavonoid biosynthesis. Few genes of the alkaloid biosynthesis, along with anticancer and plant defense genes, were also discovered. Additionally, several cytochrome P450 (CYP450) and glycosyltransferase unique sequences were also found. We identified simple sequence repeat motifs in transcripts with an abundance of di-nucleotide simple sequence repeat (SSR; 43.1%) markers. Large scale expression profiling through Reads per Kilobase per Million mapped reads (RPKM) showed major genes involved in different metabolic pathways of the plant. Genes, expressed sequence tags (ESTs) and unique sequences from this study provide an important resource for the scientific community, interested in the molecular genetics and functional genomics of C. borivilianum. PMID:24376689

  16. De Novo transcriptome sequencing reveals important molecular networks and metabolic pathways of the plant, Chlorophytum borivilianum.

    PubMed

    Kalra, Shikha; Puniya, Bhanwar Lal; Kulshreshtha, Deepika; Kumar, Sunil; Kaur, Jagdeep; Ramachandran, Srinivasan; Singh, Kashmir

    2013-01-01

    Chlorophytum borivilianum, an endangered medicinal plant species is highly recognized for its aphrodisiac properties provided by saponins present in the plant. The transcriptome information of this species is limited and only few hundred expressed sequence tags (ESTs) are available in the public databases. To gain molecular insight of this plant, high throughput transcriptome sequencing of leaf RNA was carried out using Illumina's HiSeq 2000 sequencing platform. A total of 22,161,444 single end reads were retrieved after quality filtering. Available (e.g., De-Bruijn/Eulerian graph) and in-house developed bioinformatics tools were used for assembly and annotation of transcriptome. A total of 101,141 assembled transcripts were obtained, with coverage size of 22.42 Mb and average length of 221 bp. Guanine-cytosine (GC) content was found to be 44%. Bioinformatics analysis, using non-redundant proteins, gene ontology (GO), enzyme commission (EC) and kyoto encyclopedia of genes and genomes (KEGG) databases, extracted all the known enzymes involved in saponin and flavonoid biosynthesis. Few genes of the alkaloid biosynthesis, along with anticancer and plant defense genes, were also discovered. Additionally, several cytochrome P450 (CYP450) and glycosyltransferase unique sequences were also found. We identified simple sequence repeat motifs in transcripts with an abundance of di-nucleotide simple sequence repeat (SSR; 43.1%) markers. Large scale expression profiling through Reads per Kilobase per Million mapped reads (RPKM) showed major genes involved in different metabolic pathways of the plant. Genes, expressed sequence tags (ESTs) and unique sequences from this study provide an important resource for the scientific community, interested in the molecular genetics and functional genomics of C. borivilianum.

  17. Sequence Determinants of Compaction in Intrinsically Disordered Proteins

    PubMed Central

    Marsh, Joseph A.; Forman-Kay, Julie D.

    2010-01-01

    Abstract Intrinsically disordered proteins (IDPs), which lack folded structure and are disordered under nondenaturing conditions, have been shown to perform important functions in a large number of cellular processes. These proteins have interesting structural properties that deviate from the random-coil-like behavior exhibited by chemically denatured proteins. In particular, IDPs are often observed to exhibit significant compaction. In this study, we have analyzed the hydrodynamic radii of a number of IDPs to investigate the sequence determinants of this compaction. Net charge and proline content are observed to be strongly correlated with increased hydrodynamic radii, suggesting that these are the dominant contributors to compaction. Hydrophobicity and secondary structure, on the other hand, appear to have negligible effects on compaction, which implies that the determinants of structure in folded and intrinsically disordered proteins are profoundly different. Finally, we observe that polyhistidine tags seem to increase IDP compaction, which suggests that these tags have significant perturbing effects and thus should be removed before any structural characterizations of IDPs. Using the relationships observed in this analysis, we have developed a sequence-based predictor of hydrodynamic radius for IDPs that shows substantial improvement over a simple model based upon chain length alone. PMID:20483348

  18. DOE Office of Scientific and Technical Information (OSTI.GOV)

    Vyatkina, Kira; Wu, Si; Dekker, Lennard J. M.

    De novo sequencing of proteins and peptides is one of the most important problems in mass spectrometry-driven proteomics. A variety of methods have been developed to accomplish this task from a set of bottom-up tandem (MS/MS) mass spectra. However, a more recently emerged top-down technology, now gaining more and more popularity, opens new perspectives for protein analysis and characterization, implying a need in efficient algorithms for processing this kind of MS/MS data. Here we describe a method that allows to retrieve from a set of top-down MS/MS spectra long and accurate sequence fragments of the proteins contained in a sample.more » To this end, we outline a strategy for generating high-quality sequence tags from top-down spectra, and introduce the concept of a T-Bruijn graph by adapting to the case of tags the notion of an A-Bruijn graph widely used in genomics. The output of the proposed approach represents the set of amino acid strings spelled out by optimal paths in the connected components of a T-Bruijn graph. We illustrate its performance on top-down datasets acquired from carbonic anhydrase 2 (CAH2) and the Fab region of alemtuzumab.« less

  19. A note on the efficiencies of sampling strategies in two-stage Bayesian regional fine mapping of a quantitative trait.

    PubMed

    Chen, Zhijian; Craiu, Radu V; Bull, Shelley B

    2014-11-01

    In focused studies designed to follow up associations detected in a genome-wide association study (GWAS), investigators can proceed to fine-map a genomic region by targeted sequencing or dense genotyping of all variants in the region, aiming to identify a functional sequence variant. For the analysis of a quantitative trait, we consider a Bayesian approach to fine-mapping study design that incorporates stratification according to a promising GWAS tag SNP in the same region. Improved cost-efficiency can be achieved when the fine-mapping phase incorporates a two-stage design, with identification of a smaller set of more promising variants in a subsample taken in stage 1, followed by their evaluation in an independent stage 2 subsample. To avoid the potential negative impact of genetic model misspecification on inference we incorporate genetic model selection based on posterior probabilities for each competing model. Our simulation study shows that, compared to simple random sampling that ignores genetic information from GWAS, tag-SNP-based stratified sample allocation methods reduce the number of variants continuing to stage 2 and are more likely to promote the functional sequence variant into confirmation studies. © 2014 WILEY PERIODICALS, INC.

  20. Gene discovery in Eimeria tenella by immunoscreening cDNA expression libraries of sporozoites and schizonts with chicken intestinal antibodies.

    PubMed

    Réfega, Susana; Girard-Misguich, Fabienne; Bourdieu, Christiane; Péry, Pierre; Labbé, Marie

    2003-04-02

    Specific antibodies were produced ex vivo from intestinal culture of Eimeria tenella infected chickens. The specificity of these intestinal antibodies was tested against different parasite stages. These antibodies were used to immunoscreen first generation schizont and sporozoite cDNA libraries permitting the identification of new E. tenella antigens. We obtained a total of 119 cDNA clones which were subjected to sequence analysis. The sequences coding for the proteins inducing local immune responses were compared with nucleotide or protein databases and with expressed sequence tags (ESTs) databases. We identified new Eimeria genes coding for heat shock proteins, a ribosomal protein, a pyruvate kinase and a pyridoxine kinase. Specific features of other sequences are discussed.

  1. Cloning and characterization of a basic Cysteine-like protease (Cathespsin L1) expressed in the gut of larval Diaprepes abbreviatus L. (Coleoptera: Curculionidae)

    USDA-ARS?s Scientific Manuscript database

    Diaprepes abbreviatus is an important pest that causes extensive damage to citrus in the USA. Analysis of an expressed sequence tag (EST) library from the digestive tract of larvae and adult D. abbreviatus identified cathepsins as major putative digestive enzymes. One class, sharing amino acid seque...

  2. Identification and functional characterization of effectors in expressed sequence tags from various life cycle stages of the potato cyst nematode Globodera pallida.

    PubMed

    Jones, John T; Kumar, Amar; Pylypenko, Liliya A; Thirugnanasambandam, Amarnath; Castelli, Lydia; Chapman, Sean; Cock, Peter J A; Grenier, Eric; Lilley, Catherine J; Phillips, Mark S; Blok, Vivian C

    2009-11-01

    In this article, we describe the analysis of over 9000 expressed sequence tags (ESTs) from cDNA libraries obtained from various life cycle stages of Globodera pallida. We have identified over 50 G. pallida effectors from this dataset using bioinformatics analysis, by screening clones in order to identify secreted proteins up-regulated after the onset of parasitism and using in situ hybridization to confirm the expression in pharyngeal gland cells. A substantial gene family encoding G. pallida SPRYSEC proteins has been identified. The expression of these genes is restricted to the dorsal pharyngeal gland cell. Different members of the SPRYSEC family of proteins from G. pallida show different subcellular localization patterns in plants, with some localized to the cytoplasm and others to the nucleus and nucleolus. Differences in subcellular localization may reflect diverse functional roles for each individual protein or, more likely, variety in the compartmentalization of plant proteins targeted by the nematode. Our data are therefore consistent with the suggestion that the SPRYSEC proteins suppress host defences, as suggested previously, and that they achieve this through interaction with a range of host targets.

  3. Analysis and functional annotation of expressed sequence tags from in vitro cell lines of elasmobranchs: spiny dogfish shark (Squalus acanthias) and little skate (Leucoraja erinacea)

    PubMed Central

    Parton, Angela; Bayne, Christopher J.; Barnes, David W.

    2010-01-01

    Elasmobranchs are the most commonly used experimental models among the jawed, cartilaginous fish (Chondrichthyes). Previously we developed cell lines from embryos of two elasmobranchs, Squalus acanthias the spiny dogfish shark (SAE line), and Leucoraja erinacea the little skate (LEE-1 line). From these lines cDNA libraries were derived and expressed sequence tags (ESTs) generated. From the SAE cell line 4303 unique transcripts were identified, with 1848 of these representing unknown sequences (showing no BLASTX identification). From the LEE-1 cell line, 3660 unique transcripts were identified, and unknown, unique sequences totaled 1333. Gene Ontology (GO) annotation showed that GO assignments for the two cell lines were in general similar. These results suggest that the procedures used to derive the cell lines led to isolation of cell types of the same general embryonic origin from both species. The LEE-1 transcripts included GO categories “envelope” and “oxidoreductase activity” but the SAE transcripts did not. GO analysis of SAE transcripts identified the category “anatomical structure formation” that was not present in LEE-1 cells. Increased organelle compartments may exist within LEE-1 cells compared to SAE cells, and the higher oxidoreductase activity in LEE-1 cells may indicate a role for these cells in responses associated with innate immunity or in steroidogenesis. These EST libraries from elasmobranch cell lines provide information for assembly of genomic sequences and are useful in revealing gene diversity, new genes and molecular markers, as well as in providing means for elucidation of full-length cDNAs and probes for gene array analyses. This is the first study of this type with members of the Chondrichthyes. PMID:20471924

  4. Analysis and functional annotation of expressed sequence tags from in vitro cell lines of elasmobranchs: Spiny dogfish shark (Squalus acanthias) and little skate (Leucoraja erinacea).

    PubMed

    Parton, Angela; Bayne, Christopher J; Barnes, David W

    2010-09-01

    Elasmobranchs are the most commonly used experimental models among the jawed, cartilaginous fish (Chondrichthyes). Previously we developed cell lines from embryos of two elasmobranchs, Squalus acanthias the spiny dogfish shark (SAE line), and Leucoraja erinacea the little skate (LEE-1 line). From these lines cDNA libraries were derived and expressed sequence tags (ESTs) generated. From the SAE cell line 4303 unique transcripts were identified, with 1848 of these representing unknown sequences (showing no BLASTX identification). From the LEE-1 cell line, 3660 unique transcripts were identified, and unknown, unique sequences totaled 1333. Gene Ontology (GO) annotation showed that GO assignments for the two cell lines were in general similar. These results suggest that the procedures used to derive the cell lines led to isolation of cell types of the same general embryonic origin from both species. The LEE-1 transcripts included GO categories "envelope" and "oxidoreductase activity" but the SAE transcripts did not. GO analysis of SAE transcripts identified the category "anatomical structure formation" that was not present in LEE-1 cells. Increased organelle compartments may exist within LEE-1 cells compared to SAE cells, and the higher oxidoreductase activity in LEE-1 cells may indicate a role for these cells in responses associated with innate immunity or in steroidogenesis. These EST libraries from elasmobranch cell lines provide information for assembly of genomic sequences and are useful in revealing gene diversity, new genes and molecular markers, as well as in providing means for elucidation of full-length cDNAs and probes for gene array analyses. This is the first study of this type with members of the Chondrichthyes. Copyright 2010 Elsevier Inc. All rights reserved.

  5. Identification of Delta5-fatty acid desaturase from the cellular slime mold dictyostelium discoideum.

    PubMed

    Saito, T; Ochiai, H

    1999-10-01

    cDNA fragments putatively encoding amino acid sequences characteristic of the fatty acid desaturase were obtained using expressed sequence tag (EST) information of the Dictyostelium cDNA project. Using this sequence, we have determined the cDNA sequence and genomic sequence of a desaturase. The cloned cDNA is 1489 nucleotides long and the deduced amino acid sequence comprised 464 amino acid residues containing an N-terminal cytochrome b5 domain. The whole sequence was 38.6% identical to the initially identified Delta5-desaturase of Mortierella alpina. We have confirmed its function as Delta5-desaturase by over expression mutation in D. discoideum and also the gain of function mutation in the yeast Saccharomyces cerevisiae. Analysis of the lipids from transformed D. discoideum and yeast demonstrated the accumulation of Delta5-desaturated products. This is the first report concering fatty acid desaturase in cellular slime molds.

  6. Sequencing, Analysis, and Annotation of Expressed Sequence Tags for Camelus dromedarius

    PubMed Central

    Al-Swailem, Abdulaziz M.; Shehata, Maher M.; Abu-Duhier, Faisel M.; Al-Yamani, Essam J.; Al-Busadah, Khalid A.; Al-Arawi, Mohammed S.; Al-Khider, Ali Y.; Al-Muhaimeed, Abdullah N.; Al-Qahtani, Fahad H.; Manee, Manee M.; Al-Shomrani, Badr M.; Al-Qhtani, Saad M.; Al-Harthi, Amer S.; Akdemir, Kadir C.; Otu, Hasan H.

    2010-01-01

    Despite its economical, cultural, and biological importance, there has not been a large scale sequencing project to date for Camelus dromedarius. With the goal of sequencing complete DNA of the organism, we first established and sequenced camel EST libraries, generating 70,272 reads. Following trimming, chimera check, repeat masking, cluster and assembly, we obtained 23,602 putative gene sequences, out of which over 4,500 potentially novel or fast evolving gene sequences do not carry any homology to other available genomes. Functional annotation of sequences with similarities in nucleotide and protein databases has been obtained using Gene Ontology classification. Comparison to available full length cDNA sequences and Open Reading Frame (ORF) analysis of camel sequences that exhibit homology to known genes show more than 80% of the contigs with an ORF>300 bp and ∼40% hits extending to the start codons of full length cDNAs suggesting successful characterization of camel genes. Similarity analyses are done separately for different organisms including human, mouse, bovine, and rat. Accompanying web portal, CAGBASE (http://camel.kacst.edu.sa/), hosts a relational database containing annotated EST sequences and analysis tools with possibility to add sequences from public domain. We anticipate our results to provide a home base for genomic studies of camel and other comparative studies enabling a starting point for whole genome sequencing of the organism. PMID:20502665

  7. Characterization of the two intracellular lipases of Y. lipolytica encoded by TGL3 and TGL4 genes: new insights into the role of intracellular lipases and lipid body organisation.

    PubMed

    Dulermo, Thierry; Tréton, Brigitte; Beopoulos, Athanasios; Kabran Gnankon, Affoué Philomène; Haddouche, Ramdane; Nicaud, Jean-Marc

    2013-09-01

    Eukaryotes store lipids in a specialised organelle, the lipid body (LB), mainly as triglycerides (TAGs). Both the rates of synthesis and degradation contribute to the control of the accumulation of TAGs. The synthesis of TAGs in yeasts has been well documented, especially in the model yeast Saccharomyces cerevisiae and in the oleaginous yeast Yarrowia lipolytica. However, descriptions of the processes involved in TAG degradation are more scarce and mostly for S. cerevisiae. Here, we report the characterisation of two Y. lipolytica genes, YlTGL3 and YlTGL4, encoding intracellular lipases involved in TAG degradation. The two proteins are localised in lipid bodies, and YlTgl4 was mainly found at the interface between LBs. Surprisingly, the spatial organisation of YlTgl3 and YlTgl4 depends on the culture medium and on the physiological phase of the cell. Inactivation of one or both genes doubles the lipid accumulation capacity of Y. lipolytica, increasing the cell's capacity to accumulate TAGs. The amino acid sequence of YlTgl4 contains the consensus sequence motif (G/A)XSXG, typical of serine hydrolases, whereas YlTgl3 does not. Single and double mutants are unable to degrade TAGs, and higher expression of YlTgl4 correlates with TAG degradation. Therefore, we propose that YlTgl4 is the main lipase responsible for TAG degradation and that YlTgl3 may act as a positive regulator of YlTgl4 rather than a functional lipase. Thus, contrary to S. cerevisiae, Y. lipolytica possesses two intracellular lipases with distinct roles and with distinct localisations in the LB. © 2013. Published by Elsevier B.V. All rights reserved.

  8. In silico analysis of expressed sequence tags from Trichostrongylus vitrinus (Nematoda): comparison of the automated ESTExplorer workflow platform with conventional database searches.

    PubMed

    Nagaraj, Shivashankar H; Gasser, Robin B; Nisbet, Alasdair J; Ranganathan, Shoba

    2008-01-01

    The analysis of expressed sequence tags (EST) offers a rapid and cost effective approach to elucidate the transcriptome of an organism, but requires several computational methods for assembly and annotation. Researchers frequently analyse each step manually, which is laborious and time consuming. We have recently developed ESTExplorer, a semi-automated computational workflow system, in order to achieve the rapid analysis of EST datasets. In this study, we evaluated EST data analysis for the parasitic nematode Trichostrongylus vitrinus (order Strongylida) using ESTExplorer, compared with database matching alone. We functionally annotated 1776 ESTs obtained via suppressive-subtractive hybridisation from T. vitrinus, an important parasitic trichostrongylid of small ruminants. Cluster and comparative genomic analyses of the transcripts using ESTExplorer indicated that 290 (41%) sequences had homologues in Caenorhabditis elegans, 329 (42%) in parasitic nematodes, 202 (28%) in organisms other than nematodes, and 218 (31%) had no significant match to any sequence in the current databases. Of the C. elegans homologues, 90 were associated with 'non-wildtype' double-stranded RNA interference (RNAi) phenotypes, including embryonic lethality, maternal sterility, sterile progeny, larval arrest and slow growth. We could functionally classify 267 (38%) sequences using the Gene Ontologies (GO) and establish pathway associations for 230 (33%) sequences using the Kyoto Encyclopedia of Genes and Genomes (KEGG). Further examination of this EST dataset revealed a number of signalling molecules, proteases, protease inhibitors, enzymes, ion channels and immune-related genes. In addition, we identified 40 putative secreted proteins that could represent potential candidates for developing novel anthelmintics or vaccines. We further compared the automated EST sequence annotations, using ESTExplorer, with database search results for individual T. vitrinus ESTs. ESTExplorer reliably and rapidly annotated 301 ESTs, with pathway and GO information, eliminating 60 low quality hits from database searches. We evaluated the efficacy of ESTExplorer in analysing EST data, and demonstrate that computational tools can be used to accelerate the process of gene discovery in EST sequencing projects. The present study has elucidated sets of relatively conserved and potentially novel genes for biological investigation, and the annotated EST set provides further insight into the molecular biology of T. vitrinus, towards the identification of novel drug targets.

  9. Expressed sequence tag analysis of guinea pig (Cavia porcellus) eye tissues for NEIBank

    PubMed Central

    Simpanya, Mukoma F.; Wistow, Graeme; Gao, James; David, Larry L.; Giblin, Frank J.

    2008-01-01

    Purpose To characterize gene expression patterns in guinea pig ocular tissues and identify orthologs of human genes from NEIBank expressed sequence tags. Methods RNA was extracted from dissected eye tissues of 2.5-month-old guinea pigs to make three unamplified and unnormalized cDNA libraries in the pCMVSport-6 vector for the lens, retina, and eye minus lens and retina. Over 4,000 clones were sequenced from each library and were analyzed using GRIST for clustering and gene identification. Lens crystallin EST data were validated using two-dimensional electrophoresis (2-DE), matrix assisted laser desorption (MALDI), and electrospray ionization mass spectrometry (ESIMS). Results Combined data from the three libraries generated a total of 6,694 distinctive gene clusters, with each library having between 1,000 and 3,000 clusters. Approximately 60% of the total gene clusters were novel cDNA sequences and had significant homologies to other mammalian sequences in GenBank. Complete cDNA sequences were obtained for many guinea pig lens proteins, including αA/αAinsert-, γN-, and γS-crystallins, lengsin and GRIFIN. The ratio of αA- to αB-crystallin on 2-DE gels was 8: 1 in the lens nucleus and 6.5: 1 in the cortex. Analysis of ESTs, genome sequence, and proteins (by MALDI), did not reveal any evidence for the presence of γD-, γE-, and γF-crystallin in the guinea pig. Predicted masses of many guinea pig lens crystallins were confirmed by ESIMS analysis. For the retina, orthologs of human phototransduction genes were found, such as Rhodopsin, S-antigen (Sag, Arrestin), and Transducin. The guinea-pig ortholog of NRL, a key rod photoreceptor-specific transcription factor, was also represented in EST data. In the ‘rest-of-eye’ library, the most abundant transcripts included decorin and keratin 12, representative of the cornea. Conclusions Genomic analysis of guinea pig eye tissues provides sequence-verified clones for future studies. Guinea pig orthologs of many human eye specific genes were identified. Guinea pig gene structures were similar to their human and rodent gene counterparts. Surprisingly, no orthologs of γD-, γE-, and γF-crystallin were found in EST, proteomic, or the current guinea pig genome data. PMID:19104676

  10. Biochemical characterization of Yarrowia lipolytica LIP8, a secreted lipase with a cleavable C-terminal region.

    PubMed

    Kamoun, Jannet; Schué, Mathieu; Messaoud, Wala; Baignol, Justine; Point, Vanessa; Mateos-Diaz, Eduardo; Mansuelle, Pascal; Gargouri, Youssef; Parsiegla, Goetz; Cavalier, Jean-François; Carrière, Frédéric; Aloulou, Ahmed

    2015-02-01

    Yarrowia lipolytica is a lipolytic yeast possessing 16 paralog genes coding for lipases. Little information on these lipases has been obtained and only the major secreted lipase, namely YLLIP2, had been biochemically and structurally characterized. Another secreted lipase, YLLIP8, was isolated from Y. lipolytica culture medium and compared with the recombinant enzyme produced in Pichia pastoris. N-terminal sequencing showed that YLLIP8 is produced in its active form after the cleavage of a signal peptide. Mass spectrometry analysis revealed that YLLIP8 recovered from culture medium lacks a C-terminal part of 33 amino acids which are present in the coding sequence. A 3D model of YLLIP8 built from the X-ray structure of the homologous YLLIP2 lipase shows that these truncated amino acids in YLLIP8 belong to an additional C-terminal region predicted to be mainly helical. Western blot analysis shows that YLLIP8 C-tail is rapidly cleaved upon enzyme secretion since both cell-bound and culture supernatant lipases lack this extension. Mature recombinant YLLIP8 displays a true lipase activity on short-, medium- and long-chain triacylglycerols (TAG), with an optimum activity at alkaline pH on medium chain TAG. It has no apparent regioselectivity in TAG hydrolysis, thus generating glycerol and FFAs as final lipolysis products. YLLIP8 properties are distinct from those of the 1,3-regioselective YLLIP2, acting optimally at acidic pH. These lipases are tailored for complementary roles in fatty acid uptake by Y. lipolytica. Copyright © 2014 Elsevier B.V. All rights reserved.

  11. Analysis and functional annotation of expressed sequence tags (ESTs) from multiple tissues of oil palm (Elaeis guineensis Jacq.)

    PubMed Central

    Ho, Chai-Ling; Kwan, Yen-Yen; Choi, Mei-Chooi; Tee, Sue-Sean; Ng, Wai-Har; Lim, Kok-Ang; Lee, Yang-Ping; Ooi, Siew-Eng; Lee, Weng-Wah; Tee, Jin-Ming; Tan, Siang-Hee; Kulaveerasingam, Harikrishna; Alwee, Sharifah Shahrul Rabiah Syed; Abdullah, Meilina Ong

    2007-01-01

    Background Oil palm is the second largest source of edible oil which contributes to approximately 20% of the world's production of oils and fats. In order to understand the molecular biology involved in in vitro propagation, flowering, efficient utilization of nitrogen sources and root diseases, we have initiated an expressed sequence tag (EST) analysis on oil palm. Results In this study, six cDNA libraries from oil palm zygotic embryos, suspension cells, shoot apical meristems, young flowers, mature flowers and roots, were constructed. We have generated a total of 14537 expressed sequence tags (ESTs) from these libraries, from which 6464 tentative unique contigs (TUCs) and 2129 singletons were obtained. Approximately 6008 of these tentative unique genes (TUGs) have significant matches to the non-redundant protein database, from which 2361 were assigned to one or more Gene Ontology categories. Predominant transcripts and differentially expressed genes were identified in multiple oil palm tissues. Homologues of genes involved in many aspects of flower development were also identified among the EST collection, such as CONSTANS-like, AGAMOUS-like (AGL)2, AGL20, LFY-like, SQUAMOSA, SQUAMOSA binding protein (SBP) etc. Majority of them are the first representatives in oil palm, providing opportunities to explore the cause of epigenetic homeotic flowering abnormality in oil palm, given the importance of flowering in fruit production. The transcript levels of two flowering-related genes, EgSBP and EgSEP were analysed in the flower tissues of various developmental stages. Gene homologues for enzymes involved in oil biosynthesis, utilization of nitrogen sources, and scavenging of oxygen radicals, were also uncovered among the oil palm ESTs. Conclusion The EST sequences generated will allow comparative genomic studies between oil palm and other monocotyledonous and dicotyledonous plants, development of gene-targeted markers for the reference genetic map, design and fabrication of DNA array for future studies of oil palm. The outcomes of such studies will contribute to oil palm improvements through the establishment of breeding program using marker-assisted selection, development of diagnostic assays using gene targeted markers, and discovery of candidate genes related to important agronomic traits of oil palm. PMID:17953740

  12. Analysis of bacterial and archaeal diversity in coastal microbial mats using massive parallel 16S rRNA gene tag sequencing.

    PubMed

    Bolhuis, Henk; Stal, Lucas J

    2011-11-01

    Coastal microbial mats are small-scale and largely closed ecosystems in which a plethora of different functional groups of microorganisms are responsible for the biogeochemical cycling of the elements. Coastal microbial mats play an important role in coastal protection and morphodynamics through stabilization of the sediments and by initiating the development of salt-marshes. Little is known about the bacterial and especially archaeal diversity and how it contributes to the ecological functioning of coastal microbial mats. Here, we analyzed three different types of coastal microbial mats that are located along a tidal gradient and can be characterized as marine (ST2), brackish (ST3) and freshwater (ST3) systems. The mats were sampled during three different seasons and subjected to massive parallel tag sequencing of the V6 region of the 16S rRNA genes of Bacteria and Archaea. Sequence analysis revealed that the mats are among the most diverse marine ecosystems studied so far and consist of several novel taxonomic levels ranging from classes to species. The diversity between the different mat types was far more pronounced than the changes between the different seasons at one location. The archaeal community for these mats have not been studied before and revealed a strong reaction on a short period of draught during summer resulting in a massive increase in halobacterial sequences, whereas the bacterial community was barely affected. We concluded that the community composition and the microbial diversity were intrinsic of the mat type and depend on the location along the tidal gradient indicating a relation with salinity.

  13. Mass spectrometry-based protein identification by integrating de novo sequencing with database searching.

    PubMed

    Wang, Penghao; Wilson, Susan R

    2013-01-01

    Mass spectrometry-based protein identification is a very challenging task. The main identification approaches include de novo sequencing and database searching. Both approaches have shortcomings, so an integrative approach has been developed. The integrative approach firstly infers partial peptide sequences, known as tags, directly from tandem spectra through de novo sequencing, and then puts these sequences into a database search to see if a close peptide match can be found. However the current implementation of this integrative approach has several limitations. Firstly, simplistic de novo sequencing is applied and only very short sequence tags are used. Secondly, most integrative methods apply an algorithm similar to BLAST to search for exact sequence matches and do not accommodate sequence errors well. Thirdly, by applying these methods the integrated de novo sequencing makes a limited contribution to the scoring model which is still largely based on database searching. We have developed a new integrative protein identification method which can integrate de novo sequencing more efficiently into database searching. Evaluated on large real datasets, our method outperforms popular identification methods.

  14. Sampling and pyrosequencing methods for characterizing bacterial communities in the human gut using 16S sequence tags

    PubMed Central

    2010-01-01

    Intense interest centers on the role of the human gut microbiome in health and disease, but optimal methods for analysis are still under development. Here we present a study of methods for surveying bacterial communities in human feces using 454/Roche pyrosequencing of 16S rRNA gene tags. We analyzed fecal samples from 10 individuals and compared methods for storage, DNA purification and sequence acquisition. To assess reproducibility, we compared samples one cm apart on a single stool specimen for each individual. To analyze storage methods, we compared 1) immediate freezing at -80°C, 2) storage on ice for 24 or 3) 48 hours. For DNA purification methods, we tested three commercial kits and bead beating in hot phenol. Variations due to the different methodologies were compared to variation among individuals using two approaches--one based on presence-absence information for bacterial taxa (unweighted UniFrac) and the other taking into account their relative abundance (weighted UniFrac). In the unweighted analysis relatively little variation was associated with the different analytical procedures, and variation between individuals predominated. In the weighted analysis considerable variation was associated with the purification methods. Particularly notable was improved recovery of Firmicutes sequences using the hot phenol method. We also carried out surveys of the effects of different 454 sequencing methods (FLX versus Titanium) and amplification of different 16S rRNA variable gene segments. Based on our findings we present recommendations for protocols to collect, process and sequence bacterial 16S rDNA from fecal samples--some major points are 1) if feasible, bead-beating in hot phenol or use of the PSP kit improves recovery; 2) storage methods can be adjusted based on experimental convenience; 3) unweighted (presence-absence) comparisons are less affected by lysis method. PMID:20673359

  15. Short range spread-spectrum radiolocation system and method

    DOEpatents

    Smith, Stephen F.

    2003-04-29

    A short range radiolocation system and associated methods that allow the location of an item, such as equipment, containers, pallets, vehicles, or personnel, within a defined area. A small, battery powered, self-contained tag is provided to an item to be located. The tag includes a spread-spectrum transmitter that transmits a spread-spectrum code and identification information. A plurality of receivers positioned about the area receive signals from a transmitting tag. The position of the tag, and hence the item, is located by triangulation. The system employs three different ranging techniques for providing coarse, intermediate, and fine spatial position resolution. Coarse positioning information is provided by use of direct-sequence code phase transmitted as a spread-spectrum signal. Intermediate positioning information is provided by the use of a difference signal transmitted with the direct-sequence spread-spectrum code. Fine positioning information is provided by use of carrier phase measurements. An algorithm is employed to combine the three data sets to provide accurate location measurements.

  16. Expression and purification of recombinant proteins in Escherichia coli tagged with the metal-binding protein CusF.

    PubMed

    Cantu-Bustos, J Enrique; Vargas-Cortez, Teresa; Morones-Ramirez, Jose Ruben; Balderas-Renteria, Isaias; Galbraith, David W; McEvoy, Megan M; Zarate, Xristo

    2016-05-01

    Production of recombinant proteins in Escherichia coli has been improved considerably through the use of fusion proteins, because they increase protein solubility and facilitate purification via affinity chromatography. In this article, we propose the use of CusF as a new fusion partner for expression and purification of recombinant proteins in E. coli. Using a cell-free protein expression system, based on the E. coli S30 extract, Green Fluorescent Protein (GFP) was expressed with a series of different N-terminal tags, immobilized on self-assembled protein microarrays, and its fluorescence quantified. GFP tagged with CusF showed the highest fluorescence intensity, and this was greater than the intensities from corresponding GFP constructs that contained MBP or GST tags. Analysis of protein production in vivo showed that CusF produces large amounts of soluble protein with low levels of inclusion bodies. Furthermore, fusion proteins can be exported to the cellular periplasm, if CusF contains the signal sequence. Taking advantage of its ability to bind copper ions, recombinant proteins can be purified with readily available IMAC resins charged with this metal ion, producing pure proteins after purification and tag removal. We therefore recommend the use of CusF as a viable alternative to MBP or GST as a fusion protein/affinity tag for the production of soluble recombinant proteins in E. coli. Copyright © 2016 Elsevier Inc. All rights reserved.

  17. Expanding the versatility of phage display I: efficient display of peptide-tags on protein VII of the filamentous phage.

    PubMed

    Løset, Geir Åge; Bogen, Bjarne; Sandlie, Inger

    2011-02-24

    Phage display is a platform for selection of specific binding molecules and this is a clear-cut motivation for increasing its performance. Polypeptides are normally displayed as fusions to the major coat protein VIII (pVIII), or the minor coat protein III (pIII). Display on other coat proteins such as pVII allows for display of heterologous peptide sequences on the virions in addition to those displayed on pIII and pVIII. In addition, pVII display is an alternative to pIII or pVIII display. Here we demonstrate how standard pIII or pVIII display phagemids are complemented with a helper phage which supports production of virions that are tagged with octa FLAG, HIS(6) or AviTag on pVII. The periplasmic signal sequence required for pIII and pVIII display, and which has been added to pVII in earlier studies, is omitted altogether. Tagging on pVII is an important and very useful add-on feature to standard pIII and pVII display. Any phagemid bearing a protein of interest on either pIII or pVIII can be tagged with any of the tags depending simply on choice of helper phage. We show in this paper how such tags may be utilized for immobilization and separation as well as purification and detection of monoclonal and polyclonal phage populations.

  18. The use of sequence-based SSR mining for the development of a vast collection of microsatellites in Aquilegia Formosa

    Treesearch

    Brandon Schlautman; Vera Pfeiffer; Juan Zalapa; Johanne Brunet

    2014-01-01

    Numerous microsatellite markers were developed for Aquilegia formosafrom sequences deposited within the Expressed Sequence Tag (EST), Genomic Survey Sequence (GSS), and Nucleotide databases in NCBI. Microsatellites (SSRs) were identified and primers were designed for 9 SSR containing sequences in the Nucleotide database, 3803 sequences in the EST...

  19. Gene expression profiling via LongSAGE in a non-model plant species: a case study in seeds of Brassica napus

    PubMed Central

    Obermeier, Christian; Hosseini, Bashir; Friedt, Wolfgang; Snowdon, Rod

    2009-01-01

    Background Serial analysis of gene expression (LongSAGE) was applied for gene expression profiling in seeds of oilseed rape (Brassica napus ssp. napus). The usefulness of this technique for detailed expression profiling in a non-model organism was demonstrated for the highly complex, neither fully sequenced nor annotated genome of B. napus by applying a tag-to-gene matching strategy based on Brassica ESTs and the annotated proteome of the closely related model crucifer A. thaliana. Results Transcripts from 3,094 genes were detected at two time-points of seed development, 23 days and 35 days after pollination (DAP). Differential expression showed a shift from gene expression involved in diverse developmental processes including cell proliferation and seed coat formation at 23 DAP to more focussed metabolic processes including storage protein accumulation and lipid deposition at 35 DAP. The most abundant transcripts at 23 DAP were coding for diverse protease inhibitor proteins and proteases, including cysteine proteases involved in seed coat formation and a number of lipid transfer proteins involved in embryo pattern formation. At 35 DAP, transcripts encoding napin, cruciferin and oleosin storage proteins were most abundant. Over both time-points, 18.6% of the detected genes were matched by Brassica ESTs identified by LongSAGE tags in antisense orientation. This suggests a strong involvement of antisense transcript expression in regulatory processes during B. napus seed development. Conclusion This study underlines the potential of transcript tagging approaches for gene expression profiling in Brassica crop species via EST matching to annotated A. thaliana genes. Limits of tag detection for low-abundance transcripts can today be overcome by ultra-high throughput sequencing approaches, so that tag-based gene expression profiling may soon become the method of choice for global expression profiling in non-model species. PMID:19575793

  20. Expression and Stability of Foreign Epitopes Introduced into 3A Nonstructural Protein of Foot-and-Mouth Disease Virus

    PubMed Central

    Li, Pinghua; Bai, Xingwen; Cao, Yimei; Han, Chenghao; Lu, Zengjun; Sun, Pu; Yin, Hong; Liu, Zaixin

    2012-01-01

    Foot-and-mouth disease virus (FMDV) is an aphthovirus that belongs to the Picornaviridae family and causes one of the most important animal diseases worldwide. The capacity of other picornaviruses to express foreign antigens has been extensively reported, however, little is known about FMDV. To explore the potential of FMDV as a viral vector, an 11-amino-acid (aa) HSV epitope and an 8 aa FLAG epitope were introduced into the C-terminal different regions of 3A protein of FMDV full-length infectious cDNA clone. Recombinant viruses expressing the HSV or FLAG epitope were successfully rescued after transfection of both modified constructs. Immunofluorescence assay, Western blot and sequence analysis showed that the recombinant viruses stably maintained the foreign epitopes even after 11 serial passages in BHK-21 cells. The 3A-tagged viruses shared similar plaque phenotypes and replication kinetics to those of the parental virus. In addition, mice experimentally infected with the epitope-tagged viruses could induce tag-specific antibodies. Our results demonstrate that FMDV can be used effectively as a viral vector for the delivery of foreign tags. PMID:22848509

  1. Expressed sequence tag (EST) analysis of the pine wood nematode Bursaphelenchus xylophilus and B. mucronatus.

    PubMed

    Kikuchi, Taisei; Aikawa, Takuya; Kosaka, Hajime; Pritchard, Leighton; Ogura, Nobuo; Jones, John T

    2007-09-01

    Most Bursaphelenchus species feed on fungi that colonise dead or dying trees. However, Bursaphelenchus xylophilus is unique in that in addition to feeding on fungi it has the capacity to be a parasite of live pine trees. We present an analysis of over 13,000 expressed sequence tags (ESTs) from B. xylophilus and, by way of contrast, over 3000 ESTs from a closely related species that does not parasitise plants as readily; B. mucronatus. Four libraries from B. xylophilus, from a variety of life stages including fungal feeding nematodes, nematodes extracted from plants and dauer-like stage nematodes, and one library from B. mucronatus were constructed and used to generate ESTs. Contig analysis showed that the 13,327 B. xylophilus ESTs could be grouped into 2110 contigs and 4377 singletons giving a total of 6487 identified genes. Similarly the 3193 B. mucronatus ESTs yielded a total of 2219 identified genes from 425 contigs and 1794 singletons. A variety of proteins potentially important in the parasitic process of B. xylophilus and B. mucronatus, including plant and fungal cell wall degrading enzymes and a novel gene potentially encoding a expansin-like protein that may disrupt non-covalent bonds in the plant cell wall were identified in the libraries. Additionally several gene candidates potentially involved in dauer entry or maintenance were also identified in the EST dataset. The EST sequences from this study will provide a solid base for future research on the biology, pathogenicity and evolutionary history of this nematode group.

  2. Cytogenetic and molecular identification of a wheat-Leymus mollis alien multiple substitution line from octoploid Tritileymus x Triticum durum.

    PubMed

    Pang, Y H; Zhao, J X; Du, W L; Li, Y L; Wang, J; Wang, L M; Wu, J; Cheng, X N; Yang, Q H; Chen, X H

    2014-05-23

    Leymus mollis (Trin.) Pilger (NsNsXmXm, 2n = 28), a wild relative of common wheat, possesses many traits that are potentially valuable for wheat improvement. In order to exploit and utilize the useful genes of L. mollis, we developed a multiple alien substitution line, 10DM50, from the progenies of octoploid Tritileymus M842-16 x Triticum durum cv. D4286. Genomic in situ hybridization analysis of mitosis and meiosis (metaphase I), using labeled total DNA of Psathyrostachys huashanica as probe, showed that the substitution line 10DM50 was a cytogenetically stable alien substitution line with 36 chromosomes from wheat and three pairs of Ns genome chromosomes from L. mollis. Simple sequence repeat analysis showed that the chromosomes 3D, 6D, and 7D were absent in 10DM50. Expressed sequence tag-sequence tagged sites analysis showed that new chromatin from 3Ns, 6Ns, and 7Ns of L. mollis were detected in 10DM50. We deduced that the substitution line 10DM50 was a multiple alien substitution line with the 3D, 6D, and 7D chromosomes replaced by 3Ns, 6Ns, and 7Ns from L. mollis. 10DM50 showed high resistance to leaf rust and significantly improved spike length, spikes per plant, and kernels per spike, which are correlated with higher wheat yield. These results suggest that line 10DM50 could be used as intermediate material for transferring desirable traits from L. mollis into common wheat in breeding programs.

  3. Evolutionary view of acyl-CoA diacylglycerol acyltransferase (DGAT), a key enzyme in neutral lipid biosynthesis.

    PubMed

    Turchetto-Zolet, Andreia C; Maraschin, Felipe S; de Morais, Guilherme L; Cagliari, Alexandro; Andrade, Cláudia M B; Margis-Pinheiro, Marcia; Margis, Rogerio

    2011-09-20

    Triacylglycerides (TAGs) are a class of neutral lipids that represent the most important storage form of energy for eukaryotic cells. DGAT (acyl-CoA: diacylglycerol acyltransferase; EC 2.3.1.20) is a transmembrane enzyme that acts in the final and committed step of TAG synthesis, and it has been proposed to be the rate-limiting enzyme in plant storage lipid accumulation. In fact, two different enzymes identified in several eukaryotic species, DGAT1 and DGAT2, are the main enzymes responsible for TAG synthesis. These enzymes do not share high DNA or protein sequence similarities, and it has been suggested that they play non-redundant roles in different tissues and in some species in TAG synthesis. Despite a number of previous studies on the DGAT1 and DGAT2 genes, which have emphasized their importance as potential obesity treatment targets to increase triacylglycerol accumulation, little is known about their evolutionary timeline in eukaryotes. The goal of this study was to examine the evolutionary relationship of the DGAT1 and DGAT2 genes across eukaryotic organisms in order to infer their origin. We have conducted a broad survey of fully sequenced genomes, including representatives of Amoebozoa, yeasts, fungi, algae, musses, plants, vertebrate and invertebrate species, for the presence of DGAT1 and DGAT2 gene homologs. We found that the DGAT1 and DGAT2 genes are nearly ubiquitous in eukaryotes and are readily identifiable in all the major eukaryotic groups and genomes examined. Phylogenetic analyses of the DGAT1 and DGAT2 amino acid sequences revealed evolutionary partitioning of the DGAT protein family into two major DGAT1 and DGAT2 clades. Protein secondary structure and hydrophobic-transmembrane analysis also showed differences between these enzymes. The analysis also revealed that the MGAT2 and AWAT genes may have arisen from DGAT2 duplication events. In this study, we identified several DGAT1 and DGAT2 homologs in eukaryote taxa. Overall, the data show that DGAT1 and DGAT2 are present in most eukaryotic organisms and belong to two different gene families. The phylogenetic and evolutionary analyses revealed that DGAT1 and DGAT2 evolved separately, with functional convergence, despite their wide molecular and structural divergence.

  4. Evolutionary view of acyl-CoA diacylglycerol acyltransferase (DGAT), a key enzyme in neutral lipid biosynthesis

    PubMed Central

    2011-01-01

    Background Triacylglycerides (TAGs) are a class of neutral lipids that represent the most important storage form of energy for eukaryotic cells. DGAT (acyl-CoA: diacylglycerol acyltransferase; EC 2.3.1.20) is a transmembrane enzyme that acts in the final and committed step of TAG synthesis, and it has been proposed to be the rate-limiting enzyme in plant storage lipid accumulation. In fact, two different enzymes identified in several eukaryotic species, DGAT1 and DGAT2, are the main enzymes responsible for TAG synthesis. These enzymes do not share high DNA or protein sequence similarities, and it has been suggested that they play non-redundant roles in different tissues and in some species in TAG synthesis. Despite a number of previous studies on the DGAT1 and DGAT2 genes, which have emphasized their importance as potential obesity treatment targets to increase triacylglycerol accumulation, little is known about their evolutionary timeline in eukaryotes. The goal of this study was to examine the evolutionary relationship of the DGAT1 and DGAT2 genes across eukaryotic organisms in order to infer their origin. Results We have conducted a broad survey of fully sequenced genomes, including representatives of Amoebozoa, yeasts, fungi, algae, musses, plants, vertebrate and invertebrate species, for the presence of DGAT1 and DGAT2 gene homologs. We found that the DGAT1 and DGAT2 genes are nearly ubiquitous in eukaryotes and are readily identifiable in all the major eukaryotic groups and genomes examined. Phylogenetic analyses of the DGAT1 and DGAT2 amino acid sequences revealed evolutionary partitioning of the DGAT protein family into two major DGAT1 and DGAT2 clades. Protein secondary structure and hydrophobic-transmembrane analysis also showed differences between these enzymes. The analysis also revealed that the MGAT2 and AWAT genes may have arisen from DGAT2 duplication events. Conclusions In this study, we identified several DGAT1 and DGAT2 homologs in eukaryote taxa. Overall, the data show that DGAT1 and DGAT2 are present in most eukaryotic organisms and belong to two different gene families. The phylogenetic and evolutionary analyses revealed that DGAT1 and DGAT2 evolved separately, with functional convergence, despite their wide molecular and structural divergence. PMID:21933415

  5. Diacylglycerol acyltransferase 2 of Mortierella alpina with specificity on long-chain polyunsaturated fatty acids: A potential tool for reconstituting lipids with nutritional value.

    PubMed

    Jeennor, Sukanya; Veerana, Mayura; Anantayanon, Jutamas; Panchanawaporn, Sarocha; Chutrakul, Chanikul; Laoteng, Kobkul

    2017-12-10

    Based on available genome sequences and bioinformatics tools, we searched for an uncharacterized open reading frame of Mortierella alpina (MaDGAT2) using diacylglycerol acyltransferase sequence (fungal DGAT type 2B) as a query. Functional characterization of the identified native and codon-optimized M. alpina genes were then performed by heterologous expression in Saccharomyces cerevisiae strain defective in synthesis of neutral lipid (NL). Lipid analysis of the yeast tranformant carrying MaDGAT2 showed that the NL biosynthesis and lipid particle formation were restored by the gene complementation. Substrate specificity study of the fungal enzyme by fatty acid supplementation in the transformant cultures showed that it had a broad specificity on saturated and unsaturated fatty acid substrates for esterification into triacylglycerol (TAG). The n-6 polyunsaturated fatty acids (PUFAs) with 18 and 20 carbon atoms, including linoleic acid, γ-linolenic acid, dihomo γ-linolenic and arachidonic acid could be incorporated into TAG fraction in the yeast cells. Interestingly, among n-3 PUFAs tested, the MaDGAT2 enzyme preferred eicosapentaenoic acid (EPA) substrate as its highly proportional constituent found in TAG fraction. This study provides a potential genetic tool for reconstituting oils rich in long-chain PUFAs with nutritional value. Copyright © 2017 Elsevier B.V. All rights reserved.

  6. A Primary Assembly of a Bovine Haplotype Block Map Based on a 15,036-Single-Nucleotide Polymorphism Panel Genotyped in Holstein–Friesian Cattle

    PubMed Central

    Khatkar, Mehar S.; Zenger, Kyall R.; Hobbs, Matthew; Hawken, Rachel J.; Cavanagh, Julie A. L.; Barris, Wes; McClintock, Alexander E.; McClintock, Sara; Thomson, Peter C.; Tier, Bruce; Nicholas, Frank W.; Raadsma, Herman W.

    2007-01-01

    Analysis of data on 1000 Holstein–Friesian bulls genotyped for 15,036 single-nucleotide polymorphisms (SNPs) has enabled genomewide identification of haplotype blocks and tag SNPs. A final subset of 9195 SNPs in Hardy–Weinberg equilibrium and mapped on autosomes on the bovine sequence assembly (release Btau 3.1) was used in this study. The average intermarker spacing was 251.8 kb. The average minor allele frequency (MAF) was 0.29 (0.05–0.5). Following recent precedents in human HapMap studies, a haplotype block was defined where 95% of combinations of SNPs within a region are in very high linkage disequilibrium. A total of 727 haplotype blocks consisting of ≥3 SNPs were identified. The average block length was 69.7 ± 7.7 kb, which is ∼5–10 times larger than in humans. These blocks comprised a total of 2964 SNPs and covered 50,638 kb of the sequence map, which constitutes 2.18% of the length of all autosomes. A set of tag SNPs, which will be useful for further fine-mapping studies, has been identified. Overall, the results suggest that as many as 75,000–100,000 tag SNPs would be needed to track all important haplotype blocks in the bovine genome. This would require ∼250,000 SNPs in the discovery phase. PMID:17435229

  7. Analysis of expressed sequence tags from a NaHCO(3)-treated alkali-tolerant plant, Chloris virgata.

    PubMed

    Nishiuchi, Shunsaku; Fujihara, Kazumasa; Liu, Shenkui; Takano, Tetsuo

    2010-04-01

    Chloris virgata Swartz (C. virgata) is a gramineous wild plant that can survive in saline-alkali areas in northeast China. To examine the tolerance mechanisms of C. virgata, we constructed a cDNA library from whole plants of C. virgata that had been treated with 100 mM NaHCO(3) for 24 h and sequenced 3168 randomly selected clones. Most (2590) of the expressed sequence tags (ESTs) showed significant similarity to sequences in the NCBI database. Of the 2590 genes, 1893 were unique. Gene Ontology (GO) Slim annotations were obtained for 1081 ESTs by BLAST2GO and it was found that 75 genes of them were annotated with GO terms "response to stress", "response to abiotic stimulus", and "response to biotic stimulus", indicating these genes were likely to function in tolerance mechanism of C. virgata. In a separate experiment, 24 genes that are known from previous studies to be associated with abiotic stress tolerance were further examined by real-time RT-PCR to see how their expressions were affected by NaHCO(3) stress. NaHCO(3) treatment up-regulated the expressions of pathogenesis-related gene (DC998527), Win1 precursor gene (DC998617), catalase gene (DC999385), ribosome inactivating protein 1 (DC999555), Na(+)/H(+) antiporter gene (DC998043), and two-component regulator gene (DC998236). Copyright 2010 Elsevier Masson SAS. All rights reserved.

  8. RAD tag sequencing as a source of SNP markers in Cynara cardunculus L

    PubMed Central

    2012-01-01

    Background The globe artichoke (Cynara cardunculus L. var. scolymus) genome is relatively poorly explored, especially compared to those of the other major Asteraceae crops sunflower and lettuce. No SNP markers are in the public domain. We have combined the recently developed restriction-site associated DNA (RAD) approach with the Illumina DNA sequencing platform to effect the rapid and mass discovery of SNP markers for C. cardunculus. Results RAD tags were sequenced from the genomic DNA of three C. cardunculus mapping population parents, generating 9.7 million reads, corresponding to ~1 Gbp of sequence. An assembly based on paired ends produced ~6.0 Mbp of genomic sequence, separated into ~19,000 contigs (mean length 312 bp), of which ~21% were fragments of putative coding sequence. The shared sequences allowed for the discovery of ~34,000 SNPs and nearly 800 indels, equivalent to a SNP frequency of 5.6 per 1,000 nt, and an indel frequency of 0.2 per 1,000 nt. A sample of heterozygous SNP loci was mapped by CAPS assays and this exercise provided validation of our mining criteria. The repetitive fraction of the genome had a high representation of retrotransposon sequence, followed by simple repeats, AT-low complexity regions and mobile DNA elements. The genomic k-mers distribution and CpG rate of C. cardunculus, compared with data derived from three whole genome-sequenced dicots species, provided a further evidence of the random representation of the C. cardunculus genome generated by RAD sampling. Conclusion The RAD tag sequencing approach is a cost-effective and rapid method to develop SNP markers in a highly heterozygous species. Our approach permitted to generate a large and robust SNP datasets by the adoption of optimized filtering criteria. PMID:22214349

  9. Annotation and sequence diversity of transposable elements in common bean (Phaseolus vulgaris).

    PubMed

    Gao, Dongying; Abernathy, Brian; Rohksar, Daniel; Schmutz, Jeremy; Jackson, Scott A

    2014-01-01

    Common bean (Phaseolus vulgaris) is an important legume crop grown and consumed worldwide. With the availability of the common bean genome sequence, the next challenge is to annotate the genome and characterize functional DNA elements. Transposable elements (TEs) are the most abundant component of plant genomes and can dramatically affect genome evolution and genetic variation. Thus, it is pivotal to identify TEs in the common bean genome. In this study, we performed a genome-wide transposon annotation in common bean using a combination of homology and sequence structure-based methods. We developed a 2.12-Mb transposon database which includes 791 representative transposon sequences and is available upon request or from www.phytozome.org. Of note, nearly all transposons in the database are previously unrecognized TEs. More than 5,000 transposon-related expressed sequence tags (ESTs) were detected which indicates that some transposons may be transcriptionally active. Two Ty1-copia retrotransposon families were found to encode the envelope-like protein which has rarely been identified in plant genomes. Also, we identified an extra open reading frame (ORF) termed ORF2 from 15 Ty3-gypsy families that was located between the ORF encoding the retrotransposase and the 3'LTR. The ORF2 was in opposite transcriptional orientation to retrotransposase. Sequence homology searches and phylogenetic analysis suggested that the ORF2 may have an ancient origin, but its function is not clear. These transposon data provide a useful resource for understanding the genome organization and evolution and may be used to identify active TEs for developing transposon-tagging system in common bean and other related genomes.

  10. A SSR-based genetic linkage map of cultivated peanut (Arachis hypogaea L.)

    USDA-ARS?s Scientific Manuscript database

    The objective of this study was to construct a molecular linkage map of cultivated tetraploid peanut using simple sequence repeat (SSR) markers derived primarily from peanut genomic sequences, expressed sequence tags (ESTs), and by "data mining" sequences released in GenBank. Three recombinant inbre...

  11. Performance and precision of double digestion RAD (ddRAD) genotyping in large multiplexed datasets of marine fish species.

    PubMed

    Maroso, F; Hillen, J E J; Pardo, B G; Gkagkavouzis, K; Coscia, I; Hermida, M; Franch, R; Hellemans, B; Van Houdt, J; Simionati, B; Taggart, J B; Nielsen, E E; Maes, G; Ciavaglia, S A; Webster, L M I; Volckaert, F A M; Martinez, P; Bargelloni, L; Ogden, R

    2018-06-01

    The development of Genotyping-By-Sequencing (GBS) technologies enables cost-effective analysis of large numbers of Single Nucleotide Polymorphisms (SNPs), especially in "non-model" species. Nevertheless, as such technologies enter a mature phase, biases and errors inherent to GBS are becoming evident. Here, we evaluated the performance of double digest Restriction enzyme Associated DNA (ddRAD) sequencing in SNP genotyping studies including high number of samples. Datasets of sequence data were generated from three marine teleost species (>5500 samples, >2.5 × 10 12 bases in total), using a standardized protocol. A common bioinformatics pipeline based on STACKS was established, with and without the use of a reference genome. We performed analyses throughout the production and analysis of ddRAD data in order to explore (i) the loss of information due to heterogeneous raw read number across samples; (ii) the discrepancy between expected and observed tag length and coverage; (iii) the performances of reference based vs. de novo approaches; (iv) the sources of potential genotyping errors of the library preparation/bioinformatics protocol, by comparing technical replicates. Our results showed use of a reference genome and a posteriori genotype correction improved genotyping precision. Individual read coverage was a key variable for reproducibility; variance in sequencing depth between loci in the same individual was also identified as an important factor and found to correlate to tag length. A comparison of downstream analysis carried out with ddRAD vs single SNP allele specific assay genotypes provided information about the levels of genotyping imprecision that can have a significant impact on allele frequency estimations and population assignment. The results and insights presented here will help to select and improve approaches to the analysis of large datasets based on RAD-like methodologies. Crown Copyright © 2018. Published by Elsevier B.V. All rights reserved.

  12. Endophyte Microbiome Diversity in Micropropagated Atriplex canescens and Atriplex torreyi var griffithsii

    PubMed Central

    Lucero, Mary E.; Unc, Adrian; Cooke, Peter; Dowd, Scot; Sun, Shulei

    2011-01-01

    Microbial diversity associated with micropropagated Atriplex species was assessed using microscopy, isolate culturing, and sequencing. Light, electron, and confocal microscopy revealed microbial cells in aseptically regenerated leaves and roots. Clone libraries and tag-encoded FLX amplicon pyrosequencing (TEFAP) analysis amplified sequences from callus homologous to diverse fungal and bacterial taxa. Culturing isolated some seed borne endophyte taxa which could be readily propagated apart from the host. Microbial cells were observed within biofilm-like residues associated with plant cell surfaces and intercellular spaces. Various universal primers amplified both plant and microbial sequences, with different primers revealing different patterns of fungal diversity. Bacterial and fungal TEFAP followed by alignment with sequences from curated databases revealed 7 bacterial and 17 ascomycete taxa in A. canescens, and 5 bacterial taxa in A. torreyi. Additional diversity was observed among isolates and clone libraries. Micropropagated Atriplex retains a complex, intimately associated microbiome which includes diverse strains well poised to interact in manners that influence host physiology. Microbiome analysis was facilitated by high throughput sequencing methods, but primer biases continue to limit recovery of diverse sequences from even moderately complex communities. PMID:21437280

  13. A new fusion protein platform for quantitatively measuring activity of multiple proteases

    PubMed Central

    2014-01-01

    Background Recombinant proteins fused with specific cleavage sequences are widely used as substrate for quantitatively analyzing the activity of proteases. Here we propose a new fusion platform for multiple proteases, by using diaminopropionate ammonia-lyase (DAL) as the fusion protein. It was based on the finding that a fused His6-tag could significantly decreases the activities of DAL from E. coli (eDAL) and Salmonella typhimurium (sDAL). Previously, we have shown that His6GST-tagged eDAL could be used to determine the activity of tobacco etch virus protease (TEVp) under different temperatures or in the denaturant at different concentrations. In this report, we will assay different tags and cleavage sequences on DAL for expressing yield in E. coli, stability of the fused proteins and performance of substrate of other common proteases. Results We tested seven different protease cleavage sequences (rhinovirus 3C, TEV protease, factor Xa, Ssp DnaB intein, Sce VMA1 intein, thrombin and enterokinase), three different tags (His6, GST, CBD and MBP) and two different DALs (eDAL and sDAL), for their performance as substrate to the seven corresponding proteases. Among them, we found four active DAL-fusion substrates suitable for TEVp, factor Xa, thrombin and DnaB intein. Enterokinase cleaved eDAL at undesired positions and did not process sDAL. Substitution of GST with MBP increase the expression level of the fused eDAL and this fusion protein was suitable as a substrate for analyzing activity of rhinovirus 3C. We demonstrated that SUMO protease Ulp1 with a N-terminal His6-tag or MBP tag displayed different activity using the designed His6SUMO-eDAL as substrate. Finally, owing to the high level of the DAL-fusion protein in E. coli, these protein substrates can also be detected directly from the crude extract. Conclusion The results show that our designed DAL-fusion proteins can be used to quantify the activities of both sequence- and conformational-specific proteases, with sufficient substrate specificity. PMID:24649897

  14. Recurrence time statistics: versatile tools for genomic DNA sequence analysis.

    PubMed

    Cao, Yinhe; Tung, Wen-Wen; Gao, J B

    2004-01-01

    With the completion of the human and a few model organisms' genomes, and the genomes of many other organisms waiting to be sequenced, it has become increasingly important to develop faster computational tools which are capable of easily identifying the structures and extracting features from DNA sequences. One of the more important structures in a DNA sequence is repeat-related. Often they have to be masked before protein coding regions along a DNA sequence are to be identified or redundant expressed sequence tags (ESTs) are to be sequenced. Here we report a novel recurrence time based method for sequence analysis. The method can conveniently study all kinds of periodicity and exhaustively find all repeat-related features from a genomic DNA sequence. An efficient codon index is also derived from the recurrence time statistics, which has the salient features of being largely species-independent and working well on very short sequences. Efficient codon indices are key elements of successful gene finding algorithms, and are particularly useful for determining whether a suspected EST belongs to a coding or non-coding region. We illustrate the power of the method by studying the genomes of E. coli, the yeast S. cervisivae, the nematode worm C. elegans, and the human, Homo sapiens. Computationally, our method is very efficient. It allows us to carry out analysis of genomes on the whole genomic scale by a PC.

  15. Markers and mapping revisited: finding your gene.

    PubMed

    Jones, Neil; Ougham, Helen; Thomas, Howard; Pasakinskiene, Izolda

    2009-01-01

    This paper is an update of our earlier review (Jones et al., 1997, Markers and mapping: we are all geneticists now. New Phytologist 137: 165-177), which dealt with the genetics of mapping, in terms of recombination as the basis of the procedure, and covered some of the first generation of markers, including restriction fragment length polymorphisms (RFLPs), random amplified polymorphic DNA (RAPDs), simple sequence repeats (SSRs) and quantitative trait loci (QTLs). In the intervening decade there have been numerous developments in marker science with many new systems becoming available, which are herein described: cleavage amplification polymorphism (CAP), sequence-specific amplification polymorphism (S-SAP), inter-simple sequence repeat (ISSR), sequence tagged site (STS), sequence characterized amplification region (SCAR), selective amplification of microsatellite polymorphic loci (SAMPL), single nucleotide polymorphism (SNP), expressed sequence tag (EST), sequence-related amplified polymorphism (SRAP), target region amplification polymorphism (TRAP), microarrays, diversity arrays technology (DArT), single-strand conformation polymorphism (SSCP), denaturing gradient gel electrophoresis (DGGE), temperature gradient gel electrophoresis (TGGE) and methylation-sensitive PCR. In addition there has been an explosion of knowledge and databases in the area of genomics and bioinformatics. The number of flowering plant ESTs is c. 19 million and counting, with all the opportunity that this provides for gene-hunting, while the survey of bioinformatics and computer resources points to a rapid growth point for future activities in unravelling and applying the burst of new information on plant genomes. A case study is presented on tracking down a specific gene (stay-green (SGR), a post-transcriptional senescence regulator) using the full suite of mapping tools and comparative mapping resources. We end with a brief speculation on how genome analysis may progress into the future of this highly dynamic arena of plant science.

  16. Microsatellite DNA in genomic survey sequences and UniGenes of loblolly pine

    Treesearch

    Craig S Echt; Surya Saha; Dennis L Deemer; C Dana Nelson

    2011-01-01

    Genomic DNA sequence databases are a potential and growing resource for simple sequence repeat (SSR) marker development in loblolly pine (Pinus taeda L.). Loblolly pine also has many expressed sequence tags (ESTs) available for microsatellite (SSR) marker development. We compared loblolly pine SSR densities in genome survey sequences (GSSs) to those in non-redundant...

  17. Testing genotyping strategies for ultra-deep sequencing of a co-amplifying gene family: MHC class I in a passerine bird.

    PubMed

    Biedrzycka, Aleksandra; Sebastian, Alvaro; Migalska, Magdalena; Westerdahl, Helena; Radwan, Jacek

    2017-07-01

    Characterization of highly duplicated genes, such as genes of the major histocompatibility complex (MHC), where multiple loci often co-amplify, has until recently been hindered by insufficient read depths per amplicon. Here, we used ultra-deep Illumina sequencing to resolve genotypes at exon 3 of MHC class I genes in the sedge warbler (Acrocephalus schoenobaenus). We sequenced 24 individuals in two replicates and used this data, as well as a simulated data set, to test the effect of amplicon coverage (range: 500-20 000 reads per amplicon) on the repeatability of genotyping using four different genotyping approaches. A third replicate employed unique barcoding to assess the extent of tag jumping, that is swapping of individual tag identifiers, which may confound genotyping. The reliability of MHC genotyping increased with coverage and approached or exceeded 90% within-method repeatability of allele calling at coverages of >5000 reads per amplicon. We found generally high agreement between genotyping methods, especially at high coverages. High reliability of the tested genotyping approaches was further supported by our analysis of the simulated data set, although the genotyping approach relying primarily on replication of variants in independent amplicons proved sensitive to repeatable errors. According to the most repeatable genotyping method, the number of co-amplifying variants per individual ranged from 19 to 42. Tag jumping was detectable, but at such low frequencies that it did not affect the reliability of genotyping. We thus demonstrate that gene families with many co-amplifying genes can be reliably genotyped using HTS, provided that there is sufficient per amplicon coverage. © 2016 John Wiley & Sons Ltd.

  18. Recombinant Protein Truncation Strategy for Inducing Bactericidal Antibodies to the Macrophage Infectivity Potentiator Protein of Neisseria meningitidis and Circumventing Potential Cross-Reactivity with Human FK506-Binding Proteins

    PubMed Central

    Bielecka, Magdalena K.; Devos, Nathalie; Gilbert, Mélanie; Hung, Miao-Chiu; Weynants, Vincent; Heckels, John E.

    2014-01-01

    A recombinant macrophage infectivity potentiator (rMIP) protein of Neisseria meningitidis induces significant serum bactericidal antibody production in mice and is a candidate meningococcal vaccine antigen. However, bioinformatics analysis of MIP showed some amino acid sequence similarity to human FK506-binding proteins (FKBPs) in residues 166 to 252 located in the globular domain of the protein. To circumvent the potential concern over generating antibodies that could recognize human proteins, we immunized mice with recombinant truncated type I rMIP proteins that lacked the globular domain and the signal leader peptide (LP) signal sequence (amino acids 1 to 22) and contained the His purification tag at either the N or C terminus (C-term). The immunogenicity of truncated rMIP proteins was compared to that of full (i.e., full-length) rMIP proteins (containing the globular domain) with either an N- or C-terminal His tag and with or without the LP sequence. By comparing the functional murine antibody responses to these various constructs, we determined that C-term His truncated rMIP (−LP) delivered in liposomes induced high levels of antibodies that bound to the surface of wild-type but not Δmip mutant meningococci and showed bactericidal activity against homologous type I MIP (median titers of 128 to 256) and heterologous type II and III (median titers of 256 to 512) strains, thereby providing at least 82% serogroup B strain coverage. In contrast, in constructs lacking the LP, placement of the His tag at the N terminus appeared to abrogate bactericidal activity. The strategy used in this study would obviate any potential concerns regarding the use of MIP antigens for inclusion in bacterial vaccines. PMID:25452551

  19. Detection of polyomavirus simian virus 40 tumor antigen DNA in AIDS-related systemic non-Hodgkin lymphoma

    NASA Technical Reports Server (NTRS)

    Vilchez, Regis A.; Lednicky, John A.; Halvorson, Steven J.; White, Zoe S.; Kozinetz, Claudia A.; Butel, Janet S.

    2002-01-01

    Systemic non-Hodgkin lymphoma (S-NHL) is a common malignancy during HIV infection, and it is hypothesized that infectious agents may be involved in the etiology. Epstein-Barr virus DNA is found in <40% of patients with AIDS-related S-NHL, suggesting that other oncogenic viruses, such as polyomaviruses, may play a role in pathogenesis. We analyzed AIDS-related S-NHL samples, NHL samples from HIV-negative patients, peripheral blood leukocytes from HIV-infected and -uninfected patients without NHL, and lymph nodes without tumors from HIV-infected patients. Specimens were examined by polymerase chain reaction analysis with use of primers specific for an N-terminal region of the oncoprotein large tumor antigen ( T-ag ) gene conserved among all three polyomaviruses (simian virus 40 [SV40], JC virus, and BK virus). Polyomavirus T-ag DNA sequences, proven to be SV40-specific, were detected more frequently in AIDS-related S-NHL samples (6 of 26) than in peripheral blood leukocytes from HIV-infected patients (6 of 26 vs. 0 of 69; p =.0001), NHL samples from HIV-negative patients (6 of 26 vs. 0 of 10; p =.09), or lymph nodes (6 of 26 vs. 0 of 7; p =.16). Sequences of C-terminal T-ag DNA from SV40 were amplified from two AIDS-related S-NHL samples. Epstein-Barr virus DNA sequences were detected in 38% (10 of 26) AIDS-related S-NHL samples, 50% (5 of 10) HIV-negative S-NHL samples, and 57% (4 of 7) lymph nodes. None of the S-NHL samples were positive for both Epstein-Barr virus DNA and SV40 DNA. Further studies of the possible role of SV40 in the pathogenesis of S-NHL are warranted.

  20. An active ac/ds transposon system for activation tagging in tomato cultivar m82 using clonal propagation.

    PubMed

    Carter, Jared D; Pereira, Andy; Dickerman, Allan W; Veilleux, Richard E

    2013-05-01

    Tomato (Solanum lycopersicum) is a model organism for Solanaceae in both molecular and agronomic research. This project utilized Agrobacterium tumefaciens transformation and the transposon-tagging construct Activator (Ac)/Dissociator (Ds)-ATag-Bar_gosGFP to produce activation-tagged and knockout mutants in the processing tomato cultivar M82. The construct carried hygromycin resistance (hyg), green fluorescent protein (GFP), and the transposase (TPase) of maize (Zea mays) Activator major transcript X054214.1 on the stable Ac element, along with a 35S enhancer tetramer and glufosinate herbicide resistance (BAR) on the mobile Ds-ATag element. An in vitro propagation strategy was used to produce a population of 25 T0 plants from a single transformed plant regenerated in tissue culture. A T1 population of 11,000 selfed and cv M82 backcrossed progeny was produced from the functional T0 line. This population was screened using glufosinate herbicide, hygromycin leaf painting, and multiplex polymerase chain reaction (PCR). Insertion sites of transposed Ds-ATag elements were identified through thermal asymmetric interlaced PCR, and resulting product sequences were aligned to the recently published tomato genome. A population of 509 independent, Ds-only transposant lines spanning all 12 tomato chromosomes has been developed. Insertion site analysis demonstrated that more than 80% of these lines harbored Ds insertions conducive to activation tagging. The capacity of the Ds-ATag element to alter transcription was verified by quantitative real-time reverse transcription-PCR in two mutant lines. The transposon-tagged lines have been immortalized in seed stocks and can be accessed through an online database, providing a unique resource for tomato breeding and analysis of gene function in the background of a commercial tomato cultivar.

  1. SEAN: SNP prediction and display program utilizing EST sequence clusters.

    PubMed

    Huntley, Derek; Baldo, Angela; Johri, Saurabh; Sergot, Marek

    2006-02-15

    SEAN is an application that predicts single nucleotide polymorphisms (SNPs) using multiple sequence alignments produced from expressed sequence tag (EST) clusters. The algorithm uses rules of sequence identity and SNP abundance to determine the quality of the prediction. A Java viewer is provided to display the EST alignments and predicted SNPs.

  2. Deciphering the landscape of host barriers to Listeria monocytogenes infection.

    PubMed

    Zhang, Ting; Abel, Sören; Abel Zur Wiesch, Pia; Sasabe, Jumpei; Davis, Brigid M; Higgins, Darren E; Waldor, Matthew K

    2017-06-13

    Listeria monocytogenes is a common food-borne pathogen that can disseminate from the intestine and infect multiple organs. Here, we used sequence tag-based analysis of microbial populations (STAMP) to investigate L monocytogenes population dynamics during infection. We created a genetically barcoded library of murinized L monocytogenes and then used deep sequencing to track the pathogen's dissemination routes and quantify its founding population ( N b ) sizes in different organs. We found that the pathogen disseminates from the gastrointestinal tract to distal sites through multiple independent routes and that N b sizes vary greatly among tissues, indicative of diverse host barriers to infection. Unexpectedly, comparative analyses of sequence tags revealed that fecally excreted organisms are largely derived from the very small number of L. monocytogenes cells that colonize the gallbladder. Immune depletion studies suggest that distinct innate immune cells restrict the pathogen's capacity to establish replicative niches in the spleen and liver. Finally, studies in germ-free mice suggest that the microbiota plays a critical role in the development of the splenic, but not the hepatic, barriers that prevent L. monocytogenes from seeding these organs. Collectively, these observations illustrate the potency of the STAMP approach to decipher the impact of host factors on population dynamics of pathogens during infection.

  3. Deciphering the landscape of host barriers to Listeria monocytogenes infection

    PubMed Central

    Zhang, Ting; Abel, Sören; Abel zur Wiesch, Pia; Sasabe, Jumpei; Davis, Brigid M.; Higgins, Darren E.; Waldor, Matthew K.

    2017-01-01

    Listeria monocytogenes is a common food-borne pathogen that can disseminate from the intestine and infect multiple organs. Here, we used sequence tag-based analysis of microbial populations (STAMP) to investigate L. monocytogenes population dynamics during infection. We created a genetically barcoded library of murinized L. monocytogenes and then used deep sequencing to track the pathogen’s dissemination routes and quantify its founding population (Nb) sizes in different organs. We found that the pathogen disseminates from the gastrointestinal tract to distal sites through multiple independent routes and that Nb sizes vary greatly among tissues, indicative of diverse host barriers to infection. Unexpectedly, comparative analyses of sequence tags revealed that fecally excreted organisms are largely derived from the very small number of L. monocytogenes cells that colonize the gallbladder. Immune depletion studies suggest that distinct innate immune cells restrict the pathogen’s capacity to establish replicative niches in the spleen and liver. Finally, studies in germ-free mice suggest that the microbiota plays a critical role in the development of the splenic, but not the hepatic, barriers that prevent L. monocytogenes from seeding these organs. Collectively, these observations illustrate the potency of the STAMP approach to decipher the impact of host factors on population dynamics of pathogens during infection. PMID:28559314

  4. Recording high quality speech during tagged cine-MRI studies using a fiber optic microphone.

    PubMed

    NessAiver, Moriel S; Stone, Maureen; Parthasarathy, Vijay; Kahana, Yuvi; Paritsky, Alexander; Paritsky, Alex

    2006-01-01

    To investigate the feasibility of obtaining high quality speech recordings during cine imaging of tongue movement using a fiber optic microphone. A Complementary Spatial Modulation of Magnetization (C-SPAMM) tagged cine sequence triggered by an electrocardiogram (ECG) simulator was used to image a volunteer while speaking the syllable pairs /a/-/u/, /i/-/u/, and the words "golly" and "Tamil" in sync with the imaging sequence. A noise-canceling, optical microphone was fastened approximately 1-2 inches above the mouth of the volunteer. The microphone was attached via optical fiber to a laptop computer, where the speech was sampled at 44.1 kHz. A reference recording of gradient activity with no speech was subtracted from target recordings. Good quality speech was discernible above the background gradient sound using the fiber optic microphone without reference subtraction. The audio waveform of gradient activity was extremely stable and reproducible. Subtraction of the reference gradient recording further reduced gradient noise by roughly 21 dB, resulting in exceptionally high quality speech waveforms. It is possible to obtain high quality speech recordings using an optical microphone even during exceptionally loud cine imaging sequences. This opens up the possibility of more elaborate MRI studies of speech including spectral analysis of the speech signal in all types of MRI.

  5. Discovery a novel organic solvent tolerant esterase from Salinispora arenicola CNP193 through genome mining.

    PubMed

    Fang, Yaowei; Wang, Shujun; Liu, Shu; Jiao, Yuliang

    2015-09-01

    An esterase gene, encoding a 325-amino-acid protein (SAestA), was mined form obligate marine actinomycete strain Salinispora arenicola CNP193 genome sequence. Phylogenetic analysis of the deduced amino acid sequence showed that the enzyme belonged to the family IV of lipolytic enzymes. The gene was cloned, expressed in Escherichia coli as a His-tagged protein, purified and characterized. The molecular weight of His-tagged SAestA is ∼38 kDa. SAestA-His6 was active in a temperature (5-40 °C) and pH range (7.0-11.0), and maximal activity was determined at pH 9.0 and 30 °C. The activity was severely inhibited by Hg(2+), Cu(2+), and Zn(2+). In particular, this enzyme showed remarkable stability in presence of organic solvents (25%, v/v) with log P>2.0 even after incubation for 7 days. All these characteristics suggested that SAestA may be a potential candidate for application in industrial processes in aqueous/organic media. Copyright © 2015. Published by Elsevier B.V.

  6. A zebra-band phenotype in maize can be suppressed in constant light, and results from mutation of a PPOXlike gene (protophorphyrinogen oxidase IX-like) for porphyrin biosynthesis

    USDA-ARS?s Scientific Manuscript database

    A zebra-band phenotype was identified in a maize population of transposon-tagged mutants (UniformMu, searchable by sequence at MaizeGDB.org). Genotype-phenotype analysis of an F2 family showed that the zebra stripes co-segregated with a single Mu insertion in the second exon of a Protoporphyrinogen ...

  7. Analysis of Expressed Sequence Tags (EST) in Date Palm.

    PubMed

    Al-Faifi, Sulieman A; Migdadi, Hussein M; Algamdi, Salem S; Khan, Mohammad Altaf; Al-Obeed, Rashid S; Ammar, Megahed H; Jakse, Jerenj

    2017-01-01

    Expressed sequence tags (EST) were generated from a normalized cDNA library of the date palm Sukkari cv. to understand the high-quality and better field performance of this well-known commercial cultivar. A total of 6943 high-quality ESTs were generated, out of them 6671 are submitted to the GenBank dbEST (LIBEST_028537). The generated ESTs were assembled into 6362 unigenes, consisting of 494 (14.4%) contigs and 5868 (84.53%) singletons. The functional annotation shows that the majority of the ESTs are associated with binding (44%), catalytic (40%), transporter (5%), and structural molecular (5%) activities. The blastx results show that 73% of unigenes are significantly similar to known plant genes and 27% are novel. The latter could be of particular interest in date palm genetic studies. Further analysis shows that some ESTs are categorized as stress/defense- and fruit development-related genes. These newly generated ESTs could significantly enhance date palm EST databases in the public domain and are available to scientists and researchers across the globe. This knowledge will facilitate the discovery of candidate genes that govern important developmental and agronomical traits in date palm. It will provide important resources for developing genetic tools, comparative genomics, and genome evolution among date palm cultivars.

  8. Analysis of expressed sequence tags from the Ulva prolifera (Chlorophyta)

    NASA Astrophysics Data System (ADS)

    Niu, Jianfeng; Hu, Haiyan; Hu, Songnian; Wang, Guangce; Peng, Guang; Sun, Song

    2010-01-01

    In 2008, a green tide broke out before the sailing competition of the 29th Olympic Games in Qingdao. The causative species was determined to be Enteromorpha prolifera ( Ulva prolifera O. F. Müller), a familiar green macroalga along the coastline of China. Rapid accumulation of a large biomass of floating U. prolifera prompted research on different aspects of this species. In this study, we constructed a nonnormalized cDNA library from the thalli of U. prolifera and acquired 10 072 high-quality expressed sequence tags (ESTs). These ESTs were assembled into 3 519 nonredundant gene groups, including 1 446 clusters and 2 073 singletons. After annotation with the nr database, a large number of genes were found to be related with chloroplast and ribosomal protein, GO functional classification showed 1 418 ESTs participated in photosynthesis and 1 359 ESTs were responsible for the generation of precursor metabolites and energy. In addition, rather comprehensive carbon fixation pathways were found in U. prolifera using KEGG. Some stress-related and signal transduction-related genes were also found in this study. All the evidences displayed that U. prolifera had substance and energy foundation for the intense photosynthesis and the rapid proliferation. Phylogenetic analysis of cytochrome c oxidase subunit I revealed that this green-tide causative species is most closely affiliated to Pseudendoclonium akinetum (Ulvophyceae).

  9. Identification and biochemical characterization of an Arabidopsis indole-3-acetic acid glucosyltransferase.

    PubMed

    Jackson, R G; Lim, E K; Li, Y; Kowalczyk, M; Sandberg, G; Hoggett, J; Ashford, D A; Bowles, D J

    2001-02-09

    Biochemical characterization of recombinant gene products following a phylogenetic analysis of the UDP-glucosyltransferase (UGT) multigene family of Arabidopsis has identified one enzyme (UGT84B1) with high activity toward the plant hormone indole-3-acetic acid (IAA) and three related enzymes (UGT84B2, UGT75B1, and UGT75B2) with trace activities. The identity of the IAA conjugate has been confirmed to be 1-O-indole acetyl glucose ester. A sequence annotated as a UDP-glucose:IAA glucosyltransferase (IAA-UGT) in the Arabidopsis genome and expressed sequence tag data bases given its similarity to the maize iaglu gene sequence showed no activity toward IAA. This study describes the first biochemical analysis of a recombinant IAA-UGT and provides the foundation for future genetic approaches to understand the role of 1-O-indole acetyl glucose ester in Arabidopsis.

  10. Differently localized lysophosphatidic acid acyltransferases crucial for triacylglycerol biosynthesis in the oleaginous alga Nannochloropsis.

    PubMed

    Nobusawa, Takashi; Hori, Koichi; Mori, Hiroshi; Kurokawa, Ken; Ohta, Hiroyuki

    2017-05-01

    The production of renewable bioenergy will be necessary to meet rising global fossil fuel demands. Members of the marine microalgae genus Nannochloropsis produce large quantities of oils (triacylglycerols; TAGs), and this genus is regarded as one of the most promising for biodiesel production. Recent genome sequencing and transcriptomic studies on Nannochloropsis have provided a foundation for understanding its oleaginous trait, but the mechanism underlying oil accumulation remains to be clarified. Here we report Nannochloropsis knock-out strains of four extraplastidic lysophosphatidic acid acyltransferases (LPAT1-LPAT4) that catalyze a major de novo biosynthetic step of TAGs and membrane lipids. We found that the four LPATs are differently involved in lipid metabolic flow in Nannochloropsis. Double knock-outs among the LPATs revealed the pivotal LPATs for TAG biosynthesis, and localization analysis indicated that the stramenopile-specific LPATs (LPAT3 and LPAT4) associated with TAG synthesis reside at the perimeter of lipid droplets. No homologous region has been found with other lipid droplet-associated proteins, however. Lipid droplets are an organelle found in nearly all organisms, and recently they were shown to play important roles in cellular metabolism and signaling. Our results provide direct evidence for the importance of the perimeter of lipid droplet in TAG synthesis in addition to its known role in maintaining TAG stability, and these findings suggest that the oleaginous trait of Nannochloropsis is enabled by the acquisition of LPATs at the perimeter of lipid droplets. © 2017 The Authors. The Plant Journal published by John Wiley & Sons Ltd and Society for Experimental Biology.

  11. Deep sequencing reveals exceptional diversity and modes of transmission for bacterial sponge symbionts.

    PubMed

    Webster, Nicole S; Taylor, Michael W; Behnam, Faris; Lücker, Sebastian; Rattei, Thomas; Whalan, Stephen; Horn, Matthias; Wagner, Michael

    2010-08-01

    Marine sponges contain complex bacterial communities of considerable ecological and biotechnological importance, with many of these organisms postulated to be specific to sponge hosts. Testing this hypothesis in light of the recent discovery of the rare microbial biosphere, we investigated three Australian sponges by massively parallel 16S rRNA gene tag pyrosequencing. Here we show bacterial diversity that is unparalleled in an invertebrate host, with more than 250,000 sponge-derived sequence tags being assigned to 23 bacterial phyla and revealing up to 2996 operational taxonomic units (95% sequence similarity) per sponge species. Of the 33 previously described 'sponge-specific' clusters that were detected in this study, 48% were found exclusively in adults and larvae - implying vertical transmission of these groups. The remaining taxa, including 'Poribacteria', were also found at very low abundance among the 135,000 tags retrieved from surrounding seawater. Thus, members of the rare seawater biosphere may serve as seed organisms for widely occurring symbiont populations in sponges and their host association might have evolved much more recently than previously thought. © 2009 Society for Applied Microbiology and Blackwell Publishing Ltd.

  12. Deep sequencing reveals exceptional diversity and modes of transmission for bacterial sponge symbionts

    PubMed Central

    Webster, Nicole S; Taylor, Michael W; Behnam, Faris; Lücker, Sebastian; Rattei, Thomas; Whalan, Stephen; Horn, Matthias; Wagner, Michael

    2010-01-01

    Marine sponges contain complex bacterial communities of considerable ecological and biotechnological importance, with many of these organisms postulated to be specific to sponge hosts. Testing this hypothesis in light of the recent discovery of the rare microbial biosphere, we investigated three Australian sponges by massively parallel 16S rRNA gene tag pyrosequencing. Here we show bacterial diversity that is unparalleled in an invertebrate host, with more than 250 000 sponge-derived sequence tags being assigned to 23 bacterial phyla and revealing up to 2996 operational taxonomic units (95% sequence similarity) per sponge species. Of the 33 previously described ‘sponge-specific’ clusters that were detected in this study, 48% were found exclusively in adults and larvae – implying vertical transmission of these groups. The remaining taxa, including ‘Poribacteria’, were also found at very low abundance among the 135 000 tags retrieved from surrounding seawater. Thus, members of the rare seawater biosphere may serve as seed organisms for widely occurring symbiont populations in sponges and their host association might have evolved much more recently than previously thought. PMID:21966903

  13. Whole-genome sequence-based analysis of thyroid function.

    PubMed

    Taylor, Peter N; Porcu, Eleonora; Chew, Shelby; Campbell, Purdey J; Traglia, Michela; Brown, Suzanne J; Mullin, Benjamin H; Shihab, Hashem A; Min, Josine; Walter, Klaudia; Memari, Yasin; Huang, Jie; Barnes, Michael R; Beilby, John P; Charoen, Pimphen; Danecek, Petr; Dudbridge, Frank; Forgetta, Vincenzo; Greenwood, Celia; Grundberg, Elin; Johnson, Andrew D; Hui, Jennie; Lim, Ee M; McCarthy, Shane; Muddyman, Dawn; Panicker, Vijay; Perry, John R B; Bell, Jordana T; Yuan, Wei; Relton, Caroline; Gaunt, Tom; Schlessinger, David; Abecasis, Goncalo; Cucca, Francesco; Surdulescu, Gabriela L; Woltersdorf, Wolfram; Zeggini, Eleftheria; Zheng, Hou-Feng; Toniolo, Daniela; Dayan, Colin M; Naitza, Silvia; Walsh, John P; Spector, Tim; Davey Smith, George; Durbin, Richard; Richards, J Brent; Sanna, Serena; Soranzo, Nicole; Timpson, Nicholas J; Wilson, Scott G

    2015-03-06

    Normal thyroid function is essential for health, but its genetic architecture remains poorly understood. Here, for the heritable thyroid traits thyrotropin (TSH) and free thyroxine (FT4), we analyse whole-genome sequence data from the UK10K project (N=2,287). Using additional whole-genome sequence and deeply imputed data sets, we report meta-analysis results for common variants (MAF≥1%) associated with TSH and FT4 (N=16,335). For TSH, we identify a novel variant in SYN2 (MAF=23.5%, P=6.15 × 10(-9)) and a new independent variant in PDE8B (MAF=10.4%, P=5.94 × 10(-14)). For FT4, we report a low-frequency variant near B4GALT6/SLC25A52 (MAF=3.2%, P=1.27 × 10(-9)) tagging a rare TTR variant (MAF=0.4%, P=2.14 × 10(-11)). All common variants explain ≥20% of the variance in TSH and FT4. Analysis of rare variants (MAF<1%) using sequence kernel association testing reveals a novel association with FT4 in NRG1. Our results demonstrate that increased coverage in whole-genome sequence association studies identifies novel variants associated with thyroid function.

  14. Biosynthesis of Lipoic Acid in Arabidopsis: Cloning and Characterization of the cDNA for Lipoic Acid Synthase1

    PubMed Central

    Yasuno, Rie; Wada, Hajime

    1998-01-01

    Lipoic acid is a coenzyme that is essential for the activity of enzyme complexes such as those of pyruvate dehydrogenase and glycine decarboxylase. We report here the isolation and characterization of LIP1 cDNA for lipoic acid synthase of Arabidopsis. The Arabidopsis LIP1 cDNA was isolated using an expressed sequence tag homologous to the lipoic acid synthase of Escherichia coli. This cDNA was shown to code for Arabidopsis lipoic acid synthase by its ability to complement a lipA mutant of E. coli defective in lipoic acid synthase. DNA-sequence analysis of the LIP1 cDNA revealed an open reading frame predicting a protein of 374 amino acids. Comparisons of the deduced amino acid sequence with those of E. coli and yeast lipoic acid synthase homologs showed a high degree of sequence similarity and the presence of a leader sequence presumably required for import into the mitochondria. Southern-hybridization analysis suggested that LIP1 is a single-copy gene in Arabidopsis. Western analysis with an antibody against lipoic acid synthase demonstrated that this enzyme is located in the mitochondrial compartment in Arabidopsis cells as a 43-kD polypeptide. PMID:9808738

  15. Interferon-gamma of the giant panda (Ailuropoda melanoleuca): complementary DNA cloning, expression, and phylogenetic analysis.

    PubMed

    Tao, Yaqiong; Zeng, Bo; Xu, Liu; Yue, Bisong; Yang, Dong; Zou, Fangdong

    2010-01-01

    Interferon-gamma (IFN-gamma) is the only member of type II IFN and is vital in the regulation of immune and inflammatory responses. Herein we report the cloning, expression, and sequence analysis of IFN-gamma from the giant panda (Ailuropoda melanoleuca). The open reading frame of this gene is 501 base pair in length and encodes a polypeptide consisting of 166 amino acids. All conserved N-linked glycosylation sites and cysteine residues among carnivores were found in the predicted amino acid sequence of the giant panda. Recombinant giant panda IFN-gamma with a V5 epitope and polyhistidine tag was expressed in HEK293 host cells and confirmed by Western blotting. Phylogenetic analysis of mammalian IFN-gamma-coding sequences indicated that the giant panda IFN-gamma was closest to that of carnivores, then to ungulates and dolphin, and shared a distant relationship with mouse and human. These results represent a first step into the study of IFN-gamma in giant panda.

  16. Characterization and isolation of a T-DNA tagged banana promoter active during in vitro culture and low temperature stress.

    PubMed

    Santos, Efrén; Remy, Serge; Thiry, Els; Windelinckx, Saskia; Swennen, Rony; Sági, László

    2009-06-24

    Next-generation transgenic plants will require a more precise regulation of transgene expression, preferably under the control of native promoters. A genome-wide T-DNA tagging strategy was therefore performed for the identification and characterization of novel banana promoters. Embryogenic cell suspensions of a plantain-type banana were transformed with a promoterless, codon-optimized luciferase (luc+) gene and low temperature-responsive luciferase activation was monitored in real time. Around 16,000 transgenic cell colonies were screened for baseline luciferase activity at room temperature 2 months after transformation. After discarding positive colonies, cultures were re-screened in real-time at 26 degrees C followed by a gradual decrease to 8 degrees C. The baseline activation frequency was 0.98%, while the frequency of low temperature-responsive luciferase activity was 0.61% in the same population of cell cultures. Transgenic colonies with luciferase activity responsive to low temperature were regenerated to plantlets and luciferase expression patterns monitored during different regeneration stages. Twenty four banana DNA sequences flanking the right T-DNA borders in seven independent lines were cloned via PCR walking. RT-PCR analysis in one line containing five inserts allowed the identification of the sequence that had activated luciferase expression under low temperature stress in a developmentally regulated manner. This activating sequence was fused to the uidA reporter gene and back-transformed into a commercial dessert banana cultivar, in which its original expression pattern was confirmed. This promoter tagging and real-time screening platform proved valuable for the identification of novel promoters and genes in banana and for monitoring expression patterns throughout in vitro development and low temperature treatment. Combination of PCR walking techniques was efficient for the isolation of candidate promoters even in a multicopy T-DNA line. Qualitative and quantitative GUS expression analyses of one tagged promoter in a commercial cultivar demonstrated a reproducible promoter activity pattern during in vitro culture. Thus, this promoter could be used during in vitro selection and generation of commercial transgenic plants.

  17. Identification of immunity-related genes in the larvae of Protaetia brevitarsis seulensis (Coleoptera: Cetoniidae) by a next-generation sequencing-based transcriptome analysis.

    PubMed

    Bang, Kyeongrin; Hwang, Sejung; Lee, Jiae; Cho, Saeyoull

    2015-01-01

    To identify immune-related genes in the larvae of white-spotted flower chafers, next-generation sequencing was conducted with an Illumina HiSeq2000, resulting in 100 million cDNA reads with sequence information from over 10 billion base pairs (bp) and >50× transcriptome coverage. A subset of 77,336 contigs was created, and ∼35,532 sequences matched entries against the NCBI nonredundant database (cutoff, e < 10(-5)). Statistical analysis was performed on the 35,532 contigs. For profiling of the immune response, samples were analyzed by aligning 42 base sequence tags to the de novo reference assembly, comparing levels in immunized larvae to control levels of expression. Of the differentially expressed genes, 3,440 transcripts were upregulated and 3,590 transcripts were downregulated. Many of these genes were confirmed as immune-related genes such as pattern recognition proteins, immune-related signal transduction proteins, antimicrobial peptides, and cellular response proteins, by comparison to published data. © The Author 2015. Published by Oxford University Press on behalf of the Entomological Society of America.

  18. NABIC: A New Access Portal to Search, Visualize, and Share Agricultural Genomics Data.

    PubMed

    Seol, Young-Joo; Lee, Tae-Ho; Park, Dong-Suk; Kim, Chang-Kug

    2016-01-01

    The National Agricultural Biotechnology Information Center developed an access portal to search, visualize, and share agricultural genomics data with a focus on South Korean information and resources. The portal features an agricultural biotechnology database containing a wide range of omics data from public and proprietary sources. We collected 28.4 TB of data from 162 agricultural organisms, with 10 types of omics data comprising next-generation sequencing sequence read archive, genome, gene, nucleotide, DNA chip, expressed sequence tag, interactome, protein structure, molecular marker, and single-nucleotide polymorphism datasets. Our genomic resources contain information on five animals, seven plants, and one fungus, which is accessed through a genome browser. We also developed a data submission and analysis system as a web service, with easy-to-use functions and cutting-edge algorithms, including those for handling next-generation sequencing data.

  19. Microchannel DNA Sequencing by End-Labelled Free Solution Electrophoresis

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Barron, A.

    2005-09-29

    The further development of End-Labeled Free-Solution Electrophoresis will greatly simplify DNA separation and sequencing on microfluidic devices. The development and optimization of drag-tags is critical to the success of this research.

  20. Transcriptome dynamics through alternative polyadenylation in developmental and environmental responses in plants revealed by deep sequencing

    PubMed Central

    Shen, Yingjia; Venu, R.C.; Nobuta, Kan; Wu, Xiaohui; Notibala, Varun; Demirci, Caghan; Meyers, Blake C.; Wang, Guo-Liang; Ji, Guoli; Li, Qingshun Q.

    2011-01-01

    Polyadenylation sites mark the ends of mRNA transcripts. Alternative polyadenylation (APA) may alter sequence elements and/or the coding capacity of transcripts, a mechanism that has been demonstrated to regulate gene expression and transcriptome diversity. To study the role of APA in transcriptome dynamics, we analyzed a large-scale data set of RNA “tags” that signify poly(A) sites and expression levels of mRNA. These tags were derived from a wide range of tissues and developmental stages that were mutated or exposed to environmental treatments, and generated using digital gene expression (DGE)–based protocols of the massively parallel signature sequencing (MPSS-DGE) and the Illumina sequencing-by-synthesis (SBS-DGE) sequencing platforms. The data offer a global view of APA and how it contributes to transcriptome dynamics. Upon analysis of these data, we found that ∼60% of Arabidopsis genes have multiple poly(A) sites. Likewise, ∼47% and 82% of rice genes use APA, supported by MPSS-DGE and SBS-DGE tags, respectively. In both species, ∼49%–66% of APA events were mapped upstream of annotated stop codons. Interestingly, 10% of the transcriptomes are made up of APA transcripts that are differentially distributed among developmental stages and in tissues responding to environmental stresses, providing an additional level of transcriptome dynamics. Examples of pollen-specific APA switching and salicylic acid treatment-specific APA clearly demonstrated such dynamics. The significance of these APAs is more evident in the 3034 genes that have conserved APA events between rice and Arabidopsis. PMID:21813626

  1. Synchronous detection of ebolavirus conserved RNA sequences and ebolavirus-encoded miRNA-like fragment based on a zwitterionic copper (II) metal-organic framework.

    PubMed

    Qiu, Gui-Hua; Weng, Zi-Hua; Hu, Pei-Pei; Duan, Wen-Jun; Xie, Bao-Ping; Sun, Bin; Tang, Xiao-Yan; Chen, Jin-Xiang

    2018-04-01

    From a three-dimensional (3D) metal-organic framework (MOF) of {[Cu(Cmdcp)(phen)(H 2 O)] 2 ·9H 2 O} n (1, H 3 CmdcpBr = N-carboxymethyl-(3,5-dicarboxyl)pyridinium bromide, phen = phenanthroline), a sensitive and selective fluorescence sensor has been developed for the simultaneous detection of ebolavirus conserved RNA sequences and ebolavirus-encoded microRNA-like (miRNA-like) fragment. The results from molecular dynamics simulation confirmed that MOF 1 absorbs carboxyfluorescein (FAM)-tagged and 5(6)-carboxyrhodamine, triethylammonium salt (ROX)-tagged probe ss-DNA (probe DNA, P-DNA) by π … π stacking and hydrogen bonding, as well as additional electrostatic interactions to form a sensing platform of P-DNAs@1 with quenched FAM and ROX fluorescence. In the presence of targeted ebolavirus conserved RNA sequences or ebolavirus-encoded miRNA-like fragment, the fluorophore-labeled P-DNA hybridizes with the analyte to give a P-DNA@RNA duplex and released from MOF 1, triggering a fluorescence recovery. Simultaneous detection of two target RNAs has also been realized by single and synchronous fluorescence analysis. The formed sensing platform shows high sensitivity for ebolavirus conserved RNA sequences and ebolavirus-encoded miRNA-like fragment with detection limits at the picomolar level and high selectivity without cross-reaction between the two probes. MOF 1 thus shows the potential as an effective fluorescent sensing platform for the synchronous detection of two ebolavirus-related sequences, and offer improved diagnostic accuracy of Ebola virus disease. Copyright © 2017 Elsevier B.V. All rights reserved.

  2. Generation of expressed sequence tags for discovery of genes responsible for floral traits of Chrysanthemum morifolium by next-generation sequencing technology.

    PubMed

    Sasaki, Katsutomo; Mitsuda, Nobutaka; Nashima, Kenji; Kishimoto, Kyutaro; Katayose, Yuichi; Kanamori, Hiroyuki; Ohmiya, Akemi

    2017-09-04

    Chrysanthemum morifolium is one of the most economically valuable ornamental plants worldwide. Chrysanthemum is an allohexaploid plant with a large genome that is commercially propagated by vegetative reproduction. New cultivars with different floral traits, such as color, morphology, and scent, have been generated mainly by classical cross-breeding and mutation breeding. However, only limited genetic resources and their genome information are available for the generation of new floral traits. To obtain useful information about molecular bases for floral traits of chrysanthemums, we read expressed sequence tags (ESTs) of chrysanthemums by high-throughput sequencing using the 454 pyrosequencing technology. We constructed normalized cDNA libraries, consisting of full-length, 3'-UTR, and 5'-UTR cDNAs derived from various tissues of chrysanthemums. These libraries produced a total number of 3,772,677 high-quality reads, which were assembled into 213,204 contigs. By comparing the data obtained with those of full genome-sequenced species, we confirmed that our chrysanthemum contig set contained the majority of all expressed genes, which was sufficient for further molecular analysis in chrysanthemums. We confirmed that our chrysanthemum EST set (contigs) contained a number of contigs that encoded transcription factors and enzymes involved in pigment and aroma compound metabolism that was comparable to that of other species. This information can serve as an informative resource for identifying genes involved in various biological processes in chrysanthemums. Moreover, the findings of our study will contribute to a better understanding of the floral characteristics of chrysanthemums including the myriad cultivars at the molecular level.

  3. Analysis of expressed sequence tags from Actinidia: applications of a cross species EST database for gene discovery in the areas of flavor, health, color and ripening

    PubMed Central

    Crowhurst, Ross N; Gleave, Andrew P; MacRae, Elspeth A; Ampomah-Dwamena, Charles; Atkinson, Ross G; Beuning, Lesley L; Bulley, Sean M; Chagne, David; Marsh, Ken B; Matich, Adam J; Montefiori, Mirco; Newcomb, Richard D; Schaffer, Robert J; Usadel, Björn; Allan, Andrew C; Boldingh, Helen L; Bowen, Judith H; Davy, Marcus W; Eckloff, Rheinhart; Ferguson, A Ross; Fraser, Lena G; Gera, Emma; Hellens, Roger P; Janssen, Bart J; Klages, Karin; Lo, Kim R; MacDiarmid, Robin M; Nain, Bhawana; McNeilage, Mark A; Rassam, Maysoon; Richardson, Annette C; Rikkerink, Erik HA; Ross, Gavin S; Schröder, Roswitha; Snowden, Kimberley C; Souleyre, Edwige JF; Templeton, Matt D; Walton, Eric F; Wang, Daisy; Wang, Mindy Y; Wang, Yanming Y; Wood, Marion; Wu, Rongmei; Yauk, Yar-Khing; Laing, William A

    2008-01-01

    Background Kiwifruit (Actinidia spp.) are a relatively new, but economically important crop grown in many different parts of the world. Commercial success is driven by the development of new cultivars with novel consumer traits including flavor, appearance, healthful components and convenience. To increase our understanding of the genetic diversity and gene-based control of these key traits in Actinidia, we have produced a collection of 132,577 expressed sequence tags (ESTs). Results The ESTs were derived mainly from four Actinidia species (A. chinensis, A. deliciosa, A. arguta and A. eriantha) and fell into 41,858 non redundant clusters (18,070 tentative consensus sequences and 23,788 EST singletons). Analysis of flavor and fragrance-related gene families (acyltransferases and carboxylesterases) and pathways (terpenoid biosynthesis) is presented in comparison with a chemical analysis of the compounds present in Actinidia including esters, acids, alcohols and terpenes. ESTs are identified for most genes in color pathways controlling chlorophyll degradation and carotenoid biosynthesis. In the health area, data are presented on the ESTs involved in ascorbic acid and quinic acid biosynthesis showing not only that genes for many of the steps in these pathways are represented in the database, but that genes encoding some critical steps are absent. In the convenience area, genes related to different stages of fruit softening are identified. Conclusion This large EST resource will allow researchers to undertake the tremendous challenge of understanding the molecular basis of genetic diversity in the Actinidia genus as well as provide an EST resource for comparative fruit genomics. The various bioinformatics analyses we have undertaken demonstrates the extent of coverage of ESTs for genes encoding different biochemical pathways in Actinidia. PMID:18655731

  4. Arginine kinase from the Tardigrade, Macrobiotus occidentalis: molecular cloning, phylogenetic analysis and enzymatic properties.

    PubMed

    Uda, Kouji; Ishida, Mikako; Matsui, Tohru; Suzuki, Tomohiko

    2010-10-01

    Arginine kinase (AK), which catalyzes the reversible transfer of phosphate from ATP to arginine to yield phosphoarginine and ADP, is widely distributed throughout the invertebrates. We determined the cDNA sequence of AK from the tardigrade (water bear) Macrobiotus occidentalis, cloned the sequence into pET30b plasmid, and expressed it in Escherichia coli as a 6x His-tag—fused protein. The cDNA is 1377 bp, has an open reading frame of 1080 bp, and has 5′- and 3′-untranslated regions of 116 and 297 bp, respectively. The open reading frame encodes a 359-amino acid protein containing the 12 residues considered necessary for substrate binding in Limulus AK. This is the first AK sequence from a tardigrade. From fragmented and non-annotated sequences available from DNA databases, we assembled 46 complete AK sequences: 26 from arthropods (including 19 from Insecta), 11 from nematodes, 4 from mollusks, 2 from cnidarians and 2 from onychophorans. No onychophoran sequences have been reported previously. The phylogenetic trees of 104 AKs indicated clearly that Macrobiotus AK (from the phylum Tardigrada) shows close affinity with Epiperipatus and Euperipatoides AKs (from the phylum Onychophora), and therefore forms a sister group with the arthropod AKs. Recombinant 6x His-tagged Macrobiotus AK was successfully expressed as a soluble protein, and the kinetic constants (K(m), K(d), V(ma) and k(cat)) were determined for the forward reaction. Comparison of these kinetic constants with those of AKs from other sources (arthropods, mollusks and nematodes) indicated that Macrobiotus AK is unique in that it has the highest values for k(cat) and K(d)K(m) (indicative of synergistic substrate binding) of all characterized AKs.

  5. Cytogenetic and molecular markers for detecting Aegilops uniaristata chromosomes in a wheat background.

    PubMed

    Gong, Wenping; Li, Guangrong; Zhou, Jianping; Li, Genying; Liu, Cheng; Huang, Chengyan; Zhao, Zhendong; Yang, Zujun

    2014-09-01

    Aegilops uniaristata has many agronomically useful traits that can be used for wheat breeding. So far, a Triticum turgidum - Ae. uniaristata amphiploid and one set of Chinese Spring (CS) - Ae. uniaristata addition lines have been produced. To guide Ae. uniaristata chromatin transformation from these lines into cultivated wheat through chromosome engineering, reliable cytogenetic and molecular markers specific for Ae. uniaristata chromosomes need to be developed. Standard C-banding shows that C-bands mainly exist in the centromeric regions of Ae. uniaristata but rarely at the distal ends. Fluorescence in situ hybridization (FISH) using (GAA)8 as a probe showed that the hybridization signal of chromosomes 1N-7N are different, thus (GAA)8 can be used to identify all Ae. uniaristata chromosomes in wheat background simultaneously. Moreover, a total of 42 molecular markers specific for Ae. uniaristata chromosomes were developed by screening expressed sequence tag - sequence tagged site (EST-STS), expressed sequence tag - simple sequence repeat (EST-SSR), and PCR-based landmark unique gene (PLUG) primers. The markers were subsequently localized using the CS - Ae. uniaristata addition lines and different wheat cultivars as controls. The cytogenetic and molecular markers developed herein will be helpful for screening and identifying wheat - Ae. uniaristata progeny.

  6. DNA sequencing using fluorescence background electroblotting membrane

    DOEpatents

    Caldwell, Karin D.; Chu, Tun-Jen; Pitt, William G.

    1992-01-01

    A method for the multiplex sequencing on DNA is disclosed which comprises the electroblotting or specific base terminated DNA fragments, which have been resolved by gel electrophoresis, onto the surface of a neutral non-aromatic polymeric microporous membrane exhibiting low background fluorescence which has been surface modified to contain amino groups. Polypropylene membranes are preferably and the introduction of amino groups is accomplished by subjecting the membrane to radio or microwave frequency plasma discharge in the presence of an aminating agent, preferably ammonia. The membrane, containing physically adsorbed DNA fragments on its surface after the electroblotting, is then treated with crosslinking means such as UV radiation or a glutaraldehyde spray to chemically bind the DNA fragments to the membrane through said smino groups contained on the surface thereof. The DNA fragments chemically bound to the membrane are subjected to hybridization probing with a tagged probe specific to the sequence of the DNA fragments. The tagging may be by either fluorophores or radioisotopes. The tagged probes hybridized to said target DNA fragments are detected and read by laser induced fluorescence detection or autoradiograms. The use of aminated low fluorescent background membranes allows the use of fluorescent detection and reading even when the available amount of DNA to be sequenced is small. The DNA bound to the membrances may be reprobed numerous times.

  7. DNA sequencing using fluorescence background electroblotting membrane

    DOEpatents

    Caldwell, K.D.; Chu, T.J.; Pitt, W.G.

    1992-05-12

    A method for the multiplex sequencing on DNA is disclosed which comprises the electroblotting or specific base terminated DNA fragments, which have been resolved by gel electrophoresis, onto the surface of a neutral non-aromatic polymeric microporous membrane exhibiting low background fluorescence which has been surface modified to contain amino groups. Polypropylene membranes are preferably and the introduction of amino groups is accomplished by subjecting the membrane to radio or microwave frequency plasma discharge in the presence of an aminating agent, preferably ammonia. The membrane, containing physically adsorbed DNA fragments on its surface after the electroblotting, is then treated with crosslinking means such as UV radiation or a glutaraldehyde spray to chemically bind the DNA fragments to the membrane through amino groups contained on the surface. The DNA fragments chemically bound to the membrane are subjected to hybridization probing with a tagged probe specific to the sequence of the DNA fragments. The tagging may be by either fluorophores or radioisotopes. The tagged probes hybridized to the target DNA fragments are detected and read by laser induced fluorescence detection or autoradiograms. The use of aminated low fluorescent background membranes allows the use of fluorescent detection and reading even when the available amount of DNA to be sequenced is small. The DNA bound to the membranes may be reprobed numerous times. No Drawings

  8. Mining and gene ontology based annotation of SSR markers from expressed sequence tags of Humulus lupulus

    PubMed Central

    Singh, Swati; Gupta, Sanchita; Mani, Ashutosh; Chaturvedi, Anoop

    2012-01-01

    Humulus lupulus is commonly known as hops, a member of the family moraceae. Currently many projects are underway leading to the accumulation of voluminous genomic and expressed sequence tag sequences in public databases. The genetically characterized domains in these databases are limited due to non-availability of reliable molecular markers. The large data of EST sequences are available in hops. The simple sequence repeat markers extracted from EST data are used as molecular markers for genetic characterization, in the present study. 25,495 EST sequences were examined and assembled to get full-length sequences. Maximum frequency distribution was shown by mononucleotide SSR motifs i.e. 60.44% in contig and 62.16% in singleton where as minimum frequency are observed for hexanucleotide SSR in contig (0.09%) and pentanucleotide SSR in singletons (0.12%). Maximum trinucleotide motifs code for Glutamic acid (GAA) while AT/TA were the most frequent repeat of dinucleotide SSRs. Flanking primer pairs were designed in-silico for the SSR containing sequences. Functional categorization of SSRs containing sequences was done through gene ontology terms like biological process, cellular component and molecular function. PMID:22368382

  9. Compression-based distance (CBD): a simple, rapid, and accurate method for microbiota composition comparison

    PubMed Central

    2013-01-01

    Background Perturbations in intestinal microbiota composition have been associated with a variety of gastrointestinal tract-related diseases. The alleviation of symptoms has been achieved using treatments that alter the gastrointestinal tract microbiota toward that of healthy individuals. Identifying differences in microbiota composition through the use of 16S rRNA gene hypervariable tag sequencing has profound health implications. Current computational methods for comparing microbial communities are usually based on multiple alignments and phylogenetic inference, making them time consuming and requiring exceptional expertise and computational resources. As sequencing data rapidly grows in size, simpler analysis methods are needed to meet the growing computational burdens of microbiota comparisons. Thus, we have developed a simple, rapid, and accurate method, independent of multiple alignments and phylogenetic inference, to support microbiota comparisons. Results We create a metric, called compression-based distance (CBD) for quantifying the degree of similarity between microbial communities. CBD uses the repetitive nature of hypervariable tag datasets and well-established compression algorithms to approximate the total information shared between two datasets. Three published microbiota datasets were used as test cases for CBD as an applicable tool. Our study revealed that CBD recaptured 100% of the statistically significant conclusions reported in the previous studies, while achieving a decrease in computational time required when compared to similar tools without expert user intervention. Conclusion CBD provides a simple, rapid, and accurate method for assessing distances between gastrointestinal tract microbiota 16S hypervariable tag datasets. PMID:23617892

  10. Compression-based distance (CBD): a simple, rapid, and accurate method for microbiota composition comparison.

    PubMed

    Yang, Fang; Chia, Nicholas; White, Bryan A; Schook, Lawrence B

    2013-04-23

    Perturbations in intestinal microbiota composition have been associated with a variety of gastrointestinal tract-related diseases. The alleviation of symptoms has been achieved using treatments that alter the gastrointestinal tract microbiota toward that of healthy individuals. Identifying differences in microbiota composition through the use of 16S rRNA gene hypervariable tag sequencing has profound health implications. Current computational methods for comparing microbial communities are usually based on multiple alignments and phylogenetic inference, making them time consuming and requiring exceptional expertise and computational resources. As sequencing data rapidly grows in size, simpler analysis methods are needed to meet the growing computational burdens of microbiota comparisons. Thus, we have developed a simple, rapid, and accurate method, independent of multiple alignments and phylogenetic inference, to support microbiota comparisons. We create a metric, called compression-based distance (CBD) for quantifying the degree of similarity between microbial communities. CBD uses the repetitive nature of hypervariable tag datasets and well-established compression algorithms to approximate the total information shared between two datasets. Three published microbiota datasets were used as test cases for CBD as an applicable tool. Our study revealed that CBD recaptured 100% of the statistically significant conclusions reported in the previous studies, while achieving a decrease in computational time required when compared to similar tools without expert user intervention. CBD provides a simple, rapid, and accurate method for assessing distances between gastrointestinal tract microbiota 16S hypervariable tag datasets.

  11. Digital gene expression analysis of gene expression differences within Brassica diploids and allopolyploids.

    PubMed

    Jiang, Jinjin; Wang, Yue; Zhu, Bao; Fang, Tingting; Fang, Yujie; Wang, Youping

    2015-01-27

    Brassica includes many successfully cultivated crop species of polyploid origin, either by ancestral genome triplication or by hybridization between two diploid progenitors, displaying complex repetitive sequences and transposons. The U's triangle, which consists of three diploids and three amphidiploids, is optimal for the analysis of complicated genomes after polyploidization. Next-generation sequencing enables the transcriptome profiling of polyploids on a global scale. We examined the gene expression patterns of three diploids (Brassica rapa, B. nigra, and B. oleracea) and three amphidiploids (B. napus, B. juncea, and B. carinata) via digital gene expression analysis. In total, the libraries generated between 5.7 and 6.1 million raw reads, and the clean tags of each library were mapped to 18547-21995 genes of B. rapa genome. The unambiguous tag-mapped genes in the libraries were compared. Moreover, the majority of differentially expressed genes (DEGs) were explored among diploids as well as between diploids and amphidiploids. Gene ontological analysis was performed to functionally categorize these DEGs into different classes. The Kyoto Encyclopedia of Genes and Genomes analysis was performed to assign these DEGs into approximately 120 pathways, among which the metabolic pathway, biosynthesis of secondary metabolites, and peroxisomal pathway were enriched. The non-additive genes in Brassica amphidiploids were analyzed, and the results indicated that orthologous genes in polyploids are frequently expressed in a non-additive pattern. Methyltransferase genes showed differential expression pattern in Brassica species. Our results provided an understanding of the transcriptome complexity of natural Brassica species. The gene expression changes in diploids and allopolyploids may help elucidate the morphological and physiological differences among Brassica species.

  12. Next-Generation Sequencing of the Chrysanthemum nankingense (Asteraceae) Transcriptome Permits Large-Scale Unigene Assembly and SSR Marker Discovery

    PubMed Central

    Wang, Haibin; Jiang, Jiafu; Chen, Sumei; Qi, Xiangyu; Peng, Hui; Li, Pirui; Song, Aiping; Guan, Zhiyong; Fang, Weimin; Liao, Yuan; Chen, Fadi

    2013-01-01

    Background Simple sequence repeats (SSRs) are ubiquitous in eukaryotic genomes. Chrysanthemum is one of the largest genera in the Asteraceae family. Only few Chrysanthemum expressed sequence tag (EST) sequences have been acquired to date, so the number of available EST-SSR markers is very low. Methodology/Principal Findings Illumina paired-end sequencing technology produced over 53 million sequencing reads from C. nankingense mRNA. The subsequent de novo assembly yielded 70,895 unigenes, of which 45,789 (64.59%) unigenes showed similarity to the sequences in NCBI database. Out of 45,789 sequences, 107 have hits to the Chrysanthemum Nr protein database; 679 and 277 sequences have hits to the database of Helianthus and Lactuca species, respectively. MISA software identified a large number of putative EST-SSRs, allowing 1,788 primer pairs to be designed from the de novo transcriptome sequence and a further 363 from archival EST sequence. Among 100 primer pairs randomly chosen, 81 markers have amplicons and 20 are polymorphic for genotypes analysis in Chrysanthemum. The results showed that most (but not all) of the assays were transferable across species and that they exposed a significant amount of allelic diversity. Conclusions/Significance SSR markers acquired by transcriptome sequencing are potentially useful for marker-assisted breeding and genetic analysis in the genus Chrysanthemum and its related genera. PMID:23626799

  13. The Alveolate Perkinsus marinus: Biological Insights from EST Gene Discovery

    PubMed Central

    2010-01-01

    Background Perkinsus marinus, a protozoan parasite of the eastern oyster Crassostrea virginica, has devastated natural and farmed oyster populations along the Atlantic and Gulf coasts of the United States. It is classified as a member of the Perkinsozoa, a recently established phylum considered close to the ancestor of ciliates, dinoflagellates, and apicomplexans, and a key taxon for understanding unique adaptations (e.g. parasitism) within the Alveolata. Despite intense parasite pressure, no disease-resistant oysters have been identified and no effective therapies have been developed to date. Results To gain insight into the biological basis of the parasite's virulence and pathogenesis mechanisms, and to identify genes encoding potential targets for intervention, we generated >31,000 5' expressed sequence tags (ESTs) derived from four trophozoite libraries generated from two P. marinus strains. Trimming and clustering of the sequence tags yielded 7,863 unique sequences, some of which carry a spliced leader. Similarity searches revealed that 55% of these had hits in protein sequence databases, of which 1,729 had their best hit with proteins from the chromalveolates (E-value ≤ 1e-5). Some sequences are similar to those proven to be targets for effective intervention in other protozoan parasites, and include not only proteases, antioxidant enzymes, and heat shock proteins, but also those associated with relict plastids, such as acetyl-CoA carboxylase and methyl erythrithol phosphate pathway components, and those involved in glycan assembly, protein folding/secretion, and parasite-host interactions. Conclusions Our transcriptome analysis of P. marinus, the first for any member of the Perkinsozoa, contributes new insight into its biology and taxonomic position. It provides a very informative, albeit preliminary, glimpse into the expression of genes encoding functionally relevant proteins as potential targets for chemotherapy, and evidence for the presence of a relict plastid. Further, although P. marinus sequences display significant similarity to those from both apicomplexans and dinoflagellates, the presence of trans-spliced transcripts confirms the previously established affinities with the latter. The EST analysis reported herein, together with the recently completed sequence of the P. marinus genome and the development of transfection methodology, should result in improved intervention strategies against dermo disease. PMID:20374649

  14. Expression of CB2 cannabinoid receptor in Pichia pastoris.

    PubMed

    Feng, Wenke; Cai, Jian; Pierce, William M; Song, Zhao-Hui

    2002-12-01

    To facilitate purification and structural characterization, the CB2 cannabinoid receptor is expressed in methylotrophic yeast Pichia pastoris. The expression plasmids were constructed in which the CB2 gene is under the control of the highly inducible promoter of P. pastoris alcohol oxidase 1 gene. A c-myc epitope and a hexahistidine tag were introduced at the C-terminal of the CB2 to permit easy detection and purification. In membrane preparations of CB2 gene transformed yeast cells, Western blot analysis detected the expression of CB2 proteins. Radioligand binding assays demonstrated that the CB2 receptors expressed in P. pastoris have a pharmacological profile similar to that of the receptors expressed in mammalian systems. Furthermore, the epitope-tagged receptor was purified by metal chelating chromatography and the purified CB2 preparations were subjected to digestion by trypsin. MALDI/TOF mass spectrometry analysis of the peptides extracted from tryptic digestions detected 14 peptide fragments derived from the CB2 receptor. ESI mass spectrometry was used to sequence one of these peptide fragments, thus, further confirming the identity of the purified receptor. In conclusion, these data demonstrated for the first time that epitope-tagged, functional CB2 cannabinoid receptor can be expressed in P. pastoris for purification.

  15. Insight into the coordination and the binding sites of Cu(2+) by the histidyl-6-tag using experimental and computational tools.

    PubMed

    Watly, Joanna; Simonovsky, Eyal; Wieczorek, Robert; Barbosa, Nuno; Miller, Yifat; Kozlowski, Henryk

    2014-07-07

    His-tags are specific sequences containing six to nine subsequent histydyl residues, and they are used for purification of recombinant proteins by use of IMAC chromatography. Such polyhistydyl tags, often used in molecular biology, can be also found in nature. Proteins containing histidine-rich domains play a critical role in many life functions in both prokaryote and eukaryote organisms. Binding mode and the thermodynamic properties of the system depend on the specific metal ion and the histidine sequence. Despite the wide application of the His-tag for purification of proteins, little is known about the properties of metal-binding to such tag domains. This inspired us to undertake detailed studies on the coordination of Cu(2+) ion to hexa-His-tag. Experiments were performed using the potentiometric, UV-visible, CD, and EPR techniques. In addition, molecular dynamics (MD) simulations and density functional theory (DFT) calculations were applied. The experimental studies have shown that the Cu(2+) ion binds most likely to two imidazoles and one, two, or three amide nitrogens, depending on the pH. The structures and stabilities of the complexes for the Cu(2+)-Ac-(His)6-NH2 system using experimental and computational tools were established. Polymorphic binding states are suggested, with a possibility of the formation of α-helix structure induced by metal ion coordination. Metal ion is bound to various pairs of imidazole moieties derived from the tag with different efficiencies. The coordination sphere around the metal ion is completed by molecules of water. Finally, the Cu(2+) binding by Ac-(His)6-NH2 is much more efficient compared to other multihistidine protein domains.

  16. Activation tagging in indica rice identifies ribosomal proteins as potential targets for manipulation of water-use efficiency and abiotic stress tolerance in plants.

    PubMed

    Moin, Mazahar; Bakshi, Achala; Saha, Anusree; Udaya Kumar, M; Reddy, Attipalli R; Rao, K V; Siddiq, E A; Kirti, P B

    2016-11-01

    We have generated 3900 enhancer-based activation-tagged plants, in addition to 1030 stable Dissociator-enhancer plants in a widely cultivated indica rice variety, BPT-5204. Of them, 3000 were screened for water-use efficiency (WUE) by analysing photosynthetic quantum efficiency and yield-related attributes under water-limiting conditions that identified 200 activation-tagged mutants, which were analysed for flanking sequences at the site of enhancer integration in the genome. We have further selected five plants with low Δ 13 C, high quantum efficiency and increased plant yield compared with wild type for a detailed investigation. Expression studies of 18 genes in these mutants revealed that in four plants one of the three to four tagged genes became activated, while two genes were concurrently up-regulated in the fifth plant. Two genes coding for proteins involved in 60S ribosomal assembly, RPL6 and RPL23A, were among those that became activated by enhancers. Quantitative expression analysis of these two genes also corroborated the results on activating-tagging. The high up-regulation of RPL6 and RPL23A in various stress treatments and the presence of significant cis-regulatory elements in their promoter regions along with the high up-regulation of several of RPL genes in various stress treatments indicate that they are potential targets for manipulating WUE/abiotic stress tolerance. © 2016 John Wiley & Sons Ltd.

  17. C-terminal tyrosine residues modulate the fusion activity of the Hendra virus fusion protein

    PubMed Central

    Popa, Andreea; Pager, Cara Teresia; Dutch, Rebecca Ellis

    2011-01-01

    The paramyxovirus family includes important human pathogens such as measles, mumps, respiratory syncytial virus and the recently emerged, highly pathogenic Hendra and Nipah viruses. The viral fusion (F) protein plays critical roles in infection, promoting both the viral-cell membrane fusion events needed for viral entry as well as cell-cell fusion events leading to syncytia formation. We describe the surprising finding that addition of the short epitope HA tag to the cytoplasmic tail (CT) of the Hendra virus F protein leads to a significant increase in cell-cell membrane fusion. This increase was not due to alterations in surface expression, cleavage state, or association with lipid microdomains. Addition of a Myc tag of similar length did not alter Hendra F fusion activity, indicating that the observed stimulation was not solely a result of lengthening the CT. Three tyrosine residues within the HA tag were critical for the increase in fusion, suggesting C-terminal tyrosines may modulate Hendra fusion activity. The effects of HA tag addition varied with other fusion proteins, as parainfluenza virus 5 F-HA showed decreased surface expression and no stimulation in fusion. These results indicate that additions to the C-terminal end of the F protein CT can modulate protein function in a sequence specific manner, reinforcing the need for careful analysis of epitope tagged glycoproteins. In addition, our results implicate C-terminal tyrosine residues in modulation of the membrane fusion reaction promoted by these viral glycoproteins. PMID:21175223

  18. Transposon tagging and the study of root development in Arabidopsis

    NASA Technical Reports Server (NTRS)

    Tsugeki, R.; Olson, M. L.; Fedoroff, N. V.

    1998-01-01

    The maize Ac-Ds transposable element family has been used as the basis of transposon mutagenesis systems that function in a variety of plants, including Arabidopsis. We have developed modified transposons and methods which simplify the detection, cloning and analysis of insertion mutations. We have identified and are analyzing two plant lines in which genes expressed either in the root cap cells or in the quiescent cells, cortex/endodermal initial cells and columella cells of the root cap have been tagged with a transposon carrying a reporter gene. A gene expressed in root cap cells tagged with an enhancer-trap Ds was isolated and its corresponding EST cDNA was identified. Nucleotide and deduced amino acid sequences of the gene show no significant similarity to other genes in the database. Genetic ablation experiments have been done by fusing a root cap-specific promoter to the diphtheria toxin A-chain gene and introducing the fusion construct into Arabidopsis plants. We find that in addition to eliminating gravitropism, root cap ablation inhibits elongation of roots by lowering root meristematic activities.

  19. Generation of a total of 6483 expressed sequence tags from 60 day-old bovine whole fetus and fetal placenta.

    PubMed

    Oishi, M; Gohma, H; Lejukole, H Y; Taniguchi, Y; Yamada, T; Suzuki, K; Shinkai, H; Uenishi, H; Yasue, H; Sasaki, Y

    2004-05-01

    Expressed sequence tags (ESTs) generated based on characterization of clones isolated randomly from cDNA libraries are used to study gene expression profiles in specific tissues and to provide useful information for characterizing tissue physiology. In this study, two directionally cloned cDNA libraries were constructed from 60 day-old bovine whole fetus and fetal placenta. We have characterized 5357 and 1126 clones, and then identified 3464 and 795 unique sequences for the fetus and placenta cDNA libraries: 1851 and 504 showed homology to already identified genes, and 1613 and 291 showed no significant matches to any of the sequences in DNA databases, respectively. Further, we found 94 unique sequences overlapping in both the fetus and the placenta, leading to a catalog of 4165 genes expressed in 60 day-old fetus and placenta. The catalog is used to examine expression profile of genes in 60 day-old bovine fetus and placenta.

  20. Sequencing-based gene network analysis provides a core set of gene resource for understanding thermal adaptation in Zhikong scallop Chlamys farreri.

    PubMed

    Fu, X; Sun, Y; Wang, J; Xing, Q; Zou, J; Li, R; Wang, Z; Wang, S; Hu, X; Zhang, L; Bao, Z

    2014-01-01

    Marine organisms are commonly exposed to variable environmental conditions, and many of them are under threat from increased sea temperatures caused by global climate change. Generating transcriptomic resources under different stress conditions are crucial for understanding molecular mechanisms underlying thermal adaptation. In this study, we conducted transcriptome-wide gene expression profiling of the scallop Chlamys farreri challenged by acute and chronic heat stress. Of the 13 953 unique tags, more than 850 were significantly differentially expressed at each time point after acute heat stress, which was more than the number of tags differentially expressed (320-350) under chronic heat stress. To obtain a systemic view of gene expression alterations during thermal stress, a weighted gene coexpression network was constructed. Six modules were identified as acute heat stress-responsive modules. Among them, four modules involved in apoptosis regulation, mRNA binding, mitochondrial envelope formation and oxidation reduction were downregulated. The remaining two modules were upregulated. One was enriched with chaperone and the other with microsatellite sequences, whose coexpression may originate from a transcription factor binding site. These results indicated that C. farreri triggered several cellular processes to acclimate to elevated temperature. No modules responded to chronic heat stress, suggesting that the scallops might have acclimated to elevated temperature within 3 days. This study represents the first sequencing-based gene network analysis in a nonmodel aquatic species and provides valuable gene resources for the study of thermal adaptation, which should assist in the development of heat-tolerant scallop lines for aquaculture. © 2013 John Wiley & Sons Ltd.

  1. Porcine transcriptome analysis based on 97 non-normalized cDNA libraries and assembly of 1,021,891 expressed sequence tags

    PubMed Central

    Gorodkin, Jan; Cirera, Susanna; Hedegaard, Jakob; Gilchrist, Michael J; Panitz, Frank; Jørgensen, Claus; Scheibye-Knudsen, Karsten; Arvin, Troels; Lumholdt, Steen; Sawera, Milena; Green, Trine; Nielsen, Bente J; Havgaard, Jakob H; Rosenkilde, Carina; Wang, Jun; Li, Heng; Li, Ruiqiang; Liu, Bin; Hu, Songnian; Dong, Wei; Li, Wei; Yu, Jun; Wang, Jian; Stærfeldt, Hans-Henrik; Wernersson, Rasmus; Madsen, Lone B; Thomsen, Bo; Hornshøj, Henrik; Bujie, Zhan; Wang, Xuegang; Wang, Xuefei; Bolund, Lars; Brunak, Søren; Yang, Huanming; Bendixen, Christian; Fredholm, Merete

    2007-01-01

    Background Knowledge of the structure of gene expression is essential for mammalian transcriptomics research. We analyzed a collection of more than one million porcine expressed sequence tags (ESTs), of which two-thirds were generated in the Sino-Danish Pig Genome Project and one-third are from public databases. The Sino-Danish ESTs were generated from one normalized and 97 non-normalized cDNA libraries representing 35 different tissues and three developmental stages. Results Using the Distiller package, the ESTs were assembled to roughly 48,000 contigs and 73,000 singletons, of which approximately 25% have a high confidence match to UniProt. Approximately 6,000 new porcine gene clusters were identified. Expression analysis based on the non-normalized libraries resulted in the following findings. The distribution of cluster sizes is scaling invariant. Brain and testes are among the tissues with the greatest number of different expressed genes, whereas tissues with more specialized function, such as developing liver, have fewer expressed genes. There are at least 65 high confidence housekeeping gene candidates and 876 cDNA library-specific gene candidates. We identified differential expression of genes between different tissues, in particular brain/spinal cord, and found patterns of correlation between genes that share expression in pairs of libraries. Finally, there was remarkable agreement in expression between specialized tissues according to Gene Ontology categories. Conclusion This EST collection, the largest to date in pig, represents an essential resource for annotation, comparative genomics, assembly of the pig genome sequence, and further porcine transcription studies. PMID:17407547

  2. [Methods, challenges and opportunities for big data analyses of microbiome].

    PubMed

    Sheng, Hua-Fang; Zhou, Hong-Wei

    2015-07-01

    Microbiome is a novel research field related with a variety of chronic inflamatory diseases. Technically, there are two major approaches to analysis of microbiome: metataxonome by sequencing the 16S rRNA variable tags, and metagenome by shot-gun sequencing of the total microbial (mainly bacterial) genome mixture. The 16S rRNA sequencing analyses pipeline includes sequence quality control, diversity analyses, taxonomy and statistics; metagenome analyses further includes gene annotation and functional analyses. With the development of the sequencing techniques, the cost of sequencing will decrease, and big data analyses will become the central task. Data standardization, accumulation, modeling and disease prediction are crucial for future exploit of these data. Meanwhile, the information property in these data, and the functional verification with culture-dependent and culture-independent experiments remain the focus in future research. Studies of human microbiome will bring a better understanding of the relations between the human body and the microbiome, especially in the context of disease diagnosis and therapy, which promise rich research opportunities.

  3. Sequence analysis of 497 mouse brain ESTs expressed in the substantia nigra

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Stewart, G.J.; Savioz, A.; Davies, R.W.

    1997-01-15

    The use of subtracted, region-specific cDNA libraries combined with single-pass cDNA sequencing allows the discovery of novel genes and facilitates molecular description of the tissue or region involved. We report the sequence of 497 mouse expressed sequence tags (ESTs) from two subtracted libraries enriched for cDNAs expressed in the substantia nigra, a brain region with important roles in movement control and Parkinson disease. Of these, 238 ESTs give no database matches and therefore derive from novel genes. A further 115 ESTs show sequence similarity to ESTs from other organisms, which themselves do not yield any significant database matches to genesmore » of known function. Fifty-six ESTs show sequence similarity to previously identified genes whose mouse homologues have not been reported. The total number of ESTs reported that are new for the mouse is 407, which, together with the 90 ESTs corresponding to known mouse genes or cDNAs, contributes to the molecular description of the substantia nigra. 21 refs., 4 tabs.« less

  4. Flexible, fast and accurate sequence alignment profiling on GPGPU with PaSWAS.

    PubMed

    Warris, Sven; Yalcin, Feyruz; Jackson, Katherine J L; Nap, Jan Peter

    2015-01-01

    To obtain large-scale sequence alignments in a fast and flexible way is an important step in the analyses of next generation sequencing data. Applications based on the Smith-Waterman (SW) algorithm are often either not fast enough, limited to dedicated tasks or not sufficiently accurate due to statistical issues. Current SW implementations that run on graphics hardware do not report the alignment details necessary for further analysis. With the Parallel SW Alignment Software (PaSWAS) it is possible (a) to have easy access to the computational power of NVIDIA-based general purpose graphics processing units (GPGPUs) to perform high-speed sequence alignments, and (b) retrieve relevant information such as score, number of gaps and mismatches. The software reports multiple hits per alignment. The added value of the new SW implementation is demonstrated with two test cases: (1) tag recovery in next generation sequence data and (2) isotype assignment within an immunoglobulin 454 sequence data set. Both cases show the usability and versatility of the new parallel Smith-Waterman implementation.

  5. Multiple tag labeling method for DNA sequencing

    DOEpatents

    Mathies, Richard A.; Huang, Xiaohua C.; Quesada, Mark A.

    1995-01-01

    A DNA sequencing method described which uses single lane or channel electrophoresis. Sequencing fragments are separated in said lane and detected using a laser-excited, confocal fluorescence scanner. Each set of DNA sequencing fragments is separated in the same lane and then distinguished using a binary coding scheme employing only two different fluorescent labels. Also described is a method of using radio-isotope labels.

  6. Changes in coral-associated microbial communities during a bleaching event.

    PubMed

    Bourne, David; Iida, Yuki; Uthicke, Sven; Smith-Keune, Carolyn

    2008-04-01

    Environmental stressors such as increased sea surface temperatures are well-known for contributing to coral bleaching; however, the effect of increased temperatures and subsequent bleaching on coral-associated microbial communities is poorly understood. Colonies of the hard coral Acropora millepora were tagged on a reef flat off Magnetic Island (Great Barrier Reef) and surveyed over 2.5 years, which included a severe bleaching event in January/February 2002. Daily average water temperatures exceeded the previous 10-year average by more than 1 degrees C for extended periods with field-based visual surveys recording all tagged colonies displaying signs of bleaching. During the bleaching period, direct counts of coral zooxanthellae densities decreased by approximately 64%, before recovery to pre-bleaching levels after the thermal stress event. A subset of three tagged coral colonies were sampled through the bleaching event and changes in the microbial community elucidated. Denaturing gradient gel electrophoresis (DGGE) analysis demonstrated conserved bacterial banding profiles between the three coral colonies, confirming previous studies highlighting specific microbial associations. As coral colonies bleached, the microbial community shifted and redundancy analysis (RDA) of DGGE banding patterns revealed a correlation of increasing temperature with the appearance of Vibrio-affiliated sequences. Interestingly, this shift to a Vibrio-dominated community commenced prior to visual signs of bleaching. Clone libraries hybridized with Vibrio-specific oligonucleotide probes confirmed an increase in the fraction of Vibrio-affiliated clones during the bleaching period. Post bleaching, the coral microbial associations again shifted, returning to a profile similar to the fingerprints prior to bleaching. This provided further evidence for corals selecting and shaping their microbial partners. For non-bleached samples, a close association with Spongiobacter-related sequences were revealed by both clone libraries and DGGE profiling. Despite Vibrio species being previously implicated in bleaching of specific coral species, it is unsure if the relative increase in retrieved Vibrio sequences is due to bacterial infection or an opportunistic response to compromised health and changing environmental parameters of the coral host. This study provides the first molecular-based study demonstrating changes in coral-associated bacterial assemblages during a bleaching event on a natural reef system.

  7. Developing expressed sequence tag libraries and the discovery of simple sequence repeat markers for two species of raspberry (Rubus L.)

    USDA-ARS?s Scientific Manuscript database

    Background: Due to a relatively high level of codominant inheritance and transferability within and among taxonomic groups, simple sequence repeat (SSR) markers are important elements in comparative mapping and delineation of genomic regions associated with traits of economic importance. Expressed S...

  8. Transferable green fluorescence-tagged pEI2 in Edwardsiella ictaluri

    USDA-ARS?s Scientific Manuscript database

    The pEI2 plasmid of Edwardsiella ictaluri isolate, I49, was tagged using a Tn10-GFP-kan cassette to create the green fluorescence-expressing derivative I49-gfp. The Tn10-GFP-kan insertion site was mapped by plasmid sequencing to 663 bp upstream of orf2 and appeared to be at a neutral site in the pla...

  9. NABIC: A New Access Portal to Search, Visualize, and Share Agricultural Genomics Data

    PubMed Central

    Seol, Young-Joo; Lee, Tae-Ho; Park, Dong-Suk; Kim, Chang-Kug

    2016-01-01

    The National Agricultural Biotechnology Information Center developed an access portal to search, visualize, and share agricultural genomics data with a focus on South Korean information and resources. The portal features an agricultural biotechnology database containing a wide range of omics data from public and proprietary sources. We collected 28.4 TB of data from 162 agricultural organisms, with 10 types of omics data comprising next-generation sequencing sequence read archive, genome, gene, nucleotide, DNA chip, expressed sequence tag, interactome, protein structure, molecular marker, and single-nucleotide polymorphism datasets. Our genomic resources contain information on five animals, seven plants, and one fungus, which is accessed through a genome browser. We also developed a data submission and analysis system as a web service, with easy-to-use functions and cutting-edge algorithms, including those for handling next-generation sequencing data. PMID:26848255

  10. Population structure of pigs determined by single nucleotide polymorphisms observed in assembled expressed sequence tags.

    PubMed

    Matsumoto, Toshimi; Okumura, Naohiko; Uenishi, Hirohide; Hayashi, Takeshi; Hamasima, Noriyuki; Awata, Takashi

    2012-01-01

    We have collected more than 190000 porcine expressed sequence tags (ESTs) from full-length complementary DNA (cDNA) libraries and identified more than 2800 single nucleotide polymorphisms (SNPs). In this study, we tentatively chose 222 SNPs observed in assembled ESTs to study pigs of different breeds; 104 were selected by comparing the cDNA sequences of a Meishan pig and samples of three-way cross pigs (Landrace, Large White, and Duroc: LWD), and 118 were selected from LWD samples. To evaluate the genetic variation between the chosen SNPs from pig breeds, we determined the genotypes for 192 pig samples (11 pig groups) from our DNA reference panel with matrix-assisted laser desorption ionization time-of-flight mass spectrometry. Of the 222 reference SNPs, 186 were successfully genotyped. A neighbor-joining tree showed that the pig groups were classified into two large clusters, namely, Euro-American and East Asian pig populations. F-statistics and the analysis of molecular variance of Euro-American pig groups revealed that approximately 25% of the genetic variations occurred because of intergroup differences. As the F(IS) values were less than the F(ST) values(,) the clustering, based on the Bayesian inference, implied that there was strong genetic differentiation among pig groups and less divergence within the groups in our samples. © 2011 The Authors. Animal Science Journal © 2011 Japanese Society of Animal Science.

  11. Global Analysis of Transcription Factor-Binding Sites in Yeast Using ChIP-Seq

    PubMed Central

    Lefrançois, Philippe; Gallagher, Jennifer E. G.; Snyder, Michael

    2016-01-01

    Transcription factors influence gene expression through their ability to bind DNA at specific regulatory elements. Specific DNA-protein interactions can be isolated through the chromatin immunoprecipitation (ChIP) procedure, in which DNA fragments bound by the protein of interest are recovered. ChIP is followed by high-throughput DNA sequencing (Seq) to determine the genomic provenance of ChIP DNA fragments and their relative abundance in the sample. This chapter describes a ChIP-Seq strategy adapted for budding yeast to enable the genome-wide characterization of binding sites of transcription factors (TFs) and other DNA-binding proteins in an efficient and cost-effective way. Yeast strains with epitope-tagged TFs are most commonly used for ChIP-Seq, along with their matching untagged control strains. The initial step of ChIP involves the cross-linking of DNA and proteins. Next, yeast cells are lysed and sonicated to shear chromatin into smaller fragments. An antibody against an epitope-tagged TF is used to pull down chromatin complexes containing DNA and the TF of interest. DNA is then purified and proteins degraded. Specific barcoded adapters for multiplex DNA sequencing are ligated to ChIP DNA. Short DNA sequence reads (28–36 base pairs) are parsed according to the barcode and aligned against the yeast reference genome, thus generating a nucleotide-resolution map of transcription factor-binding sites and their occupancy. PMID:25213249

  12. Stable isotope, site-specific mass tagging for protein identification

    DOEpatents

    Chen, Xian

    2006-10-24

    Proteolytic peptide mass mapping as measured by mass spectrometry provides an important method for the identification of proteins, which are usually identified by matching the measured and calculated m/z values of the proteolytic peptides. A unique identification is, however, heavily dependent upon the mass accuracy and sequence coverage of the fragment ions generated by peptide ionization. The present invention describes a method for increasing the specificity, accuracy and efficiency of the assignments of particular proteolytic peptides and consequent protein identification, by the incorporation of selected amino acid residue(s) enriched with stable isotope(s) into the protein sequence without the need for ultrahigh instrumental accuracy. Selected amino acid(s) are labeled with .sup.13C/.sup.15N/.sup.2H and incorporated into proteins in a sequence-specific manner during cell culturing. Each of these labeled amino acids carries a defined mass change encoded in its monoisotopic distribution pattern. Through their characteristic patterns, the peptides with mass tag(s) can then be readily distinguished from other peptides in mass spectra. The present method of identifying unique proteins can also be extended to protein complexes and will significantly increase data search specificity, efficiency and accuracy for protein identifications.

  13. Expressed sequence tags from the flower pathogen Claviceps purpurea.

    PubMed

    Oeser, Birgitt; Beaussart, François; Haarmann, Thomas; Lorenz, Nicole; Nathues, Eva; Rolke, Yvonne; Scheffer, Jan; Weiner, January; Tudzynski, Paul

    2009-09-01

    SUMMARY The ascomycete Claviceps purpurea (ergot) is a biotrophic flower pathogen of rye and other grasses. The deleterious toxic effects of infected rye seeds on humans and grazing animals have been known since the Middle Ages. To gain further insight into the molecular basis of this disease, we generated about 10 000 expressed sequence tags (ESTs)-about 25% originating from axenic fungal culture and about 75% from tissues collected 6-20 days after infection of rye spikes. The pattern of axenic vs. in planta gene expression was compared. About 200 putative plant genes were identified within the in planta library. A high percentage of these were predicted to function in plant defence against the ergot fungus and other pathogens, for example pathogenesis-related proteins. Potential fungal pathogenicity and virulence genes were found via comparison with the pathogen-host interaction database (PHI-base; http://www.phi-base.org) and with genes known to be highly expressed in the haustoria of the bean rust fungus. Comparative analysis of Claviceps and two other fungal flower pathogens (necrotrophic Fusarium graminearum and biotrophic Ustilago maydis) highlighted similarities and differences in their lifestyles, for example all three fungi have signalling components and cell wall-degrading enzymes in their arsenal. In summary, the analysis of axenic and in planta ESTs yielded a collection of candidate genes to be evaluated for functional roles in this plant-microbe interaction.

  14. Multiple tag labeling method for DNA sequencing

    DOEpatents

    Mathies, R.A.; Huang, X.C.; Quesada, M.A.

    1995-07-25

    A DNA sequencing method is described which uses single lane or channel electrophoresis. Sequencing fragments are separated in the lane and detected using a laser-excited, confocal fluorescence scanner. Each set of DNA sequencing fragments is separated in the same lane and then distinguished using a binary coding scheme employing only two different fluorescent labels. Also described is a method of using radioisotope labels. 5 figs.

  15. Reproducibility of Illumina platform deep sequencing errors allows accurate determination of DNA barcodes in cells.

    PubMed

    Beltman, Joost B; Urbanus, Jos; Velds, Arno; van Rooij, Nienke; Rohr, Jan C; Naik, Shalin H; Schumacher, Ton N

    2016-04-02

    Next generation sequencing (NGS) of amplified DNA is a powerful tool to describe genetic heterogeneity within cell populations that can both be used to investigate the clonal structure of cell populations and to perform genetic lineage tracing. For applications in which both abundant and rare sequences are biologically relevant, the relatively high error rate of NGS techniques complicates data analysis, as it is difficult to distinguish rare true sequences from spurious sequences that are generated by PCR or sequencing errors. This issue, for instance, applies to cellular barcoding strategies that aim to follow the amount and type of offspring of single cells, by supplying these with unique heritable DNA tags. Here, we use genetic barcoding data from the Illumina HiSeq platform to show that straightforward read threshold-based filtering of data is typically insufficient to filter out spurious barcodes. Importantly, we demonstrate that specific sequencing errors occur at an approximately constant rate across different samples that are sequenced in parallel. We exploit this observation by developing a novel approach to filter out spurious sequences. Application of our new method demonstrates its value in the identification of true sequences amongst spurious sequences in biological data sets.

  16. Diversity analysis in Cannabis sativa based on large-scale development of expressed sequence tag-derived simple sequence repeat markers.

    PubMed

    Gao, Chunsheng; Xin, Pengfei; Cheng, Chaohua; Tang, Qing; Chen, Ping; Wang, Changbiao; Zang, Gonggu; Zhao, Lining

    2014-01-01

    Cannabis sativa L. is an important economic plant for the production of food, fiber, oils, and intoxicants. However, lack of sufficient simple sequence repeat (SSR) markers has limited the development of cannabis genetic research. Here, large-scale development of expressed sequence tag simple sequence repeat (EST-SSR) markers was performed to obtain more informative genetic markers, and to assess genetic diversity in cannabis (Cannabis sativa L.). Based on the cannabis transcriptome, 4,577 SSRs were identified from 3,624 ESTs. From there, a total of 3,442 complementary primer pairs were designed as SSR markers. Among these markers, trinucleotide repeat motifs (50.99%) were the most abundant, followed by hexanucleotide (25.13%), dinucleotide (16.34%), tetranucloetide (3.8%), and pentanucleotide (3.74%) repeat motifs, respectively. The AAG/CTT trinucleotide repeat (17.96%) was the most abundant motif detected in the SSRs. One hundred and seventeen EST-SSR markers were randomly selected to evaluate primer quality in 24 cannabis varieties. Among these 117 markers, 108 (92.31%) were successfully amplified and 87 (74.36%) were polymorphic. Forty-five polymorphic primer pairs were selected to evaluate genetic diversity and relatedness among the 115 cannabis genotypes. The results showed that 115 varieties could be divided into 4 groups primarily based on geography: Northern China, Europe, Central China, and Southern China. Moreover, the coefficient of similarity when comparing cannabis from Northern China with the European group cannabis was higher than that when comparing with cannabis from the other two groups, owing to a similar climate. This study outlines the first large-scale development of SSR markers for cannabis. These data may serve as a foundation for the development of genetic linkage, quantitative trait loci mapping, and marker-assisted breeding of cannabis.

  17. Diversity Analysis in Cannabis sativa Based on Large-Scale Development of Expressed Sequence Tag-Derived Simple Sequence Repeat Markers

    PubMed Central

    Cheng, Chaohua; Tang, Qing; Chen, Ping; Wang, Changbiao; Zang, Gonggu; Zhao, Lining

    2014-01-01

    Cannabis sativa L. is an important economic plant for the production of food, fiber, oils, and intoxicants. However, lack of sufficient simple sequence repeat (SSR) markers has limited the development of cannabis genetic research. Here, large-scale development of expressed sequence tag simple sequence repeat (EST-SSR) markers was performed to obtain more informative genetic markers, and to assess genetic diversity in cannabis (Cannabis sativa L.). Based on the cannabis transcriptome, 4,577 SSRs were identified from 3,624 ESTs. From there, a total of 3,442 complementary primer pairs were designed as SSR markers. Among these markers, trinucleotide repeat motifs (50.99%) were the most abundant, followed by hexanucleotide (25.13%), dinucleotide (16.34%), tetranucloetide (3.8%), and pentanucleotide (3.74%) repeat motifs, respectively. The AAG/CTT trinucleotide repeat (17.96%) was the most abundant motif detected in the SSRs. One hundred and seventeen EST-SSR markers were randomly selected to evaluate primer quality in 24 cannabis varieties. Among these 117 markers, 108 (92.31%) were successfully amplified and 87 (74.36%) were polymorphic. Forty-five polymorphic primer pairs were selected to evaluate genetic diversity and relatedness among the 115 cannabis genotypes. The results showed that 115 varieties could be divided into 4 groups primarily based on geography: Northern China, Europe, Central China, and Southern China. Moreover, the coefficient of similarity when comparing cannabis from Northern China with the European group cannabis was higher than that when comparing with cannabis from the other two groups, owing to a similar climate. This study outlines the first large-scale development of SSR markers for cannabis. These data may serve as a foundation for the development of genetic linkage, quantitative trait loci mapping, and marker-assisted breeding of cannabis. PMID:25329551

  18. Draft genome sequence of bitter gourd (Momordica charantia), a vegetable and medicinal plant in tropical and subtropical regions.

    PubMed

    Urasaki, Naoya; Takagi, Hiroki; Natsume, Satoshi; Uemura, Aiko; Taniai, Naoki; Miyagi, Norimichi; Fukushima, Mai; Suzuki, Shouta; Tarora, Kazuhiko; Tamaki, Moritoshi; Sakamoto, Moriaki; Terauchi, Ryohei; Matsumura, Hideo

    2017-02-01

    Bitter gourd (Momordica charantia) is an important vegetable and medicinal plant in tropical and subtropical regions globally. In this study, the draft genome sequence of a monoecious bitter gourd inbred line, OHB3-1, was analyzed. Through Illumina sequencing and de novo assembly, scaffolds of 285.5 Mb in length were generated, corresponding to ∼84% of the estimated genome size of bitter gourd (339 Mb). In this draft genome sequence, 45,859 protein-coding gene loci were identified, and transposable elements accounted for 15.3% of the whole genome. According to synteny mapping and phylogenetic analysis of conserved genes, bitter gourd was more related to watermelon (Citrullus lanatus) than to cucumber (Cucumis sativus) or melon (C. melo). Using RAD-seq analysis, 1507 marker loci were genotyped in an F2 progeny of two bitter gourd lines, resulting in an improved linkage map, comprising 11 linkage groups. By anchoring RAD tag markers, 255 scaffolds were assigned to the linkage map. Comparative analysis of genome sequences and predicted genes determined that putative trypsin-inhibitor and ribosome-inactivating genes were distinctive in the bitter gourd genome. These genes could characterize the bitter gourd as a medicinal plant. © The Author 2016. Published by Oxford University Press on behalf of Kazusa DNA Research Institute.

  19. Transcriptome analysis of eyestalk and hemocytes in the ridgetail white prawn Exopalaemon carinicauda: assembly, annotation and marker discovery.

    PubMed

    Li, Jitao; Li, Jian; Chen, Ping; Liu, Ping; He, Yuying

    2015-01-01

    The ridgetail white prawn Exopalaemon carinicauda is one of major economic mariculture species in eastern China. The deficiency of genomic and transcriptomic data is becoming the bottleneck of further researches on its good traits. In the present study, 454 pyrosequencing was undertaken to investigate the transcriptome profiles of E. carinicauda. A collection of 1,028,710 sequence reads (459.59 Mb) obtained from cDNA prepared from eyestalk and hemocytes was assembled into 162,056 expressed sequence tags (ESTs). Of these, 29.88 % of 48,428 contigs and 70.12 % of 113,628 singlets possessed high similarities to sequences in the GenBank non-redundant database, with most significant (E value <1e(-10)) unigenes matches occurring with crustacean and insect sequences. KEGG analysis of unigenes identified putative members of biological pathways related to growth and immunity. In addition, we obtained a total of putative 125,112 SNPs and 13,467 microsatellites. These results will contribute to the understanding of the genome makeup and provide useful information for future functional genomic research in E. carinicauda.

  20. Molecular cloning of a putative gene encoding isopentenyltransferase from pingyitiancha (Malus hupehensis) and characterization of its response to nitrate.

    PubMed

    Peng, Jing; Peng, Futian; Zhu, Chunfu; Wei, Shaochong

    2008-06-01

    A putative isopentenyltransferase (IPT) encoding gene was identified from a pingyitiancha (Malus hupehensis Rehd.) expressed sequence tag database, and the full-length gene was cloned by RACE. Based on expression profile and sequence alignment, the nucleotide sequence of the clone, named MhIPT3, was most similar to AtIPT3, an IPT gene in Arabidopsis. The full-length cDNA contained a 963-bp open reading frame encoding a protein of 321 amino acids with a molecular mass of 37.3 kDa. Sequence analysis of genomic DNA revealed the absence of introns in the frame. Quantitative real-time PCR analysis demonstrated that the gene was expressed in roots, stems and leaves. Application of nitrate to roots of nitrogen-deprived seedlings strongly induced expression of MhIPT3 and was accompanied by the accumulation of cytokinins, whereas MhIPT3 expression was little affected by ammonium application to roots of nitrogen-deprived seedlings. Application of nitrate to leaves also up-regulated the expression of MhIPT3 and corresponded closely with the accumulation of isopentyladenine and isopentyladenosine in leaves.

  1. MytiBase: a knowledgebase of mussel (M. galloprovincialis) transcribed sequences

    PubMed Central

    Venier, Paola; De Pittà, Cristiano; Bernante, Filippo; Varotto, Laura; De Nardi, Barbara; Bovo, Giuseppe; Roch, Philippe; Novoa, Beatriz; Figueras, Antonio; Pallavicini, Alberto; Lanfranchi, Gerolamo

    2009-01-01

    Background Although Bivalves are among the most studied marine organisms due to their ecological role, economic importance and use in pollution biomonitoring, very little information is available on the genome sequences of mussels. This study reports the functional analysis of a large-scale Expressed Sequence Tag (EST) sequencing from different tissues of Mytilus galloprovincialis (the Mediterranean mussel) challenged with toxic pollutants, temperature and potentially pathogenic bacteria. Results We have constructed and sequenced seventeen cDNA libraries from different Mediterranean mussel tissues: gills, digestive gland, foot, anterior and posterior adductor muscle, mantle and haemocytes. A total of 24,939 clones were sequenced from these libraries generating 18,788 high-quality ESTs which were assembled into 2,446 overlapping clusters and 4,666 singletons resulting in a total of 7,112 non-redundant sequences. In particular, a high-quality normalized cDNA library (Nor01) was constructed as determined by the high rate of gene discovery (65.6%). Bioinformatic screening of the non-redundant M. galloprovincialis sequences identified 159 microsatellite-containing ESTs. Clusters, consensuses, related similarities and gene ontology searches have been organized in a dedicated, searchable database . Conclusion We defined the first species-specific catalogue of M. galloprovincialis ESTs including 7,112 unique transcribed sequences. Putative microsatellite markers were identified. This annotated catalogue represents a valuable platform for expression studies, marker validation and genetic linkage analysis for investigations in the biology of Mediterranean mussels. PMID:19203376

  2. Dive patterns of tagged right whales in the Great South Channel

    NASA Astrophysics Data System (ADS)

    Winn, Howard E.; Goodyear, Jeffrey D.; Kenney, Robert D.; Petricig, Richard O.

    Right whales were tagged in 1988 and 1989 with radio and sonic telemetry tags as part of a multidisciplinary investigation of right whales and their habitat in the Great South Channel region east of Cape Cod. The tags yielded data on the durations of 6456 dives and 6482 surfacings, as well as 23,538 measurements of the depth of a diving whale. Log-survivorship analysis of the 1988 data showed a clear separation between the durations of dives between blows within a single surfacing sequence or bout (intea-bout dives) and longer dives between surfacing sequences (interbout dives) at 27 s, which was also applied to the 1989 data. Inter-bout dives averaged 127.3 s, and were significantly longer in 1988 than in 1989. Inter-bout dives were significantly longer during the day than night in 1988, and longer at night in 1989. The average intea-bout dive duration was 11.8 s, with 1989 dives longer than those in 1988. Surface durations averaged 6.2 s, and were also significantly longer in 1989. Dive depths were recorded only in 1989. Mean dive depth was 7.3 m, and only 12 dives went deeper than 30 m. The typical right whale dive pattern in 1988 included relatively short surfacings, long dives during the day, and shorter dives at night. This correlated with strong diel vertical migration by the dense zooplankton patches on which they were presumed to be feeding based on indirect evidence-from near the surface at night to near the bottom during the day. The 1989 pattern included longer dives during the night, as well as some exceptionally long surfacings. Zooplankton in 1989 did not migrate vertically, and remained near the surface day and night in right whale feeding areas. Right whale dive patterns in the Great South Channel are closely correlated with the horizontal and vertical distributions and movements of dense patches of their zooplankton prey.

  3. Finding similar nucleotide sequences using network BLAST searches.

    PubMed

    Ladunga, Istvan

    2009-06-01

    The Basic Local Alignment Search Tool (BLAST) is a keystone of bioinformatics due to its performance and user-friendliness. Beginner and intermediate users will learn how to design and submit blastn and Megablast searches on the Web pages at the National Center for Biotechnology Information. We map nucleic acid sequences to genomes, find identical or similar mRNA, expressed sequence tag, and noncoding RNA sequences, and run Megablast searches, which are much faster than blastn. Understanding results is assisted by taxonomy reports, genomic views, and multiple alignments. We interpret expected frequency thresholds, biological significance, and statistical significance. Weak hits provide no evidence, but hints for further analyses. We find genes that may code for homologous proteins by translated BLAST. We reduce false positives by filtering out low-complexity regions. Parsed BLAST results can be integrated into analysis pipelines. Links in the output connect to Entrez, PUBMED, structural, sequence, interaction, and expression databases. This facilitates integration with a wide spectrum of biological knowledge.

  4. Functional analysis of Pacific oyster (Crassostrea gigas) β-thymosin: Focus on antimicrobial activity.

    PubMed

    Nam, Bo-Hye; Seo, Jung-Kil; Lee, Min Jeong; Kim, Young-Ok; Kim, Dong-Gyun; An, Cheul Min; Park, Nam Gyu

    2015-07-01

    An antimicrobial peptide, ∼5 kDa in size, was isolated and purified in its active form from the mantle of the Pacific oyster Crassostrea gigas by C18 reversed-phase high-performance liquid chromatography. Matrix-assisted laser desorption ionisation time-of-flight analysis revealed 4656.4 Da of the purified and unreduced peptide. A comparison of the N-terminal amino acid sequence of oyster antimicrobial peptide with deduced amino acid sequences in our local expressed sequence tag (EST) database of C. gigas (unpublished data) revealed that the oyster antimicrobial peptide sequence entirely matched the deduced amino acid sequence of an EST clone (HM-8_A04), which was highly homologous with the β-thymosin of other species. The cDNA possessed a 126-bp open reading frame that encoded a protein of 41 amino acids. To confirm the antimicrobial activity of C. gigas β-thymosin, we overexpressed a recombinant β-thymosin (rcgTβ) using a pET22 expression plasmid in an Escherichia coli system. The antimicrobial activity of rcgTβ was evaluated and demonstrated using a bacterial growth inhibition test in both liquid and solid cultures. Copyright © 2015 Elsevier Ltd. All rights reserved.

  5. Study of cnidarian-algal symbiosis in the "omics" age.

    PubMed

    Meyer, Eli; Weis, Virginia M

    2012-08-01

    The symbiotic associations between cnidarians and dinoflagellate algae (Symbiodinium) support productive and diverse ecosystems in coral reefs. Many aspects of this association, including the mechanistic basis of host-symbiont recognition and metabolic interaction, remain poorly understood. The first completed genome sequence for a symbiotic anthozoan is now available (the coral Acropora digitifera), and extensive expressed sequence tag resources are available for a variety of other symbiotic corals and anemones. These resources make it possible to profile gene expression, protein abundance, and protein localization associated with the symbiotic state. Here we review the history of "omics" studies of cnidarian-algal symbiosis and the current availability of sequence resources for corals and anemones, identifying genes putatively involved in symbiosis across 10 anthozoan species. The public availability of candidate symbiosis-associated genes leaves the field of cnidarian-algal symbiosis poised for in-depth comparative studies of sequence diversity and gene expression and for targeted functional studies of genes associated with symbiosis. Reviewing the progress to date suggests directions for future investigations of cnidarian-algal symbiosis that include (i) sequencing of Symbiodinium, (ii) proteomic analysis of the symbiosome membrane complex, (iii) glycomic analysis of Symbiodinium cell surfaces, and (iv) expression profiling of the gastrodermal cells hosting Symbiodinium.

  6. Defective in Cuticular Ridges (DCR) of Arabidopsis thaliana, a Gene Associated with Surface Cutin Formation, Encodes a Soluble Diacylglycerol Acyltransferase*

    PubMed Central

    Rani, Sapa Hima; Krishna, T. H. Anantha; Saha, Saikat; Negi, Arvind Singh; Rajasekharan, Ram

    2010-01-01

    A key step in the triacylglycerol (TAG) biosynthetic pathway is the final acylation of diacylglycerol (DAG) by DAG acyltransferase. In silico analysis has revealed that the DCR (defective in cuticular ridges) (At5g23940) gene has a typical HX4D acyltransferase motif at the N-terminal end and a lipid binding motif VX2GF at the middle of the sequence. To understand the biochemical function, the gene was overexpressed in Escherichia coli, and the purified recombinant protein was found to acylate DAG specifically in an acyl-CoA-dependent manner. Overexpression of At5g23940 in a Saccharomyces cerevisiae quadruple mutant deficient in DAG acyltransferases resulted in TAG accumulation. At5g23940 rescued the growth of this quadruple mutant in the oleate-containing medium, whereas empty vector control did not. Lipid particles were localized in the cytosol of At5g23940-transformed quadruple mutant cells, as observed by oil red O staining. There was an incorporation of 16-hydroxyhexadecanoic acid into TAG in At5g23940-transformed cells of quadruple mutant. Here we report a soluble acyl-CoA-dependent DAG acyltransferase from Arabidopsis thaliana. Taken together, these data suggest that a broad specific DAG acyltransferase may be involved in the cutin as well as in the TAG biosynthesis by supplying hydroxy fatty acid. PMID:20921218

  7. Defective in cuticular ridges (DCR) of Arabidopsis thaliana, a gene associated with surface cutin formation, encodes a soluble diacylglycerol acyltransferase.

    PubMed

    Rani, Sapa Hima; Krishna, T H Anantha; Saha, Saikat; Negi, Arvind Singh; Rajasekharan, Ram

    2010-12-03

    A key step in the triacylglycerol (TAG) biosynthetic pathway is the final acylation of diacylglycerol (DAG) by DAG acyltransferase. In silico analysis has revealed that the DCR (defective in cuticular ridges) (At5g23940) gene has a typical HX(4)D acyltransferase motif at the N-terminal end and a lipid binding motif VX(2)GF at the middle of the sequence. To understand the biochemical function, the gene was overexpressed in Escherichia coli, and the purified recombinant protein was found to acylate DAG specifically in an acyl-CoA-dependent manner. Overexpression of At5g23940 in a Saccharomyces cerevisiae quadruple mutant deficient in DAG acyltransferases resulted in TAG accumulation. At5g23940 rescued the growth of this quadruple mutant in the oleate-containing medium, whereas empty vector control did not. Lipid particles were localized in the cytosol of At5g23940-transformed quadruple mutant cells, as observed by oil red O staining. There was an incorporation of 16-hydroxyhexadecanoic acid into TAG in At5g23940-transformed cells of quadruple mutant. Here we report a soluble acyl-CoA-dependent DAG acyltransferase from Arabidopsis thaliana. Taken together, these data suggest that a broad specific DAG acyltransferase may be involved in the cutin as well as in the TAG biosynthesis by supplying hydroxy fatty acid.

  8. Building and using a statistical 3D motion atlas for analyzing myocardial contraction in MRI

    NASA Astrophysics Data System (ADS)

    Rougon, Nicolas F.; Petitjean, Caroline; Preteux, Francoise J.

    2004-05-01

    We address the issue of modeling and quantifying myocardial contraction from 4D MR sequences, and present an unsupervised approach for building and using a statistical 3D motion atlas for the normal heart. This approach relies on a state-of-the-art variational non rigid registration (NRR) technique using generalized information measures, which allows for robust intra-subject motion estimation and inter-subject anatomical alignment. The atlas is built from a collection of jointly acquired tagged and cine MR exams in short- and long-axis views. Subject-specific non parametric motion estimates are first obtained by incremental NRR of tagged images onto the end-diastolic (ED) frame. Individual motion data are then transformed into the coordinate system of a reference subject using subject-to-reference mappings derived by NRR of cine ED images. Finally, principal component analysis of aligned motion data is performed for each cardiac phase, yielding a mean model and a set of eigenfields encoding kinematic ariability. The latter define an organ-dedicated hierarchical motion basis which enables parametric motion measurement from arbitrary tagged MR exams. To this end, the atlas is transformed into subject coordinates by reference-to-subject NRR of ED cine frames. Atlas-based motion estimation is then achieved by parametric NRR of tagged images onto the ED frame, yielding a compact description of myocardial contraction during diastole.

  9. Comparison of Sanger and next generation sequencing performance for genotyping Cryptosporidium isolates at the 18S rRNA and actin loci.

    PubMed

    Paparini, Andrea; Gofton, Alexander; Yang, Rongchang; White, Nicole; Bunce, Michael; Ryan, Una M

    2015-01-01

    Cryptosporidium is an important enteric pathogen that infects a wide range of humans and animals. Rapid and reliable detection and characterisation methods are essential for understanding the transmission dynamics of the parasite. Sanger sequencing, and high-throughput sequencing (HTS) on an Ion Torrent platform, were compared with each other for their sensitivity and accuracy in detecting and characterising 25 Cryptosporidium-positive human and animal faecal samples. Ion Torrent reads (n = 123,857) were obtained at both 18S rRNA and actin loci for 21 of the 25 samples. Of these, one isolate at the actin locus (Cattle 05) and three at the 18S rRNA locus (HTS 10, HTS 11 and HTS 12), suffered PCR drop-out (i.e. PCR failures) when using fusion-tagged PCR. Sanger sequences were obtained for both loci for 23 of the 25 samples and showed good agreement with Ion Torrent-based genotyping. Two samples both from pythons (SK 02 and SK 05) produced mixed 18S and actin chromatograms by Sanger sequencing but were clearly identified by Ion Torrent sequencing as C. muris. One isolate (SK 03) was typed as C. muris by Sanger sequencing but was identified as a mixed C. muris and C. tyzzeri infection by HTS. 18S rRNA Type B sequences were identified in 4/6 C. parvum isolates when deep sequenced but were undetected in Sanger sequencing. Sanger was cheaper than Ion Torrent when sequencing a small numbers of samples, but when larger numbers of samples are considered (n = 60), the costs were comparative. Fusion-tagged amplicon based approaches are a powerful way of approaching mixtures, the only draw-back being the loss of PCR efficiency on low-template samples when using primers coupled to MID tags and adaptors. Taken together these data show that HTS has excellent potential for revealing the "true" composition of species/types in a Cryptosporidium infection, but that HTS workflows need to be carefully developed to ensure sensitivity, accuracy and contamination are controlled. Copyright © 2015 Elsevier Inc. All rights reserved.

  10. Short Communication: Genetic linkage map of Cucurbita maxima with molecular and morphological markers.

    PubMed

    Ge, Y; Li, X; Yang, X X; Cui, C S; Qu, S P

    2015-05-22

    Cucurbita maxima is one of the most widely cultivated vegetables in China and exhibits distinct morphological characteristics. In this study, genetic linkage analysis with 57 simple-sequence repeats, 21 amplified fragment length polymorphisms, 3 random-amplified polymorphic DNA, and one morphological marker revealed 20 genetic linkage groups of C. maxima covering a genetic distance of 991.5 cM with an average of 12.1 cM between adjacent markers. Genetic linkage analysis identified the simple-sequence repeat marker 'PU078072' 5.9 cM away from the locus 'Rc', which controls rind color. The genetic map in the present study will be useful for better mapping, tagging, and cloning of quantitative trait loci/gene(s) affecting economically important traits and for breeding new varieties of C. maxima through marker-assisted selection.

  11. Changes in rat spinal cord gene expression after inflammatory hyperalgesia of the joint and manual therapy.

    PubMed

    Ruhlen, Rachel L; Singh, Vineet K; Pazdernik, Vanessa K; Towns, Lex C; Snider, Eric J; Sargentini, Neil J; Degenhardt, Brian F

    2014-10-01

    Mobilization of a joint affects local tissue directly but may also have other effects that are mediated through the central nervous system. To identify differential gene expression in the spinal cords of rats with or without inflammatory joint injury after manual therapy or no treatment. Rats were randomly assigned to 1 of 4 treatment groups: no injury and no touch (NI/NT), injury and no touch (I/NT), no injury and manual therapy (NI/MT), and injury and manual therapy (I/MT). We induced acute inflammatory joint injury in the rats by injecting carrageenan into an ankle. Rats in the no-injury groups did not receive carrageenan injection. One day after injury, rats received manual therapy to the knee of the injured limb. Rats in the no-touch groups were anesthetized without receiving manual therapy. Spinal cords were harvested 30 minutes after therapy or no touch, and spinal cord gene expression was analyzed by microarray for 3 comparisons: NI/NT vs I/NT, I/MT vs I/NT, and NI/NT vs NI/MT. Three rats were assigned to each group. Of 38,875 expressed sequence tags, 755 were differentially expressed in the NI/NT vs I/NT comparison. For the other comparisons, no expressed sequence tags were differentially expressed. Cluster analysis revealed that the differentially expressed sequence tags were over-represented in several categories, including ion homeostasis (enrichment score, 2.29), transmembrane (enrichment score, 1.55), and disulfide bond (enrichment score, 2.04). An inflammatory injury to the ankle of rats caused differential expression of genes in the spinal cord. Consistent with other studies, genes involved in ion transport were among those affected. However, manual therapy to the knees of injured limbs or to rats without injury did not alter gene expression in the spinal cord. Thus, evidence for central nervous system mediation of manual therapy was not observed. © 2014 The American Osteopathic Association.

  12. The use of archived tags in retrospective genetic analysis of fish.

    PubMed

    Bonanomi, Sara; Therkildsen, Nina Overgaard; Hedeholm, Rasmus Berg; Hemmer-Hansen, Jakob; Nielsen, Einar E

    2014-05-01

    Collections of historical tissue samples from fish (e.g. scales and otoliths) stored in museums and fisheries institutions are precious sources of DNA for conducting retrospective genetic analysis. However, in some cases, only external tags used for documentation of spatial dynamics of fish populations have been preserved. Here, we test the usefulness of fish tags as a source of DNA for genetic analysis. We extract DNA from historical tags from cod collected in Greenlandic waters between 1950 and 1968. We show that the quantity and quality of DNA recovered from tags is comparable to DNA from archived otoliths from the same individuals. Surprisingly, levels of cross-contamination do not seem to be significantly higher in DNA from external (tag) than internal (otolith) sources. Our study therefore demonstrates that historical tags can be a highly valuable source of DNA for retrospective genetic analysis of fish. © 2013 John Wiley & Sons Ltd.

  13. Genotyping variability of computationally categorized peach microsatellite markers

    USDA-ARS?s Scientific Manuscript database

    Numerous expressed sequence tag (EST) simple sequence repeat (SSR) primers can be easily mined out. The obstacle to develop them into usable markers is how to optimally select downsized subsets of the primers for genotyping, which accordingly reduces amplification failure and monomorphism often occu...

  14. Molecular barcodes detect redundancy and contamination in hairpin-bisulfite PCR

    PubMed Central

    Miner, Brooks E.; Stöger, Reinhard J.; Burden, Alice F.; Laird, Charles D.; Hansen, R. Scott

    2004-01-01

    PCR amplification of limited amounts of DNA template carries an increased risk of product redundancy and contamination. We use molecular barcoding to label each genomic DNA template with an individual sequence tag prior to PCR amplification. In addition, we include molecular ‘batch-stamps’ that effectively label each genomic template with a sample ID and analysis date. This highly sensitive method identifies redundant and contaminant sequences and serves as a reliable method for positive identification of desired sequences; we can therefore capture accurately the genomic template diversity in the sample analyzed. Although our application described here involves the use of hairpin-bisulfite PCR for amplification of double-stranded DNA, the method can readily be adapted to single-strand PCR. Useful applications will include analyses of limited template DNA for biomedical, ancient DNA and forensic purposes. PMID:15459281

  15. The rescue and evaluation of FLAG and HIS epitope-tagged Asia 1 type foot-and-mouth disease viruses.

    PubMed

    Yang, Bo; Yang, Fan; Zhang, Yan; Liu, Huanan; Jin, Ye; Cao, Weijun; Zhu, Zixiang; Zheng, Haixue; Yin, Hong

    2016-02-02

    The VP1 G-H loop of the foot-and-mouth disease virus (FMDV) contains the primary antigenic site, as well as an Arg-Gly-Asp (RGD) binding motif for the αv-integrin family of cell surface receptors. We anticipated that introducing a foreign epitope tag sequence downstream of the RGD motif would be tolerated by the viral capsid and would not destroy the antigenic site of FMDV. In this study, we have designed, generated, and characterized two recombinant FMDVs with a FLAG tag or histidine (HIS) inserted in the VP1 G-H loop downstream of the RGD motif +9 position. The tagged viruses were genetically stable and exhibited similar growth properties with their parental virus. What is more, the recombinant viruses rFMDV-FLAG and rFMDV-HIS showed neutralization sensitivity to FMDV type Asia1-specific mAbs, as well as to polyclonal antibodies. Additionally, the r1 values of the recombinant viruses were similar to that of the parental virus, indicating that the insertion of FLAG or HIS tag sequences downstream of the RGD motif +9 position do not eradicate the antigenic site of FMDV and do not affect its antigenicity. These results indicated that the G-H loop of Asia1 FMDV is able to effectively display the foreign epitopes, making this a potential approach for novel FMDV vaccines development. Copyright © 2015 Elsevier B.V. All rights reserved.

  16. Association between hMLH1 hypermethylation and JC virus (JCV) infection in human colorectal cancer (CRC).

    PubMed

    Vilkin, Alex; Niv, Yaron

    2011-04-01

    Incorporation of viral DNA may interfere with the normal sequence of human DNA bases on the genetic level or cause secondary epigenetic changes such as gene promoter methylation or histone acetylation. Colorectal cancer (CRC) is the second leading cause of cancer mortality in the USA. Chromosomal instability (CIN) was established as the key mechanism in cancer development. Later, it was found that CRC results not only from the progressive accumulation of genetic alterations but also from epigenetic changes. JC virus (JCV) is a candidate etiologic factor in sporadic CRC. It may act by stabilizing β-catenin, facilitating its entrance to the cell nucleus, initialing proliferation and cancer development. Diploid CRC cell lines transfected with JCV-containing plasmids developed CIN. This result provides direct experimental evidence for the ability of JCV T-Ag to induce CIN in the genome of colonic epithelial cells. The association of CRC hMLH1 methylation and tumor positivity for JCV was recently documented. JC virus T-Ag DNA sequences were found in 77% of CRCs and are associated with promoter methylation of multiple genes. hMLH1 was methylated in 25 out of 80 CRC patients positive for T-Ag (31%) in comparison with only one out of 11 T-Ag negative cases (9%). Thus, JCV can mediate both CIN and aberrant methylation in CRC. Like other viruses, chronic infection with JCV may induce CRC by different mechanisms which should be further investigated. Thus, gene promoter methylation induced by JCV may be an important process in CRC and the polyp-carcinoma sequence.

  17. Alternative Splicing Studies of the Reactive Oxygen Species Gene Network in Populus Reveal Two Isoforms of High-Isoelectric-Point Superoxide Dismutase1[C][W

    PubMed Central

    Srivastava, Vaibhav; Srivastava, Manoj Kumar; Chibani, Kamel; Nilsson, Robert; Rouhier, Nicolas; Melzer, Michael; Wingsle, Gunnar

    2009-01-01

    Recent evidence has shown that alternative splicing (AS) is widely involved in the regulation of gene expression, substantially extending the diversity of numerous proteins. In this study, a subset of expressed sequence tags representing members of the reactive oxygen species gene network was selected from the PopulusDB database to investigate AS mechanisms in Populus. Examples of all known types of AS were detected, but intron retention was the most common. Interestingly, the closest Arabidopsis (Arabidopsis thaliana) homologs of half of the AS genes identified in Populus are not reportedly alternatively spliced. Two genes encoding the protein of most interest in our study (high-isoelectric-point superoxide dismutase [hipI-SOD]) have been found in black cottonwood (Populus trichocarpa), designated PthipI-SODC1 and PthipI-SODC2. Analysis of the expressed sequence tag libraries has indicated the presence of two transcripts of PthipI-SODC1 (hipI-SODC1b and hipI-SODC1s). Alignment of these sequences with the PthipI-SODC1 gene showed that hipI-SODC1b was 69 bp longer than hipI-SODC1s due to an AS event involving the use of an alternative donor splice site in the sixth intron. Transcript analysis showed that the splice variant hipI-SODC1b was differentially expressed, being clearly expressed in cambial and xylem, but not phloem, regions. In addition, immunolocalization and mass spectrometric data confirmed the presence of hipI-SOD proteins in vascular tissue. The functionalities of the spliced gene products were assessed by expressing recombinant hipI-SOD proteins and in vitro SOD activity assays. PMID:19176719

  18. Alternative splicing studies of the reactive oxygen species gene network in Populus reveal two isoforms of high-isoelectric-point superoxide dismutase.

    PubMed

    Srivastava, Vaibhav; Srivastava, Manoj Kumar; Chibani, Kamel; Nilsson, Robert; Rouhier, Nicolas; Melzer, Michael; Wingsle, Gunnar

    2009-04-01

    Recent evidence has shown that alternative splicing (AS) is widely involved in the regulation of gene expression, substantially extending the diversity of numerous proteins. In this study, a subset of expressed sequence tags representing members of the reactive oxygen species gene network was selected from the PopulusDB database to investigate AS mechanisms in Populus. Examples of all known types of AS were detected, but intron retention was the most common. Interestingly, the closest Arabidopsis (Arabidopsis thaliana) homologs of half of the AS genes identified in Populus are not reportedly alternatively spliced. Two genes encoding the protein of most interest in our study (high-isoelectric-point superoxide dismutase [hipI-SOD]) have been found in black cottonwood (Populus trichocarpa), designated PthipI-SODC1 and PthipI-SODC2. Analysis of the expressed sequence tag libraries has indicated the presence of two transcripts of PthipI-SODC1 (hipI-SODC1b and hipI-SODC1s). Alignment of these sequences with the PthipI-SODC1 gene showed that hipI-SODC1b was 69 bp longer than hipI-SODC1s due to an AS event involving the use of an alternative donor splice site in the sixth intron. Transcript analysis showed that the splice variant hipI-SODC1b was differentially expressed, being clearly expressed in cambial and xylem, but not phloem, regions. In addition, immunolocalization and mass spectrometric data confirmed the presence of hipI-SOD proteins in vascular tissue. The functionalities of the spliced gene products were assessed by expressing recombinant hipI-SOD proteins and in vitro SOD activity assays.

  19. Transcriptome analysis and identification of induced genes in the response of Harmonia axyridis to cold hardiness.

    PubMed

    Tang, Bin; Liu, Xiao-Jun; Shi, Zuo-Kun; Shen, Qi-Da; Xu, Yan-Xia; Wang, Su; Zhang, Fan; Wang, Shi-Gui

    2017-06-01

    Harmonia axyridis is an important predatory lady beetle that is a natural enemy of agricultural and forestry pests. In this research, the cold hardiness induced genes and their expression changes in H. axyridis were screened and detected by the way of the transcriptome and qualitative real-time PCR under normal and low temperatures, using high-throughput transcriptome and digital gene-expression-tag technologies. We obtained a 10Gb transcriptome and an 8Mb gene expression tag pool using Illumina deep sequencing technology and RNA-Seq analysis (accession number SRX540102). Of the 46,980 non-redundant unigenes identified, 28,037 (59.7%) were matched to known genes in GenBank, 21,604 (46.0%) in Swiss-Prot, 19,482 (41.5%) in Kyoto Encyclopedia of Genes and Genomes and 13,193 (28.1%) in Gene Ontology databases. Seventy-five percent of the unigene sequences had top matches with gene sequences from Tribolium castaneum. Results indicated that 60 genes regulated the entire cold-acclimation response, and, of these, seven genes were always up-regulated and five genes always down-regulated. Further screening revealed that six cold-resistant genes, E3 ubiquitin-protein ligase, transketolase, trehalase, serine/arginine repetitive matrix protein 2, glycerol kinase and sugar transporter SWEET1-like, play key roles in the response. Expression from a number of the differentially expressed genes was confirmed with quantitative real-time PCR (HaCS_Trans). The paper attempted to identify cold-resistance response genes, and study the potential mechanism by which cold acclimation enhances the insect's cold endurance. Information on these cold-resistance response genes will improve the development of low-temperature storage technology of natural enemy insects for future use in biological control. Copyright © 2017 Elsevier Inc. All rights reserved.

  20. Crosstalk between mTORC1 and cAMP Signaling

    DTIC Science & Technology

    2014-07-01

    based genome editing to endogenously tag the V1 subunit and introduce point mutations (T175A; phospho-defective and T175D; phospho-mimetic). By...analysis ap- proach, and the other screened small GTPases using RNAi in Drosophila cells [41,48]. There are four Rag proteins in mammals: RagA and RagB (!98...at the lysosome The Rag proteins lack membrane-targeting sequences , unlike other typical small GTPases such as Rheb. Thus, the Rag–mTORC1 complex is

  1. Production of human antimicrobial peptide LL-37 in Escherichia coli using a thioredoxin-SUMO dual fusion system.

    PubMed

    Li, Yifeng

    2013-02-01

    LL-37 is a human antimicrobial peptide that has been shown to possess multiple functions in host defense. In this report, the peptide was expressed as a fusion with a thioredoxin-SUMO dual-tag. Upon SUMO protease mediated cleavage at the SUMO/peptide junction, LL-37 with its native N-terminus was generated. The released peptide was separated from the dual-tag and cleavage enzyme by size-exclusion chromatography. Mass spectrometry analysis proves that the recombinant peptide has a molecular weight as theoretically expected for its native form. The produced peptide displayed antimicrobial activity against Escherichia coli K-12. On average, 2.4 mg peptide was obtained from one liter of bacterial culture. Thus, the described approach provides an effective alternative for producing active recombinant LL-37 with its natural amino acid sequence in E. coli. Copyright © 2012 Elsevier Inc. All rights reserved.

  2. Using Next Generation Sequencing for Multiplexed Trait-Linked Markers in Wheat

    PubMed Central

    Bernardo, Amy; Wang, Shan; St. Amand, Paul; Bai, Guihua

    2015-01-01

    With the advent of next generation sequencing (NGS) technologies, single nucleotide polymorphisms (SNPs) have become the major type of marker for genotyping in many crops. However, the availability of SNP markers for important traits of bread wheat ( Triticum aestivum L.) that can be effectively used in marker-assisted selection (MAS) is still limited and SNP assays for MAS are usually uniplex. A shift from uniplex to multiplex assays will allow the simultaneous analysis of multiple markers and increase MAS efficiency. We designed 33 locus-specific markers from SNP or indel-based marker sequences that linked to 20 different quantitative trait loci (QTL) or genes of agronomic importance in wheat and analyzed the amplicon sequences using an Ion Torrent Proton Sequencer and a custom allele detection pipeline to determine the genotypes of 24 selected germplasm accessions. Among the 33 markers, 27 were successfully multiplexed and 23 had 100% SNP call rates. Results from analysis of "kompetitive allele-specific PCR" (KASP) and sequence tagged site (STS) markers developed from the same loci fully verified the genotype calls of 23 markers. The NGS-based multiplexed assay developed in this study is suitable for rapid and high-throughput screening of SNPs and some indel-based markers in wheat. PMID:26625271

  3. De Novo Transcriptome Sequencing Analysis of cDNA Library and Large-Scale Unigene Assembly in Japanese Red Pine (Pinus densiflora)

    PubMed Central

    Liu, Le; Zhang, Shijie; Lian, Chunlan

    2015-01-01

    Japanese red pine (Pinus densiflora) is extensively cultivated in Japan, Korea, China, and Russia and is harvested for timber, pulpwood, garden, and paper markets. However, genetic information and molecular markers were very scarce for this species. In this study, over 51 million sequencing clean reads from P. densiflora mRNA were produced using Illumina paired-end sequencing technology. It yielded 83,913 unigenes with a mean length of 751 bp, of which 54,530 (64.98%) unigenes showed similarity to sequences in the NCBI database. Among which the best matches in the NCBI Nr database were Picea sitchensis (41.60%), Amborella trichopoda (9.83%), and Pinus taeda (4.15%). A total of 1953 putative microsatellites were identified in 1784 unigenes using MISA (MicroSAtellite) software, of which the tri-nucleotide repeats were most abundant (50.18%) and 629 EST-SSR (expressed sequence tag- simple sequence repeats) primer pairs were successfully designed. Among 20 EST-SSR primer pairs randomly chosen, 17 markers yielded amplification products of the expected size in P. densiflora. Our results will provide a valuable resource for gene-function analysis, germplasm identification, molecular marker-assisted breeding and resistance-related gene(s) mapping for pine for P. densiflora. PMID:26690126

  4. Analysis of expressed sequence tags (ESTs) from cocoa (Theobroma cacao L) upon infection with Phytophthora megakarya.

    PubMed

    Naganeeswaran, Sudalaimuthu Asari; Subbian, Elain Apshara; Ramaswamy, Manimekalai

    2012-01-01

    Phytophthora megakarya, the causative agent of cacao black pod disease in West African countries causes an extensive loss of yield. In this study we have analyzed 4 libraries of ESTs derived from Phytophthora megakarya infected cocoa leaf and pod tissues. Totally 6379 redundant sequences were retrieved from ESTtik database and EST processing was performed using seqclean tool. Clustering and assembling using CAP3 generated 3333 non-redundant (907 contigs and 2426 singletons) sequences. The primary sequence analysis of 3333 non-redundant sequences showed that the GC percentage was 42.7 and the sequence length ranged from 101 - 2576 nucleotides. Further, functional analysis (Blast, Interproscan, Gene ontology and KEGG search) were executed and 1230 orthologous genes were annotated. Totally 272 enzymes corresponding to 114 metabolic pathways were identified. Functional annotation revealed that most of the sequences are related to molecular function, stress response and biological processes. The annotated enzymes are aldehyde dehydrogenase (E.C: 1.2.1.3), catalase (E.C: 1.11.1.6), acetyl-CoA C-acetyltransferase (E.C: 2.3.1.9), threonine ammonia-lyase (E.C: 4.3.1.19), acetolactate synthase (E.C: 2.2.1.6), O-methyltransferase (E.C: 2.1.1.68) which play an important role in amino acid biosynthesis and phenyl propanoid biosynthesis. All this information was stored in MySQL database management system to be used in future for reconstruction of biotic stress response pathway in cocoa.

  5. GrigoraSNPs: Optimized Analysis of SNPs for DNA Forensics.

    PubMed

    Ricke, Darrell O; Shcherbina, Anna; Michaleas, Adam; Fremont-Smith, Philip

    2018-04-16

    High-throughput sequencing (HTS) of single nucleotide polymorphisms (SNPs) enables additional DNA forensic capabilities not attainable using traditional STR panels. However, the inclusion of sets of loci selected for mixture analysis, extended kinship, phenotype, biogeographic ancestry prediction, etc., can result in large panel sizes that are difficult to analyze in a rapid fashion. GrigoraSNP was developed to address the allele-calling bottleneck that was encountered when analyzing SNP panels with more than 5000 loci using HTS. GrigoraSNPs uses a MapReduce parallel data processing on multiple computational threads plus a novel locus-identification hashing strategy leveraging target sequence tags. This tool optimizes the SNP calling module of the DNA analysis pipeline with runtimes that scale linearly with the number of HTS reads. Results are compared with SNP analysis pipelines implemented with SAMtools and GATK. GrigoraSNPs removes a computational bottleneck for processing forensic samples with large HTS SNP panels. Published 2018. This article is a U.S. Government work and is in the public domain in the USA.

  6. The First Molecular Identification of an Olive Collection Applying Standard Simple Sequence Repeats and Novel Expressed Sequence Tag Markers.

    PubMed

    Mousavi, Soraya; Mariotti, Roberto; Regni, Luca; Nasini, Luigi; Bufacchi, Marina; Pandolfi, Saverio; Baldoni, Luciana; Proietti, Primo

    2017-01-01

    Germplasm collections of tree crop species represent fundamental tools for conservation of diversity and key steps for its characterization and evaluation. For the olive tree, several collections were created all over the world, but only few of them have been fully characterized and molecularly identified. The olive collection of Perugia University (UNIPG), established in the years' 60, represents one of the first attempts to gather and safeguard olive diversity, keeping together cultivars from different countries. In the present study, a set of 370 olive trees previously uncharacterized was screened with 10 standard simple sequence repeats (SSRs) and nine new EST-SSR markers, to correctly and thoroughly identify all genotypes, verify their representativeness of the entire cultivated olive variation, and validate the effectiveness of new markers in comparison to standard genotyping tools. The SSR analysis revealed the presence of 59 genotypes, corresponding to 72 well known cultivars, 13 of them resulting exclusively present in this collection. The new EST-SSRs have shown values of diversity parameters quite similar to those of best standard SSRs. When compared to hundreds of Mediterranean cultivars, the UNIPG olive accessions were splitted into the three main populations (East, Center and West Mediterranean), confirming that the collection has a good representativeness of the entire olive variability. Furthermore, Bayesian analysis, performed on the 59 genotypes of the collection by the use of both sets of markers, have demonstrated their splitting into four clusters, with a well balanced membership obtained by EST respect to standard SSRs. The new OLEST ( Olea expressed sequence tags) SSR markers resulted as effective as the best standard markers. The information obtained from this study represents a high valuable tool for ex situ conservation and management of olive genetic resources, useful to build a common database from worldwide olive cultivar collections, also based on recently developed markers.

  7. Mass fingerprinting of the venom and transcriptome of venom gland of scorpion Centruroides tecomanus.

    PubMed

    Valdez-Velázquez, Laura L; Quintero-Hernández, Verónica; Romero-Gutiérrez, Maria Teresa; Coronas, Fredy I V; Possani, Lourival D

    2013-01-01

    Centruroides tecomanus is a Mexican scorpion endemic of the State of Colima, that causes human fatalities. This communication describes a proteome analysis obtained from milked venom and a transcriptome analysis from a cDNA library constructed from two pairs of venom glands of this scorpion. High perfomance liquid chromatography separation of soluble venom produced 80 fractions, from which at least 104 individual components were identified by mass spectrometry analysis, showing to contain molecular masses from 259 to 44,392 Da. Most of these components are within the expected molecular masses for Na(+)- and K(+)-channel specific toxic peptides, supporting the clinical findings of intoxication, when humans are stung by this scorpion. From the cDNA library 162 clones were randomly chosen, from which 130 sequences of good quality were identified and were clustered in 28 contigs containing, each, two or more expressed sequence tags (EST) and 49 singlets with only one EST. Deduced amino acid sequence analysis from 53% of the total ESTs showed that 81% (24 sequences) are similar to known toxic peptides that affect Na(+)-channel activity, and 19% (7 unique sequences) are similar to K(+)-channel especific toxins. Out of the 31 sequences, at least 8 peptides were confirmed by direct Edman degradation, using components isolated directly from the venom. The remaining 19%, 4%, 4%, 15% and 5% of the ESTs correspond respectively to proteins involved in cellular processes, antimicrobial peptides, venom components, proteins without defined function and sequences without similarity in databases. Among the cloned genes are those similar to metalloproteinases.

  8. Generation and analysis of expressed sequence tags in the extreme large genomes Lilium and Tulipa.

    PubMed

    Shahin, Arwa; van Kaauwen, Martijn; Esselink, Danny; Bargsten, Joachim W; van Tuyl, Jaap M; Visser, Richard G F; Arens, Paul

    2012-11-20

    Bulbous flowers such as lily and tulip (Liliaceae family) are monocot perennial herbs that are economically very important ornamental plants worldwide. However, there are hardly any genetic studies performed and genomic resources are lacking. To build genomic resources and develop tools to speed up the breeding in both crops, next generation sequencing was implemented. We sequenced and assembled transcriptomes of four lily and five tulip genotypes using 454 pyro-sequencing technology. Successfully, we developed the first set of 81,791 contigs with an average length of 514 bp for tulip, and enriched the very limited number of 3,329 available ESTs (Expressed Sequence Tags) for lily with 52,172 contigs with an average length of 555 bp. The contigs together with singletons covered on average 37% of lily and 39% of tulip estimated transcriptome. Mining lily and tulip sequence data for SSRs (Simple Sequence Repeats) showed that di-nucleotide repeats were twice more abundant in UTRs (UnTranslated Regions) compared to coding regions, while tri-nucleotide repeats were equally spread over coding and UTR regions. Two sets of single nucleotide polymorphism (SNP) markers suitable for high throughput genotyping were developed. In the first set, no SNPs flanking the target SNP (50 bp on either side) were allowed. In the second set, one SNP in the flanking regions was allowed, which resulted in a 2 to 3 fold increase in SNP marker numbers compared with the first set. Orthologous groups between the two flower bulbs: lily and tulip (12,017 groups) and among the three monocot species: lily, tulip, and rice (6,900 groups) were determined using OrthoMCL. Orthologous groups were screened for common SNP markers and EST-SSRs to study synteny between lily and tulip, which resulted in 113 common SNP markers and 292 common EST-SSR. Lily and tulip contigs generated were annotated and described according to Gene Ontology terminology. Two transcriptome sets were built that are valuable resources for marker development, comparative genomic studies and candidate gene approaches. Next generation sequencing of leaf transcriptome is very effective; however, deeper sequencing and using more tissues and stages is advisable for extended comparative studies.

  9. Bioinformatics and reanalysis of subtracted expressed sequence tags from the human ciliary body: Identification of novel biological functions.

    PubMed

    Escribano, Julio; Coca-Prados, Miguel

    2002-08-28

    The ciliary body is largely known for its major roles in the regulation of aqueous humor secretion, intraocular pressure, and accommodation of the lens. In this review article we applied bioinformatics to re-examine hundreds of expressed sequence tags (ESTs) previously isolated by subtractive hybridization from a human ciliary body library [1]. The DNA sequences of these clones have been recently added to the web site of NEIBank. DNA sequence comparisons of subtracted ESTs were performed against all entries in the last available release of the non-redundant database containing GenBank, EMBL, DDBJ and PDB sequences using the BlastN program accessed through NCBI's BLAST services on the internet (NCBI). Sequences were also compared and mapped using the Blast search program provided through the Internet by the Human Genome Project (UCSC). A total number of 284 independent ESTs were classified in 17 functional groups. Analysis of their relationships allowed to define the expression of five major groups of known genes: (i) protein synthesis, folding, secretion and degradation (20%); (ii) energy supply and biosynthesis (12%); (iii) contractility and cytoskeleton structure (6%); (iv) cellular signaling and cell cycle regulation (7%); and (v) nerve cell related tasks (2%), including neuropeptide processing and putative non-visual phototransduction and circadian rhythm control. The largest group contain unidentified sequences, a total of 105 sequences, accounting for 37% of ESTs. The unidentified sequences show similarity to genomic non-coding regions, or genes of unknown function. The most highly represented EST, correspond to myocilin, a gene involved in glaucoma. The data also confirms the secretory functions of the ciliary epithelium, and its high metabolism; the presence of a neuroendocrine peptidergic system presumably involved in the regulation of the intraocular pressure and/or aqueous humor secretion. Additional genes may be related to a non-visual phototransduction cascade and/or to circadian rhythms. Overall this initial group of subtracted ESTs can lead to uncover novel physiological functions of the ciliary body in normal and in disease, as well as novel candidate genes for ocular diseases.

  10. Association study of IL10, IL1beta, and IL1RN and schizophrenia using tag SNPs from a comprehensive database: suggestive association with rs16944 at IL1beta.

    PubMed

    Shirts, Brian H; Wood, Joel; Yolken, Robert H; Nimgaonkar, Vishwajit L

    2006-12-01

    Genetic association studies of several candidate cytokine genes have been motivated by evidence of immune dysfunction among patients with schizophrenia. Intriguing but inconsistent associations have been reported with polymorphisms of three positional candidate genes, namely IL1beta, IL1RN, and IL10. We used comprehensive sequencing data from the Seattle SNPs database to select tag SNPs that represent all common polymorphisms in the Caucasian population at these loci. Associations with 28 tag SNPs were evaluated in 478 cases and 501 unscreened control individuals, while accounting for population sub-structure using the genomic control method. The samples were also stratified by gender, diagnostic category, and exposure to infectious agents. Significant association was not detected after correcting for multiple comparisons. However, meta-analysis of our data combined with previously published association studies of rs16944 (IL1beta -511) suggests that the C allele confers modest risk for schizophrenia among individuals reporting Caucasian ancestry, but not Asians (Caucasians, n=819 cases, 1292 controls; p=0.0013, OR=1.24, 95% CI 1.09, 1.41).

  11. Expression of recombinant CD59 with an N-terminal peptide epitope facilitates analysis of residues contributing to its complement-inhibitory function.

    PubMed

    Zhou, Q; Zhao, J; Hüsler, T; Sims, P J

    1996-10-01

    CD59 is a plasma membrane-anchored glycoprotein that serves to protect human cells from lysis by the C5b-9 complex of complement. The immunodominant epitopes of CD59 are known to be sensitive to disruption of native tertiary structure, complicating immunological measurement of expressed mutant constructs for structure function analysis. In order to quantify cell-surface expression of wild-type and mutant forms of this complement inhibitor, independent of CD59 antigen, an 11-residue peptide (TAG) recognized by monoclonal antibody (mAb) 9E10 was inserted before the N-terminal codon (L1) of mature CD59, in a pcDNA3 expression plasmid. SV-T2 cells were transfected with this plasmid, yielding cell lines expressing 0 to > 10(5) CD59/cell. The TAG-CD59 fusion protein was confirmed to be GPI-anchored, N-glycosylated and showed identical complement-inhibitory function to wild-type CD59, lacking the TAG peptide sequence. Using this construct, the contribution of each of four surface-localized aromatic residues (4Y, 47F, 61Y, and 62Y) to CD59's complement-inhibitory function was examined. These assays revealed normal surface expression with complete loss of complement-inhibitory function in the 4Y --> S, 47F --> G and 61Y --> S mutants. By contrast, 62Y --> S mutants retained approximately 40% of function of wild-type CD59. These studies confirmed the utility of the TAG-CD59 construct for quantifying CD59 surface expression and activity, and implicate surface aromatic residues 4Y, 47F, 61Y and 62Y as essential to maintenance of CD59's normal complement-regulatory function.

  12. Yeast Inner-Subunit PA-NZ-1 Labeling Strategy for Accurate Subunit Identification in a Macromolecular Complex through Cryo-EM Analysis.

    PubMed

    Wang, Huping; Han, Wenyu; Takagi, Junichi; Cong, Yao

    2018-05-11

    Cryo-electron microscopy (cryo-EM) has been established as one of the central tools in the structural study of macromolecular complexes. Although intermediate- or low-resolution structural information through negative staining or cryo-EM analysis remains highly valuable, we lack general and efficient ways to achieve unambiguous subunit identification in these applications. Here, we took advantage of the extremely high affinity between a dodecapeptide "PA" tag and the NZ-1 antibody Fab fragment to develop an efficient "yeast inner-subunit PA-NZ-1 labeling" strategy that when combined with cryo-EM could precisely identify subunits in macromolecular complexes. Using this strategy combined with cryo-EM 3D reconstruction, we were able to visualize the characteristic NZ-1 Fab density attached to the PA tag inserted into a surface-exposed loop in the middle of the sequence of CCT6 subunit present in the Saccharomyces cerevisiae group II chaperonin TRiC/CCT. This procedure facilitated the unambiguous localization of CCT6 in the TRiC complex. The PA tag was designed to contain only 12 amino acids and a tight turn configuration; when inserted into a loop, it usually has a high chance of maintaining the epitope structure and low likelihood of perturbing the native structure and function of the target protein compared to other tagging systems. We also found that the association between PA and NZ-1 can sustain the cryo freezing conditions, resulting in very high occupancy of the Fab in the final cryo-EM images. Our study demonstrated the robustness of this strategy combined with cryo-EM in efficient and accurate subunit identification in challenging multi-component complexes. Copyright © 2018 Elsevier Ltd. All rights reserved.

  13. Understanding why users tag: A survey of tagging motivation literature and results from an empirical study.

    PubMed

    Strohmaier, Markus; Körner, Christian; Kern, Roman

    2012-12-01

    While recent progress has been achieved in understanding the structure and dynamics of social tagging systems, we know little about the underlying user motivations for tagging, and how they influence resulting folksonomies and tags. This paper addresses three issues related to this question. (1) What distinctions of user motivations are identified by previous research, and in what ways are the motivations of users amenable to quantitative analysis? (2) To what extent does tagging motivation vary across different social tagging systems? (3) How does variability in user motivation influence resulting tags and folksonomies? In this paper, we present measures to detect whether a tagger is primarily motivated by categorizing or describing resources, and apply these measures to datasets from seven different tagging systems. Our results show that (a) users' motivation for tagging varies not only across, but also within tagging systems, and that (b) tag agreement among users who are motivated by categorizing resources is significantly lower than among users who are motivated by describing resources . Our findings are relevant for (1) the development of tag-based user interfaces, (2) the analysis of tag semantics and (3) the design of search algorithms for social tagging systems.

  14. Understanding why users tag: A survey of tagging motivation literature and results from an empirical study

    PubMed Central

    Strohmaier, Markus; Körner, Christian; Kern, Roman

    2012-01-01

    While recent progress has been achieved in understanding the structure and dynamics of social tagging systems, we know little about the underlying user motivations for tagging, and how they influence resulting folksonomies and tags. This paper addresses three issues related to this question. (1) What distinctions of user motivations are identified by previous research, and in what ways are the motivations of users amenable to quantitative analysis? (2) To what extent does tagging motivation vary across different social tagging systems? (3) How does variability in user motivation influence resulting tags and folksonomies? In this paper, we present measures to detect whether a tagger is primarily motivated by categorizing or describing resources, and apply these measures to datasets from seven different tagging systems. Our results show that (a) users’ motivation for tagging varies not only across, but also within tagging systems, and that (b) tag agreement among users who are motivated by categorizing resources is significantly lower than among users who are motivated by describing resources. Our findings are relevant for (1) the development of tag-based user interfaces, (2) the analysis of tag semantics and (3) the design of search algorithms for social tagging systems. PMID:23471473

  15. Single-step affinity and cost-effective purification of recombinant proteins using the Sepharose-binding lectin-tag from the mushroom Laetiporus sulphureus as fusion partner.

    PubMed

    Li, Xiao-Jing; Liu, Jin-Ling; Gao, Dong-Sheng; Wan, Wen-Yan; Yang, Xia; Li, Yong-Tao; Chang, Hong-Tao; Chen, Lu; Wang, Chuan-Qing; Zhao, Jun

    2016-03-01

    Previous research showed that a lectin from the mushroom Laetiporus sulphureus, designed LSL, bound to Sepharose and could be eluted by lactose. In this study, by taking advantage of the strong affinity of LSL-tag for Sepharose, we developed a single-step purification method for LSL-tagged fusion proteins. We utilized unmodified Sepharose-4B as a specific adsorbent and 0.2 M lactose solution as an elution buffer. Fusion proteins of LSL-tag and porcine circovirus capsid protein, designated LSL-Cap was recovered with purity of 90 ± 4%, and yield of 87 ± 3% from crude extract of recombinant Escherichia coli. To enable the remove of LSL-tag, tobacco etch virus (TEV) protease recognition sequence was placed downstream of LSL-tag in the expression vector, and LSL-tagged TEV protease, designated LSL-TEV, was also expressed in E. coli., and was recovered with purity of 82 ± 5%, and yield of 85 ± 2% from crude extract of recombinant E. coli. After digestion of LSL-tagged recombinant proteins with LSL-TEV, the LSL tag and LSL-TEV can be easily removed by passing the digested products through the Sepharose column. It is of worthy noting that the Sepharose can be reused after washing with PBS. The LSL affinity purification method enables rapid and inexpensive purification of LSL-tagged fusion proteins and scale-up production of native proteins. Copyright © 2015 Elsevier Inc. All rights reserved.

  16. Monitoring Arthrobacter protophormiae RKJ100 in a 'tag and chase' method during p-nitrophenol bio-remediation in soil microcosms.

    PubMed

    Pandey, Gunjan; Pandey, Janmejay; Jain, Rakesh K

    2006-05-01

    Monitoring of micro-organisms released deliberately into the environment is essential to assess their movement during the bio-remediation process. During the last few years, DNA-based genetic methods have emerged as the preferred method for such monitoring; however, their use is restricted in cases where organisms used for bio-remediation are not well characterized or where the public domain databases do not provide sufficient information regarding their sequence. For monitoring of such micro-organisms, alternate approaches have to be undertaken. In this study, we have specifically monitored a p-nitrophenol (PNP)-degrading organism, Arthrobacter protophormiae RKJ100, using molecular methods during PNP degradation in soil microcosm. Cells were tagged with a transposon-based foreign DNA sequence prior to their introduction into PNP-contaminated microcosms. Later, this artificially introduced DNA sequence was PCR-amplified to distinguish the bio-augmented organism from the indigenous microflora during PNP bio-remediation.

  17. Distorted Immunodominance by Linker Sequences or other Epitopes from a Second Protein Antigen During Antigen-Processing

    PubMed Central

    Kim, AeRyon; Boronina, Tatiana N.; Cole, Robert N.; Darrah, Erika; Sadegh-Nasseri, Scheherazade

    2017-01-01

    The immune system focuses on and responds to very few representative immunodominant epitopes from pathogenic insults. However, due to the complexity of the antigen processing, understanding the parameters that lead to immunodominance has proved difficult. In an attempt to uncover the determinants of immunodominance among several dominant epitopes, we utilized a cell free antigen processing system and allowed the system to identify the hierarchies among potential determinants. We then tested the results in vivo; in mice and in human. We report here, that immunodominance of known sequences in a given protein can change if two or more proteins are being processed and presented simultaneously. Surprisingly, we find that new spacer/tag sequences commonly added to proteins for purification purposes can distort the capture of the physiological immunodominant epitopes. We warn against adding tags and spacers to candidate vaccines, or recommend cleaving it off before using for vaccination. PMID:28422163

  18. VariantBam: filtering and profiling of next-generational sequencing data using region-specific rules.

    PubMed

    Wala, Jeremiah; Zhang, Cheng-Zhong; Meyerson, Matthew; Beroukhim, Rameen

    2016-07-01

    We developed VariantBam, a C ++ read filtering and profiling tool for use with BAM, CRAM and SAM sequencing files. VariantBam provides a flexible framework for extracting sequencing reads or read-pairs that satisfy combinations of rules, defined by any number of genomic intervals or variant sites. We have implemented filters based on alignment data, sequence motifs, regional coverage and base quality. For example, VariantBam achieved a median size reduction ratio of 3.1:1 when applied to 10 lung cancer whole genome BAMs by removing large tags and selecting for only high-quality variant-supporting reads and reads matching a large dictionary of sequence motifs. Thus VariantBam enables efficient storage of sequencing data while preserving the most relevant information for downstream analysis. VariantBam and full documentation are available at github.com/jwalabroad/VariantBam rameen@broadinstitute.org Supplementary data are available at Bioinformatics online. © The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

  19. Identification of tissue-specific, abiotic stress-responsive gene expression patterns in wine grape (Vitis vinifera L.) based on curation and mining of large-scale EST data sets

    PubMed Central

    2011-01-01

    Background Abiotic stresses, such as water deficit and soil salinity, result in changes in physiology, nutrient use, and vegetative growth in vines, and ultimately, yield and flavor in berries of wine grape, Vitis vinifera L. Large-scale expressed sequence tags (ESTs) were generated, curated, and analyzed to identify major genetic determinants responsible for stress-adaptive responses. Although roots serve as the first site of perception and/or injury for many types of abiotic stress, EST sequencing in root tissues of wine grape exposed to abiotic stresses has been extremely limited to date. To overcome this limitation, large-scale EST sequencing was conducted from root tissues exposed to multiple abiotic stresses. Results A total of 62,236 expressed sequence tags (ESTs) were generated from leaf, berry, and root tissues from vines subjected to abiotic stresses and compared with 32,286 ESTs sequenced from 20 public cDNA libraries. Curation to correct annotation errors, clustering and assembly of the berry and leaf ESTs with currently available V. vinifera full-length transcripts and ESTs yielded a total of 13,278 unique sequences, with 2302 singletons and 10,976 mapped to V. vinifera gene models. Of these, 739 transcripts were found to have significant differential expression in stressed leaves and berries including 250 genes not described previously as being abiotic stress responsive. In a second analysis of 16,452 ESTs from a normalized root cDNA library derived from roots exposed to multiple, short-term, abiotic stresses, 135 genes with root-enriched expression patterns were identified on the basis of their relative EST abundance in roots relative to other tissues. Conclusions The large-scale analysis of relative EST frequency counts among a diverse collection of 23 different cDNA libraries from leaf, berry, and root tissues of wine grape exposed to a variety of abiotic stress conditions revealed distinct, tissue-specific expression patterns, previously unrecognized stress-induced genes, and many novel genes with root-enriched mRNA expression for improving our understanding of root biology and manipulation of rootstock traits in wine grape. mRNA abundance estimates based on EST library-enriched expression patterns showed only modest correlations between microarray and quantitative, real-time reverse transcription-polymerase chain reaction (qRT-PCR) methods highlighting the need for deep-sequencing expression profiling methods. PMID:21592389

  20. Identification and validation of Asteraceae miRNAs by the expressed sequence tag analysis.

    PubMed

    Monavar Feshani, Aboozar; Mohammadi, Saeed; Frazier, Taylor P; Abbasi, Abbas; Abedini, Raha; Karimi Farsad, Laleh; Ehya, Farveh; Salekdeh, Ghasem Hosseini; Mardi, Mohsen

    2012-02-10

    MicroRNAs (miRNAs) are small non-coding RNA molecules that play a vital role in the regulation of gene expression. Despite their identification in hundreds of plant species, few miRNAs have been identified in the Asteraceae, a large family that comprises approximately one tenth of all flowering plants. In this study, we used the expressed sequence tag (EST) analysis to identify potential conserved miRNAs and their putative target genes in the Asteraceae. We applied quantitative Real-Time PCR (qRT-PCR) to confirm the expression of eight potential miRNAs in Carthamus tinctorius and Helianthus annuus. We also performed qRT-PCR analysis to investigate the differential expression pattern of five newly identified miRNAs during five different cotyledon growth stages in safflower. Using these methods, we successfully identified and characterized 151 potentially conserved miRNAs, belonging to 26 miRNA families, in 11 genus of Asteraceae. EST analysis predicted that the newly identified conserved Asteraceae miRNAs target 130 total protein-coding ESTs in sunflower and safflower, as well as 433 additional target genes in other plant species. We experimentally confirmed the existence of seven predicted miRNAs, (miR156, miR159, miR160, miR162, miR166, miR396, and miR398) in safflower and sunflower seedlings. We also observed that five out of eight miRNAs are differentially expressed during cotyledon development. Our results indicate that miRNAs may be involved in the regulation of gene expression during seed germination and the formation of the cotyledons in the Asteraceae. The findings of this study might ultimately help in the understanding of miRNA-mediated gene regulation in important crop species. Copyright © 2011 Elsevier B.V. All rights reserved.

  1. Analyses of Expressed Sequence Tags from Apple1

    PubMed Central

    Newcomb, Richard D.; Crowhurst, Ross N.; Gleave, Andrew P.; Rikkerink, Erik H.A.; Allan, Andrew C.; Beuning, Lesley L.; Bowen, Judith H.; Gera, Emma; Jamieson, Kim R.; Janssen, Bart J.; Laing, William A.; McArtney, Steve; Nain, Bhawana; Ross, Gavin S.; Snowden, Kimberley C.; Souleyre, Edwige J.F.; Walton, Eric F.; Yauk, Yar-Khing

    2006-01-01

    The domestic apple (Malus domestica; also known as Malus pumila Mill.) has become a model fruit crop in which to study commercial traits such as disease and pest resistance, grafting, and flavor and health compound biosynthesis. To speed the discovery of genes involved in these traits, develop markers to map genes, and breed new cultivars, we have produced a substantial expressed sequence tag collection from various tissues of apple, focusing on fruit tissues of the cultivar Royal Gala. Over 150,000 expressed sequence tags have been collected from 43 different cDNA libraries representing 34 different tissues and treatments. Clustering of these sequences results in a set of 42,938 nonredundant sequences comprising 17,460 tentative contigs and 25,478 singletons, together representing what we predict are approximately one-half the expressed genes from apple. Many potential molecular markers are abundant in the apple transcripts. Dinucleotide repeats are found in 4,018 nonredundant sequences, mainly in the 5′-untranslated region of the gene, with a bias toward one repeat type (containing AG, 88%) and against another (repeats containing CG, 0.1%). Trinucleotide repeats are most common in the predicted coding regions and do not show a similar degree of sequence bias in their representation. Bi-allelic single-nucleotide polymorphisms are highly abundant with one found, on average, every 706 bp of transcribed DNA. Predictions of the numbers of representatives from protein families indicate the presence of many genes involved in disease resistance and the biosynthesis of flavor and health-associated compounds. Comparisons of some of these gene families with Arabidopsis (Arabidopsis thaliana) suggest instances where there have been duplications in the lineages leading to apple of biosynthetic and regulatory genes that are expressed in fruit. This resource paves the way for a concerted functional genomics effort in this important temperate fruit crop. PMID:16531485

  2. Comparison of direct boiling method with commercial kits for extracting fecal microbiome DNA by Illumina sequencing of 16S rRNA tags.

    PubMed

    Peng, Xin; Yu, Ke-Qiang; Deng, Guan-Hua; Jiang, Yun-Xia; Wang, Yu; Zhang, Guo-Xia; Zhou, Hong-Wei

    2013-12-01

    Low cost and high throughput capacity are major advantages of using next generation sequencing (NGS) techniques to determine metagenomic 16S rRNA tag sequences. These methods have significantly changed our view of microorganisms in the fields of human health and environmental science. However, DNA extraction using commercial kits has shortcomings of high cost and time constraint. In the present study, we evaluated the determination of fecal microbiomes using a direct boiling method compared with 5 different commercial extraction methods, e.g., Qiagen and MO BIO kits. Principal coordinate analysis (PCoA) using UniFrac distances and clustering showed that direct boiling of a wide range of feces concentrations gave a similar pattern of bacterial communities as those obtained from most of the commercial kits, with the exception of the MO BIO method. Fecal concentration by boiling method affected the estimation of α-diversity indices, otherwise results were generally comparable between boiling and commercial methods. The operational taxonomic units (OTUs) determined through direct boiling showed highly consistent frequencies with those determined through most of the commercial methods. Even those for the MO BIO kit were also obtained by the direct boiling method with high confidence. The present study suggested that direct boiling could be used to determine the fecal microbiome and using this method would significantly reduce the cost and improve the efficiency of the sample preparation for studying gut microbiome diversity. © 2013 Elsevier B.V. All rights reserved.

  3. An efficient procedure for the expression and purification of HIV-1 protease from inclusion bodies.

    PubMed

    Nguyen, Hong-Loan Thi; Nguyen, Thuy Thi; Vu, Quy Thi; Le, Hang Thi; Pham, Yen; Trinh, Phuong Le; Bui, Thuan Phuong; Phan, Tuan-Nghia

    2015-12-01

    Several studies have focused on HIV-1 protease for developing drugs for treating AIDS. Recombinant HIV-1 protease is used to screen new drugs from synthetic compounds or natural substances. However, large-scale expression and purification of this enzyme is difficult mainly because of its low expression and solubility. In this study, we constructed 9 recombinant plasmids containing a sequence encoding HIV-1 protease along with different fusion tags and examined the expression of the enzyme from these plasmids. Of the 9 plasmids, pET32a(+) plasmid containing the HIV-1 protease-encoding sequence along with sequences encoding an autocleavage site GTVSFNF at the N-terminus and TEV plus 6× His tag at the C-terminus showed the highest expression of the enzyme and was selected for further analysis. The recombinant protein was isolated from inclusion bodies by using 2 tandem Q- and Ni-Sepharose columns. SDS-PAGE of the obtained HIV-1 protease produced a single band of approximately 13 kDa. The enzyme was recovered efficiently (4 mg protein/L of cell culture) and had high specific activity of 1190 nmol min(-1) mg(-1) at an optimal pH of 4.7 and optimal temperature of 37 °C. This procedure for expressing and purifying HIV-1 protease is now being scaled up to produce the enzyme on a large scale for its application. Copyright © 2015 Elsevier Inc. All rights reserved.

  4. The bovine lactation genome: Insights into the evolution of mammalian milk

    USDA-ARS?s Scientific Manuscript database

    The newly assembled Bos Taurus genome sequence enables the linkage of bovine milk and lactation data with other mammalian genomes. Using publicly available milk proteome data and mammary expressed sequence tags, 197 milk protein genes and over 6,000 mammary genes were identified in the bovine genome...

  5. Differential transferability of EST-SSR primers developed from diploid species Pseudoroegneria spicata, Thinopyrum bessarabicum, and Th. elongatum

    USDA-ARS?s Scientific Manuscript database

    Simple sequence repeat technology based on expressed sequence tag (EST-SSR) is a useful genomic tool for genome mapping, characterizing plant species relationships, elucidating genome evolution, and tracing genes on alien chromosome segments. EST-SSR primers developed from three perennial diploid T...

  6. Comparative analysis of viral RNA signatures on different RIG-I-like receptors

    PubMed Central

    Sanchez David, Raul Y; Combredet, Chantal; Sismeiro, Odile; Dillies, Marie-Agnès; Jagla, Bernd; Coppée, Jean-Yves; Mura, Marie; Guerbois Galla, Mathilde; Despres, Philippe; Tangy, Frédéric; Komarova, Anastassia V

    2016-01-01

    The RIG-I-like receptors (RLRs) play a major role in sensing RNA virus infection to initiate and modulate antiviral immunity. They interact with particular viral RNAs, most of them being still unknown. To decipher the viral RNA signature on RLRs during viral infection, we tagged RLRs (RIG-I, MDA5, LGP2) and applied tagged protein affinity purification followed by next-generation sequencing (NGS) of associated RNA molecules. Two viruses with negative- and positive-sense RNA genome were used: measles (MV) and chikungunya (CHIKV). NGS analysis revealed that distinct regions of MV genome were specifically recognized by distinct RLRs: RIG-I recognized defective interfering genomes, whereas MDA5 and LGP2 specifically bound MV nucleoprotein-coding region. During CHIKV infection, RIG-I associated specifically to the 3’ untranslated region of viral genome. This study provides the first comparative view of the viral RNA ligands for RIG-I, MDA5 and LGP2 in the presence of infection. DOI: http://dx.doi.org/10.7554/eLife.11275.001 PMID:27011352

  7. SAGE analysis of early oogenesis in the silkworm, Bombyx mori.

    PubMed

    Funaguma, Shunsuke; Hashimoto, Shin-ichi; Suzuki, Yutaka; Omuro, Naoko; Sugano, Sumio; Mita, Kazuei; Katsuma, Susumu; Shimada, Toru

    2007-02-01

    To identify genes involved in the differentiation of Bombyx cystoblast, we constructed two 3' long serial analysis of gene expression (Long SAGE) libraries from stage 1-3 or stage 2-3 egg chambers and compared their gene expression profiles. In both libraries, the most frequent tags were derived from the same novel transcript. The transcript does not have any open reading frame capable of encoding a protein with over 100 amino acids in length. RNA blot analysis revealed that this transcript is specifically and abundantly expressed in the Bombyx ovary, mainly the germ line cells in the ovarioles. These results suggest that Bombyx oogenesis may be regulated by a previously unidentified non-coding RNA. Comparison of the gene expression profiles between the stage 1-3 and stage 2-3 egg chamber libraries revealed that 272 tags were significantly more abundant in stage 1-3 egg chambers (p<0.05 and at least two-fold change) than in library 2. Among the differentially expressed transcripts were the sequences that correspond to ATP synthase subunit d (3.1-fold enriched) and ATP synthase coupling factor 6 (9.1-fold enriched), suggesting that they are involved in regulation of cell cycle of cystocytes.

  8. Programmable polyproteams built using twin peptide superglues

    PubMed Central

    Veggiani, Gianluca; Nakamura, Tomohiko; Brenner, Michael D.; Yan, Jun; Robinson, Carol V.; Howarth, Mark

    2016-01-01

    Programmed connection of amino acids or nucleotides into chains introduced a revolution in control of biological function. Reacting proteins together is more complex because of the number of reactive groups and delicate stability. Here we achieved sequence-programmed irreversible connection of protein units, forming polyprotein teams by sequential amidation and transamidation. SpyTag peptide is engineered to spontaneously form an isopeptide bond with SpyCatcher protein. By engineering the adhesin RrgA from Streptococcus pneumoniae, we developed the peptide SnoopTag, which formed a spontaneous isopeptide bond to its protein partner SnoopCatcher with >99% yield and no cross-reaction to SpyTag/SpyCatcher. Solid-phase attachment followed by sequential SpyTag or SnoopTag reaction between building-blocks enabled iterative extension. Linear, branched, and combinatorial polyproteins were synthesized, identifying optimal combinations of ligands against death receptors and growth factor receptors for cancer cell death signal activation. This simple and modular route to programmable “polyproteams” should enable exploration of a new area of biological space. PMID:26787909

  9. Programmable polyproteams built using twin peptide superglues.

    PubMed

    Veggiani, Gianluca; Nakamura, Tomohiko; Brenner, Michael D; Gayet, Raphaël V; Yan, Jun; Robinson, Carol V; Howarth, Mark

    2016-02-02

    Programmed connection of amino acids or nucleotides into chains introduced a revolution in control of biological function. Reacting proteins together is more complex because of the number of reactive groups and delicate stability. Here we achieved sequence-programmed irreversible connection of protein units, forming polyprotein teams by sequential amidation and transamidation. SpyTag peptide is engineered to spontaneously form an isopeptide bond with SpyCatcher protein. By engineering the adhesin RrgA from Streptococcus pneumoniae, we developed the peptide SnoopTag, which formed a spontaneous isopeptide bond to its protein partner SnoopCatcher with >99% yield and no cross-reaction to SpyTag/SpyCatcher. Solid-phase attachment followed by sequential SpyTag or SnoopTag reaction between building-blocks enabled iterative extension. Linear, branched, and combinatorial polyproteins were synthesized, identifying optimal combinations of ligands against death receptors and growth factor receptors for cancer cell death signal activation. This simple and modular route to programmable "polyproteams" should enable exploration of a new area of biological space.

  10. Determining Zebrafish Epitope Reactivity to Commercially Available Antibodies.

    PubMed

    Villarreal, Michael A; Biediger, Nicole M; Bonner, Natalie A; Miller, Jennifer N; Zepeda, Samantha K; Ricard, Benjamin J; García, Dana M; Lewis, Karen A

    2017-08-01

    Antibodies raised against mammalian proteins may exhibit cross-reactivity with zebrafish proteins, making these antibodies useful for fish studies. However, zebrafish may express multiple paralogues of similar sequence and size, making them difficult to distinguish by traditional Western blot analysis. To identify the zebrafish proteins that are recognized by an antimammalian antibody, we developed a system to screen putative epitopes by cloning the sequences between the yeast SUMO protein and a C-terminal 6xHis tag. The recombinant fusion protein was expressed in Escherichia coli and analyzed by Western blot to conclusively identify epitopes that exhibit cross-reactivity with the antibodies of interest. This approach can be used to determine the species cross-reactivity and epitope specificity of a wide variety of peptide antigen-derived antibodies.

  11. Assessing the effects of subject motion on T2 relaxation under spin tagging (TRUST) cerebral oxygenation measurements using volume navigators.

    PubMed

    Stout, Jeffrey N; Tisdall, M Dylan; McDaniel, Patrick; Gagoski, Borjan; Bolar, Divya S; Grant, Patricia Ellen; Adalsteinsson, Elfar

    2017-12-01

    Subject motion may cause errors in estimates of blood T 2 when using the T 2 -relaxation under spin tagging (TRUST) technique on noncompliant subjects like neonates. By incorporating 3D volume navigators (vNavs) into the TRUST pulse sequence, independent measurements of motion during scanning permit evaluation of these errors. The effects of integrated vNavs on TRUST-based T 2 estimates were evaluated using simulations and in vivo subject data. Two subjects were scanned with the TRUST+vNav sequence during prescribed movements. Mean motion scores were derived from vNavs and TRUST images, along with a metric of exponential fit quality. Regression analysis was performed between T 2 estimates and mean motion scores. Also, motion scores were determined from independent neonatal scans. vNavs negligibly affected venous blood T 2 estimates and better detected subject motion than fit quality metrics. Regression analysis showed that T 2 is biased upward by 4.1 ms per 1 mm of mean motion score. During neonatal scans, mean motion scores of 0.6 to 2.0 mm were detected. Motion during TRUST causes an overestimate of T 2 , which suggests a cautious approach when comparing TRUST-based cerebral oxygenation measurements of noncompliant subjects. Magn Reson Med 78:2283-2289, 2017. © 2017 International Society for Magnetic Resonance in Medicine. © 2017 International Society for Magnetic Resonance in Medicine.

  12. Detection of protein modifications and counterfeit protein pharmaceuticals using isotope tags for relative and absolute quantification and matrix-assisted laser desorption/ionization tandem time-of-flight mass spectrometry: studies of insulins.

    PubMed

    Ye, Hongping; Hill, John; Kauffman, John; Gryniewicz, Connie; Han, Xianlin

    2008-08-15

    Isotope tags for relative and absolute quantification (iTRAQ) reagent coupled with matrix-assisted laser desorption/ionization tandem time-of-flight (MALDI-TOF/TOF) mass spectrometric analysis has been evaluated as both a qualitative and quantitative method for the detection of modifications to active pharmaceutical ingredients derived from recombinant DNA technologies and as a method to detect counterfeit drug products. Five types of insulin (human, bovine, porcine, Lispro, and Lantus) were used as model products in the study because of their minor variations in amino acid sequence. Several experiments were conducted in which each insulin variant was separately digested with Glu-C, and the digestate was labeled with one of four different iTRAQ reagents. All digestates were then combined for desalting and MALDI-TOF/TOF mass spectrometric analysis. When the digestion procedure was optimized, the insulin sequence coverage was 100%. Five different types of insulin were readily differentiated, including human insulin (P28K29) and Lispro insulin (K28P29), which differ only by the interchange of two contiguous residues. Moreover, quantitative analyses show that the results obtained from the iTRAQ method agree well with those determined by other conventional methods. Collectively, the iTRAQ method can be used as a qualitative and quantitative technique for the detection of protein modification and counterfeiting.

  13. Genome-wide analysis of promoter architecture in Drosophila melanogaster

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Hoskins, Roger A.; Landolin, Jane M.; Brown, James B.

    2010-10-20

    Core promoters are critical regions for gene regulation in higher eukaryotes. However, the boundaries of promoter regions, the relative rates of initiation at the transcription start sites (TSSs) distributed within them, and the functional significance of promoter architecture remain poorly understood. We produced a high-resolution map of promoters active in the Drosophila melanogaster embryo by integrating data from three independent and complementary methods: 21 million cap analysis of gene expression (CAGE) tags, 1.2 million RNA ligase mediated rapid amplification of cDNA ends (RLMRACE) reads, and 50,000 cap-trapped expressed sequence tags (ESTs). We defined 12,454 promoters of 8037 genes. Our analysismore » indicates that, due to non-promoter-associated RNA background signal, previous studies have likely overestimated the number of promoter-associated CAGE clusters by fivefold. We show that TSS distributions form a complex continuum of shapes, and that promoters active in the embryo and adult have highly similar shapes in 95% of cases. This suggests that these distributions are generally determined by static elements such as local DNA sequence and are not modulated by dynamic signals such as histone modifications. Transcription factor binding motifs are differentially enriched as a function of promoter shape, and peaked promoter shape is correlated with both temporal and spatial regulation of gene expression. Our results contribute to the emerging view that core promoters are functionally diverse and control patterning of gene expression in Drosophila and mammals.« less

  14. Comparative expression profiling in grape (Vitis vinifera) berries derived from frequency analysis of ESTs and MPSS signatures.

    PubMed

    Iandolino, Alberto; Nobuta, Kan; da Silva, Francisco Goes; Cook, Douglas R; Meyers, Blake C

    2008-05-12

    Vitis vinifera (V. vinifera) is the primary grape species cultivated for wine production, with an industry valued annually in the billions of dollars worldwide. In order to sustain and increase grape production, it is necessary to understand the genetic makeup of grape species. Here we performed mRNA profiling using Massively Parallel Signature Sequencing (MPSS) and combined it with available Expressed Sequence Tag (EST) data. These tag-based technologies, which do not require a priori knowledge of genomic sequence, are well-suited for transcriptional profiling. The sequence depth of MPSS allowed us to capture and quantify almost all the transcripts at a specific stage in the development of the grape berry. The number and relative abundance of transcripts from stage II grape berries was defined using Massively Parallel Signature Sequencing (MPSS). A total of 2,635,293 17-base and 2,259,286 20-base signatures were obtained, representing at least 30,737 and 26,878 distinct sequences. The average normalized abundance per signature was approximately 49 TPM (Transcripts Per Million). Comparisons of the MPSS signatures with available Vitis species' ESTs and a unigene set demonstrated that 6,430 distinct contigs and 2,190 singletons have a perfect match to at least one MPSS signature. Among the matched sequences, ESTs were identified from tissues other than berries or from berries at different developmental stages. Additional MPSS signatures not matching to known grape ESTs can extend our knowledge of the V. vinifera transcriptome, particularly when these data are used to assist in annotation of whole genome sequences from Vitis vinifera. The MPSS data presented here not only achieved a higher level of saturation than previous EST based analyses, but in doing so, expand the known set of transcripts of grape berries during the unique stage in development that immediately precedes the onset of ripening. The MPSS dataset also revealed evidence of antisense expression not previously reported in grapes but comparable to that reported in other plant species. Finally, we developed a novel web-based, public resource for utilization of the grape MPSS data [1].

  15. An unusual plant triterpene synthase with predominant α-amyrin-producing activity identified by characterizing oxidosqualene cyclases from Malus × domestica.

    PubMed

    Brendolise, Cyril; Yauk, Yar-Khing; Eberhard, Ellen D; Wang, Mindy; Chagne, David; Andre, Christelle; Greenwood, David R; Beuning, Lesley L

    2011-07-01

    The pentacyclic triterpenes, in particular ursolic acid and oleanolic acid and their derivatives, exist abundantly in the plant kingdom, where they are well known for their anti-inflammatory, antitumour and antimicrobial properties. α-Amyrin and β-amyrin are the precursors of ursolic and oleanolic acids, respectively, formed by concerted cyclization of squalene epoxide by a complex synthase reaction. We identified three full-length expressed sequence tag sequences in cDNA libraries constructed from apple (Malus × domestica 'Royal Gala') that were likely to encode triterpene synthases. Two of these expressed sequence tag sequences were essentially identical (> 99% amino acid similarity; MdOSC1 and MdOSC3). MdOSC1 and MdOSC2 were expressed by transient expression in Nicotiana benthamiana leaves and by expression in the yeast Pichia methanolica. The resulting products were analysed by GC and GC-MS. MdOSC1 was shown to be a mixed amyrin synthase (a 5 : 1 ratio of α-amyrin to β-amyrin). MdOSC1 is the only triterpene synthase so far identified in which the level of α-amyrin produced is > 80% of the total product and is, therefore, primarily an α-amyrin synthase. No product was evident for MdOSC2 when expressed either transiently or in yeast, suggesting that this putative triterpene synthase is either encoded by a pseudogene or does not express well in these systems. Transcript expression analysis in Royal Gala indicated that the genes are mostly expressed in apple peel, and that the MdOSC2 expression level was much lower than that of MdOSC1 and MdOSC3 in all the tissues tested. Amyrin content analysis was undertaken by LC-MS, and demonstrated that levels and ratios differ between tissues, but that the true consequence of synthase activity is reflected in the ursolic/oleanolic acid content and in further triterpenoids derived from them. Phylogenetic analysis placed the three triterpene synthase sequences with other triterpene synthases that encoded either α-amyrin and/or β-amyrin synthase. MdOSC1 and MdOSC3 clustered with the multifunctional triterpene synthases, whereas MdOSC2 was most similar to the β-amyrin synthases. © 2011 The New Zealand Institute for Plant and Food Research Limited. Journal compilation © 2011 FEBS.

  16. The application of new software tools to quantitative protein profiling via isotope-coded affinity tag (ICAT) and tandem mass spectrometry: I. Statistically annotated datasets for peptide sequences and proteins identified via the application of ICAT and tandem mass spectrometry to proteins copurifying with T cell lipid rafts.

    PubMed

    von Haller, Priska D; Yi, Eugene; Donohoe, Samuel; Vaughn, Kelly; Keller, Andrew; Nesvizhskii, Alexey I; Eng, Jimmy; Li, Xiao-jun; Goodlett, David R; Aebersold, Ruedi; Watts, Julian D

    2003-07-01

    Lipid rafts were prepared according to standard protocols from Jurkat T cells stimulated via T cell receptor/CD28 cross-linking and from control (unstimulated) cells. Co-isolating proteins from the control and stimulated cell preparations were labeled with isotopically normal (d0) and heavy (d8) versions of the same isotope-coded affinity tag (ICAT) reagent, respectively. Samples were combined, proteolyzed, and resultant peptides fractionated via cation exchange chromatography. Cysteine-containing (ICAT-labeled) peptides were recovered via the biotin tag component of the ICAT reagents by avidin-affinity chromatography. On-line micro-capillary liquid chromatography tandem mass spectrometry was performed on both avidin-affinity (ICAT-labeled) and flow-through (unlabeled) fractions. Initial peptide sequence identification was by searching recorded tandem mass spectrometry spectra against a human sequence data base using SEQUEST software. New statistical data modeling algorithms were then applied to the SEQUEST search results. These allowed for discrimination between likely "correct" and "incorrect" peptide assignments, and from these the inferred proteins that they collectively represented, by calculating estimated probabilities that each peptide assignment and subsequent protein identification was a member of the "correct" population. For convenience, the resultant lists of peptide sequences assigned and the proteins to which they corresponded were filtered at an arbitrarily set cut-off of 0.5 (i.e. 50% likely to be "correct") and above and compiled into two separate datasets. In total, these data sets contained 7667 individual peptide identifications, which represented 2669 unique peptide sequences, corresponding to 685 proteins and related protein groups.

  17. Bio-inorganic synthesis of ZnO powders using recombinant His-tagged ZnO binding peptide as a promoter.

    PubMed

    Song, Lei; Liu, Yingying; Zhang, Zhifang; Wang, Xi; Chen, Jinchun

    2010-10-01

    Inorganic-binding peptides termed as genetically engineered polypeptides for inorganics (GEPIs), are small peptide sequences selected via combinatorial biology-based protocols of phage or cell surface display technologies. Recent advances in nanotechnology and molecular biology allow the engineering of these peptides with specific affinity to inorganics, often used as molecular linkers or assemblers, to facilitate materials synthesis, which provides a new insight into the material science and engineering field. As a case study on this biomimetic application, here we report a novel biosynthetic ZnO binding protein and its application in promoting bio-inorganic materials synthesis. In brief, the gene encoding a ZnO binding peptide(ZBP) was genetically fused with His(6)-tag and GST-tag using E.coli expression vector pET-28a (+) and pGEX-4T-3. The recombinant protein GST-His-ZBP was expressed, purified with Ni-NTA system, identified by SDS-PAGE electrophoresis and Western blot analysis and confirmed by liquid chromatography-mass spectrometry/mass spectrometry (LC-MS/MS) analysis. Affinity adsorption test demonstrated that the fusion protein had a specific avidity for ZnO nanoparticles (NPs). Results from the bio-inorganic synthesis experiment indicated that the new protein played a promoting part in grain refinement and accelerated precipitation during the formation of the ultra-fine precursor powders in the Zn(OH)(2) sol. X-ray diffraction (XRD) analysis on the final products after calcining the precursor powders showed that hexagonal wurtzite ZnO crystals were obtained. Our work suggested a novel approach to the application about the organic-inorganic interactions.

  18. Construction and characterization of 3A-epitope-tagged foot-and-mouth disease virus.

    PubMed

    Ma, Xueqing; Li, Pinghua; Sun, Pu; Bai, Xingwen; Bao, Huifang; Lu, Zengjun; Fu, Yuanfang; Cao, Yimei; Li, Dong; Chen, Yingli; Qiao, Zilin; Liu, Zaixin

    2015-04-01

    Nonstructural protein 3A of foot-and-mouth disease virus (FMDV) is a partially conserved protein of 153 amino acids (aa) in most FMDVs examined to date. Specific deletion in the FMDV 3A protein has been associated with the inability of FMDV to grow in primary bovine cells and cause disease in cattle. However, the aa residues playing key roles in these processes are poorly understood. In this study, we constructed epitope-tagged FMDVs containing an 8 aa FLAG epitope, a 9 aa haemagglutinin (HA) epitope, and a 10 aa c-Myc epitope to substitute residues 94-101, 93-101, and 93-102 of 3A protein, respectively, using a recently developed O/SEA/Mya-98 FMDV infectious cDNA clone. Immunofluorescence assay (IFA), Western blot and sequence analysis showed that the epitope-tagged viruses stably maintained and expressed the foreign epitopes even after 10 serial passages in BHK-21 cells. The epitope-tagged viruses displayed growth properties and plaque phenotypes similar to those of the parental virus in BHK-21 cells. However, the epitope-tagged viruses exhibited lower growth rates and smaller plaque size phenotypes than those of the parental virus in primary fetal bovine kidney (FBK) cells, but similar growth properties and plaque phenotypes to those of the recombinant viruses harboring 93-102 deletion in 3A. These results demonstrate that the decreased ability of FMDV to replicate in primary bovine cells was not associated with the length of 3A, and the genetic determinant thought to play key role in decreased ability to replicate in primary bovine cells could be reduced from 93-102 residues to 8 aa residues at positions 94-101 in 3A protein. Copyright © 2015 Elsevier B.V. All rights reserved.

  19. Comparison of NxTAG Respiratory Pathogen Panel and Anyplex II RV16 Tests for Multiplex Detection of Respiratory Pathogens in Hospitalized Children

    PubMed Central

    Brotons, Pedro; Henares, Desiree; Latorre, Irene; Cepillo, Antonio; Launes, Cristian

    2016-01-01

    Multiplex molecular techniques can detect a diversity of respiratory viruses and bacteria that cause childhood acute respiratory infection rapidly and conveniently. However, currently available techniques show high variation in performance. We sought to compare the diagnostic accuracy of the novel multiplex NxTAG respiratory pathogen panel (RPP) RUO test versus a routine multiplex Anyplex II RV16 assay in respiratory specimens collected from children <18 years of age hospitalized with nonspecific symptoms of acute lower respiratory infection. Parallel testing was performed on nasopharyngeal aspirates prospectively collected at referral Children's Hospital Sant Joan de Déu (Barcelona, Spain) between June and November 2015. Agreement values between the two tests and kappa coefficients were assessed. Bidirectional sequencing was performed for the resolution of discordant results. A total of 319 samples were analyzed by both techniques. A total of 268 (84.0%) of them yielded concordant results. Positive percent agreement values ranged from 83.3 to 100%, while the negative percent agreement was more than 99% for all targets except for enterovirus/rhinovirus (EV/RV; 94.4%). Kappa coefficients ranged from 0.83 to 1.00. Discrepancy analysis confirmed 66.0% of NxTAG RPP RUO results. A total of 260 viruses were detected, with EV/RV (n = 105, 40.4%) being the most prevalent target. Viral coinfections were found in 44 (14.2%) samples. In addition, NxTAG RPP RUO detected single bacterial and mixed viral-bacterial infections in seven samples. NxTAG RPP RUO showed high positive and negative agreement with Anyplex II RV16 for main viruses that cause acute respiratory infections in children, coupled with an additional capability to detect some respiratory bacteria. PMID:27629904

  20. Development of 23 novel polymorphic EST-SSR markers for the endangered relict conifer Metasequoia glyptostroboides.

    PubMed

    Jin, Yuqing; Bi, Quanxin; Guan, Wenbin; Mao, Jian-Feng

    2015-09-01

    Metasequoia glyptostroboides is an endangered relict conifer species endemic to China. In this study, expressed sequence tag-simple sequence repeat (EST-SSR) markers were developed using transcriptome mining for future genetic and functional studies. We collected 97,565 unigene sequences generated by 454 pyrosequencing. A bioinformatics analysis identified 2087 unique and putative microsatellites, from which 96 novel microsatellite markers were developed. Fifty-three of the 96 primer sets successfully amplified clear fragments of the expected sizes; 23 of those loci were polymorphic. The number of alleles per locus ranged from two to eight, with an average of three, and the observed and expected heterozygosity values ranged from 0 to 1.0 and 0.117 to 0.813, respectively. These microsatellite loci will enrich the genetic resources to develop functional studies and conservation strategies for this endangered relict species.

  1. Cadmium sulfide nanocluster-based electrochemical stripping detection of DNA hybridization.

    PubMed

    Zhu, Ningning; Zhang, Aiping; He, Pingang; Fang, Yuzhi

    2003-03-01

    A novel, sensitive electrochemical DNA hybridization detection assay, using cadmium sulfide (CdS) nanoclusters as the oligonucleotide labeling tag, is described. The assay relies on the hybridization of the target DNA with the CdS nanocluster oligonucleotide DNA probe, followed by the dissolution of the CdS nanoclusters anchored on the hybrids and the indirect determination of the dissolved cadmium ions by sensitive anodic stripping voltammetry (ASV) at a mercury-coated glassy carbon electrode (GCE). The results showed that only a complementary sequence could form a double-stranded dsDNA-CdS with the DNA probe and give an obvious electrochemical response. A three-base mismatch sequence and non-complementary sequence had negligible response. The combination of the large number of cadmium ions released from each dsDNA hybrid with the remarkable sensitivity of the electrochemical stripping analysis for cadmium at mercury-film GCE allows detection at levels as low as 0.2 pmol L(-1) of the complementary sequence of DNA.

  2. Survey of duckweed diversity in Lake Chao and total fatty acid, triacylglycerol, profiles of representative strains.

    PubMed

    Tang, J; Li, Y; Ma, J; Cheng, J J

    2015-09-01

    Lemnaceae (duckweeds) are widely distributed aquatic flowering plants. Their high growth rate, starch content and suitability for bioremediation make them potential feedstock for biofuels. However, few natural duckweed resources have been investigated in China, and there is no information about total fatty acid (TFA) and triacylglycerol (TAG) composition of duckweeds from China. Here, the genetic diversity of a natural duckweed population collected from Lake Chao, China, was investigated using multilocus sequence typing (MLST). The 54 strains were categorised into four species in four genera, representing 12 distinct sequence types. Strains representing Lemna aequinoctialis and Spirodela polyrhiza were predominant. Interestingly, a surprisingly high degree of genetic diversification within L. aequinoctialis was observed. The four duckweed species revealed a uniform fatty acid composition, with three fatty acids, palmitic acid, linoleic acid and linolenic acid, accounting for more than 80% of the TFA. The TFA in biomass varied among species, ranging from 1.05% (of dry weight, DW) for L. punctata and S. polyrhiza to 1.62% for Wolffia globosa. The four duckweed species contained similar TAG contents, 0.02% mg · DW(-1). The fatty acid profiles of TAG were different from those of TFA, and also varied among the four species. The survey investigated the genetic diversity of duckweeds from Lake Chao, and provides an initial insight into TFA and TAG of four duckweed species, indicating that intraspecific and interspecific variations exist in the content and composition of both TFA and TAG in comparison with other studies. © 2015 German Botanical Society and The Royal Botanical Society of the Netherlands.

  3. In-plane "superresolution" MRI with phaseless sub-pixel encoding.

    PubMed

    Hennel, Franciszek; Tian, Rui; Engel, Maria; Pruessmann, Klaas P

    2018-04-15

    Acquisition of high-resolution imaging data using multiple excitations without the sensitivity to fluctuations of the transverse magnetization phase, which is a major problem of multi-shot MRI. The concept of superresolution MRI based on microscopic tagging is analyzed using an analogy with the optical method of structured illumination. Sinusoidal tagging is shown to provide subpixel resolution by mixing of neighboring spatial frequency (k-space) bands. It represents a phaseless modulation added on top of the standard Fourier encoding, which allows the phase fluctuations to be discarded at an intermediate reconstruction step. Improvements are proposed to correct for tag distortions due to magnetic field inhomogeneity and to avoid the propagation of Gibbs ringing from intermediate low-resolution images to the final image. The method was applied to diffusion-weighted EPI. Artifact-free superresolution images can be obtained despite a finite duration of the tagging sequence and related pattern distortions by a field map based phase correction of band-wise reconstructed images. The ringing effect present in the intermediate images can be suppressed by partial overlapping of the mixed k-space bands in combination with an adapted filter. High-resolution diffusion-weighted images of the human head were obtained with a three-shot EPI sequence despite motion-related phase fluctuations between the shots. Due to its phaseless character, tagging-based sub-pixel encoding is an alternative to k-space segmenting in the presence of unknown phase fluctuations, in particular those due to motion under strong diffusion gradients. Proposed improvements render the method practicable in realistic conditions. © 2018 International Society for Magnetic Resonance in Medicine.

  4. Exploring the Structure of Library and Information Science Web Space Based on Multivariate Analysis of Social Tags

    ERIC Educational Resources Information Center

    Joo, Soohyung; Kipp, Margaret E. I.

    2015-01-01

    Introduction: This study examines the structure of Web space in the field of library and information science using multivariate analysis of social tags from the Website, Delicious.com. A few studies have examined mathematical modelling of tags, mainly examining tagging in terms of tripartite graphs, pattern tracing and descriptive statistics. This…

  5. Discovery of a new mechanism for regulation of plant triacylglycerol metabolism: The peanut diacylglycerol acyltransferase-1 gene family transcriptome is highly enriched in alternative splicing variants.

    PubMed

    Zheng, Ling; Shockey, Jay; Guo, Feng; Shi, Lingmin; Li, Xinguo; Shan, Lei; Wan, Shubo; Peng, Zhenying

    2017-12-01

    Triacylglycerols (TAGs) are the most important energy storage form in oilseed crops. Diacylglycerol acyltransferase (DGAT) catalyzes the rate-limiting step of the Kennedy pathway of TAG biosynthesis. To date, little is known about the regulation of DGAT activity in peanut (Arachis hypogaea), an agronomically important oilseed crop that is cultivated in many parts of the world. In this study, seven distinct forms of type 1 DGAT (AhDGAT1.1-AhDGAT1.7) were identified, cloned, and characterized. Comparisons of the nucleotide sequences and gene structures revealed many different splicing variants of AhDGAT1, some of which displayed different organ-specific expression patterns. A representative gene (AhDGAT1.1) was transformed into wild-type tobacco and was shown to increase seed fatty acid (FA) content by 14.7%-20.9%. All seven AhDGAT1s were expressed in TAG-deficient Saccharomyces cerevisiae strain H1246; the five longest AhDGAT1 variants generated high levels of acyltransferase activity and complemented the free fatty acid lethality phenotype in this strain. The alternative splicing that gives rise to AhDGAT1.2 and AhDGAT1.4 creates predicted protein C-terminal truncations. The proteins encoded by these two variants were not active and did not complement the fatty acid sensitivity in H1246. These results were verified by visualization of intracellular lipid droplets using Nile Red staining. Collectively, the results presented here represent the first comprehensive analysis of the peanut DGAT1 gene family, which, unlike in other published plant DGAT1 sequences, shows widespread alternative splicing that may affect the expression patterns and enzyme activities of some members of the gene family. Copyright © 2017. Published by Elsevier GmbH.

  6. A Medicago truncatula Tobacco Retrotransposon Insertion Mutant Collection with Defects in Nodule Development and Symbiotic Nitrogen Fixation1[W][OA

    PubMed Central

    Pislariu, Catalina I.; D. Murray, Jeremy; Wen, JiangQi; Cosson, Viviane; Muni, RajaSekhara Reddy Duvvuru; Wang, Mingyi; A. Benedito, Vagner; Andriankaja, Andry; Cheng, Xiaofei; Jerez, Ivone Torres; Mondy, Samuel; Zhang, Shulan; Taylor, Mark E.; Tadege, Million; Ratet, Pascal; Mysore, Kirankumar S.; Chen, Rujin; Udvardi, Michael K.

    2012-01-01

    A Tnt1-insertion mutant population of Medicago truncatula ecotype R108 was screened for defects in nodulation and symbiotic nitrogen fixation. Primary screening of 9,300 mutant lines yielded 317 lines with putative defects in nodule development and/or nitrogen fixation. Of these, 230 lines were rescreened, and 156 lines were confirmed with defective symbiotic nitrogen fixation. Mutants were sorted into six distinct phenotypic categories: 72 nonnodulating mutants (Nod−), 51 mutants with totally ineffective nodules (Nod+ Fix−), 17 mutants with partially ineffective nodules (Nod+ Fix+/−), 27 mutants defective in nodule emergence, elongation, and nitrogen fixation (Nod+/− Fix−), one mutant with delayed and reduced nodulation but effective in nitrogen fixation (dNod+/− Fix+), and 11 supernodulating mutants (Nod++Fix+/−). A total of 2,801 flanking sequence tags were generated from the 156 symbiotic mutant lines. Analysis of flanking sequence tags revealed 14 insertion alleles of the following known symbiotic genes: NODULE INCEPTION (NIN), DOESN’T MAKE INFECTIONS3 (DMI3/CCaMK), ERF REQUIRED FOR NODULATION, and SUPERNUMERARY NODULES (SUNN). In parallel, a polymerase chain reaction-based strategy was used to identify Tnt1 insertions in known symbiotic genes, which revealed 25 additional insertion alleles in the following genes: DMI1, DMI2, DMI3, NIN, NODULATION SIGNALING PATHWAY1 (NSP1), NSP2, SUNN, and SICKLE. Thirty-nine Nod− lines were also screened for arbuscular mycorrhizal symbiosis phenotypes, and 30 mutants exhibited defects in arbuscular mycorrhizal symbiosis. Morphological and developmental features of several new symbiotic mutants are reported. The collection of mutants described here is a source of novel alleles of known symbiotic genes and a resource for cloning novel symbiotic genes via Tnt1 tagging. PMID:22679222

  7. A Medicago truncatula tobacco retrotransposon insertion mutant collection with defects in nodule development and symbiotic nitrogen fixation.

    PubMed

    Pislariu, Catalina I; Murray, Jeremy D; Wen, JiangQi; Cosson, Viviane; Muni, RajaSekhara Reddy Duvvuru; Wang, Mingyi; Benedito, Vagner A; Andriankaja, Andry; Cheng, Xiaofei; Jerez, Ivone Torres; Mondy, Samuel; Zhang, Shulan; Taylor, Mark E; Tadege, Million; Ratet, Pascal; Mysore, Kirankumar S; Chen, Rujin; Udvardi, Michael K

    2012-08-01

    A Tnt1-insertion mutant population of Medicago truncatula ecotype R108 was screened for defects in nodulation and symbiotic nitrogen fixation. Primary screening of 9,300 mutant lines yielded 317 lines with putative defects in nodule development and/or nitrogen fixation. Of these, 230 lines were rescreened, and 156 lines were confirmed with defective symbiotic nitrogen fixation. Mutants were sorted into six distinct phenotypic categories: 72 nonnodulating mutants (Nod-), 51 mutants with totally ineffective nodules (Nod+ Fix-), 17 mutants with partially ineffective nodules (Nod+ Fix+/-), 27 mutants defective in nodule emergence, elongation, and nitrogen fixation (Nod+/- Fix-), one mutant with delayed and reduced nodulation but effective in nitrogen fixation (dNod+/- Fix+), and 11 supernodulating mutants (Nod++Fix+/-). A total of 2,801 flanking sequence tags were generated from the 156 symbiotic mutant lines. Analysis of flanking sequence tags revealed 14 insertion alleles of the following known symbiotic genes: NODULE INCEPTION (NIN), DOESN'T MAKE INFECTIONS3 (DMI3/CCaMK), ERF REQUIRED FOR NODULATION, and SUPERNUMERARY NODULES (SUNN). In parallel, a polymerase chain reaction-based strategy was used to identify Tnt1 insertions in known symbiotic genes, which revealed 25 additional insertion alleles in the following genes: DMI1, DMI2, DMI3, NIN, NODULATION SIGNALING PATHWAY1 (NSP1), NSP2, SUNN, and SICKLE. Thirty-nine Nod- lines were also screened for arbuscular mycorrhizal symbiosis phenotypes, and 30 mutants exhibited defects in arbuscular mycorrhizal symbiosis. Morphological and developmental features of several new symbiotic mutants are reported. The collection of mutants described here is a source of novel alleles of known symbiotic genes and a resource for cloning novel symbiotic genes via Tnt1 tagging.

  8. Evolutionary characterization of pig interferon-inducible transmembrane gene family and member expression dynamics in tracheobronchial lymph nodes of pigs infected with swine respiratory disease viruses.

    PubMed

    Miller, Laura C; Jiang, Zhihua; Sang, Yongming; Harhay, Gregory P; Lager, Kelly M

    2014-06-15

    Studies have found that a cluster of duplicated gene loci encoding the interferon-inducible transmembrane proteins (IFITMs) family have antiviral activity against several viruses, including influenza A virus. The gene family has 5 and 7 members in humans and mice, respectively. Here, we confirm the current annotation of pig IFITM1, IFITM2, IFITM3, IFITM5, IFITM1L1 and IFITM1L4, manually annotated IFITM1L2, IFITM1L3, IFITM5L, IFITM3L1 and IFITM3L2, and provide expressed sequence tag (EST) and/or mRNA evidence, not contained with the NCBI Reference Sequence database (RefSeq), for the existence of IFITM6, IFITM7 and a new IFITM1-like (IFITM1LN) gene in pigs. Phylogenic analyses showed seven porcine IFITM genes with highly conserved human/mouse orthologs known to have anti-viral activity. Digital Gene Expression Tag Profiling (DGETP) of swine tracheobronchial lymph nodes (TBLN) of pigs infected with swine influenza virus (SIV), porcine pseudorabies virus, porcine reproductive and respiratory syndrome virus or porcine circovirus type 2 over 14 days post-inoculation (dpi) showed that gene expression abundance differs dramatically among pig IFITM family members, ranging from 0 to over 3000 tags per million. In particular, SIV up-regulated IFITM1 by 5.9 fold at 3 dpi. Bayesian framework further identified pig IFITM1 and IFITM3 as differentially expressed genes in the overall transcriptome analysis. In addition to being a component of protein complexes involved in homotypic adhesion, the IFITM1 is also associated with pathways related to regulation of cell proliferation and IFITM3 is involved in immune responses. Published by Elsevier B.V.

  9. Molecular diversity and distribution of marine fungi across 130 European environmental samples.

    PubMed

    Richards, Thomas A; Leonard, Guy; Mahé, Frédéric; Del Campo, Javier; Romac, Sarah; Jones, Meredith D M; Maguire, Finlay; Dunthorn, Micah; De Vargas, Colomban; Massana, Ramon; Chambouvet, Aurélie

    2015-11-22

    Environmental DNA and culture-based analyses have suggested that fungi are present in low diversity and in low abundance in many marine environments, especially in the upper water column. Here, we use a dual approach involving high-throughput diversity tag sequencing from both DNA and RNA templates and fluorescent cell counts to evaluate the diversity and relative abundance of fungi across marine samples taken from six European near-shore sites. We removed very rare fungal operational taxonomic units (OTUs) selecting only OTUs recovered from multiple samples for a detailed analysis. This approach identified a set of 71 fungal 'OTU clusters' that account for 66% of all the sequences assigned to the Fungi. Phylogenetic analyses demonstrated that this diversity includes a significant number of chytrid-like lineages that had not been previously described, indicating that the marine environment encompasses a number of zoosporic fungi that are new to taxonomic inventories. Using the sequence datasets, we identified cases where fungal OTUs were sampled across multiple geographical sites and between different sampling depths. This was especially clear in one relatively abundant and diverse phylogroup tentatively named Novel Chytrid-Like-Clade 1 (NCLC1). For comparison, a subset of the water column samples was also investigated using fluorescent microscopy to examine the abundance of eukaryotes with chitin cell walls. Comparisons of relative abundance of RNA-derived fungal tag sequences and chitin cell-wall counts demonstrate that fungi constitute a low fraction of the eukaryotic community in these water column samples. Taken together, these results demonstrate the phylogenetic position and environmental distribution of 71 lineages, improving our understanding of the diversity and abundance of fungi in marine environments. © 2015 The Authors.

  10. Molecular diversity and distribution of marine fungi across 130 European environmental samples

    PubMed Central

    Richards, Thomas A.; Leonard, Guy; Mahé, Frédéric; del Campo, Javier; Romac, Sarah; Jones, Meredith D. M.; Maguire, Finlay; Dunthorn, Micah; De Vargas, Colomban; Massana, Ramon; Chambouvet, Aurélie

    2015-01-01

    Environmental DNA and culture-based analyses have suggested that fungi are present in low diversity and in low abundance in many marine environments, especially in the upper water column. Here, we use a dual approach involving high-throughput diversity tag sequencing from both DNA and RNA templates and fluorescent cell counts to evaluate the diversity and relative abundance of fungi across marine samples taken from six European near-shore sites. We removed very rare fungal operational taxonomic units (OTUs) selecting only OTUs recovered from multiple samples for a detailed analysis. This approach identified a set of 71 fungal ‘OTU clusters' that account for 66% of all the sequences assigned to the Fungi. Phylogenetic analyses demonstrated that this diversity includes a significant number of chytrid-like lineages that had not been previously described, indicating that the marine environment encompasses a number of zoosporic fungi that are new to taxonomic inventories. Using the sequence datasets, we identified cases where fungal OTUs were sampled across multiple geographical sites and between different sampling depths. This was especially clear in one relatively abundant and diverse phylogroup tentatively named Novel Chytrid-Like-Clade 1 (NCLC1). For comparison, a subset of the water column samples was also investigated using fluorescent microscopy to examine the abundance of eukaryotes with chitin cell walls. Comparisons of relative abundance of RNA-derived fungal tag sequences and chitin cell-wall counts demonstrate that fungi constitute a low fraction of the eukaryotic community in these water column samples. Taken together, these results demonstrate the phylogenetic position and environmental distribution of 71 lineages, improving our understanding of the diversity and abundance of fungi in marine environments. PMID:26582030

  11. Screening for Glycosylphosphatidylinositol-Modified Cell Wall Proteins in Pichia pastoris and Their Recombinant Expression on the Cell Surface

    PubMed Central

    Zhang, Li; Liang, Shuli; Zhou, Xinying; Jin, Zi; Jiang, Fengchun; Han, Shuangyan; Zheng, Suiping

    2013-01-01

    Glycosylphosphatidylinositol (GPI)-anchored glycoproteins have various intrinsic functions in yeasts and different uses in vitro. In the present study, the genome of Pichia pastoris GS115 was screened for potential GPI-modified cell wall proteins. Fifty putative GPI-anchored proteins were selected on the basis of (i) the presence of a C-terminal GPI attachment signal sequence, (ii) the presence of an N-terminal signal sequence for secretion, and (iii) the absence of transmembrane domains in mature protein. The predicted GPI-anchored proteins were fused to an alpha-factor secretion signal as a substitute for their own N-terminal signal peptides and tagged with the chimeric reporters FLAG tag and mature Candida antarctica lipase B (CALB). The expression of fusion proteins on the cell surface of P. pastoris GS115 was determined by whole-cell flow cytometry and immunoblotting analysis of the cell wall extracts obtained by β-1,3-glucanase digestion. CALB displayed on the cell surface of P. pastoris GS115 with the predicted GPI-anchored proteins was examined on the basis of potential hydrolysis of p-nitrophenyl butyrate. Finally, 13 proteins were confirmed to be GPI-modified cell wall proteins in P. pastoris GS115, which can be used to display heterologous proteins on the yeast cell surface. PMID:23835174

  12. Skin Microbiome Surveys Are Strongly Influenced by Experimental Design.

    PubMed

    Meisel, Jacquelyn S; Hannigan, Geoffrey D; Tyldsley, Amanda S; SanMiguel, Adam J; Hodkinson, Brendan P; Zheng, Qi; Grice, Elizabeth A

    2016-05-01

    Culture-independent studies to characterize skin microbiota are increasingly common, due in part to affordable and accessible sequencing and analysis platforms. Compared to culture-based techniques, DNA sequencing of the bacterial 16S ribosomal RNA (rRNA) gene or whole metagenome shotgun (WMS) sequencing provides more precise microbial community characterizations. Most widely used protocols were developed to characterize microbiota of other habitats (i.e., gastrointestinal) and have not been systematically compared for their utility in skin microbiome surveys. Here we establish a resource for the cutaneous research community to guide experimental design in characterizing skin microbiota. We compare two widely sequenced regions of the 16S rRNA gene to WMS sequencing for recapitulating skin microbiome community composition, diversity, and genetic functional enrichment. We show that WMS sequencing most accurately recapitulates microbial communities, but sequencing of hypervariable regions 1-3 of the 16S rRNA gene provides highly similar results. Sequencing of hypervariable region 4 poorly captures skin commensal microbiota, especially Propionibacterium. WMS sequencing, which is resource and cost intensive, provides evidence of a community's functional potential; however, metagenome predictions based on 16S rRNA sequence tags closely approximate WMS genetic functional profiles. This study highlights the importance of experimental design for downstream results in skin microbiome surveys. Copyright © 2016 The Authors. Published by Elsevier Inc. All rights reserved.

  13. Skin microbiome surveys are strongly influenced by experimental design

    PubMed Central

    Meisel, Jacquelyn S.; Hannigan, Geoffrey D.; Tyldsley, Amanda S.; SanMiguel, Adam J.; Hodkinson, Brendan P.; Zheng, Qi; Grice, Elizabeth A.

    2016-01-01

    Culture-independent studies to characterize skin microbiota are increasingly common, due in part to affordable and accessible sequencing and analysis platforms. Compared to culture-based techniques, DNA sequencing of the bacterial 16S ribosomal RNA (rRNA) gene or whole metagenome shotgun (WMS) sequencing provide more precise microbial community characterizations. Most widely used protocols were developed to characterize microbiota of other habitats (i.e. gastrointestinal), and have not been systematically compared for their utility in skin microbiome surveys. Here we establish a resource for the cutaneous research community to guide experimental design in characterizing skin microbiota. We compare two widely sequenced regions of the 16S rRNA gene to WMS sequencing for recapitulating skin microbiome community composition, diversity, and genetic functional enrichment. We show that WMS sequencing most accurately recapitulates microbial communities, but sequencing of hypervariable regions 1-3 of the 16S rRNA gene provides highly similar results. Sequencing of hypervariable region 4 poorly captures skin commensal microbiota, especially Propionibacterium. WMS sequencing, which is resource- and cost-intensive, provides evidence of a community’s functional potential; however, metagenome predictions based on 16S rRNA sequence tags closely approximate WMS genetic functional profiles. This work highlights the importance of experimental design for downstream results in skin microbiome surveys. PMID:26829039

  14. Draft Sequences of the Radish (Raphanus sativus L.) Genome

    PubMed Central

    Kitashiba, Hiroyasu; Li, Feng; Hirakawa, Hideki; Kawanabe, Takahiro; Zou, Zhongwei; Hasegawa, Yoichi; Tonosaki, Kaoru; Shirasawa, Sachiko; Fukushima, Aki; Yokoi, Shuji; Takahata, Yoshihito; Kakizaki, Tomohiro; Ishida, Masahiko; Okamoto, Shunsuke; Sakamoto, Koji; Shirasawa, Kenta; Tabata, Satoshi; Nishio, Takeshi

    2014-01-01

    Radish (Raphanus sativus L., n = 9) is one of the major vegetables in Asia. Since the genomes of Brassica and related species including radish underwent genome rearrangement, it is quite difficult to perform functional analysis based on the reported genomic sequence of Brassica rapa. Therefore, we performed genome sequencing of radish. Short reads of genomic sequences of 191.1 Gb were obtained by next-generation sequencing (NGS) for a radish inbred line, and 76,592 scaffolds of ≥300 bp were constructed along with the bacterial artificial chromosome-end sequences. Finally, the whole draft genomic sequence of 402 Mb spanning 75.9% of the estimated genomic size and containing 61,572 predicted genes was obtained. Subsequently, 221 single nucleotide polymorphism markers and 768 PCR-RFLP markers were used together with the 746 markers produced in our previous study for the construction of a linkage map. The map was combined further with another radish linkage map constructed mainly with expressed sequence tag-simple sequence repeat markers into a high-density integrated map of 1,166 cM with 2,553 DNA markers. A total of 1,345 scaffolds were assigned to the linkage map, spanning 116.0 Mb. Bulked PCR products amplified by 2,880 primer pairs were sequenced by NGS, and SNPs in eight inbred lines were identified. PMID:24848699

  15. TaGS5-3A, a grain size gene selected during wheat improvement for larger kernel and yield.

    PubMed

    Ma, Lin; Li, Tian; Hao, Chenyang; Wang, Yuquan; Chen, Xinhong; Zhang, Xueyong

    2016-05-01

    Grain size is a dominant component of grain weight in cereals. Earlier studies have shown that OsGS5 plays a major role in regulating both grain size and weight in rice via promotion of cell division. In this study, we isolated TaGS5 homoeologues in wheat and mapped them on chromosomes 3A, 3B and 3D. Temporal and spatial expression analysis showed that TaGS5 homoeologues were preferentially expressed in young spikes and developing grains. Two alleles of TaGS5-3A, TaGS5-3A-T and TaGS5-3A-G were identified in wheat accessions, and a functional marker was developed to discriminate them. Association analysis revealed that TaGS5-3A-T was significantly correlated with larger grain size and higher thousand kernel weight. Biochemical assays showed that TaGS5-3A-T possesses a higher enzymatic activity than TaGS5-3A-G. Transgenic rice lines overexpressing TaGS5-3A-T also exhibited larger grain size and higher thousand kernel weight than TaGS5-3A-G lines, and the transcript levels of cell cycle-related genes in TaGS5-3A-T lines were higher than those in TaGS5-3A-G lines. Furthermore, systematic evolution analysis in diploid, tetraploid and hexaploid wheat showed that TaGS5-3A underwent strong artificial selection during wheat polyploidization events and the frequency changes of two alleles demonstrated that TaGS5-3A-T was favoured in global modern wheat cultivars. These results suggest that TaGS5-3A is a positive regulator of grain size and its favoured allele TaGS5-3A-T exhibits a larger potential application in wheat high-yield breeding. © 2015 Society for Experimental Biology, Association of Applied Biologists and John Wiley & Sons Ltd.

  16. Generation of Recombinant Polioviruses Harboring RNA Affinity Tags in the 5′ and 3′ Noncoding Regions of Genomic RNAs

    PubMed Central

    Flather, Dylan; Cathcart, Andrea L.; Cruz, Casey; Baggs, Eric; Ngo, Tuan; Gershon, Paul D.; Semler, Bert L.

    2016-01-01

    Despite being intensely studied for more than 50 years, a complete understanding of the enterovirus replication cycle remains elusive. Specifically, only a handful of cellular proteins have been shown to be involved in the RNA replication cycle of these viruses. In an effort to isolate and identify additional cellular proteins that function in enteroviral RNA replication, we have generated multiple recombinant polioviruses containing RNA affinity tags within the 3′ or 5′ noncoding region of the genome. These recombinant viruses retained RNA affinity sequences within the genome while remaining viable and infectious over multiple passages in cell culture. Further characterization of these viruses demonstrated that viral protein production and growth kinetics were unchanged or only slightly altered relative to wild type poliovirus. However, attempts to isolate these genetically-tagged viral genomes from infected cells have been hindered by high levels of co-purification of nonspecific proteins and the limited matrix-binding efficiency of RNA affinity sequences. Regardless, these recombinant viruses represent a step toward more thorough characterization of enterovirus ribonucleoprotein complexes involved in RNA replication. PMID:26861382

  17. Expressed sequence tags from the plant trypanosomatid Phytomonas serpens.

    PubMed

    Pappas, Georgios J; Benabdellah, Karim; Zingales, Bianca; González, Antonio

    2005-08-01

    We have generated 2190 expressed sequence tags (ESTs) from a cDNA library of the plant trypanosomatid Phytomonas serpens. Upon processing and clustering the set of 1893 accepted sequences was reduced to 697 clusters consisting of 452 singletons and 245 contigs. Functional categories were assigned based on BLAST searches against a database of the eukaryotic orthologous groups of proteins (KOG). Thirty six percent of the generated sequences showed no hits against the KOG database and 39.6% presented similarity to the KOG classes corresponding to translation, ribosomal structure and biogenesis. The most populated cluster contained 45 ESTs homologous to members of the glucose transporter family. This fact can be immediately correlated to the reported Phytomonas dependence on anaerobic glycolytic ATP production due to the lack of cytochrome-mediated respiratory chain. In this context, not only a number of enzymes of the glycolytic pathway were identified but also of the Krebs cycle as well as specific components of the respiratory chain. The data here reported, including a few hundred unique sequences and the description of tandemly repeated motifs and putative transcript stability motifs at untranslated mRNA ends, represent an initial approach to overcome the lack of information on the molecular biology of this organism.

  18. Poly(A)-tag deep sequencing data processing to extract poly(A) sites.

    PubMed

    Wu, Xiaohui; Ji, Guoli; Li, Qingshun Quinn

    2015-01-01

    Polyadenylation [poly(A)] is an essential posttranscriptional processing step in the maturation of eukaryotic mRNA. The advent of next-generation sequencing (NGS) technology has offered feasible means to generate large-scale data and new opportunities for intensive study of polyadenylation, particularly deep sequencing of the transcriptome targeting the junction of 3'-UTR and the poly(A) tail of the transcript. To take advantage of this unprecedented amount of data, we present an automated workflow to identify polyadenylation sites by integrating NGS data cleaning, processing, mapping, normalizing, and clustering. In this pipeline, a series of Perl scripts are seamlessly integrated to iteratively map the single- or paired-end sequences to the reference genome. After mapping, the poly(A) tags (PATs) at the same genome coordinate are grouped into one cleavage site, and the internal priming artifacts removed. Then the ambiguous region is introduced to parse the genome annotation for cleavage site clustering. Finally, cleavage sites within a close range of 24 nucleotides and from different samples can be clustered into poly(A) clusters. This procedure could be used to identify thousands of reliable poly(A) clusters from millions of NGS sequences in different tissues or treatments.

  19. A long natural-antisense RNA is accumulated in the conidia of Aspergillus oryzae.

    PubMed

    Tsujii, Masaru; Okuda, Satoshi; Ishi, Kazutomo; Madokoro, Kana; Takeuchi, Michio; Yamagata, Youhei

    2016-01-01

    Analysis of expressed sequence tag libraries from various culture conditions revealed the existence of conidia-specific transcripts assembled to putative conidiation-specific reductase gene (csrA) in Aspergillus oryzae. However, the all transcripts were transcribed with opposite direction to the gene csrA. The sequence analysis of the transcript revealed that the RNA overlapped mRNA of csrA with 3'-end, and did not code protein longer than 60 amino acid residues. We designated the transcript Conidia Specific Long Natural-antisense RNA (CSLNR). The real-time PCR analysis demonstrated that the CSLNR is conidia-specific transcript, which cannot be transcribed in the absence of brlA, and the amount of CSLNR was much more than that of the transcript from csrA in conidia. Furthermore, the csrA deletion, also lacking coding region of CSLNR in A. oryzae reduced the number of conidia. Overexpression of CsrA demonstrated the inhibition of growth and conidiation, while CSLNR did not affect conidiation.

  20. Basic Helix-Loop-Helix Transcription Factor Gene Family Phylogenetics and Nomenclature

    PubMed Central

    Skinner, Michael K.; Rawls, Alan; Wilson-Rawls, Jeanne; Roalson, Eric H.

    2010-01-01

    A phylogenetic analysis of the basic helix-loop-helix (bHLH) gene superfamily was performed using seven different species (human, mouse, rat, worm, fly, yeast, and plant Arabidopsis) and involving over 600 bHLH genes [1]. All bHLH genes were identified in the genomes of the various species, including expressed sequence tags, and the entire coding sequence was used in the analysis. Nearly 15% of the gene family has been updated or added since the original publication. A super-tree involving six clades and all structural relationships was established and is now presented for four of the species. The wealth of functional data available for members of the bHLH gene superfamily provides us with the opportunity to use this exhaustive phylogenetic tree to predict potential functions of uncharacterized members of the family. This phylogenetic and genomic analysis of the bHLH gene family has revealed unique elements of the evolution and functional relationships of the different genes in the bHLH gene family. PMID:20219281

  1. Preparing and Analyzing Expressed Sequence Tags (ESTs) Library for the Mammary Tissue of Local Turkish Kivircik Sheep

    PubMed Central

    Omeroglu Ulu, Zehra; Ulu, Salih; Un, Cemal; Ozdem Oztabak, Kemal; Altunatmaz, Kemal

    2017-01-01

    Kivircik sheep is an important local Turkish sheep according to its meat quality and milk productivity. The aim of this study was to analyze gene expression profiles of both prenatal and postnatal stages for the Kivircik sheep. Therefore, two different cDNA libraries, which were taken from the same Kivircik sheep mammary gland tissue at prenatal and postnatal stages, were constructed. Total 3072 colonies which were randomly selected from the two libraries were sequenced for developing a sheep ESTs collection. We used Phred/Phrap computer programs for analysis of the raw EST and readable EST sequences were assembled with the CAP3 software. Putative functions of all unique sequences and statistical analysis were determined by Geneious software. Total 422 ESTs have over 80% similarity to known sequences of other organisms in NCBI classified by Panther database for the Gene Ontology (GO) category. By comparing gene expression profiles, we observed some putative genes that may be relative to reproductive performance or play important roles in milk synthesis and secretion. A total of 2414 ESTs have been deposited to the NCBI GenBank database (GW996847–GW999260). EST data in this study have provided a new source of information to functional genome studies of sheep. PMID:28239610

  2. Protein Information Resource: a community resource for expert annotation of protein data

    PubMed Central

    Barker, Winona C.; Garavelli, John S.; Hou, Zhenglin; Huang, Hongzhan; Ledley, Robert S.; McGarvey, Peter B.; Mewes, Hans-Werner; Orcutt, Bruce C.; Pfeiffer, Friedhelm; Tsugita, Akira; Vinayaka, C. R.; Xiao, Chunlin; Yeh, Lai-Su L.; Wu, Cathy

    2001-01-01

    The Protein Information Resource, in collaboration with the Munich Information Center for Protein Sequences (MIPS) and the Japan International Protein Information Database (JIPID), produces the most comprehensive and expertly annotated protein sequence database in the public domain, the PIR-International Protein Sequence Database. To provide timely and high quality annotation and promote database interoperability, the PIR-International employs rule-based and classification-driven procedures based on controlled vocabulary and standard nomenclature and includes status tags to distinguish experimentally determined from predicted protein features. The database contains about 200 000 non-redundant protein sequences, which are classified into families and superfamilies and their domains and motifs identified. Entries are extensively cross-referenced to other sequence, classification, genome, structure and activity databases. The PIR web site features search engines that use sequence similarity and database annotation to facilitate the analysis and functional identification of proteins. The PIR-Inter­national databases and search tools are accessible on the PIR web site at http://pir.georgetown.edu/ and at the MIPS web site at http://www.mips.biochem.mpg.de. The PIR-International Protein Sequence Database and other files are also available by FTP. PMID:11125041

  3. Analysis of expressed sequence tags from Uromyces appendiculatus hyphae and haustoria and their comparison to sequences from other rust fungi.

    PubMed

    Puthoff, D P; Neelam, A; Ehrenfried, M L; Scheffler, B E; Ballard, L; Song, Q; Campbell, K B; Cooper, B; Tucker, M L

    2008-10-01

    Hyphae, 2 to 8 days postinoculation (dpi), and haustoria, 5 dpi, were isolated from Uromyces appendiculatus infected bean leaves (Phaseolus vulgaris cv. Pinto 111) and a separate cDNA library prepared for each fungal preparation. Approximately 10,000 hyphae and 2,700 haustoria clones were sequenced from both the 5' and 3' ends. Assembly of all of the fungal sequences yielded 3,359 contigs and 927 singletons. The U. appendiculatus sequences were compared with sequence data for other rust fungi, Phakopsora pachyrhizi, Uromyces fabae, and Puccinia graminis. The U. appendiculatus haustoria library included a large number of genes with unknown cellular function; however, summation of sequences of known cellular function suggested that haustoria at 5 dpi had fewer transcripts linked to protein synthesis in favor of energy metabolism and nutrient uptake. In addition, open reading frames in the U. appendiculatus data set with an N-terminal signal peptide were identified and compared with other proteins putatively secreted from rust fungi. In this regard, a small family of putatively secreted RTP1-like proteins was identified in U. appendiculatus and P. graminis.

  4. An efficient tag derived from the common epitope of tospoviral NSs proteins for monitoring recombinant proteins expressed in both bacterial and plant systems.

    PubMed

    Cheng, Hao-Wen; Chen, Kuan-Chun; Raja, Joseph A J; Li, Jian-Xian; Yeh, Shyi-Dong

    2013-04-15

    NSscon (23 aa), a common epitope in the gene silencing suppressor NSs proteins of the members of the Watermelon silver mottle virus (WSMoV) serogroup, was previously identified. In this investigation, we expressed different green fluorescent protein (GFP)-fused deletions of NSscon in bacteria and reacted with NSscon monoclonal antibody (MAb). Our results indicated that the core 9 amino acids, "(109)KFTMHNQIF(117)", denoted as "nss", retain the reactivity of NSscon. In bacterial pET system, four different recombinant proteins labeled with nss, either at N- or C-extremes, were readily detectable without position effects, with sensitivity superior to that for the polyhistidine-tag. When the nss-tagged Zucchini yellow mosaic virus (ZYMV) helper component-protease (HC-Pro) and WSMoV nucleocapsid protein were transiently expressed by agroinfiltration in tobacco, they were readily detectable and the tag's possible efficacy for gene silencing suppression was not noticed. Co-immunoprecipitation of nss-tagged and non-tagged proteins expressed from bacteria confirmed the interaction of potyviral HC-Pro and coat protein. Thus, we conclude that this novel nss sequence is highly valuable for tagging recombinant proteins in both bacterial and plant expression systems. Copyright © 2013 Elsevier B.V. All rights reserved.

  5. Identification and characterization of TCRgamma and TCRdelta chains in channel catfish, Ictalurus punctatus

    USDA-ARS?s Scientific Manuscript database

    Channel catfish, Ictalurus punctatus, T cell receptors (TCR) gamma and delta were identified by mining of expressed sequence tag databases and full length sequences were obtained by 5'-RACE and RT-PCR protocols. cDNAs for each of these TCR chains encode typical variable (V), (diversity; D), joining ...

  6. Characterization of genic microsatellite markers derived from expressed sequence tags in Pacific abalone ( Haliotis discus hannai)

    NASA Astrophysics Data System (ADS)

    Li, Qi; Shu, Jing; Zhao, Cui; Liu, Shikai; Kong, Lingfeng; Zheng, Xiaodong

    2010-01-01

    Simple sequence repeat (SSR) markers were developed from the expressed sequence tags (ESTs) of Pacific abalone ( Haliotis discus hannai). Repeat motifs were found in 4.95% of the ESTs at a frequency of one repeat every 10.04 kb of EST sequences, after redundancy elimination. Seventeen polymorphic EST-SSRs were developed. The number of alleles per locus varied from 2-17, with an average of 6.8 alleles per locus. The expected and observed heterozygosities ranged from 0.159 to 0.928 and from 0.132 to 0.922, respectively. Twelve of the 17 loci (70.6%) were successfully amplified in H. diversicolor. Seventeen loci segregated in three families, with three showing the presence of null alleles (17.6%). The adequate level of variability and low frequency of null alleles observed in H. discus hannai, together with the high rate of transportability across Haliotis species, make this set of EST-SSR markers an important tool for comparative mapping, marker-assisted selection, and evolutionary studies, not only in the Pacific abalone, but also in related species.

  7. Discovery and mapping of a new expressed sequence tag-single nucleotide polymorphism and simple sequence repeat panel for large-scale genetic studies and breeding of Theobroma cacao L.

    PubMed Central

    Allegre, Mathilde; Argout, Xavier; Boccara, Michel; Fouet, Olivier; Roguet, Yolande; Bérard, Aurélie; Thévenin, Jean Marc; Chauveau, Aurélie; Rivallan, Ronan; Clement, Didier; Courtois, Brigitte; Gramacho, Karina; Boland-Augé, Anne; Tahi, Mathias; Umaharan, Pathmanathan; Brunel, Dominique; Lanaud, Claire

    2012-01-01

    Theobroma cacao is an economically important tree of several tropical countries. Its genetic improvement is essential to provide protection against major diseases and improve chocolate quality. We discovered and mapped new expressed sequence tag-single nucleotide polymorphism (EST-SNP) and simple sequence repeat (SSR) markers and constructed a high-density genetic map. By screening 149 650 ESTs, 5246 SNPs were detected in silico, of which 1536 corresponded to genes with a putative function, while 851 had a clear polymorphic pattern across a collection of genetic resources. In addition, 409 new SSR markers were detected on the Criollo genome. Lastly, 681 new EST-SNPs and 163 new SSRs were added to the pre-existing 418 co-dominant markers to construct a large consensus genetic map. This high-density map and the set of new genetic markers identified in this study are a milestone in cocoa genomics and for marker-assisted breeding. The data are available at http://tropgenedb.cirad.fr. PMID:22210604

  8. Candidate Genes Expressed in Tolerant Common Wheat With Resistant to English Grain Aphid.

    PubMed

    Luo, Kun; Zhang, Gaisheng; Wang, Chunping; Ouellet, Thérèse; Wu, Jingjing; Zhu, Qidi; Zhao, Huiyan

    2014-10-01

    The English grain aphid, Sitobion avenae (F.) (Hemiptera: Aphididae), is a common worldwide pest of wheat (Triticum aestivum L.). The use of improved resistant cultivars by the farmers is the most effective and environmentally friendly method to control this aphid in the field. The winter wheat genotypes 98-10-35 and Amigo are resistant to S. avenae. To identify genes responsible for resistance to S. avenae in these genotypes, differential-display reverse transcription-polymerase chain reaction was used to identify the corresponding differentially expressed sequences in current study. Two backcross progenies were obtained by crossing the two resistant genotypes with the susceptible genotype 1376. Six potential expected-differential bands were sequenced. Lengths of the expressed sequence tags ranged from 128 to 532 bp. Although these expressed sequences were likely associated with S. avenae resistance, there was one expressed sequence tag located on 7DL chromosome, and its potential function may associate with the ability to maintain photosynthesis in wheat. That serves as an active way for tolerant common wheat with resistant to S. avenae. Cloning the full length of these sequences would help us thoroughly understand the mechanism of wheat resistance to S. avenae and be valuable for breeding cultivars with S. avenae resistance. © 2014 Entomological Society of America.

  9. MO-G-18C-03: Evaluation of Deformable Image Registration for Lung Motion Estimation Using Hyperpolarized Gas Tagging MRI

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Huang, Q; Zhang, Y; Liu, Y

    2014-06-15

    Purpose: Hyperpolarized gas (HP) tagging MRI is a novel imaging technique for direct measurement of lung motion during breathing. This study aims to quantitatively evaluate the accuracy of deformable image registration (DIR) in lung motion estimation using HP tagging MRI as references. Methods: Three healthy subjects were imaged using the HP MR tagging, as well as a high-resolution 3D proton MR sequence (TrueFISP) at the end-of-inhalation (EOI) and the end-of-exhalation (EOE). Ground truth of lung motion and corresponding displacement vector field (tDVF) was derived from HP tagging MRI by manually tracking the displacement of tagging grids between EOI and EOE.more » Seven different DIR methods were applied to the high-resolution TrueFISP MR images (EOI and EOE) to generate the DIR-based DVFs (dDVF). The DIR methods include Velocity (VEL), MIM, Mirada, multi-grid B-spline from Elastix (MGB) and 3 other algorithms from DIRART toolbox (Double Force Demons (DFD), Improved Lucas-Kanade (ILK), and Iterative Optical Flow (IOF)). All registrations were performed by independent experts. Target registration error (TRE) was calculated as tDVF – dDVF. Analysis was performed for the entire lungs, and separately for the upper and lower lungs. Results: Significant differences between tDVF and dDVF were observed. Besides the DFD and IOF algorithms, all other dDVFs showed similarity in deformation magnitude distribution but away from the ground truth. The average TRE for entire lung ranged 2.5−23.7mm (mean=8.8mm), depending on the DIR method and subject's breathing amplitude. Larger TRE (13.3–23.7mm) was found in subject with larger breathing amplitude of 45.6mm. TRE was greater in lower lung (2.5−33.9 mm, mean=12.4mm) than that in upper lung (2.5−11.9 mm, mean=5.8mm). Conclusion: Significant differences were observed in lung motion estimation between the HP gas tagging MRI method and the DIR methods, especially when lung motion is large. Large variation among different DIR methods was also observed.« less

  10. Contig Maps and Genomic Sequencing Identify Candidate Genes in the Usher 1C Locus

    PubMed Central

    Higgins, Michael J.; Day, Colleen D.; Smilinich, Nancy J.; Ni, L.; Cooper, Paul R.; Nowak, Norma J.; Davies, Chris; de Jong, Pieter J.; Hejtmancik, Fielding; Evans, Glen A.; Smith, Richard J.H.; Shows, Thomas B.

    1998-01-01

    Usher syndrome 1C (USH1C) is a congenital condition manifesting profound hearing loss, the absence of vestibular function, and eventual retinal degeneration. The USH1C locus has been mapped genetically to a 2- to 3-cM interval in 11p14–15.1 between D11S899 and D11S861. In an effort to identify the USH1C disease gene we have isolated the region between these markers in yeast artificial chromosomes (YACs) using a combination of STS content mapping and Alu–PCR hybridization. The YAC contig is ∼3.5 Mb and has located several other loci within this interval, resulting in the order CEN-LDHA-SAA1-TPH-D11S1310-(D11S1888/KCNC1)-MYOD1-D11S902D11S921-D11S1890-TEL. Subsequent haplotyping and homozygosity analysis refined the location of the disease gene to a 400-kb interval between D11S902 and D11S1890 with all affected individuals being homozygous for the internal marker D11S921. To facilitate gene identification, the critical region has been converted into P1 artificial chromosome (PAC) clones using sequence-tagged sites (STSs) mapped to the YAC contig, Alu–PCR products generated from the YACs, and PAC end probes. A contig of >50 PAC clones has been assembled between D11S1310 and D11S1890, confirming the order of markers used in haplotyping. Three PAC clones representing nearly two-thirds of the USH1C critical region have been sequenced. PowerBLAST analysis identified six clusters of expressed sequence tags (ESTs), two known genes (BIR,SUR1) mapped previously to this region, and a previously characterized but unmapped gene NEFA (DNA binding/EF hand/acidic amino-acid-rich). GRAIL analysis identified 11 CpG islands and 73 exons of excellent quality. These data allowed the construction of a transcription map for the USH1C critical region, consisting of three known genes and six or more novel transcripts. Based on their map location, these loci represent candidate disease loci for USH1C. The NEFA gene was assessed as the USH1C locus by the sequencing of an amplified NEFA cDNA from an USH1C patient; however, no mutations were detected. [The sequence data described in this paper have been submitted to GenBank under accession numbers AC000406–AC000407.] PMID:9445488

  11. Cloning and expression of Bartonella henselae sucB gene encoding an immunogenic dihydrolipoamide succinyltransferase homologous protein.

    PubMed

    Kabeya, Hidenori; Maruyama, Soichi; Hirano, Kouji; Mikami, Takeshi

    2003-01-01

    Immunoscreening of a ZAP genomic library of Bartonella henselae strain Houston-1 expressed in Escherichia coli resulted in the isolation of a clone containing 3.5 kb BamHI genomic DNA fragment. This 3.5 kb DNA fragment was found to contain a sequence of a gene encoding a protein with significant homology to the dihydrolipoamide succinyltransferase of Brucella melitensis (sucB). Subsequent cloning and DNA sequence analysis revealed that the deduced amino acid sequence from the cloned gene showed 66.5% identity to SucB protein of B. melitensis, and 43.4 and 47.2% identities to those of Coxiella burnetii and E. coli, respectively. The gene was expressed as a His-Nus A-tagged fusion protein. The recombinant SucB protein (rSucB) was shown to be an immunoreactive protein of about 115 kDa by Western blot analysis with sera from B. henselae-immunized mice. Therefore the rSucB may be a candidate antigen for a specific serological diagnosis of B. henselae infection.

  12. Insights into the Melipona scutellaris (Hymenoptera, Apidae, Meliponini) fat body transcriptome.

    PubMed

    de Sousa, Cristina Soares; Serrão, José Eduardo; Bonetti, Ana Maria; Amaral, Isabel Marques Rodrigues; Kerr, Warwick Estevam; Maranhão, Andréa Queiroz; Ueira-Vieira, Carlos

    2013-07-01

    The insect fat body is a multifunctional organ analogous to the vertebrate liver. The fat body is involved in the metabolism of juvenile hormone, regulation of environmental stress, production of immunity regulator-like proteins in cells and protein storage. However, very little is known about the molecular mechanisms involved in fat body physiology in stingless bees. In this study, we analyzed the transcriptome of the fat body from the stingless bee Melipona scutellaris. In silico analysis of a set of cDNA library sequences yielded 1728 expressed sequence tags (ESTs) and 997 high-quality sequences that were assembled into 29 contigs and 117 singlets. The BLAST X tool showed that 86% of the ESTs shared similarity with Apis mellifera (honeybee) genes. The M. scutellaris fat body ESTs encoded proteins with roles in numerous physiological processes, including anti-oxidation, phosphorylation, metabolism, detoxification, transmembrane transport, intracellular transport, cell proliferation, protein hydrolysis and protein synthesis. This is the first report to describe a transcriptomic analysis of specific organs of M. scutellaris. Our findings provide new insights into the physiological role of the fat body in stingless bees.

  13. Insights into the Melipona scutellaris (Hymenoptera, Apidae, Meliponini) fat body transcriptome

    PubMed Central

    de Sousa, Cristina Soares; Serrão, José Eduardo; Bonetti, Ana Maria; Amaral, Isabel Marques Rodrigues; Kerr, Warwick Estevam; Maranhão, Andréa Queiroz; Ueira-Vieira, Carlos

    2013-01-01

    The insect fat body is a multifunctional organ analogous to the vertebrate liver. The fat body is involved in the metabolism of juvenile hormone, regulation of environmental stress, production of immunity regulator-like proteins in cells and protein storage. However, very little is known about the molecular mechanisms involved in fat body physiology in stingless bees. In this study, we analyzed the transcriptome of the fat body from the stingless bee Melipona scutellaris. In silico analysis of a set of cDNA library sequences yielded 1728 expressed sequence tags (ESTs) and 997 high-quality sequences that were assembled into 29 contigs and 117 singlets. The BLAST X tool showed that 86% of the ESTs shared similarity with Apis mellifera (honeybee) genes. The M. scutellaris fat body ESTs encoded proteins with roles in numerous physiological processes, including anti-oxidation, phosphorylation, metabolism, detoxification, transmembrane transport, intracellular transport, cell proliferation, protein hydrolysis and protein synthesis. This is the first report to describe a transcriptomic analysis of specific organs of M. scutellaris. Our findings provide new insights into the physiological role of the fat body in stingless bees. PMID:23885214

  14. Aligning a New Reference Genetic Map of Lupinus angustifolius with the Genome Sequence of the Model Legume, Lotus japonicus

    PubMed Central

    Nelson, Matthew N.; Moolhuijzen, Paula M.; Boersma, Jeffrey G.; Chudy, Magdalena; Lesniewska, Karolina; Bellgard, Matthew; Oliver, Richard P.; Święcicki, Wojciech; Wolko, Bogdan; Cowling, Wallace A.; Ellwood, Simon R.

    2010-01-01

    We have developed a dense reference genetic map of Lupinus angustifolius (2n = 40) based on a set of 106 publicly available recombinant inbred lines derived from a cross between domesticated and wild parental lines. The map comprised 1090 loci in 20 linkage groups and three small clusters, drawing together data from several previous mapping publications plus almost 200 new markers, of which 63 were gene-based markers. A total of 171 mainly gene-based, sequence-tagged site loci served as bridging points for comparing the Lu. angustifolius genome with the genome sequence of the model legume, Lotus japonicus via BLASTn homology searching. Comparative analysis indicated that the genomes of Lu. angustifolius and Lo. japonicus are highly diverged structurally but with significant regions of conserved synteny including the region of the Lu. angustifolius genome containing the pod-shatter resistance gene, lentus. We discuss the potential of synteny analysis for identifying candidate genes for domestication traits in Lu. angustifolius and in improving our understanding of Fabaceae genome evolution. PMID:20133394

  15. Electrostatically driven immobilization of peptides onto (Maleic anhydride-alt-methyl vinyl ether) copolymers in aqueous media.

    PubMed

    Ladavière, C; Lorenzo, C; Elaïssari, A; Mandrand, B; Delair, T

    2000-01-01

    The covalent immobilization of a model peptide onto the MAMVE copolymer, via the formation of amide bonds, occurred in moderate yields in aqueous conditions. The improvement of the grafting reaction was achieved by adding at the amino terminus of the model peptide a sequence (tag) of three positively charged amino acids, lysine or arginine, and by taking profit of electrostatic attractive interactions between the negatively charged copolymer and the tagged peptides. The arginine tag was more efficient than the lysine tag for enhancing the immobilization reaction, proving that the effect was due to an electrostic driving force. On the basis of these results, a tentative mechanism is discussed, and Scatchard plots pointed out two regimes of binding. With the first, at low polymer load (up to 50% of saturation for a lysine tag and 60-70% for an arginine tag), the binding occurred with a positive cooperative effect, the already bound peptide participating to the binding of others. A second one for higher coverages, for which the binding occurred with a negative cooperativity, and saturation was reached in the presence of a large excess of peptide.

  16. Molecular cloning and analysis of Schizosaccharomyces pombe Reb1p: sequence-specific recognition of two sites in the far upstream rDNA intergenic spacer.

    PubMed Central

    Zhao, A; Guo, A; Liu, Z; Pape, L

    1997-01-01

    The coding sequences for a Schizosaccharomyces pombe sequence-specific DNA binding protein, Reb1p, have been cloned. The predicted S. pombe Reb1p is 24-29% identical to mouse TTF-1 (transcription termination factor-1) and Saccharomyces cerevisiae REB1 protein, both of which direct termination of RNA polymerase I catalyzed transcripts. The S.pombe Reb1 cDNA encodes a predicted polypeptide of 504 amino acids with a predicted molecular weight of 58.4 kDa. The S. pombe Reb1p is unusual in that the bipartite DNA binding motif identified originally in S.cerevisiae and Klyveromyces lactis REB1 proteins is uninterrupted and thus S.pombe Reb1p may contain the smallest natural REB1 homologous DNA binding domain. Its genomic coding sequences were shown to be interrupted by two introns. A recombinant histidine-tagged Reb1 protein bearing the rDNA binding domain has two homologous, sequence-specific binding sites in the S. pomber DNA intergenic spacer, located between 289 and 480 nt downstream of the end of the approximately 25S rRNA coding sequences. Each binding site is 13-14 bp downstream of two of the three proposed in vivo termination sites. The core of this 17 bp site, AGGTAAGGGTAATGCAC, is specifically protected by Reb1p in footprinting analysis. PMID:9016645

  17. Fluorescence turn-on detection of target sequence DNA based on silicon nanodot-mediated quenching.

    PubMed

    Zhang, Yanan; Ning, Xinping; Mao, Guobin; Ji, Xinghu; He, Zhike

    2018-05-01

    We have developed a new enzyme-free method for target sequence DNA detection based on the dynamic quenching of fluorescent silicon nanodots (SiNDs) toward Cy5-tagged DNA probe. Fascinatingly, the water-soluble SiNDs can quench the fluorescence of cyanine (Cy5) in Cy5-tagged DNA probe in homogeneous solution, and the fluorescence of Cy5-tagged DNA probe can be restored in the presence of target sequence DNA (the synthetic target miRNA-27a). Based on this phenomenon, a SiND-featured fluorescent sensor has been constructed for "turn-on" detection of the synthetic target miRNA-27a for the first time. This newly developed approach possesses the merits of low cost, simple design, and convenient operation since no enzymatic reaction, toxic reagents, or separation procedures are involved. The established method achieves a detection limit of 0.16 nM, and the relative standard deviation of this method is 9% (1 nM, n = 5). The linear range is 0.5-20 nM, and the recoveries in spiked human fluids are in the range of 90-122%. This protocol provides a new tactic in the development of the nonenzymic miRNA biosensors and opens a promising avenue for early diagnosis of miRNA-associated disease. Graphical abstract The SiND-based fluorescent sensor for detection of S-miR-27a.

  18. Parallel tagged next-generation sequencing on pooled samples - a new approach for population genetics in ecology and conservation.

    PubMed

    Zavodna, Monika; Grueber, Catherine E; Gemmell, Neil J

    2013-01-01

    Next-generation sequencing (NGS) on pooled samples has already been broadly applied in human medical diagnostics and plant and animal breeding. However, thus far it has been only sparingly employed in ecology and conservation, where it may serve as a useful diagnostic tool for rapid assessment of species genetic diversity and structure at the population level. Here we undertake a comprehensive evaluation of the accuracy, practicality and limitations of parallel tagged amplicon NGS on pooled population samples for estimating species population diversity and structure. We obtained 16S and Cyt b data from 20 populations of Leiopelma hochstetteri, a frog species of conservation concern in New Zealand, using two approaches - parallel tagged NGS on pooled population samples and individual Sanger sequenced samples. Data from each approach were then used to estimate two standard population genetic parameters, nucleotide diversity (π) and population differentiation (FST), that enable population genetic inference in a species conservation context. We found a positive correlation between our two approaches for population genetic estimates, showing that the pooled population NGS approach is a reliable, rapid and appropriate method for population genetic inference in an ecological and conservation context. Our experimental design also allowed us to identify both the strengths and weaknesses of the pooled population NGS approach and outline some guidelines and suggestions that might be considered when planning future projects.

  19. Genome sequence determination and metagenomic characterization of a Dehalococcoides mixed culture grown on cis-1,2-dichloroethene.

    PubMed

    Yohda, Masafumi; Yagi, Osami; Takechi, Ayane; Kitajima, Mizuki; Matsuda, Hisashi; Miyamura, Naoaki; Aizawa, Tomoko; Nakajima, Mutsuyasu; Sunairi, Michio; Daiba, Akito; Miyajima, Takashi; Teruya, Morimi; Teruya, Kuniko; Shiroma, Akino; Shimoji, Makiko; Tamotsu, Hinako; Juan, Ayaka; Nakano, Kazuma; Aoyama, Misako; Terabayashi, Yasunobu; Satou, Kazuhito; Hirano, Takashi

    2015-07-01

    A Dehalococcoides-containing bacterial consortium that performed dechlorination of 0.20 mM cis-1,2-dichloroethene to ethene in 14 days was obtained from the sediment mud of the lotus field. To obtain detailed information of the consortium, the metagenome was analyzed using the short-read next-generation sequencer SOLiD 3. Matching the obtained sequence tags with the reference genome sequences indicated that the Dehalococcoides sp. in the consortium was highly homologous to Dehalococcoides mccartyi CBDB1 and BAV1. Sequence comparison with the reference sequence constructed from 16S rRNA gene sequences in a public database showed the presence of Sedimentibacter, Sulfurospirillum, Clostridium, Desulfovibrio, Parabacteroides, Alistipes, Eubacterium, Peptostreptococcus and Proteocatella in addition to Dehalococcoides sp. After further enrichment, the members of the consortium were narrowed down to almost three species. Finally, the full-length circular genome sequence of the Dehalococcoides sp. in the consortium, D. mccartyi IBARAKI, was determined by analyzing the metagenome with the single-molecule DNA sequencer PacBio RS. The accuracy of the sequence was confirmed by matching it to the tag sequences obtained by SOLiD 3. The genome is 1,451,062 nt and the number of CDS is 1566, which includes 3 rRNA genes and 47 tRNA genes. There exist twenty-eight RDase genes that are accompanied by the genes for anchor proteins. The genome exhibits significant sequence identity with other Dehalococcoides spp. throughout the genome, but there exists significant difference in the distribution RDase genes. The combination of a short-read next-generation DNA sequencer and a long-read single-molecule DNA sequencer gives detailed information of a bacterial consortium. Copyright © 2014 The Society for Biotechnology, Japan. Published by Elsevier B.V. All rights reserved.

  20. The contribution of 700,000 ORF sequence tags to the definition of the human transcriptome

    PubMed Central

    Camargo, Anamaria A.; Samaia, Helena P. B.; Dias-Neto, Emmanuel; Simão, Daniel F.; Migotto, Italo A.; Briones, Marcelo R. S.; Costa, Fernando F.; Aparecida Nagai, Maria; Verjovski-Almeida, Sergio; Zago, Marco A.; Andrade, Luis Eduardo C.; Carrer, Helaine; El-Dorry, Hamza F. A.; Espreafico, Enilza M.; Habr-Gama, Angelita; Giannella-Neto, Daniel; Goldman, Gustavo H.; Gruber, Arthur; Hackel, Christine; Kimura, Edna T.; Maciel, Rui M. B.; Marie, Suely K. N.; Martins, Elizabeth A. L.; Nóbrega, Marina P.; Paçó-Larson, Maria Luisa; Pardini, Maria Inês M. C.; Pereira, Gonçalo G.; Pesquero, João Bosco; Rodrigues, Vanderlei; Rogatto, Silvia R.; da Silva, Ismael D. C. G.; Sogayar, Mari C.; Sonati, Maria de Fátima; Tajara, Eloiza H.; Valentini, Sandro R.; Alberto, Fernando L.; Amaral, Maria Elisabete J.; Aneas, Ivy; Arnaldi, Liliane A. T.; de Assis, Angela M.; Bengtson, Mário Henrique; Bergamo, Nadia Aparecida; Bombonato, Vanessa; de Camargo, Maria E. R.; Canevari, Renata A.; Carraro, Dirce M.; Cerutti, Janete M.; Corrêa, Maria Lucia C.; Corrêa, Rosana F. R.; Costa, Maria Cristina R.; Curcio, Cyntia; Hokama, Paula O. M.; Ferreira, Ari J. S.; Furuzawa, Gilberto K.; Gushiken, Tsieko; Ho, Paulo L.; Kimura, Elza; Krieger, José E.; Leite, Luciana C. C.; Majumder, Paromita; Marins, Mozart; Marques, Everaldo R.; Melo, Analy S. A.; Melo, Monica; Mestriner, Carlos Alberto; Miracca, Elisabete C.; Miranda, Daniela C.; Nascimento, Ana Lucia T. O.; Nóbrega, Francisco G.; Ojopi, Élida P. B.; Pandolfi, José Rodrigo C.; Pessoa, Luciana G.; Prevedel, Aline C.; Rahal, Paula; Rainho, Claudia A.; Reis, Eduardo M. R.; Ribeiro, Marcelo L.; da Rós, Nancy; de Sá, Renata G.; Sales, Magaly M.; Sant'anna, Simone Cristina; dos Santos, Mariana L.; da Silva, Aline M.; da Silva, Neusa P.; Silva, Wilson A.; da Silveira, Rosana A.; Sousa, Josane F.; Stecconi, Daniella; Tsukumo, Fernando; Valente, Valéria; Soares, Fernando; Moreira, Eloisa S.; Nunes, Diana N.; Correa, Ricardo G.; Zalcberg, Heloisa; Carvalho, Alex F.; Reis, Luis F. L.; Brentani, Ricardo R.; Simpson, Andrew J. G.; de Souza, Sandro J.

    2001-01-01

    Open reading frame expressed sequences tags (ORESTES) differ from conventional ESTs by providing sequence data from the central protein coding portion of transcripts. We generated a total of 696,745 ORESTES sequences from 24 human tissues and used a subset of the data that correspond to a set of 15,095 full-length mRNAs as a means of assessing the efficiency of the strategy and its potential contribution to the definition of the human transcriptome. We estimate that ORESTES sampled over 80% of all highly and moderately expressed, and between 40% and 50% of rarely expressed, human genes. In our most thoroughly sequenced tissue, the breast, the 130,000 ORESTES generated are derived from transcripts from an estimated 70% of all genes expressed in that tissue, with an equally efficient representation of both highly and poorly expressed genes. In this respect, we find that the capacity of the ORESTES strategy both for gene discovery and shotgun transcript sequence generation significantly exceeds that of conventional ESTs. The distribution of ORESTES is such that many human transcripts are now represented by a scaffold of partial sequences distributed along the length of each gene product. The experimental joining of the scaffold components, by reverse transcription–PCR, represents a direct route to transcript finishing that may represent a useful alternative to full-length cDNA cloning. PMID:11593022

  1. The contribution of 700,000 ORF sequence tags to the definition of the human transcriptome.

    PubMed

    Camargo, A A; Samaia, H P; Dias-Neto, E; Simão, D F; Migotto, I A; Briones, M R; Costa, F F; Nagai, M A; Verjovski-Almeida, S; Zago, M A; Andrade, L E; Carrer, H; El-Dorry, H F; Espreafico, E M; Habr-Gama, A; Giannella-Neto, D; Goldman, G H; Gruber, A; Hackel, C; Kimura, E T; Maciel, R M; Marie, S K; Martins, E A; Nobrega, M P; Paco-Larson, M L; Pardini, M I; Pereira, G G; Pesquero, J B; Rodrigues, V; Rogatto, S R; da Silva, I D; Sogayar, M C; Sonati, M F; Tajara, E H; Valentini, S R; Alberto, F L; Amaral, M E; Aneas, I; Arnaldi, L A; de Assis, A M; Bengtson, M H; Bergamo, N A; Bombonato, V; de Camargo, M E; Canevari, R A; Carraro, D M; Cerutti, J M; Correa, M L; Correa, R F; Costa, M C; Curcio, C; Hokama, P O; Ferreira, A J; Furuzawa, G K; Gushiken, T; Ho, P L; Kimura, E; Krieger, J E; Leite, L C; Majumder, P; Marins, M; Marques, E R; Melo, A S; Melo, M B; Mestriner, C A; Miracca, E C; Miranda, D C; Nascimento, A L; Nobrega, F G; Ojopi, E P; Pandolfi, J R; Pessoa, L G; Prevedel, A C; Rahal, P; Rainho, C A; Reis, E M; Ribeiro, M L; da Ros, N; de Sa, R G; Sales, M M; Sant'anna, S C; dos Santos, M L; da Silva, A M; da Silva, N P; Silva, W A; da Silveira, R A; Sousa, J F; Stecconi, D; Tsukumo, F; Valente, V; Soares, F; Moreira, E S; Nunes, D N; Correa, R G; Zalcberg, H; Carvalho, A F; Reis, L F; Brentani, R R; Simpson, A J; de Souza, S J; Melo, M

    2001-10-09

    Open reading frame expressed sequences tags (ORESTES) differ from conventional ESTs by providing sequence data from the central protein coding portion of transcripts. We generated a total of 696,745 ORESTES sequences from 24 human tissues and used a subset of the data that correspond to a set of 15,095 full-length mRNAs as a means of assessing the efficiency of the strategy and its potential contribution to the definition of the human transcriptome. We estimate that ORESTES sampled over 80% of all highly and moderately expressed, and between 40% and 50% of rarely expressed, human genes. In our most thoroughly sequenced tissue, the breast, the 130,000 ORESTES generated are derived from transcripts from an estimated 70% of all genes expressed in that tissue, with an equally efficient representation of both highly and poorly expressed genes. In this respect, we find that the capacity of the ORESTES strategy both for gene discovery and shotgun transcript sequence generation significantly exceeds that of conventional ESTs. The distribution of ORESTES is such that many human transcripts are now represented by a scaffold of partial sequences distributed along the length of each gene product. The experimental joining of the scaffold components, by reverse transcription-PCR, represents a direct route to transcript finishing that may represent a useful alternative to full-length cDNA cloning.

  2. Expressed sequence tag analysis of adult human optic nerve for NEIBank: Identification of cell type and tissue markers

    PubMed Central

    Bernstein, Steven L; Guo, Yan; Peterson, Katherine; Wistow, Graeme

    2009-01-01

    Background The optic nerve is a pure white matter central nervous system (CNS) tract with an isolated blood supply, and is widely used in physiological studies of white matter response to various insults. We examined the gene expression profile of human optic nerve (ON) and, through the NEIBANK online resource, to provide a resource of sequenced verified cDNA clones. An un-normalized cDNA library was constructed from pooled human ON tissues and was used in expressed sequence tag (EST) analysis. Location of an abundant oligodendrocyte marker was examined by immunofluorescence. Quantitative real time polymerase chain reaction (qRT-PCR) and Western analysis were used to compare levels of expression for key calcium channel protein genes and protein product in primate and rodent ON. Results Our analyses revealed a profile similar in many respects to other white matter related tissues, but significantly different from previously available ON cDNA libraries. The previous libraries were found to include specific markers for other eye tissues, suggesting contamination. Immune/inflammatory markers were abundant in the new ON library. The oligodendrocyte marker QKI was abundant at the EST level. Immunofluorescence revealed that this protein is a useful oligodendrocyte cell-type marker in rodent and primate ONs. L-type calcium channel EST abundance was found to be particularly low. A qRT-PCR-based comparative mammalian species analysis reveals that L-type calcium channel expression levels are significantly lower in primate than in rodent ON, which may help account for the class-specific difference in responsiveness to calcium channel blocking agents. Several known eye disease genes are abundantly expressed in ON. Many genes associated with normal axonal function, mRNAs associated with axonal transport, inflammation and neuroprotection are observed. Conclusion We conclude that the new cDNA library is a faithful representation of human ON and EST data provide an initial overview of gene expression patterns in this tissue. The data provide clues for tissue-specific and species-specific properties of human ON that will help in design of therapeutic models. PMID:19778450

  3. Quantitative Tracking of Salmonella Enteritidis Transmission Routes Using Barcode-Tagged Isogenic Strains in Chickens: Proof-of-Concept Study

    PubMed Central

    Yang, Yichao; Ricke, Steven C.; Tellez, Guillermo; Kwon, Young Min

    2017-01-01

    Salmonella is an important foodborne bacterial pathogen, however, a fundamental understanding on Salmonella transmission routes within a poultry flock remains unclear. In this study, a series of barcode-tagged strains were constructed by inserting six random nucleotides into a functionally neutral region on the chromosome of S. Enteritidis as a tool for quantitative tracking of Salmonella transmission in chickens. Six distinct barcode-tagged strains were used for infection or contamination at either low dose (103 CFUs; three strains) or high dose (105 CFUs; three strains) in three independent experiments (Experiment 1 oral gavage; Experiment 2 contaminated feed; Experiment 3 contaminated water). For all chick experiments, cecal and foot-wash samples were collected from a subset of the chickens at days 7 or/and 14, from which genomic DNA was extracted and used to amplify the barcode regions. After the resulting PCR amplicons were pooled and analyzed by MiSeq sequencing, a total of approximately 1.5 million reads containing the barcode sequences were analyzed to determine the relative frequency of every barcode-tagged strain in each sample. In Experiment 1, the high dose of oral infection was correlated with greater dominance of the strains in the ceca of the respective seeder chickens and also in the contact chickens yet at lesser degrees. When chicks were exposed to contaminated feed (Experiment 2) or water (Experiment 3), there were no clear patterns of the barcode-tagged strains in relation to the dosage, except that the strains introduced at low dose required a longer time to colonize the ceca with contaminated feed. Most foot-wash samples contained only one to three strains for the majority of the samples, suggesting potential existence of an unknown mechanism(s) for strain exclusion. These results demonstrated the proof of concept of using barcode tagged to investigate transmission dynamics of Salmonella in chickens in a quantitative manner. PMID:28261587

  4. Highly Selective End-Tagged Antimicrobial Peptides Derived from PRELP

    PubMed Central

    Malmsten, Martin; Kasetty, Gopinath; Pasupuleti, Mukesh; Alenfall, Jan; Schmidtchen, Artur

    2011-01-01

    Background Antimicrobial peptides (AMPs) are receiving increasing attention due to resistance development against conventional antibiotics. Pseudomonas aeruginosa and Staphylococcus aureus are two major pathogens involved in an array of infections such as ocular infections, cystic fibrosis, wound and post-surgery infections, and sepsis. The goal of the study was to design novel AMPs against these pathogens. Methodology and Principal Findings Antibacterial activity was determined by radial diffusion, viable count, and minimal inhibitory concentration assays, while toxicity was evaluated by hemolysis and effects on human epithelial cells. Liposome and fluorescence studies provided mechanistic information. Protease sensitivity was evaluated after subjection to human leukocyte elastase, staphylococcal aureolysin and V8 proteinase, as well as P. aeruginosa elastase. Highly active peptides were evaluated in ex vivo skin infection models. C-terminal end-tagging by W and F amino acid residues increased antimicrobial potency of the peptide sequences GRRPRPRPRP and RRPRPRPRP, derived from proline arginine-rich and leucine-rich repeat protein (PRELP). The optimized peptides were antimicrobial against a range of Gram-positive S. aureus and Gram-negative P. aeruginosa clinical isolates, also in the presence of human plasma and blood. Simultaneously, they showed low toxicity against mammalian cells. Particularly W-tagged peptides displayed stability against P. aeruginosa elastase, and S. aureus V8 proteinase and aureolysin, and the peptide RRPRPRPRPWWWW-NH2 was effective against various “superbugs” including vancomycin-resistant enterococci, multi-drug resistant P. aeruginosa, and methicillin-resistant S. aureus, as well as demonstrated efficiency in an ex vivo skin wound model of S. aureus and P. aeruginosa infection. Conclusions/Significance Hydrophobic C-terminal end-tagging of the cationic sequence RRPRPRPRP generates highly selective AMPs with potent activity against multiresistant bacteria and efficiency in ex vivo wound infection models. A precise “tuning” of toxicity and proteolytic stability may be achieved by changing tag-length and adding W- or F-amino acid tags. PMID:21298015

  5. Analysis of expressed sequence tags of the cyclically parthenogenetic rotifer Brachionus plicatilis.

    PubMed

    Suga, Koushirou; Welch, David Mark; Tanaka, Yukari; Sakakura, Yoshitaka; Hagiwara, Atsushi

    2007-08-01

    Rotifers are among the most common non-arthropod animals and are the most experimentally tractable members of the basal assemblage of metazoan phyla known as Gnathifera. The monogonont rotifer Brachionus plicatilis is a developing model system for ecotoxicology, aquatic ecology, cryptic speciation, and the evolution of sex, and is an important food source for finfish aquaculture. However, basic knowledge of the genome and transcriptome of any rotifer species has been lacking. We generated and partially sequenced a cDNA library from B. plicatilis and constructed a database of over 2300 expressed sequence tags corresponding to more than 450 transcripts. About 20% of the transcripts had no significant similarity to database sequences by BLAST; most of these contained open reading frames of significant length but few had recognized Pfam motifs. Sixteen transcripts accounted for 25% of the ESTs; four of these had no significant similarity to BLAST or Pfam databases. Putative up- and downstream untranslated regions are relatively short and AT rich. In contrast to bdelloid rotifers, there was no evidence of a conserved trans-spliced leader sequence among the transcripts and most genes were single-copy. Despite the small size of this EST project it revealed several important features of the rotifer transcriptome and of individual monogonont genes. Because there is little genomic data for Gnathifera, the transcripts we found with no known function may represent genes that are species-, class-, phylum- or even superphylum-specific; the fact that some are among the most highly expressed indicates their importance. The absence of trans-spliced leader exons in this monogonont species contrasts with their abundance in bdelloid rotifers and indicates that the presence of this phenomenon can vary at the subphylum level. Our EST database provides a relatively large quantity of transcript-level data for B. plicatilis, and more generally of rotifers and other gnathiferan phyla, and can be browsed and searched at gmod.mbl.edu.

  6. Analysis of Expressed Sequence Tags of the Cyclically Parthenogenetic Rotifer Brachionus plicatilis

    PubMed Central

    Suga, Koushirou; Mark Welch, David; Tanaka, Yukari; Sakakura, Yoshitaka; Hagiwara, Atsushi

    2007-01-01

    Background Rotifers are among the most common non-arthropod animals and are the most experimentally tractable members of the basal assemblage of metazoan phyla known as Gnathifera. The monogonont rotifer Brachionus plicatilis is a developing model system for ecotoxicology, aquatic ecology, cryptic speciation, and the evolution of sex, and is an important food source for finfish aquaculture. However, basic knowledge of the genome and transcriptome of any rotifer species has been lacking. Methodology/Principal Findings We generated and partially sequenced a cDNA library from B. plicatilis and constructed a database of over 2300 expressed sequence tags corresponding to more than 450 transcripts. About 20% of the transcripts had no significant similarity to database sequences by BLAST; most of these contained open reading frames of significant length but few had recognized Pfam motifs. Sixteen transcripts accounted for 25% of the ESTs; four of these had no significant similarity to BLAST or Pfam databases. Putative up- and downstream untranslated regions are relatively short and AT rich. In contrast to bdelloid rotifers, there was no evidence of a conserved trans-spliced leader sequence among the transcripts and most genes were single-copy. Conclusions/Significance Despite the small size of this EST project it revealed several important features of the rotifer transcriptome and of individual monogonont genes. Because there is little genomic data for Gnathifera, the transcripts we found with no known function may represent genes that are species-, class-, phylum- or even superphylum-specific; the fact that some are among the most highly expressed indicates their importance. The absence of trans-spliced leader exons in this monogonont species contrasts with their abundance in bdelloid rotifers and indicates that the presence of this phenomenon can vary at the subphylum level. Our EST database provides a relatively large quantity of transcript-level data for B. plicatilis, and more generally of rotifers and other gnathiferan phyla, and can be browsed and searched at gmod.mbl.edu. PMID:17668053

  7. Mitochondrial tRNA cleavage by tRNA-targeting ribonuclease causes mitochondrial dysfunction observed in mitochondrial disease

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Ogawa, Tetsuhiro, E-mail: atetsu@mail.ecc.u-tokyo.ac.jp; Shimizu, Ayano; Takahashi, Kazutoshi

    2014-08-15

    Highlights: • MTS-tagged ribonuclease was translocated successfully to the mitochondrial matrix. • MTS-tagged ribonuclease cleaved mt tRNA and reduced COX activity. • Easy and reproducible method of inducing mt tRNA dysfunction. - Abstract: Mitochondrial DNA (mtDNA) is a genome possessed by mitochondria. Since reactive oxygen species (ROS) are generated during aerobic respiration in mitochondria, mtDNA is commonly exposed to the risk of DNA damage. Mitochondrial disease is caused by mitochondrial dysfunction, and mutations or deletions on mitochondrial tRNA (mt tRNA) genes are often observed in mtDNA of patients with the disease. Hence, the correlation between mt tRNA activity and mitochondrialmore » dysfunction has been assessed. Then, cybrid cells, which are constructed by the fusion of an enucleated cell harboring altered mtDNA with a ρ{sup 0} cell, have long been used for the analysis due to difficulty in mtDNA manipulation. Here, we propose a new method that involves mt tRNA cleavage by a bacterial tRNA-specific ribonuclease. The ribonuclease tagged with a mitochondrial-targeting sequence (MTS) was successfully translocated to the mitochondrial matrix. Additionally, mt tRNA cleavage, which resulted in the decrease of cytochrome c oxidase (COX) activity, was observed.« less

  8. Small RNA Library Preparation Method for Next-Generation Sequencing Using Chemical Modifications to Prevent Adapter Dimer Formation.

    PubMed

    Shore, Sabrina; Henderson, Jordana M; Lebedev, Alexandre; Salcedo, Michelle P; Zon, Gerald; McCaffrey, Anton P; Paul, Natasha; Hogrefe, Richard I

    2016-01-01

    For most sample types, the automation of RNA and DNA sample preparation workflows enables high throughput next-generation sequencing (NGS) library preparation. Greater adoption of small RNA (sRNA) sequencing has been hindered by high sample input requirements and inherent ligation side products formed during library preparation. These side products, known as adapter dimer, are very similar in size to the tagged library. Most sRNA library preparation strategies thus employ a gel purification step to isolate tagged library from adapter dimer contaminants. At very low sample inputs, adapter dimer side products dominate the reaction and limit the sensitivity of this technique. Here we address the need for improved specificity of sRNA library preparation workflows with a novel library preparation approach that uses modified adapters to suppress adapter dimer formation. This workflow allows for lower sample inputs and elimination of the gel purification step, which in turn allows for an automatable sRNA library preparation protocol.

  9. Identification and characterization of 43 microsatellite markers derived from expressed sequence tags of the sea cucumber ( Apostichopus japonicus)

    NASA Astrophysics Data System (ADS)

    Jiang, Qun; Li, Qi; Yu, Hong; Kong, Lingfeng

    2011-06-01

    The sea cucumber Apostichopus japonicus is a commercially and ecologically important species in China. A total of 3056 potential unigenes were generated after assembling 7597 A. japonicus expressed sequence tags (ESTs) downloaded from Gen-Bank. Two hundred and fifty microsatellite-containing ESTs (8.18%) and 299 simple sequence repeats (SSRs) were detected. The average density of SSRs was 1 per 7.403 kb of EST after redundancy elimination. Di-nucleotide repeat motifs appeared to be the most abundant type with a percentage of 69.90%. Of the 126 primer pairs designed, 90 amplified the expected products and 43 showed polymorphism in 30 individuals tested. The number of alleles per locus ranged from 2 to 26 with an average of 7.0 alleles, and the observed and expected heterozygosities varied from 0.067 to 1.000 and from 0.066 to 0.959, respectively. These new EST-derived microsatellite markers would provide sufficient polymorphism for population genetic studies and genome mapping of this sea cucumber species.

  10. TCC: an R package for comparing tag count data with robust normalization strategies

    PubMed Central

    2013-01-01

    Background Differential expression analysis based on “next-generation” sequencing technologies is a fundamental means of studying RNA expression. We recently developed a multi-step normalization method (called TbT) for two-group RNA-seq data with replicates and demonstrated that the statistical methods available in four R packages (edgeR, DESeq, baySeq, and NBPSeq) together with TbT can produce a well-ranked gene list in which true differentially expressed genes (DEGs) are top-ranked and non-DEGs are bottom ranked. However, the advantages of the current TbT method come at the cost of a huge computation time. Moreover, the R packages did not have normalization methods based on such a multi-step strategy. Results TCC (an acronym for Tag Count Comparison) is an R package that provides a series of functions for differential expression analysis of tag count data. The package incorporates multi-step normalization methods, whose strategy is to remove potential DEGs before performing the data normalization. The normalization function based on this DEG elimination strategy (DEGES) includes (i) the original TbT method based on DEGES for two-group data with or without replicates, (ii) much faster methods for two-group data with or without replicates, and (iii) methods for multi-group comparison. TCC provides a simple unified interface to perform such analyses with combinations of functions provided by edgeR, DESeq, and baySeq. Additionally, a function for generating simulation data under various conditions and alternative DEGES procedures consisting of functions in the existing packages are provided. Bioinformatics scientists can use TCC to evaluate their methods, and biologists familiar with other R packages can easily learn what is done in TCC. Conclusion DEGES in TCC is essential for accurate normalization of tag count data, especially when up- and down-regulated DEGs in one of the samples are extremely biased in their number. TCC is useful for analyzing tag count data in various scenarios ranging from unbiased to extremely biased differential expression. TCC is available at http://www.iu.a.u-tokyo.ac.jp/~kadota/TCC/ and will appear in Bioconductor (http://bioconductor.org/) from ver. 2.13. PMID:23837715

  11. Genetic variation patterns of American chestnut populations at EST-SSRs

    Treesearch

    Oliver Gailing; C. Dana Nelson

    2017-01-01

    The objective of this study is to analyze patterns of genetic variation at genic expressed sequence tag - simple sequence repeats (EST-SSRs) and at chloroplast DNA markers in populations of American chestnut (Castanea dentata Borkh.) to assist in conservation and breeding efforts. Allelic diversity at EST-SSRs decreased significantly from southwest to northeast along...

  12. Identification of genotyping-by-sequencing sequence tags associated with milling performance and end-use quality traits in hard red spring wheat (Triticum aestivum L.)

    USDA-ARS?s Scientific Manuscript database

    Wheat quality is defined by culinary end-uses and processing characteristics. Wheat breeders are interested to identify quantitative trait loci for grain, milling, and end-use quality traits because it is imperative to understand the genetic complexity underlying quantitatively inherited traits to ...

  13. Genome Comparisons Reveal a Dominant Mechanism of Chromosome Number Reduction in Grasses and Accelerated Genome Evolution in Triticeae

    USDA-ARS?s Scientific Manuscript database

    Single nucleotide polymorphism was employed in the construction of a high-resolution, expressed sequence tag (EST) map of Aegilops tauschii, the diploid source of the wheat D genome. Comparison of the map with the rice and sorghum genome sequences revealed 50 inversions and translocations; 2, 8, and...

  14. Passive wireless tags for tongue controlled assistive technology interfaces.

    PubMed

    Rakibet, Osman O; Horne, Robert J; Kelly, Stephen W; Batchelor, John C

    2016-03-01

    Tongue control with low profile, passive mouth tags is demonstrated as a human-device interface by communicating values of tongue-tag separation over a wireless link. Confusion matrices are provided to demonstrate user accuracy in targeting by tongue position. Accuracy is found to increase dramatically after short training sequences with errors falling close to 1% in magnitude with zero missed targets. The rate at which users are able to learn accurate targeting with high accuracy indicates that this is an intuitive device to operate. The significance of the work is that innovative very unobtrusive, wireless tags can be used to provide intuitive human-computer interfaces based on low cost and disposable mouth mounted technology. With the development of an appropriate reading system, control of assistive devices such as computer mice or wheelchairs could be possible for tetraplegics and others who retain fine motor control capability of their tongues. The tags contain no battery and are intended to fit directly on the hard palate, detecting tongue position in the mouth with no need for tongue piercings.

  15. Soybean oil biosynthesis: role of diacylglycerol acyltransferases.

    PubMed

    Li, Runzhi; Hatanaka, Tomoko; Yu, Keshun; Wu, Yongmei; Fukushige, Hirotada; Hildebrand, David

    2013-03-01

    Diacylglycerol acyltransferase (DGAT) catalyzes the acyl-CoA-dependent acylation of sn-1,2-diacylglycerol to form seed oil triacylglycerol (TAG). To understand the features of genes encoding soybean (Glycine max) DGATs and possible roles in soybean seed oil synthesis and accumulation, two full-length cDNAs encoding type 1 diacylglycerol acyltransferases (GmDGAT1A and GmDGAT1B) were cloned from developing soybean seeds. These coding sequences share identities of 94 % and 95 % in protein and DNA sequences. The genomic architectures of GmDGAT1A and GmDGAT1B both contain 15 introns and 16 exons. Differences in the lengths of the first exon and most of the introns were found between GmDGAT1A and GmDGAT1B genomic sequences. Furthermore, detailed in silico analysis revealed a third predicted DGAT1, GmDGAT1C. GmDGAT1A and GmDGAT1B were found to have similar activity levels and substrate specificities. Oleoyl-CoA and sn-1,2-diacylglycerol were preferred substrates over vernoloyl-CoA and sn-1,2-divernoloylglycerol. Both transcripts are much more abundant in developing seeds than in other tissues including leaves, stem, roots, and flowers. Both soybean DGAT1A and DGAT1B are highly expressed at developing seed stages of maximal TAG accumulation with DGAT1B showing highest expression at somewhat later stages than DGAT1A. DGAT1A and DGAT1B show expression profiles consistent with important roles in soybean seed oil biosynthesis and accumulation.

  16. EST-PAC a web package for EST annotation and protein sequence prediction

    PubMed Central

    Strahm, Yvan; Powell, David; Lefèvre, Christophe

    2006-01-01

    With the decreasing cost of DNA sequencing technology and the vast diversity of biological resources, researchers increasingly face the basic challenge of annotating a larger number of expressed sequences tags (EST) from a variety of species. This typically consists of a series of repetitive tasks, which should be automated and easy to use. The results of these annotation tasks need to be stored and organized in a consistent way. All these operations should be self-installing, platform independent, easy to customize and amenable to using distributed bioinformatics resources available on the Internet. In order to address these issues, we present EST-PAC a web oriented multi-platform software package for expressed sequences tag (EST) annotation. EST-PAC provides a solution for the administration of EST and protein sequence annotations accessible through a web interface. Three aspects of EST annotation are automated: 1) searching local or remote biological databases for sequence similarities using Blast services, 2) predicting protein coding sequence from EST data and, 3) annotating predicted protein sequences with functional domain predictions. In practice, EST-PAC integrates the BLASTALL suite, EST-Scan2 and HMMER in a relational database system accessible through a simple web interface. EST-PAC also takes advantage of the relational database to allow consistent storage, powerful queries of results and, management of the annotation process. The system allows users to customize annotation strategies and provides an open-source data-management environment for research and education in bioinformatics. PMID:17147782

  17. Flexbar 3.0 - SIMD and multicore parallelization.

    PubMed

    Roehr, Johannes T; Dieterich, Christoph; Reinert, Knut

    2017-09-15

    High-throughput sequencing machines can process many samples in a single run. For Illumina systems, sequencing reads are barcoded with an additional DNA tag that is contained in the respective sequencing adapters. The recognition of barcode and adapter sequences is hence commonly needed for the analysis of next-generation sequencing data. Flexbar performs demultiplexing based on barcodes and adapter trimming for such data. The massive amounts of data generated on modern sequencing machines demand that this preprocessing is done as efficiently as possible. We present Flexbar 3.0, the successor of the popular program Flexbar. It employs now twofold parallelism: multi-threading and additionally SIMD vectorization. Both types of parallelism are used to speed-up the computation of pair-wise sequence alignments, which are used for the detection of barcodes and adapters. Furthermore, new features were included to cover a wide range of applications. We evaluated the performance of Flexbar based on a simulated sequencing dataset. Our program outcompetes other tools in terms of speed and is among the best tools in the presented quality benchmark. https://github.com/seqan/flexbar. johannes.roehr@fu-berlin.de or knut.reinert@fu-berlin.de. © The Author (2017). Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com

  18. High production of llama variable heavy-chain antibody fragment (VHH) fused to various reader proteins by Aspergillus oryzae.

    PubMed

    Hisada, Hiromoto; Tsutsumi, Hiroko; Ishida, Hiroki; Hata, Yoji

    2013-01-01

    Llama variable heavy-chain antibody fragment (VHH) fused to four different reader proteins was produced and secreted in culture medium by Aspergillus oryzae. These fusion proteins consisted of N-terminal reader proteins, VHH, and a C-terminal his-tag sequence which facilitated purification using one-step his-tag affinity chromatography. SDS-PAGE analysis of the deglycosylated purified fusion proteins confirmed that the molecular weight of each corresponded to the expected sum of VHH and the respective reader proteins. The apparent high molecular weight reader protein glucoamylase (GlaB) was found to be suitable for efficient VHH production. The GlaB-VHH-His protein bound its antigen, human chorionic gonadotropin, and was detectable by a new ELISA-based method using a coupled assay with glucoamylase, glucose oxidase, peroxidase, maltose, and 3,3',5,5'-tetramethylbenzidine as substrates. Addition of potassium phosphate to the culture medium induced secretion of 0.61 mg GlaB-VHH-His protein/ml culture medium in 5 days.

  19. Analysis of JC virus DNA replication using a quantitative and high-throughput assay

    PubMed Central

    Shin, Jong; Phelan, Paul J.; Chhum, Panharith; Bashkenova, Nazym; Yim, Sung; Parker, Robert; Gagnon, David; Gjoerup, Ole; Archambault, Jacques; Bullock, Peter A.

    2015-01-01

    Progressive Multifocal Leukoencephalopathy (PML) is caused by lytic replication of JC virus (JCV) in specific cells of the central nervous system. Like other polyomaviruses, JCV encodes a large T-antigen helicase needed for replication of the viral DNA. Here, we report the development of a luciferase-based, quantitative and high-throughput assay of JCV DNA replication in C33A cells, which, unlike the glial cell lines Hs 683 and U87, accumulate high levels of nuclear T-ag needed for robust replication. Using this assay, we investigated the requirement for different domains of T-ag, and for specific sequences within and flanking the viral origin, in JCV DNA replication. Beyond providing validation of the assay, these studies revealed an important stimulatory role of the transcription factor NF1 in JCV DNA replication. Finally, we show that the assay can be used for inhibitor testing, highlighting its value for the identification of antiviral drugs targeting JCV DNA replication. PMID:25155200

  20. Construction and heterologous expression of a truncated Haemagglutinin (HA) protein from the avian influenza virus H5N1 in Escherichia coli.

    PubMed

    Chee Wei, T; Nurul Wahida, A G; Shaharum, S

    2014-12-01

    Malaysia first reported H5N1 poultry case in 2004 and subsequently outbreak in poultry population in 2007. Here, a recombinant gene encoding of peptide epitopes, consisting fragments of HA1, HA2 and a polybasic cleavage site of H5N1 strain Malaysia, was amplified and cloned into pET-47b(+) bacterial expression vector. DNA sequencing and alignment analysis confirmed that the gene had no alteration and in-frame to the vector. Then, His-tagged truncated HA protein was expressed in Escherichia coli BL21 (DE3) under 1 mM IPTG induction. The protein expression was optimized under a time-course induction study and further purified using Ni-NTA agarose under reducing condition. Migration size of protein was detected at 15 kDa by Western blot using anti-His tag monoclonal antibody and demonstrated no discrepancy compared to its calculated molecular weight.

  1. Generation and analysis of expressed sequence tags in the extreme large genomes Lilium and Tulipa

    PubMed Central

    2012-01-01

    Background Bulbous flowers such as lily and tulip (Liliaceae family) are monocot perennial herbs that are economically very important ornamental plants worldwide. However, there are hardly any genetic studies performed and genomic resources are lacking. To build genomic resources and develop tools to speed up the breeding in both crops, next generation sequencing was implemented. We sequenced and assembled transcriptomes of four lily and five tulip genotypes using 454 pyro-sequencing technology. Results Successfully, we developed the first set of 81,791 contigs with an average length of 514 bp for tulip, and enriched the very limited number of 3,329 available ESTs (Expressed Sequence Tags) for lily with 52,172 contigs with an average length of 555 bp. The contigs together with singletons covered on average 37% of lily and 39% of tulip estimated transcriptome. Mining lily and tulip sequence data for SSRs (Simple Sequence Repeats) showed that di-nucleotide repeats were twice more abundant in UTRs (UnTranslated Regions) compared to coding regions, while tri-nucleotide repeats were equally spread over coding and UTR regions. Two sets of single nucleotide polymorphism (SNP) markers suitable for high throughput genotyping were developed. In the first set, no SNPs flanking the target SNP (50 bp on either side) were allowed. In the second set, one SNP in the flanking regions was allowed, which resulted in a 2 to 3 fold increase in SNP marker numbers compared with the first set. Orthologous groups between the two flower bulbs: lily and tulip (12,017 groups) and among the three monocot species: lily, tulip, and rice (6,900 groups) were determined using OrthoMCL. Orthologous groups were screened for common SNP markers and EST-SSRs to study synteny between lily and tulip, which resulted in 113 common SNP markers and 292 common EST-SSR. Lily and tulip contigs generated were annotated and described according to Gene Ontology terminology. Conclusions Two transcriptome sets were built that are valuable resources for marker development, comparative genomic studies and candidate gene approaches. Next generation sequencing of leaf transcriptome is very effective; however, deeper sequencing and using more tissues and stages is advisable for extended comparative studies. PMID:23167289

  2. Complementary DNA cloning, sequence analysis, and tissue transcription profile of a novel U2AF2 gene from the Chinese Banna mini-pig inbred line.

    PubMed

    Wang, S Y; Huo, J L; Miao, Y W; Cheng, W M; Zeng, Y Z

    2013-04-02

    U2 small nuclear RNA auxiliary factor 2 (U2AF2) is an important gene for pre-messenger RNA splicing in higher eukaryotes. In this study, the Banna mini-pig inbred line (BMI) U2AF2 coding sequence (CDS) was cloned, sequenced, and characterized. The U2AF2 complete CDS was amplified using the reverse transcription-polymerase chain reaction (RT-PCR) technique based on the conserved sequence information of cattle and known highly homologous swine expressed sequence tags. This novel gene was deposited into the National Center for Biotechnology Information database (Accession No. JQ839267). Sequence analysis revealed that the BMI U2AF2 coding sequence consisted of 1416 bp and encoded 471 amino acids with a molecular weight of 53.12 kDa. The protein sequence has high sequence homology with U2AF65 of 6 species - Homo sapiens (100%), Equus caballus (100%), Canis lupus (100%), Macaca mulatta (99.8%), Bos taurus (74.4%), and Mus musculus (74.4%). The phylogenetic tree analysis revealed that BMI U2AF65 has a closer genetic relationship with B. taurus U2AF65 than with U2AF65 of E. caballus, C. lupus, M. mulatta, H. sapiens, and M. musculus. RT-PCR analysis showed that BMI U2AF2 was most highly expressed in the brain; moderately expressed in the spleen, lung, muscle, and skin; and weakly expressed in the liver, kidney, and ovary. Its expression was nearly silent in the spinal cord, nerve fiber, heart, stomach, pancreas, and intestine. Three microRNA target sites were predicted in the CDS of BMI U2AF2 messenger RNA. Our results establish a foundation for further insight into this swine gene.

  3. Distinguishing aspartic and isoaspartic acids in peptides by several mass spectrometric fragmentation methods

    PubMed Central

    DeGraan-Weber, Nick; Zhang, Jun; Reilly, James P.

    2016-01-01

    Six ion fragmentation techniques that can distinguish aspartic acid from its isomer, isoaspartic acid, were compared. MALDI post source decay (PSD), MALDI 157 nm photodissociation, TMPP charge tagging in PSD and photodissociation, ESI collision-induced dissociation (CID), electron transfer dissociation (ETD), and free-radical initiated peptide sequencing (FRIPS) with CID were applied to peptides containing either aspartic or isoaspartic acid. Diagnostic ions, such as the y-46 and b+H2O, are present in PSD, photodissociation, and charge tagging. c•+57 and z-57 ions are observed in ETD and FRIPS experiments. For some molecules, aspartic and isoaspartic acid yield ion fragments with significantly different intensities. ETD and charge tagging appear to be most effective at distinguishing these residues. PMID:27613306

  4. Genetic diversity analysis in Malaysian giant prawns using expressed sequence tag microsatellite markers for stock improvement program.

    PubMed

    Atin, K H; Christianus, A; Fatin, N; Lutas, A C; Shabanimofrad, M; Subha, B

    2017-08-17

    The Malaysian giant prawn is among the most commonly cultured species of the genus Macrobrachium. Stocks of giant prawns from four rivers in Peninsular Malaysia have been used for aquaculture over the past 25 years, which has led to repeated harvesting, restocking, and transplantation between rivers. Consequently, a stock improvement program is now important to avoid the depletion of wild stocks and the loss of genetic diversity. However, the success of such an improvement program depends on our knowledge of the genetic variation of these base populations. The aim of the current study was to estimate genetic variation and differentiation of these riverine sources using novel expressed sequence tag-microsatellite (EST-SSR) markers, which not only are informative on genetic diversity but also provide information on immune and metabolic traits. Our findings indicated that the tested stocks have inbreeding depression due to a significant deficiency in heterozygotes, and F IS was estimated as 0.15538 to 0.31938. An F-statistics analysis suggested that the stocks are composed of one large panmictic population. Among the four locations, stocks from Johor, in the southern region of the peninsular, showed higher allelic and genetic diversity than the other stocks. To overcome inbreeding problems, the Johor population could be used as a base population in a stock improvement program by crossing to the other populations. The study demonstrated that EST-SSR markers can be incorporated in future marker assisted breeding to aid the proper management of the stocks by breeders and stakeholders in Malaysia.

  5. Neural net controlled tag gas sampling system for nuclear reactors

    DOEpatents

    Gross, Kenneth C.; Laug, Matthew T.; Lambert, John D. B.; Herzog, James P.

    1997-01-01

    A method and system for providing a tag gas identifier to a nuclear fuel rod and analyze escaped tag gas to identify a particular failed nuclear fuel rod. The method and system include disposing a unique tag gas composition into a plenum of a nuclear fuel rod, monitoring gamma ray activity, analyzing gamma ray signals to assess whether a nuclear fuel rod has failed and is emitting tag gas, activating a tag gas sampling and analysis system upon sensing tag gas emission from a failed nuclear rod and evaluating the escaped tag gas to identify the particular failed nuclear fuel rod.

  6. Venom proteomic and venomous glands transcriptomic analysis of the Egyptian scorpion Scorpio maurus palmatus (Arachnida: Scorpionidae).

    PubMed

    Abdel-Rahman, Mohamed A; Quintero-Hernandez, Veronica; Possani, Lourival D

    2013-11-01

    Proteomic analysis of the scorpion venom Scorpio maurus palmatus was performed using reverse-phase HPLC separation followed by mass spectrometry determination. Sixty five components were identified with molecular masses varying from 413 to 14,009 Da. The high percentage of peptides (41.5%) was from 3 to 5 KDa which may represent linear antimicrobial peptides and KScTxs. Also, 155 expressed sequence tags (ESTs) were analyzed through construction the cDNA library prepared from a pair of venomous gland. About 77% of the ESTs correspond to toxin-like peptides and proteins with definite open reading frames. The cDNA sequencing results also show the presence of sequences whose putative products have sequence similarity with antimicrobial peptides (24%), insecticidal toxins, β-NaScTxs, κ-KScTxs, α-KScTxs, calcines and La1-like peptides. Also, we have obtained 23 atypical types of venom molecules not recorded in other scorpion species. Moreover, 9% of the total ESTs revealed significant similarities with proteins involved in the cellular processes of these scorpion venomous glands. This is the first set of molecular masses and transcripts described from this species, in which various venom molecules have been identified. They belong to either known or unassigned types of scorpion venom peptides and proteins, and provide valuable information for evolutionary analysis and venomics. Copyright © 2013 Elsevier Ltd. All rights reserved.

  7. Tandem MS Analysis of Selenamide-Derivatized Peptide Ions

    NASA Astrophysics Data System (ADS)

    Zhang, Yun; Zhang, Hao; Cui, Weidong; Chen, Hao

    2011-09-01

    Our previous study showed that selenamide reagents such as ebselen and N-(phenylseleno)phthalimide (NPSP) can be used for selective and rapid derivatization of protein/peptide thiols in high conversion yield. This paper reports the systematic investigation of MS/MS dissociation behaviors of selenamide-derivatized peptide ions upon collision induced dissociation (CID) and electron transfer dissociation (ETD). In the positive ion mode, derivatized peptide ions exhibit tag-dependent CID dissociation pathways. For instance, ebselen-derivatized peptide ions preferentially undergo Se-S bond cleavage upon CID to produce a characteristic fragment ion, the protonated ebselen ( m/z 276), which allows selective identification of thiol peptides from protein digest as well as selective detection of thiol proteins from protein mixture using precursor ion scan (PIS). In contrast, NPSP-derivatized peptide ions retain their phenylselenenyl tags during CID, which is useful in sequencing peptides and locating cysteine residues. In the negative ion CID mode, both types of tags are preferentially lost via the Se-S cleavage, analogous to the S-S bond cleavage during CID of disulfide-containing peptide anions. In consideration of the convenience in preparing selenamide-derivatized peptides and the similarity of Se-S of the tag to the S-S bond, we also examined ETD of the derivatized peptide ions to probe the mechanism for electron-based ion dissociation. Interestingly, facile cleavage of Se-S bond occurs to the peptide ions carrying either protons or alkali metal ions, while backbone cleavage to form c/z ions is severely inhibited. These results are in agreement with the Utah-Washington mechanism proposed for depicting electron-based ion dissociation processes.

  8. A resource of large-scale molecular markers for monitoring Agropyron cristatum chromatin introgression in wheat background based on transcriptome sequences.

    PubMed

    Zhang, Jinpeng; Liu, Weihua; Lu, Yuqing; Liu, Qunxing; Yang, Xinming; Li, Xiuquan; Li, Lihui

    2017-09-20

    Agropyron cristatum is a wild grass of the tribe Triticeae and serves as a gene donor for wheat improvement. However, very few markers can be used to monitor A. cristatum chromatin introgressions in wheat. Here, we reported a resource of large-scale molecular markers for tracking alien introgressions in wheat based on transcriptome sequences. By aligning A. cristatum unigenes with the Chinese Spring reference genome sequences, we designed 9602 A. cristatum expressed sequence tag-sequence-tagged site (EST-STS) markers for PCR amplification and experimental screening. As a result, 6063 polymorphic EST-STS markers were specific for the A. cristatum P genome in the single-receipt wheat background. A total of 4956 randomly selected polymorphic EST-STS markers were further tested in eight wheat variety backgrounds, and 3070 markers displaying stable and polymorphic amplification were validated. These markers covered more than 98% of the A. cristatum genome, and the marker distribution density was approximately 1.28 cM. An application case of all EST-STS markers was validated on the A. cristatum 6 P chromosome. These markers were successfully applied in the tracking of alien A. cristatum chromatin. Altogether, this study provided a universal method of large-scale molecular marker development to monitor wild relative chromatin in wheat.

  9. In vivo system for analyzing the function of the PsbP protein using Chlamydomonas reinhardtii.

    PubMed

    Nishimura, Taishi; Sato, Fumihiko; Ifuku, Kentaro

    2017-09-01

    The PsbP protein is an extrinsic subunit of photosystem II (PSII) specifically developed in green-plant species including land plants and green algae. The protein-protein interactions involving PsbP and its effect on oxygen evolution have been investigated in vitro using isolated PSII membranes. However, the importance of those interactions needs to be examined at the cellular level. To this end, we developed a system expressing exogenous PsbP in the background of the Chlamydomonas BF25 mutant lacking native PsbP. Expression of His-tagged PsbP successfully restored the oxygen-evolving activity and photoautotrophic growth of the mutant, while PsbP-∆15 lacking the N-terminal 15 residues, which are crucial for the oxygen-evolving activity of spinach PSII in vitro, only partially did. This demonstrated the importance of N-terminal sequence of PsbP for the photosynthetic activity in vivo. Furthermore, the PSII-LHCII supercomplex can be specifically purified from the Chlamydomonas cells having His-tagged PsbP using a metal affinity chromatography. This study provides a platform not only for the functional analysis of PsbP in vivo but also for structural analysis of the PSII-LHCII supercomplex from green algae.

  10. Identification of the maize gravitropism gene lazy plant1 by a transposon-tagging genome resequencing strategy.

    PubMed

    Howard, Thomas P; Hayward, Andrew P; Tordillos, Anthony; Fragoso, Christopher; Moreno, Maria A; Tohme, Joe; Kausch, Albert P; Mottinger, John P; Dellaporta, Stephen L

    2014-01-01

    Since their initial discovery, transposons have been widely used as mutagens for forward and reverse genetic screens in a range of organisms. The problems of high copy number and sequence divergence among related transposons have often limited the efficiency at which tagged genes can be identified. A method was developed to identity the locations of Mutator (Mu) transposons in the Zea mays genome using a simple enrichment method combined with genome resequencing to identify transposon junction fragments. The sequencing library was prepared from genomic DNA by digesting with a restriction enzyme that cuts within a perfectly conserved motif of the Mu terminal inverted repeats (TIR). Paired-end reads containing Mu TIR sequences were computationally identified and chromosomal sequences flanking the transposon were mapped to the maize reference genome. This method has been used to identify Mu insertions in a number of alleles and to isolate the previously unidentified lazy plant1 (la1) gene. The la1 gene is required for the negatively gravitropic response of shoots and mutant plants lack the ability to sense gravity. Using bioinformatic and fluorescence microscopy approaches, we show that the la1 gene encodes a cell membrane and nuclear localized protein. Our Mu-Taq method is readily adaptable to identify the genomic locations of any insertion of a known sequence in any organism using any sequencing platform.

  11. Identification of the Maize Gravitropism Gene lazy plant1 by a Transposon-Tagging Genome Resequencing Strategy

    PubMed Central

    Howard, Thomas P.; Hayward, Andrew P.; Tordillos, Anthony; Fragoso, Christopher; Moreno, Maria A.; Tohme, Joe; Kausch, Albert P.; Mottinger, John P.; Dellaporta, Stephen L.

    2014-01-01

    Since their initial discovery, transposons have been widely used as mutagens for forward and reverse genetic screens in a range of organisms. The problems of high copy number and sequence divergence among related transposons have often limited the efficiency at which tagged genes can be identified. A method was developed to identity the locations of Mutator (Mu) transposons in the Zea mays genome using a simple enrichment method combined with genome resequencing to identify transposon junction fragments. The sequencing library was prepared from genomic DNA by digesting with a restriction enzyme that cuts within a perfectly conserved motif of the Mu terminal inverted repeats (TIR). Paired-end reads containing Mu TIR sequences were computationally identified and chromosomal sequences flanking the transposon were mapped to the maize reference genome. This method has been used to identify Mu insertions in a number of alleles and to isolate the previously unidentified lazy plant1 (la1) gene. The la1 gene is required for the negatively gravitropic response of shoots and mutant plants lack the ability to sense gravity. Using bioinformatic and fluorescence microscopy approaches, we show that the la1 gene encodes a cell membrane and nuclear localized protein. Our Mu-Taq method is readily adaptable to identify the genomic locations of any insertion of a known sequence in any organism using any sequencing platform. PMID:24498020

  12. Needles in the EST Haystack: Large-Scale Identification and Analysis of Excretory-Secretory (ES) Proteins in Parasitic Nematodes Using Expressed Sequence Tags (ESTs)

    PubMed Central

    Nagaraj, Shivashankar H.; Gasser, Robin B.; Ranganathan, Shoba

    2008-01-01

    Background Parasitic nematodes of humans, other animals and plants continue to impose a significant public health and economic burden worldwide, due to the diseases they cause. Promising antiparasitic drug and vaccine candidates have been discovered from excreted or secreted (ES) proteins released from the parasite and exposed to the immune system of the host. Mining the entire expressed sequence tag (EST) data available from parasitic nematodes represents an approach to discover such ES targets. Methods and Findings In this study, we predicted, using EST2Secretome, a novel, high-throughput, computational workflow system, 4,710 ES proteins from 452,134 ESTs derived from 39 different species of nematodes, parasitic in animals (including humans) or plants. In total, 2,632, 786, and 1,292 ES proteins were predicted for animal-, human-, and plant-parasitic nematodes. Subsequently, we systematically analysed ES proteins using computational methods. Of these 4,710 proteins, 2,490 (52.8%) had orthologues in Caenorhabditis elegans, whereas 621 (13.8%) appeared to be novel, currently having no significant match to any molecule available in public databases. Of the C. elegans homologues, 267 had strong “loss-of-function” phenotypes by RNA interference (RNAi) in this nematode. We could functionally classify 1,948 (41.3%) sequences using the Gene Ontology (GO) terms, establish pathway associations for 573 (12.2%) sequences using Kyoto Encyclopaedia of Genes and Genomes (KEGG), and identify protein interaction partners for 1,774 (37.6%) molecules. We also mapped 758 (16.1%) proteins to protein domains including the nematode-specific protein family “transthyretin-like” and “chromadorea ALT,” considered as vaccine candidates against filariasis in humans. Conclusions We report the large-scale analysis of ES proteins inferred from EST data for a range of parasitic nematodes. This set of ES proteins provides an inventory of known and novel members of ES proteins as a foundation for studies focused on understanding the biology of parasitic nematodes and their interactions with their hosts, as well as for the development of novel drugs or vaccines for parasite intervention and control. PMID:18820748

  13. High-resolution phylogenetic microbial community profiling

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Singer, Esther; Coleman-Derr, Devin; Bowman, Brett

    2014-03-17

    The representation of bacterial and archaeal genome sequences is strongly biased towards cultivated organisms, which belong to merely four phylogenetic groups. Functional information and inter-phylum level relationships are still largely underexplored for candidate phyla, which are often referred to as microbial dark matter. Furthermore, a large portion of the 16S rRNA gene records in the GenBank database are labeled as environmental samples and unclassified, which is in part due to low read accuracy, potential chimeric sequences produced during PCR amplifications and the low resolution of short amplicons. In order to improve the phylogenetic classification of novel species and advance ourmore » knowledge of the ecosystem function of uncultivated microorganisms, high-throughput full length 16S rRNA gene sequencing methodologies with reduced biases are needed. We evaluated the performance of PacBio single-molecule real-time (SMRT) sequencing in high-resolution phylogenetic microbial community profiling. For this purpose, we compared PacBio and Illumina metagenomic shotgun and 16S rRNA gene sequencing of a mock community as well as of an environmental sample from Sakinaw Lake, British Columbia. Sakinaw Lake is known to contain a large age of microbial species from candidate phyla. Sequencing results show that community structure based on PacBio shotgun and 16S rRNA gene sequences is highly similar in both the mock and the environmental communities. Resolution power and community representation accuracy from SMRT sequencing data appeared to be independent of GC content of microbial genomes and was higher when compared to Illumina-based metagenome shotgun and 16S rRNA gene (iTag) sequences, e.g. full-length sequencing resolved all 23 OTUs in the mock community, while iTags did not resolve closely related species. SMRT sequencing hence offers various potential benefits when characterizing uncharted microbial communities.« less

  14. Trans-dimensional MCMC methods for fully automatic motion analysis in tagged MRI.

    PubMed

    Smal, Ihor; Carranza-Herrezuelo, Noemí; Klein, Stefan; Niessen, Wiro; Meijering, Erik

    2011-01-01

    Tagged magnetic resonance imaging (tMRI) is a well-known noninvasive method allowing quantitative analysis of regional heart dynamics. Its clinical use has so far been limited, in part due to the lack of robustness and accuracy of existing tag tracking algorithms in dealing with low (and intrinsically time-varying) image quality. In this paper, we propose a novel probabilistic method for tag tracking, implemented by means of Bayesian particle filtering and a trans-dimensional Markov chain Monte Carlo (MCMC) approach, which efficiently combines information about the imaging process and tag appearance with prior knowledge about the heart dynamics obtained by means of non-rigid image registration. Experiments using synthetic image data (with ground truth) and real data (with expert manual annotation) from preclinical (small animal) and clinical (human) studies confirm that the proposed method yields higher consistency, accuracy, and intrinsic tag reliability assessment in comparison with other frequently used tag tracking methods.

  15. A High-Throughput Data Mining of Single Nucleotide Polymorphisms in Coffea Species Expressed Sequence Tags Suggests Differential Homeologous Gene Expression in the Allotetraploid Coffea arabica1[W

    PubMed Central

    Vidal, Ramon Oliveira; Mondego, Jorge Maurício Costa; Pot, David; Ambrósio, Alinne Batista; Andrade, Alan Carvalho; Pereira, Luiz Filipe Protasio; Colombo, Carlos Augusto; Vieira, Luiz Gonzaga Esteves; Carazzolle, Marcelo Falsarella; Pereira, Gonçalo Amarante Guimarães

    2010-01-01

    Polyploidization constitutes a common mode of evolution in flowering plants. This event provides the raw material for the divergence of function in homeologous genes, leading to phenotypic novelty that can contribute to the success of polyploids in nature or their selection for use in agriculture. Mounting evidence underlined the existence of homeologous expression biases in polyploid genomes; however, strategies to analyze such transcriptome regulation remained scarce. Important factors regarding homeologous expression biases remain to be explored, such as whether this phenomenon influences specific genes, how paralogs are affected by genome doubling, and what is the importance of the variability of homeologous expression bias to genotype differences. This study reports the expressed sequence tag assembly of the allopolyploid Coffea arabica and one of its direct ancestors, Coffea canephora. The assembly was used for the discovery of single nucleotide polymorphisms through the identification of high-quality discrepancies in overlapped expressed sequence tags and for gene expression information indirectly estimated by the transcript redundancy. Sequence diversity profiles were evaluated within C. arabica (Ca) and C. canephora (Cc) and used to deduce the transcript contribution of the Coffea eugenioides (Ce) ancestor. The assignment of the C. arabica haplotypes to the C. canephora (CaCc) or C. eugenioides (CaCe) ancestral genomes allowed us to analyze gene expression contributions of each subgenome in C. arabica. In silico data were validated by the quantitative polymerase chain reaction and allele-specific combination TaqMAMA-based method. The presence of differential expression of C. arabica homeologous genes and its implications in coffee gene expression, ontology, and physiology are discussed. PMID:20864545

  16. Genome-Scale Transcriptome Analysis in Response to Nitric Oxide in Birch Cells: Implications of the Triterpene Biosynthetic Pathway

    PubMed Central

    Zeng, Fansuo; Sun, Fengkun; Li, Leilei; Liu, Kun; Zhan, Yaguang

    2014-01-01

    Evidence supporting nitric oxide (NO) as a mediator of plant biochemistry continues to grow, but its functions at the molecular level remains poorly understood and, in some cases, controversial. To study the role of NO at the transcriptional level in Betula platyphylla cells, we conducted a genome-scale transcriptome analysis of these cells. The transcriptome of untreated birch cells and those treated by sodium nitroprusside (SNP) were analyzed using the Solexa sequencing. Data were collected by sequencing cDNA libraries of birch cells, which had a long period to adapt to the suspension culture conditions before SNP-treated cells and untreated cells were sampled. Among the 34,100 UniGenes detected, BLASTX search revealed that 20,631 genes showed significant (E-values≤10−5) sequence similarity with proteins from the NR-database. Numerous expressed sequence tags (i.e., 1374) were identified as differentially expressed between the 12 h SNP-treated cells and control cells samples: 403 up-regulated and 971 down-regulated. From this, we specifically examined a core set of NO-related transcripts. The altered expression levels of several transcripts, as determined by transcriptome analysis, was confirmed by qRT-PCR. The results of transcriptome analysis, gene expression quantification, the content of triterpenoid and activities of defensive enzymes elucidated NO has a significant effect on many processes including triterpenoid production, carbohydrate metabolism and cell wall biosynthesis. PMID:25551661

  17. The transcriptome of Lutzomyia longipalpis (Diptera: Psychodidae) male reproductive organs.

    PubMed

    Azevedo, Renata V D M; Dias, Denise B S; Bretãs, Jorge A C; Mazzoni, Camila J; Souza, Nataly A; Albano, Rodolpho M; Wagner, Glauber; Davila, Alberto M R; Peixoto, Alexandre A

    2012-01-01

    It has been suggested that genes involved in the reproductive biology of insect disease vectors are potential targets for future alternative methods of control. Little is known about the molecular biology of reproduction in phlebotomine sand flies and there is no information available concerning genes that are expressed in male reproductive organs of Lutzomyia longipalpis, the main vector of American visceral leishmaniasis and a species complex. We generated 2678 high quality ESTs ("Expressed Sequence Tags") of L. longipalpis male reproductive organs that were grouped in 1391 non-redundant sequences (1136 singlets and 255 clusters). BLAST analysis revealed that only 57% of these sequences share similarity with a L. longipalpis female EST database. Although no more than 36% of the non-redundant sequences showed similarity to protein sequences deposited in databases, more than half of them presented the best-match hits with mosquito genes. Gene ontology analysis identified subsets of genes involved in biological processes such as protein biosynthesis and DNA replication, which are probably associated with spermatogenesis. A number of non-redundant sequences were also identified as putative male reproductive gland proteins (mRGPs), also known as male accessory gland protein genes (Acps). The transcriptome analysis of L. longipalpis male reproductive organs is one step further in the study of the molecular basis of the reproductive biology of this important species complex. It has allowed the identification of genes potentially involved in spermatogenesis as well as putative mRGPs sequences, which have been studied in many insect species because of their effects on female post-mating behavior and physiology and their potential role in sexual selection and speciation. These data open a number of new avenues for further research in the molecular and evolutionary reproductive biology of sand flies.

  18. Re-Ranking Sequencing Variants in the Post-GWAS Era for Accurate Causal Variant Identification

    PubMed Central

    Faye, Laura L.; Machiela, Mitchell J.; Kraft, Peter; Bull, Shelley B.; Sun, Lei

    2013-01-01

    Next generation sequencing has dramatically increased our ability to localize disease-causing variants by providing base-pair level information at costs increasingly feasible for the large sample sizes required to detect complex-trait associations. Yet, identification of causal variants within an established region of association remains a challenge. Counter-intuitively, certain factors that increase power to detect an associated region can decrease power to localize the causal variant. First, combining GWAS with imputation or low coverage sequencing to achieve the large sample sizes required for high power can have the unintended effect of producing differential genotyping error among SNPs. This tends to bias the relative evidence for association toward better genotyped SNPs. Second, re-use of GWAS data for fine-mapping exploits previous findings to ensure genome-wide significance in GWAS-associated regions. However, using GWAS findings to inform fine-mapping analysis can bias evidence away from the causal SNP toward the tag SNP and SNPs in high LD with the tag. Together these factors can reduce power to localize the causal SNP by more than half. Other strategies commonly employed to increase power to detect association, namely increasing sample size and using higher density genotyping arrays, can, in certain common scenarios, actually exacerbate these effects and further decrease power to localize causal variants. We develop a re-ranking procedure that accounts for these adverse effects and substantially improves the accuracy of causal SNP identification, often doubling the probability that the causal SNP is top-ranked. Application to the NCI BPC3 aggressive prostate cancer GWAS with imputation meta-analysis identified a new top SNP at 2 of 3 associated loci and several additional possible causal SNPs at these loci that may have otherwise been overlooked. This method is simple to implement using R scripts provided on the author's website. PMID:23950724

  19. Fluorescence Recovery After Photobleaching Analysis of the Diffusional Mobility of Plasma Membrane Proteins: HER3 Mobility in Breast Cancer Cell Membranes.

    PubMed

    Sarkar, Mitul; Koland, John G

    2016-01-01

    The fluorescence recovery after photobleaching (FRAP) method is a straightforward means of assessing the diffusional mobility of membrane-associated proteins that is readily performed with current confocal microscopy instrumentation. We describe here the specific application of the FRAP method in characterizing the lateral diffusion of genetically encoded green fluorescence protein (GFP)-tagged plasma membrane receptor proteins. The method is exemplified in an examination of whether the previously observed segregation of the mammalian HER3 receptor protein in discrete plasma membrane microdomains results from its physical interaction with cellular entities that restrict its mobility. Our FRAP measurements of the diffusional mobility of GFP-tagged HER3 reporters expressed in MCF7 cultured breast cancer cells showed that despite the observed segregation of HER3 receptors within plasma membrane microdomains their diffusion on the macroscopic scale is not spatially restricted. Thus, in FRAP analyses of various HER3 reporters a near-complete recovery of fluorescence after photobleaching was observed, indicating that HER3 receptors are not immobilized by long-lived physical interactions with intracellular species. An examination of HER3 proteins with varying intracellular domain sequence truncations also indicated that a proposed formation of oligomeric HER3 networks, mediated by physical interactions involving specific HER3 intracellular domain sequences, either does not occur or does not significantly reduce HER3 mobility on the macroscopic scale.

  20. Leaf margin phenotype-specific restriction-site-associated DNA-derived markers for pineapple (Ananas comosus L.)

    PubMed Central

    Urasaki, Naoya; Goeku, Satoko; Kaneshima, Risa; Takamine, Tomonori; Tarora, Kazuhiko; Takeuchi, Makoto; Moromizato, Chie; Yonamine, Kaname; Hosaka, Fumiko; Terakami, Shingo; Matsumura, Hideo; Yamamoto, Toshiya; Shoda, Moriyuki

    2015-01-01

    To explore genome-wide DNA polymorphisms and identify DNA markers for leaf margin phenotypes, a restriction-site-associated DNA sequencing analysis was employed to analyze three bulked DNAs of F1 progeny from a cross between a ‘piping-leaf-type’ cultivar, ‘Yugafu’, and a ‘spiny-tip-leaf-type’ variety, ‘Yonekura’. The parents were both Ananas comosus var. comosus. From the analysis, piping-leaf and spiny-tip-leaf gene-specific restriction-site-associated DNA sequencing tags were obtained and designated as PLSTs and STLSTs, respectively. The five PLSTs and two STSLTs were successfully converted to cleaved amplified polymorphic sequence (CAPS) or simple sequence repeat (SSR) markers using the sequence differences between alleles. Based on the genotyping of the F1 with two SSR and three CAPS markers, the five PLST markers were mapped in the vicinity of the P locus, with the closest marker, PLST1_SSR, being located 1.5 cM from the P locus. The two CAPS markers from STLST1 and STLST3 perfectly assessed the ‘spiny-leaf type’ as homozygotes of the recessive s allele of the S gene. The recombination value between the S locus and STLST loci was 2.4, and STLSTs were located 2.2 cM from the S locus. SSR and CAPS markers are applicable to marker-assisted selection of leaf margin phenotypes in pineapple breeding. PMID:26175625

  1. Leaf margin phenotype-specific restriction-site-associated DNA-derived markers for pineapple (Ananas comosus L.).

    PubMed

    Urasaki, Naoya; Goeku, Satoko; Kaneshima, Risa; Takamine, Tomonori; Tarora, Kazuhiko; Takeuchi, Makoto; Moromizato, Chie; Yonamine, Kaname; Hosaka, Fumiko; Terakami, Shingo; Matsumura, Hideo; Yamamoto, Toshiya; Shoda, Moriyuki

    2015-06-01

    To explore genome-wide DNA polymorphisms and identify DNA markers for leaf margin phenotypes, a restriction-site-associated DNA sequencing analysis was employed to analyze three bulked DNAs of F1 progeny from a cross between a 'piping-leaf-type' cultivar, 'Yugafu', and a 'spiny-tip-leaf-type' variety, 'Yonekura'. The parents were both Ananas comosus var. comosus. From the analysis, piping-leaf and spiny-tip-leaf gene-specific restriction-site-associated DNA sequencing tags were obtained and designated as PLSTs and STLSTs, respectively. The five PLSTs and two STSLTs were successfully converted to cleaved amplified polymorphic sequence (CAPS) or simple sequence repeat (SSR) markers using the sequence differences between alleles. Based on the genotyping of the F1 with two SSR and three CAPS markers, the five PLST markers were mapped in the vicinity of the P locus, with the closest marker, PLST1_SSR, being located 1.5 cM from the P locus. The two CAPS markers from STLST1 and STLST3 perfectly assessed the 'spiny-leaf type' as homozygotes of the recessive s allele of the S gene. The recombination value between the S locus and STLST loci was 2.4, and STLSTs were located 2.2 cM from the S locus. SSR and CAPS markers are applicable to marker-assisted selection of leaf margin phenotypes in pineapple breeding.

  2. Generation and Analysis of Expressed Sequence Tags from Olea europaea L.

    PubMed Central

    Ozdemir Ozgenturk, Nehir; Oruç, Fatma; Sezerman, Ugur; Kuçukural, Alper; Vural Korkut, Senay; Toksoz, Feriha; Un, Cemal

    2010-01-01

    Olive (Olea europaea L.) is an important source of edible oil which was originated in Near-East region. In this study, two cDNA libraries were constructed from young olive leaves and immature olive fruits for generation of ESTs to discover the novel genes and search the function of unknown genes of olive. The randomly selected 3840 colonies were sequenced for EST collection from both libraries. Readable 2228 sequences for olive leaf and 1506 sequences for olive fruit were assembled into 205 and 69 contigs, respectively, whereas 2478 were singletons. Putative functions of all 2752 differentially expressed unique sequences were designated by gene homology based on BLAST and annotated using BLAST2GO. While 1339 ESTs show no homology to the database, 2024 ESTs have homology (under 80%) with hypothetical proteins, putative proteins, expressed proteins, and unknown proteins in NCBI-GenBank. 635 EST's unique genes sequence have been identified by over 80% homology to known function in other species which were not previously described in Olea family. Only 3.1% of total EST's was shown similarity with olive database existing in NCBI. This generated EST's data and consensus sequences were submitted to NCBI as valuable source for functional genome studies of olive. PMID:21197085

  3. Efficient use of unlabeled data for protein sequence classification: a comparative study.

    PubMed

    Kuksa, Pavel; Huang, Pai-Hsi; Pavlovic, Vladimir

    2009-04-29

    Recent studies in computational primary protein sequence analysis have leveraged the power of unlabeled data. For example, predictive models based on string kernels trained on sequences known to belong to particular folds or superfamilies, the so-called labeled data set, can attain significantly improved accuracy if this data is supplemented with protein sequences that lack any class tags-the unlabeled data. In this study, we present a principled and biologically motivated computational framework that more effectively exploits the unlabeled data by only using the sequence regions that are more likely to be biologically relevant for better prediction accuracy. As overly-represented sequences in large uncurated databases may bias the estimation of computational models that rely on unlabeled data, we also propose a method to remove this bias and improve performance of the resulting classifiers. Combined with state-of-the-art string kernels, our proposed computational framework achieves very accurate semi-supervised protein remote fold and homology detection on three large unlabeled databases. It outperforms current state-of-the-art methods and exhibits significant reduction in running time. The unlabeled sequences used under the semi-supervised setting resemble the unpolished gemstones; when used as-is, they may carry unnecessary features and hence compromise the classification accuracy but once cut and polished, they improve the accuracy of the classifiers considerably.

  4. Genes are differentially expressed at transcriptional level of Neocaridina denticulata following short-term exposure to nonylphenol.

    PubMed

    Liu, Chang-Lun; Sung, Hung-Hung

    2011-09-01

    To assess the toxicity of nonylphenol towards aquatic crustaceans, Neocaridina denticulata were exposed short-term to sublethal concentration (0.001-0.5 mg/L). Following treatment, differentially expressed genes were identified using suppression subtractive hybridization on samples prepared from whole specimens. There were 20 differentially expressed sequence tags that corresponded to known genes and could be divided into six functional classes: defence, translation, metabolism, ribosomal gene expression, respiration, and genes involved in the stress response. Using semi-quantitative RT-PCR, we found that 14 of the differentially expressed sequence tags significantly responded to nonylphenol, including six at a nominal concentration of 0.01 mg/L; among them, 12 genes were down-regulated. These results suggest that under non-lethal concentrations of nonylphenol, the polluted aquatic environment may still present a potential risk to N. denticulata.

  5. Absolute Configuration of 3-METHYLCYCLOHEXANONE by Chiral Tag Rotational Spectroscopy and Vibrational Circular Dichroism

    NASA Astrophysics Data System (ADS)

    Evangelisti, Luca; Holdren, Martin S.; Mayer, Kevin J.; Smart, Taylor; West, Channing; Pate, Brooks

    2017-06-01

    The absolute configuration of 3-methylcyclohexanone was established by chiral tag rotational spectroscopy measurements using 3-butyn-2-ol as the tag partner. This molecule was chosen because it is a benchmark measurement for vibrational circular dichroism (VCD). A comparison of the analysis approaches of chiral tag rotational spectroscopy and VCD will be presented. One important issue in chiral analysis by both methods is the conformational flexibility of the molecule being analyzed. The analysis of conformational composition of samples will be illustrated. In this case, the high spectral resolution of molecular rotational spectroscopy and potential for spectral simplification by conformational cooling in the pulsed jet expansion are advantages for chiral tag spectroscopy. The computational chemistry requirements for the two methods will also be discussed. In this case, the need to perform conformer searches for weakly bound complexes and to perform reasonably high level quantum chemistry geometry optimizations on these complexes makes the computational time requirements less favorable for chiral tag rotational spectroscopy. Finally, the issue of reliability of the determination of the absolute configuration will be considered. In this case, rotational spectroscopy offers a "gold standard" analysis method through the determination of the ^{13}C-subsitution structure of the complex between 3-methylcyclohexanone and an enantiopure sample of the 3-butyn-2-ol tag.

  6. Systems Biology of Lipid Body Formation in the Green Alga Chlamydomonas reinhardtii

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Goodenough, Ursula

    The project aimed to deepen our understanding of alga triacylglycerol (TAG) production to undergird explorations of using algal TAG as a source of biodiesel fuel. Our published contributions included the following: 1) Development of a rapid assay for TAG in algal cultures which was widely distributed to the algal community. 2) A comprehensive transcriptome analysis of the development of the ultra-high-TAG “obese” phenotype In Chlamydomonas reinhardtii. 3) A comprehensive biochemical and ultrastructural analysis of the cell wall of Nannochloropsis gaditana, whose walls render it both growth-hardy and difficult to rupture for TAG recovery. A manuscript in preparation considers the autophagymore » response in C. reinhardtii and its entrance into stationary phase, both having an impact on TAG production.« less

  7. Neural net controlled tag gas sampling system for nuclear reactors

    DOEpatents

    Gross, K.C.; Laug, M.T.; Lambert, J.B.; Herzog, J.P.

    1997-02-11

    A method and system are disclosed for providing a tag gas identifier to a nuclear fuel rod and analyze escaped tag gas to identify a particular failed nuclear fuel rod. The method and system include disposing a unique tag gas composition into a plenum of a nuclear fuel rod, monitoring gamma ray activity, analyzing gamma ray signals to assess whether a nuclear fuel rod has failed and is emitting tag gas, activating a tag gas sampling and analysis system upon sensing tag gas emission from a failed nuclear rod and evaluating the escaped tag gas to identify the particular failed nuclear fuel rod. 12 figs.

  8. Analysis of Protein Adduction Kinetics by Quantitative Mass Spectrometry. Competing Adduction Reactions of Glutathione-S-Transferase P1-1 with Electrophiles

    PubMed Central

    Orton, Christopher R.; Liebler, Daniel C.

    2007-01-01

    Defining the mechanisms and consequences of protein adduction is crucial to understanding the toxicity of reactive electrophiles. Application of tandem mass spectrometry and data analysis algorithms enables detection and mapping of chemical adducts at the level of amino acid sequence. Nevertheless, detection of adducts does not indicate relative reactivity of different sites. Here we describe a method to measure the kinetics of competing adduction reactions at different sites on the same protein. Adducts are formed by electrophiles at Cys14 and Cys47 on the metabolic enzyme glutathione-S-transferase P1-1 and modification is accompanied by a loss of enzymatic activity. Relative quantitation of protein adducts was done by tagging N-termini of peptide digests with isotopically labeled phenyl isocyanate and tracking the ratio of light-tagged peptide adducts to heavy-tagged reference samples in liquid chromatography-tandem mass spectrometry analyses using a multiple reaction monitoring method. This approach was used to measure rate constants for adduction at both positions with two different model electrophiles, N-iodoacetyl-N-biotinylhexylenediamine and 1-biotinamido-4-(4′-[maleimidoethyl-cyclohexane]-carboxamido)butane. The results indicate that Cys47 was approximately 2–3-fold more reactive toward both electrophiles than was Cys14. This result was consistent with the relative reactivity of these electrophiles in a complex proteome system and with previously reported trends in reactivity of these sites. Kinetic analyses of protein modification reactions provide a means of evaluating the selectivity of reactive mediators of chemical toxicity. PMID:17433278

  9. Structured oligonucleotides for target indexing to allow single-vessel PCR amplification and solid support microarray hybridization.

    PubMed

    Girard, Laurie D; Boissinot, Karel; Peytavi, Régis; Boissinot, Maurice; Bergeron, Michel G

    2015-02-07

    The combination of molecular diagnostic technologies is increasingly used to overcome limitations on sensitivity, specificity or multiplexing capabilities, and provide efficient lab-on-chip devices. Two such techniques, PCR amplification and microarray hybridization are used serially to take advantage of the high sensitivity and specificity of the former combined with high multiplexing capacities of the latter. These methods are usually performed in different buffers and reaction chambers. However, these elaborate methods have high complexity and cost related to reagent requirements, liquid storage and the number of reaction chambers to integrate into automated devices. Furthermore, microarray hybridizations have a sequence dependent efficiency not always predictable. In this work, we have developed the concept of a structured oligonucleotide probe which is activated by cleavage from polymerase exonuclease activity. This technology is called SCISSOHR for Structured Cleavage Induced Single-Stranded Oligonucleotide Hybridization Reaction. The SCISSOHR probes enable indexing the target sequence to a tag sequence. The SCISSOHR technology also allows the combination of nucleic acid amplification and microarray hybridization in a single vessel in presence of the PCR buffer only. The SCISSOHR technology uses an amplification probe that is irreversibly modified in presence of the target, releasing a single-stranded DNA tag for microarray hybridization. Each tag is composed of a 3-nucleotide sequence-dependent segment and a unique "target sequence-independent" 14-nucleotide segment allowing for optimal hybridization with minimal cross-hybridization. We evaluated the performance of five (5) PCR buffers to support microarray hybridization, compared to a conventional hybridization buffer. Finally, as a proof of concept, we developed a multiplexed assay for the amplification, detection, and identification of three (3) DNA targets. This new technology will facilitate the design of lab-on-chip microfluidic devices, while also reducing consumable costs. At term, it will allow the cost-effective automation of highly multiplexed assays for detection and identification of genetic targets.

  10. Research advances based on mass spectrometry for profiling of triacylglycerols in oils and fats and their applications.

    PubMed

    Xu, Shu-Ling; Wei, Fang; Xie, Ya; Lv, Xin; Dong, Xu-Yan; Chen, Hong

    2018-03-23

    Vegetable oils and animal fats are dietary source of lipids that play critical and multiple roles in biological function. Triacylglycerols (TAGs) are the principal component of oils and fats with significant difference in profile among different oils and fats. TAG profiling is essential for nutritional evaluation, quality control and assurance of safety in oils and fats. However, analysis of TAGs is a challenging task because of the complicated composition of TAGs and their similar physicochemical properties in oils and fats. The rapid development of mass spectrometry (MS) technology in recent years makes it possible to analyze the composition, content and structure of TAGs in the study of the physical, chemical and nutritional properties of oils, fats and related products. This review described the research advancement based on MS for profiling of TAGs in oil, fat and their applications in food. The application of MS, including direct infusion strategies, and its combination with chromatography, gas chromatography-MS (GC-MS) and liquid chromatography-MS (LC-MS), in the analysis of TAGs were reviewed. The advantages and disadvantages of these analytical methods with relevant applications for TAGs analysis in food were also described. © 2018 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

  11. Lipopolysaccharide-induced innate immune factors in the bottlenose dolphin (Tursiops truncatus) detected in expression sequence tag analysis.

    PubMed

    Ohishi, Kazue; Shishido, Reiko; Iwata, Yasunao; Saitoh, Masafumi; Takenaka, Ryota; Ohtsu, Dai; Okutsu, Kenji; Maruyama, Tadashi

    2011-11-01

    EST analysis based on the megaclone-megasorting method was performed using leukocytes from the bottlenose dolphin (Tursiops truncatus) with or without LPS stimulation. A total of 849 upregulated and 384 downregulated EST clones were sequenced, annotated, and functionally classified. Ferritin heavy peptide I was the most abundant upregulated transcript, suggesting that LPS stimulation induced high production of reactive oxygen species, which were sequestered in ferritin. Among the immune factors, the transcripts coding for an IL-1Ra, homologs to bovine serum amyloid A3, and canine intercellular adhesion molecule-1 were highly expressed. Markedly downregulated transcripts of immune factors were those for homologs of calcium-binding proteins belonging to the S100 family, S100A12, S100A8, and S100A6. Time-course experiments on the expression of some immune factors including IL-1Ra suggested that these factors interact and control cetacean innate immunity. © 2011 The Societies and Blackwell Publishing Asia Pty Ltd.

  12. New technology and resources for cryptococcal research

    PubMed Central

    Zhang, Nannan; Park, Yoon-Dong; Williamson, Peter R.

    2014-01-01

    Rapid advances in molecular biology and genome sequencing have enabled the generation of new technology and resources for cryptococcal research. RNAi-mediated specific gene knock down has become routine and more efficient by utilizing modified shRNA plasmids and convergent promoter RNAi constructs. This system was recently applied in a high-throughput screen to identify genes involved in host-pathogen interactions. Gene deletion efficiencies have also been improved by increasing rates of homologous recombination through a number of approaches, including a combination of double-joint PCR with split-marker transformation, the use of dominant selectable markers and the introduction of Cre-Loxp systems into Cryptococcus. Moreover, visualization of cryptococcal proteins has become more facile using fusions with codon-optimized fluorescent tags, such as green or red fluorescent proteins or, mCherry. Using recent genome-wide analytical tools, new transcriptional factors and regulatory proteins have been identified in novel virulence-related signaling pathways by employing microarray analysis, RNA-sequencing and proteomic analysis. PMID:25460849

  13. In silico search, characterization and validation of new EST-SSR markers in the genus Prunus.

    PubMed

    Sorkheh, Karim; Prudencio, Angela S; Ghebinejad, Azim; Dehkordi, Mehrana Kohei; Erogul, Deniz; Rubio, Manuel; Martínez-Gómez, Pedro

    2016-07-07

    Simple sequence repeats (SSRs) are defined as sequence repeat units between 1 and 6 bp that occur in both coding and non-coding regions abundant in eukaryotic genomes, which may affect the expression of genes. In this study, expressed sequence tags (ESTs) of eight Prunus species were analyzed for in silico mining of EST-SSRs, protein annotation, and open reading frames (ORFs), and the identification of codon repetitions. A total of 316 SSRs were identified using MISA software. Dinucleotide SSR motifs (26.31 %) were found to be the most abundant type of repeats, followed by tri- (14.58 %), tetra- (0.53 %), and penta- (0.27 %) nucleotide motifs. An attempt was made to design primer pairs for 316 identified SSRs but these were successful for only 175 SSR sequences. The positions of SSRs with respect to ORFs were detected, and annotation of sequences containing SSRs was performed to assign function to each sequence. SSRs were also characterized (in terms of position in the reference genome and associated gene) using the two available Prunus reference genomes (mei and peach). Finally, 38 SSR markers were validated across peach, almond, plum, and apricot genotypes. This validation showed a higher transferability level of EST-SSR developed in P. mume (mei) in comparison with the rest of species analyzed. Findings will aid analysis of functionally important molecular markers and facilitate the analysis of genetic diversity.

  14. Tandem SUMO fusion vectors for improving soluble protein expression and purification.

    PubMed

    Guerrero, Fernando; Ciragan, Annika; Iwaï, Hideo

    2015-12-01

    Availability of highly purified proteins in quantity is crucial for detailed biochemical and structural investigations. Fusion tags are versatile tools to facilitate efficient protein purification and to improve soluble overexpression of proteins. Various purification and fusion tags have been widely used for overexpression in Escherichia coli. However, these tags might interfere with biological functions and/or structural investigations of the protein of interest. Therefore, an additional purification step to remove fusion tags by proteolytic digestion might be required. Here, we describe a set of new vectors in which yeast SUMO (SMT3) was used as the highly specific recognition sequence of ubiquitin-like protease 1, together with other commonly used solubility enhancing proteins, such as glutathione S-transferase, maltose binding protein, thioredoxin and trigger factor for optimizing soluble expression of protein of interest. This tandem SUMO (T-SUMO) fusion system was tested for soluble expression of the C-terminal domain of TonB from different organisms and for the antiviral protein scytovirin. Copyright © 2015 Elsevier Inc. All rights reserved.

  15. Optimized Design and Synthesis of Cell Permeable Biarsenical Cyanine Probe for Imaging Tagged Cytosolic Bacterial Proteins

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Fu, Na; Xiong, Yijia; Squier, Thomas C.

    2013-01-21

    To optimize cellular delivery and specific labeling of tagged cytosolic proteins by biarsenical fluorescent probes build around a cyanine dye scaffold, we have systematically varied the polarity of the hydrophobic tails (i.e., 4-5 methylene groups appended by a sulfonate or methoxy ester moiety) and arsenic capping reagent (ethanedithiol versus benzenedithiol). Targeted labeling of the cytosolic proteins SlyD and the alpha subunit of RNA polymerase engineered with a tetracysteine tagging sequences demonstrate the utility of the newly synthesized probes for live-cell visualization, albeit with varying efficiencies and background intensities. Optimal routine labeling and visualization is apparent using the ethanedithiol capping reagentmore » with the uncharged methoxy ester functionalized acyl chains. These measurements demonstrate the general utility of this class of photostable and highly fluorescent biarsenical reagents based on the cyanine scaffold for in vivo targeting of tagged cellular proteins for live cell measurements of protein dynamics.« less

  16. Completely monodisperse, highly repetitive proteins for bioconjugate capillary electrophoresis: Development and characterization

    PubMed Central

    Lin, Jennifer S.; Albrecht, Jennifer Coyne; Meagher, Robert J.; Wang, Xiaoxiao; Barron, Annelise E.

    2011-01-01

    Protein-based polymers are increasingly being used in biomaterial applications due to their ease of customization and potential monodispersity. These advantages make protein polymers excellent candidates for bioanalytical applications. Here we describe improved methods for producing drag-tags for Free-Solution Conjugate Electrophoresis (FSCE). FSCE utilizes a pure, monodisperse recombinant protein, tethered end-on to a ssDNA molecule, to enable DNA size separation in aqueous buffer. FSCE also provides a highly sensitive method to evaluate the polydispersity of a protein drag-tag and thus its suitability for bioanalytical uses. This method is able to detect slight differences in drag-tag charge or mass. We have devised an improved cloning, expression, and purification strategy that enables us to generate, for the first time, a truly monodisperse 20 kDa protein polymer and a nearly monodisperse 38 kDa protein. These newly produced proteins can be used as drag-tags to enable longer read DNA sequencing by free-solution microchannel electrophoresis. PMID:21553840

  17. Association of ESR1 gene tagging SNPs with breast cancer risk

    PubMed Central

    Dunning, Alison M.; Healey, Catherine S.; Baynes, Caroline; Maia, Ana-Teresa; Scollen, Serena; Vega, Ana; Rodríguez, Raquel; Barbosa-Morais, Nuno L.; Ponder, Bruce A.J.; Low, Yen-Ling; Bingham, Sheila; Haiman, Christopher A.; Le Marchand, Loic; Broeks, Annegien; Schmidt, Marjanka K.; Hopper, John; Southey, Melissa; Beckmann, Matthias W.; Fasching, Peter A.; Peto, Julian; Johnson, Nichola; Bojesen, Stig E.; Nordestgaard, Børge; Milne, Roger L.; Benitez, Javier; Hamann, Ute; Ko, Yon; Schmutzler, Rita K.; Burwinkel, Barbara; Schürmann, Peter; Dörk, Thilo; Heikkinen, Tuomas; Nevanlinna, Heli; Lindblom, Annika; Margolin, Sara; Mannermaa, Arto; Kosma, Veli-Matti; Chen, Xiaoqing; Spurdle, Amanda; Change-Claude, Jenny; Flesch-Janys, Dieter; Couch, Fergus J.; Olson, Janet E.; Severi, Gianluca; Baglietto, Laura; Børresen-Dale, Anne-Lise; Kristensen, Vessela; Hunter, David J.; Hankinson, Susan E.; Devilee, Peter; Vreeswijk, Maaike; Lissowska, Jolanta; Brinton, Louise; Liu, Jianjun; Hall, Per; Kang, Daehee; Yoo, Keun-Young; Shen, Chen-Yang; Yu, Jyh-Cherng; Anton-Culver, Hoda; Ziogoas, Argyrios; Sigurdson, Alice; Struewing, Jeff; Easton, Douglas F.; Garcia-Closas, Montserrat; Humphreys, Manjeet K.; Morrison, Jonathan; Pharoah, Paul D.P.; Pooley, Karen A.; Chenevix-Trench, Georgia

    2009-01-01

    We have conducted a three-stage, comprehensive single nucleotide polymorphism (SNP)-tagging association study of ESR1 gene variants (SNPs) in more than 55 000 breast cancer cases and controls from studies within the Breast Cancer Association Consortium (BCAC). No large risks or highly significant associations were revealed. SNP rs3020314, tagging a region of ESR1 intron 4, is associated with an increase in breast cancer susceptibility with a dominant mode of action in European populations. Carriers of the c-allele have an odds ratio (OR) of 1.05 [95% Confidence Intervals (CI) 1.02–1.09] relative to t-allele homozygotes, P = 0.004. There is significant heterogeneity between studies, P = 0.002. The increased risk appears largely confined to oestrogen receptor-positive tumour risk. The region tagged by SNP rs3020314 contains sequence that is more highly conserved across mammalian species than the rest of intron 4, and it may subtly alter the ratio of two mRNA splice forms. PMID:19126777

  18. Sequence Analysis of Mitochondrial Genome of Toxascaris leonina from a South China Tiger.

    PubMed

    Li, Kangxin; Yang, Fang; Abdullahi, A Y; Song, Meiran; Shi, Xianli; Wang, Minwei; Fu, Yeqi; Pan, Weida; Shan, Fang; Chen, Wu; Li, Guoqing

    2016-12-01

    Toxascaris leonina is a common parasitic nematode of wild mammals and has significant impacts on the protection of rare wild animals. To analyze population genetic characteristics of T. leonina from South China tiger, its mitochondrial (mt) genome was sequenced. Its complete circular mt genome was 14,277 bp in length, including 12 protein-coding genes, 22 tRNA genes, 2 rRNA genes, and 2 non-coding regions. The nucleotide composition was biased toward A and T. The most common start codon and stop codon were TTG and TAG, and 4 genes ended with an incomplete stop codon. There were 13 intergenic regions ranging 1 to 10 bp in size. Phylogenetically, T. leonina from a South China tiger was close to canine T. leonina . This study reports for the first time a complete mt genome sequence of T. leonina from the South China tiger, and provides a scientific basis for studying the genetic diversity of nematodes between different hosts.

  19. A proposed model for the flowering signaling pathway of sugarcane under photoperiodic control.

    PubMed

    Coelho, C P; Costa Netto, A P; Colasanti, J; Chalfun-Júnior, A

    2013-04-25

    Molecular analysis of floral induction in Arabidopsis has identified several flowering time genes related to 4 response networks defined by the autonomous, gibberellin, photoperiod, and vernalization pathways. Although grass flowering processes include ancestral functions shared by both mono- and dicots, they have developed their own mechanisms to transmit floral induction signals. Despite its high production capacity and its important role in biofuel production, almost no information is available about the flowering process in sugarcane. We searched the Sugarcane Expressed Sequence Tags database to look for elements of the flowering signaling pathway under photoperiodic control. Sequences showing significant similarity to flowering time genes of other species were clustered, annotated, and analyzed for conserved domains. Multiple alignments comparing the sequences found in the sugarcane database and those from other species were performed and their phylogenetic relationship assessed using the MEGA 4.0 software. Electronic Northerns were run with Cluster and TreeView programs, allowing us to identify putative members of the photoperiod-controlled flowering pathway of sugarcane.

  20. Bioinformatics and expressional analysis of cDNA clones from floral buds

    NASA Astrophysics Data System (ADS)

    Pawełkowicz, Magdalena Ewa; Skarzyńska, Agnieszka; Cebula, Justyna; Hincha, Dirck; ZiÄ bska, Karolina; PlÄ der, Wojciech; Przybecki, Zbigniew

    2017-08-01

    The application of genomic approaches may serve as an initial step in understanding the complexity of biochemical network and cellular processes responsible for regulation and execution of many developmental tasks. The molecular mechanism of sex expression in cucumber is still not elucidated. A study of differential expression was conducted to identify genes involved in sex determination and floral organ morphogenesis. Herein, we present generation of expression sequence tags (EST) obtained by differential hybridization (DH) and subtraction technique (cDNA-DSC) and their characteristic features such as molecular function, involvement in biology processes, expression and mapping position on the genome.

  1. Leveraging algal omics to reveal potential targets for augmenting TAG accumulation

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Arora, Neha; Pienkos, Philip T.; Pruthi, Vikas

    Ongoing global efforts to commercialize microalgal biofuels have expedited the use of multi-omics techniques to gain insights into lipid biosynthetic pathways. Functional genomics analyses have recently been employed to complement existing sequence-level omics studies, shedding light on the dynamics of lipid synthesis and its interplay with other cellular metabolic pathways, thus revealing possible targets for metabolic engineering. Here, we review the current status of algal omics studies to reveal potential targets to augment TAG accumulation in various microalgae. Here, this review specifically aims to examine and catalog systems level data related to stress-induced TAG accumulation in oleaginous microalgae and informmore » future metabolic engineering strategies to develop strains with enhanced bioproductivity, which could pave a path for sustainable green energy.« less

  2. Leveraging Algal Omics to Reveal Potential Targets for Augmenting TAG Accumulation

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Guarnieri, Michael T; Pienkos, Philip T; Arora, Neha

    2018-04-18

    Ongoing global efforts to commercialize microalgal biofuels have expedited the use of multi-omics techniques to gain insights into lipid biosynthetic pathways. Functional genomics analyses have recently been employed to complement existing sequence-level omics studies, shedding light on the dynamics of lipid synthesis and its interplay with other cellular metabolic pathways, thus revealing possible targets for metabolic engineering. Here, we review the current status of algal omics studies to reveal potential targets to augment TAG accumulation in various microalgae. This review specifically aims to examine and catalog systems level data related to stress-induced TAG accumulation in oleaginous microalgae and inform futuremore » metabolic engineering strategies to develop strains with enhanced bioproductivity, which could pave a path for sustainable green energy.« less

  3. Distinguishing Aspartic and Isoaspartic Acids in Peptides by Several Mass Spectrometric Fragmentation Methods

    NASA Astrophysics Data System (ADS)

    DeGraan-Weber, Nick; Zhang, Jun; Reilly, James P.

    2016-12-01

    Six ion fragmentation techniques that can distinguish aspartic acid from its isomer, isoaspartic acid, were compared. MALDI post-source decay (PSD), MALDI 157 nm photodissociation, tris(2,4,6-trimethoxyphenyl)phosphonium bromide (TMPP) charge tagging in PSD and photodissociation, ESI collision-induced dissociation (CID), electron transfer dissociation (ETD), and free-radical initiated peptide sequencing (FRIPS) with CID were applied to peptides containing either aspartic or isoaspartic acid. Diagnostic ions, such as the y-46 and b+H2O, are present in PSD, photodissociation, and charge tagging. c•+57 and z-57 ions are observed in ETD and FRIPS experiments. For some molecules, aspartic and isoaspartic acid yield ion fragments with significantly different intensities. ETD and charge tagging appear to be most effective at distinguishing these residues.

  4. Expression, purification, and DNA-binding activity of the Herbaspirillum seropedicae RecX protein.

    PubMed

    Galvão, Carolina W; Pedrosa, Fábio O; Souza, Emanuel M; Yates, M Geoffrey; Chubatsu, Leda S; Steffens, Maria Berenice R

    2004-06-01

    The Herbaspirillum seropedicae RecX protein participates in the SOS response: a process in which the RecA protein plays a central role. The RecX protein of the H. seropedicae, fused to a His-tag sequence (RecX His-tagged), was over-expressed in Escherichia coli and purified by metal-affinity chromatography to yield a highly purified and active protein. DNA band-shift assays showed that the RecX His-tagged protein bound to both circular and linear double-stranded DNA and also to circular single-stranded DNA. The apparent affinity of RecX for DNA decreased in the presence of Mg(2+) ions. The ability of RecX to bind DNA may be relevant to its function in the SOS response.

  5. Leveraging algal omics to reveal potential targets for augmenting TAG accumulation

    DOE PAGES

    Arora, Neha; Pienkos, Philip T.; Pruthi, Vikas; ...

    2018-04-18

    Ongoing global efforts to commercialize microalgal biofuels have expedited the use of multi-omics techniques to gain insights into lipid biosynthetic pathways. Functional genomics analyses have recently been employed to complement existing sequence-level omics studies, shedding light on the dynamics of lipid synthesis and its interplay with other cellular metabolic pathways, thus revealing possible targets for metabolic engineering. Here, we review the current status of algal omics studies to reveal potential targets to augment TAG accumulation in various microalgae. Here, this review specifically aims to examine and catalog systems level data related to stress-induced TAG accumulation in oleaginous microalgae and informmore » future metabolic engineering strategies to develop strains with enhanced bioproductivity, which could pave a path for sustainable green energy.« less

  6. Leveraging algal omics to reveal potential targets for augmenting TAG accumulation.

    PubMed

    Arora, Neha; Pienkos, Philip T; Pruthi, Vikas; Poluri, Krishna Mohan; Guarnieri, Michael T

    2018-04-18

    Ongoing global efforts to commercialize microalgal biofuels have expedited the use of multi-omics techniques to gain insights into lipid biosynthetic pathways. Functional genomics analyses have recently been employed to complement existing sequence-level omics studies, shedding light on the dynamics of lipid synthesis and its interplay with other cellular metabolic pathways, thus revealing possible targets for metabolic engineering. Here, we review the current status of algal omics studies to reveal potential targets to augment TAG accumulation in various microalgae. This review specifically aims to examine and catalog systems level data related to stress-induced TAG accumulation in oleaginous microalgae and inform future metabolic engineering strategies to develop strains with enhanced bioproductivity, which could pave a path for sustainable green energy. Copyright © 2018. Published by Elsevier Inc.

  7. Transcriptomic analysis of Ruditapes philippinarum hemocytes reveals cytoskeleton disruption after in vitro Vibrio tapetis challenge.

    PubMed

    Brulle, Franck; Jeffroy, Fanny; Madec, Stéphanie; Nicolas, Jean-Louis; Paillard, Christine

    2012-10-01

    The Manila clam, Ruditapes philippinarum, is an economically-important, commercial shellfish; harvests are diminished in some European waters by a pathogenic bacterium, Vibrio tapetis, that causes Brown Ring disease. To identify molecular characteristics associated with susceptibility or resistance to Brown Ring disease, Suppression Subtractive Hybridization (SSH) analyzes were performed to construct cDNA libraries enriched in up- or down-regulated transcripts from clam immune cells, hemocytes, after a 3-h in vitro challenge with cultured V. tapetis. Nine hundred and ninety eight sequences from the two libraries were sequenced, and an in silico analysis identified 235 unique genes. BLAST and "Gene ontology" classification analyzes revealed that 60.4% of the Expressed Sequence Tags (ESTs) have high similarities with genes involved in various physiological functions, such as immunity, apoptosis and cytoskeleton organization; whereas, 39.6% remain unidentified. From the 235 unique genes, we selected 22 candidates based upon physiological function and redundancy in the libraries. Then, Real-Time PCR analysis identified 3 genes related to cytoskeleton organization showing significant variation in expression attributable to V. tapetis exposure. Disruption in regulation of these genes is consistent with the etiologic agent of Brown Ring disease in Manila clams. Copyright © 2012 Elsevier Ltd. All rights reserved.

  8. Method for rapid base sequencing in DNA and RNA with two base labeling

    DOEpatents

    Jett, J.H.; Keller, R.A.; Martin, J.C.; Posner, R.G.; Marrone, B.L.; Hammond, M.L.; Simpson, D.J.

    1995-04-11

    A method is described for rapid-base sequencing in DNA and RNA with two-base labeling and employing fluorescent detection of single molecules at two wavelengths. Bases modified to accept fluorescent labels are used to replicate a single DNA or RNA strand to be sequenced. The bases are then sequentially cleaved from the replicated strand, excited with a chosen spectrum of electromagnetic radiation, and the fluorescence from individual, tagged bases detected in the order of cleavage from the strand. 4 figures.

  9. Method for rapid base sequencing in DNA and RNA with two base labeling

    DOEpatents

    Jett, James H.; Keller, Richard A.; Martin, John C.; Posner, Richard G.; Marrone, Babetta L.; Hammond, Mark L.; Simpson, Daniel J.

    1995-01-01

    Method for rapid-base sequencing in DNA and RNA with two-base labeling and employing fluorescent detection of single molecules at two wavelengths. Bases modified to accept fluorescent labels are used to replicate a single DNA or RNA strand to be sequenced. The bases are then sequentially cleaved from the replicated strand, excited with a chosen spectrum of electromagnetic radiation, and the fluorescence from individual, tagged bases detected in the order of cleavage from the strand.

  10. Isolation of viral ribonucleoprotein complexes from infected cells by tandem affinity purification.

    PubMed

    Mayer, Daniel; Baginsky, Sacha; Schwemmle, Martin

    2005-11-01

    The biochemical purification and analysis of viral ribonucleoprotein complexes (RNPs) of negative-strand RNA viruses is hampered by the lack of suitable tags that facilitate specific enrichment of these complexes. We therefore tested whether fusion of the tandem-affinity-purification (TAP) tag to the main component of viral RNPs, the nucleoprotein, might allow the isolation of these RNPs from cells. We constitutively expressed TAP-tagged nucleoprotein of Borna disease virus (BDV) in cells persistently infected with this virus. The TAP-tagged bait was efficiently incorporated into viral RNPs, did not interfere with BDV replication and was also packaged into viral particles. Native purification of the tagged protein complexes from BDV-infected cells by two consecutive affinity columns resulted in the isolation of several viral proteins, which were identified by MS analysis as the matrix protein, the two forms of the nucleoprotein and the phosphoprotein. In addition to the viral proteins, RT-PCR analysis revealed the presence of viral genomic RNA. Introduction of further protease cleavage sites within the TAP-tag significantly increased the purification yield. These results demonstrate that purification of TAP-tagged viral RNPs is possible and efficient, and may therefore provide new avenues for biochemical and functional studies of these complexes.

  11. In vivo expression and purification of aptamer-tagged small RNA regulators

    PubMed Central

    Said, Nelly; Rieder, Renate; Hurwitz, Robert; Deckert, Jochen; Urlaub, Henning; Vogel, Jörg

    2009-01-01

    Small non-coding RNAs (sRNAs) are an emerging class of post-transcriptional regulators of bacterial gene expression. To study sRNAs and their potential protein interaction partners, it is desirable to purify sRNAs from cells in their native form. Here, we used RNA-based affinity chromatography to purify sRNAs following their expression as aptamer-tagged variants in vivo. To this end, we developed a family of plasmids to express sRNAs with any of three widely used aptamer sequences (MS2, boxB, eIF4A), and systematically tested how the aptamer tagging impacted on intracellular accumulation and target regulation of the Salmonella GcvB, InvR or RybB sRNAs. In addition, we successfully tagged the chromosomal rybB gene with MS2 to observe that RybB-MS2 is fully functional as an envelope stress-induced repressor of ompN mRNA following induction of sigmaE. We further demonstrate that the common sRNA-binding protein, Hfq, co-purifies with MS2-tagged sRNAs of Salmonella. The presented affinity purification strategy may facilitate the isolation of in vivo assembled sRNA–protein complexes in a wide range of bacteria. PMID:19726584

  12. Synthesis of Fe3O4@nickel-silicate core-shell nanoparticles for His-tagged enzyme immobilizing agents

    NASA Astrophysics Data System (ADS)

    Shin, Moo-Kwang; Kang, Byunghoon; Yoon, Nam-Kyung; Kim, Myeong-Hoon; Ki, Jisun; Han, Seungmin; Ahn, Jung-Oh; Haam, Seungjoo

    2016-12-01

    Immobilizing enzymes on artificially fabricated carriers for their efficient use and easy removal from reactants has attracted enormous interest for decades. Specifically, binding platforms using inorganic nanoparticles have been widely explored because of the benefits of their large surface area, easy surface modification, and high stability in various pH and temperatures. Herein, we fabricated Fe3O4 encapsulated ‘sea-urchin’ shaped nickel-silicate nanoparticles with a facile synthetic route. The enzymes were then rapidly and easily immobilized with poly-histidine tags (His-tags) and nickel ion affinity. Porous nickel silicate covered nanoparticles achieved a high immobilization capacity (85 μg mg-1) of His-tagged tobacco etch virus (TEV) protease. To investigate immobilized TEV protease enzymatic activity, we analyzed the cleaved quantity of maltose binding protein-exendin-fused immunoglobulin fusion protein, which connected with the TEV protease-specific cleavage peptide sequence. Moreover, TEV protease immobilized nanocomplexes conveniently removed and recollected from the reactant by applying an external magnetic field, maintained their enzymatic activity after reuse. Therefore, our newly developed nanoplatform for His-tagged enzyme immobilization provides advantageous features for biotechnological industries including recombinant protein processing.

  13. Allergenic characterization of a novel allergen, homologous to chymotrypsin, from german cockroach.

    PubMed

    Jeong, Kyoung Yong; Son, Mina; Lee, Jae Hyun; Hong, Chein Soo; Park, Jung Won

    2015-05-01

    Cockroach feces are known to be rich in IgE-reactive components. Various protease allergens were identified by proteomic analysis of German cockroach fecal extract in a previous study. In this study, we characterized a novel allergen, a chymotrypsin-like serine protease. A cDNA sequence homologous to chymotrypsin was obtained by analysis of German cockroach expressed sequence tag (EST) clones. The recombinant chymotrypsins from the German cockroach and house dust mite (Der f 6) were expressed in Escherichia coli using the pEXP5NT/TOPO vector system, and their allergenicity was investigated by ELISA. The deduced amino acid sequence of German cockroach chymotrypsin showed 32.7 to 43.1% identity with mite group 3 (trypsin) and group 6 (chymotrypsin) allergens. Sera from 8 of 28 German cockroach allergy subjects (28.6%) showed IgE binding to the recombinant protein. IgE binding to the recombinant cockroach chymotrypsin was inhibited by house dust mite chymotrypsin Der f 6, while it minimally inhibited the German cockroach whole body extract. A novel allergen homologous to chymotrypsin was identified from the German cockroach and was cross-reactive with Der f 6.

  14. Mathematical Development and Computational Analysis of Harmonic Phase-Magnetic Resonance Imaging (HARP-MRI) Based on Bloch Nuclear Magnetic Resonance (NMR) Diffusion Model for Myocardial Motion.

    PubMed

    Dada, Michael O; Jayeoba, Babatunde; Awojoyogbe, Bamidele O; Uno, Uno E; Awe, Oluseyi E

    2017-09-13

    Harmonic Phase-Magnetic Resonance Imaging (HARP-MRI) is a tagged image analysis method that can measure myocardial motion and strain in near real-time and is considered a potential candidate to make magnetic resonance tagging clinically viable. However, analytical expressions of radially tagged transverse magnetization in polar coordinates (which is required to appropriately describe the shape of the heart) have not been explored because the physics required to directly connect myocardial deformation of tagged Nuclear Magnetic Resonance (NMR) transverse magnetization in polar geometry and the appropriate harmonic phase parameters are not yet available. The analytical solution of Bloch NMR diffusion equation in spherical geometry with appropriate spherical wave tagging function is important for proper analysis and monitoring of heart systolic and diastolic deformation with relevant boundary conditions. In this study, we applied Harmonic Phase MRI method to compute the difference between tagged and untagged NMR transverse magnetization based on the Bloch NMR diffusion equation and obtained radial wave tagging function for analysis of myocardial motion. The analytical solution of the Bloch NMR equations and the computational simulation of myocardial motion as developed in this study are intended to significantly improve healthcare for accurate diagnosis, prognosis and treatment of cardiovascular related deceases at the lowest cost because MRI scan is still one of the most expensive anywhere. The analysis is fundamental and significant because all Magnetic Resonance Imaging techniques are based on the Bloch NMR flow equations.

  15. Development and evaluation of 200 novel SNP assays for population genetic studies of westslope cutthroat trout and genetic identification of related taxa

    Treesearch

    N. R. Campbell; S. J. Amish; V. L. Prichard; K. M. McKelvey; M. K. Young; M. K. Schwartz; J. C. Garza; G. Luikart; S. R. Narum

    2012-01-01

    DNA sequence data were collected and screened for single nucleotide polymorphisms (SNPs) in westslope cutthroat trout (Oncorhynchus clarki lewisi) and also for substitutions that could be used to genetically discriminate rainbow trout (O. mykiss) and cutthroat trout, as well as several cutthroat trout subspecies. In total, 260 expressed sequence tag-derived loci were...

  16. Integration of transcriptomic and proteomic data from a single wheat cultivar provides new tools for understanding the roles of individual alpha gliadin proteins in flour quality and celiac disease

    USDA-ARS?s Scientific Manuscript database

    One-hundred-thirty-six expressed sequence tags (ESTs) encoding alpha gliadins from Triticum aestivum cv Butte 86 were identified in public databases and assembled into 19 contigs. Consensus sequences for 12 of the contigs encoded complete alpha gliadin proteins, but only two were identical to protei...

  17. Generation, annotation and analysis of ESTs from Trichoderma harzianum CECT 2413

    PubMed Central

    Vizcaíno, Juan Antonio; González, Francisco Javier; Suárez, M Belén; Redondo, José; Heinrich, Julian; Delgado-Jarana, Jesús; Hermosa, Rosa; Gutiérrez, Santiago; Monte, Enrique; Llobell, Antonio; Rey, Manuel

    2006-01-01

    Background The filamentous fungus Trichoderma harzianum is used as biological control agent of several plant-pathogenic fungi. In order to study the genome of this fungus, a functional genomics project called "TrichoEST" was developed to give insights into genes involved in biological control activities using an approach based on the generation of expressed sequence tags (ESTs). Results Eight different cDNA libraries from T. harzianum strain CECT 2413 were constructed. Different growth conditions involving mainly different nutrient conditions and/or stresses were used. We here present the analysis of the 8,710 ESTs generated. A total of 3,478 unique sequences were identified of which 81.4% had sequence similarity with GenBank entries, using the BLASTX algorithm. Using the Gene Ontology hierarchy, we performed the annotation of 51.1% of the unique sequences and compared its distribution among the gene libraries. Additionally, the InterProScan algorithm was used in order to further characterize the sequences. The identification of the putatively secreted proteins was also carried out. Later, based on the EST abundance, we examined the highly expressed genes and a hydrophobin was identified as the gene expressed at the highest level. We compared our collection of ESTs with the previous collections obtained from Trichoderma species and we also compared our sequence set with different complete eukaryotic genomes from several animals, plants and fungi. Accordingly, the presence of similar sequences in different kingdoms was also studied. Conclusion This EST collection and its annotation provide a significant resource for basic and applied research on T. harzianum, a fungus with a high biotechnological interest. PMID:16872539

  18. Application of selected ion monitoring to the analysis of triacylglycerols in olive oil by high temperature-gas chromatography/mass spectrometry.

    PubMed

    Ruiz-Samblás, C; González-Casado, A; Cuadros-Rodríguez, L; García, F P Rodríguez

    2010-06-30

    The analysis of the triacylglycerol (TAG) composition of oils is a very challenging task, since the TAGs have very similar physico-chemical properties. In this work, a high temperature-gas chromatographic method coupled to electron ionization-mass spectrometry (HT-GC/EI-MS), in the Selected Ion Monitoring (SIM) mode, method was developed for the analysis of TAGs in the olive oil; this is a method suitable for routine analysis. This method was developed using commercially available standard TAGs. The TAGs studied were separated according to their equivalent carbon number and degree of unsaturation. The peak assignment was carried out by locating the characteristic fragment ions having the same retention time on the SIM profile such as [RCO+74](+) and [RCO+128](+) ions, due to the fatty acyl residues on sn-1, sn-2 and sn-3 positions of the TAG molecule and the [M-OCOR](+) ions corresponding to the acyl ions. The developed method was very useful to eliminate the interferences that appeared in the mass spectrum since electron ionization can prevent satisfactory interpretation of spectra. Copyright 2010 Elsevier B.V. All rights reserved.

  19. Analysis of JC virus DNA replication using a quantitative and high-throughput assay

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Shin, Jong; Phelan, Paul J.; Chhum, Panharith

    2014-11-15

    Progressive Multifocal Leukoencephalopathy (PML) is caused by lytic replication of JC virus (JCV) in specific cells of the central nervous system. Like other polyomaviruses, JCV encodes a large T-antigen helicase needed for replication of the viral DNA. Here, we report the development of a luciferase-based, quantitative and high-throughput assay of JCV DNA replication in C33A cells, which, unlike the glial cell lines Hs 683 and U87, accumulate high levels of nuclear T-ag needed for robust replication. Using this assay, we investigated the requirement for different domains of T-ag, and for specific sequences within and flanking the viral origin, in JCVmore » DNA replication. Beyond providing validation of the assay, these studies revealed an important stimulatory role of the transcription factor NF1 in JCV DNA replication. Finally, we show that the assay can be used for inhibitor testing, highlighting its value for the identification of antiviral drugs targeting JCV DNA replication. - Highlights: • Development of a high-throughput screening assay for JCV DNA replication using C33A cells. • Evidence that T-ag fails to accumulate in the nuclei of established glioma cell lines. • Evidence that NF-1 directly promotes JCV DNA replication in C33A cells. • Proof-of-concept that the HTS assay can be used to identify pharmacological inhibitor of JCV DNA replication.« less

  20. Use of continuous/contiguous stacking hybridization as a diagnostic tool

    DOEpatents

    Mirzabekov, Andrei Darievich; Kirillov, Eugene Vladislavovich; Parinov, Sergei Valeryevich; Barski, Victor Evgenievich; Dubiley, Svetlana Alekseevna

    2002-01-01

    A method for detecting disease-associated alleles in patient genetic material is provided whereby a first group of oligonucleotide molecules, synthesized to compliment base sequences of the disease associated alleles is immobilized on a predetermined position on a substrate, and then contacted with patient genetic material to form duplexes. The duplexes are then contacted with a second group of oligonucleotide molecules which are synthesized to extend the predetermined length of the oligonucleotide molecules of the first group, and where each of the oligonucleotide molecules of the second group are tagged and either incorporate universal bases or a mixture of guanine, cytosine, thymine, and adenine, or complementary nucleotide strands that are tagged with a different fluorochrome which radiates light at a predetermined wavelength. The treated substrate is then washed and the light patterns radiating therefrom are compared with predetermined light patterns of various diseases that were prepared on identical substrates. A method is also provided for determining the length of a repeat sequence in DNA or RNA, and also for determining the base sequence of unknown DNA or RNA.

Top