Science.gov

Sample records for expression sequence tags

  1. Expressed sequence tags: an overview.

    PubMed

    Parkinson, John; Blaxter, Mark

    2009-01-01

    Expressed sequence tags (ESTs) are fragments of mRNA sequences derived through single sequencing reactions performed on randomly selected clones from cDNA libraries. To date, over 45 million ESTs have been generated from over 1400 different species of eukaryotes. For the most part, EST projects are used to either complement existing genome projects or serve as low-cost alternatives for purposes of gene discovery. However, with improvements in accuracy and coverage, they are beginning to find application in fields such as phylogenetics, transcript profiling and proteomics. This volume provides practical details on the generation and analysis of ESTs. Chapters are presented which cover creation of cDNA libraries; generation and processing of sequence data; bioinformatics analysis of ESTs; and their application to phylogenetics and transcript profiling. PMID:19277571

  2. Expressed sequence tags: analysis and annotation.

    PubMed

    Parkinson, John; Blaxter, Mark

    2004-01-01

    Expressed sequence tags (ESTs) present a special set of problems for bioinformatic analysis. They are partial and error-prone, and large datasets can have significant internal redundancy. To facilitate analysis of small EST datasets from in-house projects, we present an integrated "pipeline" of tools that take EST data from sequence trace to database submission. These tools also can be used to provide clustering of ESTs into putative genes and to annotate these genes with preliminary sequence similarity searches. The systems are written to use the public-domain LINUX environment and other openly available analytical tools. PMID:15153624

  3. Complementary DNA sequencing: Expressed sequence tags and human genome project

    SciTech Connect

    Adams, M.D.; Kelley, J.M.; Gocayne, J.D.; Dubnick, M.; Wu, A.; Olde, B.; Moreno, R.F.; Kerlavage, A.R.; McCombie, W.R.; Venter, J.C. ); Polymeropoulos, M.H.; Hong Xiao; Merril, C.R. )

    1991-06-21

    Automated partial DNA sequencing was conducted on more than 600 randomly selected human brain complementary DNA (cDNA) clones to generate expressed sequence tags (ESTs). ESTs have applications in the discovery of new human genes, mapping of the human genome, and identification of coding regions in genomic sequences. Of the sequences generated, 337 represent new genes, including 48 with significant similarity to genes from other organisms, such as a yeast RNA polymerase II subunit; Drosophila kinesin, Notch, and Enhancer of split; and a murine tyrosine kinase receptor. Forty-six ESTs were mapped to chromosomes after amplification by the polymerase chain reaction. This fast approach to cDNA characterization will facilitate the tagging of most human genes in a few years at a fraction of the cost of complete genomic sequencing, provide new genetic markers, and serve as a resource in diverse biological research fields.

  4. Gene Discovery through Expressed Sequence Tag Sequencing in Trypanosoma cruzi

    PubMed Central

    Verdun, Ramiro E.; Di Paolo, Nelson; Urmenyi, Turan P.; Rondinelli, Edson; Frasch, Alberto C. C.; Sanchez, Daniel O.

    1998-01-01

    Analysis of expressed sequence tags (ESTs) constitutes a useful approach for gene identification that, in the case of human pathogens, might result in the identification of new targets for chemotherapy and vaccine development. As part of the Trypanosoma cruzi genome project, we have partially sequenced the 5′ ends of 1,949 clones to generate ESTs. The clones were randomly selected from a normalized CL Brener epimastigote cDNA library. A total of 14.6% of the clones were homologous to previously identified T. cruzi genes, while 18.4% had significant matches to genes from other organisms in the database. A total of 67% of the ESTs had no matches in the database, and thus, some of them might be T. cruzi-specific genes. Functional groups of those sequences with matches in the database were constructed according to their putative biological functions. The two largest categories were protein synthesis (23.3%) and cell surface molecules (10.8%). The information reported in this paper should be useful for researchers in the field to analyze genes and proteins of their own interest. PMID:9784549

  5. Expressed sequence tag analysis in tef (Eragrostis tef (Zucc) Trotter).

    PubMed

    Yu, Ju-Kyung; Sun, Qi; Rota, Mauricio La; Edwards, Hugh; Tefera, Hailu; Sorrells, Mark E

    2006-04-01

    Tef (Eragrostis tef (Zucc.) Trotter) is the most important cereal crop in Ethiopia; however, there is very little DNA sequence information available for this species. Expressed sequence tags (ESTs) were generated from 4 cDNA libraries: seedling leaf, seedling root, and inflorescence of E. tef and seedling leaf of Eragrostis pilosa, a wild relative of E. tef. Clustering of 3603 sequences produced 530 clusters and 1890 singletons, resulting in 2420 tef unigenes. Approximately 3/4 of tef unigenes matched protein or nucleotide sequences in public databases. Annotation of unigenes associated 68% of the putative tef genes with gene ontology categories. Identification of the translated unigenes for conserved protein domains revealed 389 protein family domains (Pfam), the most frequent of which was protein kinase. A total of 170 ESTs containing simple sequence repeats (EST-SSRs) were identified and 80 EST-SSR markers were developed. In addition, 19 single-nucleotide polymorphism (SNP) and (or) insertion-deletion (indel) and 34 intron fragment length polymorphism (IFLP) markers were developed. The EST database and molecular markers generated in this study will be valuable resources for further tef genetic research. PMID:16699556

  6. A blackberry (Rubus L.) expressed sequence tag library for the development of simple sequence repeat markers

    Technology Transfer Automated Retrieval System (TEKTRAN)

    A blackberry (Rubus L.) expressed sequence tag (EST) library was produced for developing simple sequence repeat (SSR) markers from the tetraploid blackberry cultivar, Merton Thornless, the source of the thornless trait in commercial cultivars. RNA was extracted from young expanding leaves and used f...

  7. Expressed sequence tags from the plant trypanosomatid Phytomonas serpens.

    PubMed

    Pappas, Georgios J; Benabdellah, Karim; Zingales, Bianca; González, Antonio

    2005-08-01

    We have generated 2190 expressed sequence tags (ESTs) from a cDNA library of the plant trypanosomatid Phytomonas serpens. Upon processing and clustering the set of 1893 accepted sequences was reduced to 697 clusters consisting of 452 singletons and 245 contigs. Functional categories were assigned based on BLAST searches against a database of the eukaryotic orthologous groups of proteins (KOG). Thirty six percent of the generated sequences showed no hits against the KOG database and 39.6% presented similarity to the KOG classes corresponding to translation, ribosomal structure and biogenesis. The most populated cluster contained 45 ESTs homologous to members of the glucose transporter family. This fact can be immediately correlated to the reported Phytomonas dependence on anaerobic glycolytic ATP production due to the lack of cytochrome-mediated respiratory chain. In this context, not only a number of enzymes of the glycolytic pathway were identified but also of the Krebs cycle as well as specific components of the respiratory chain. The data here reported, including a few hundred unique sequences and the description of tandemly repeated motifs and putative transcript stability motifs at untranslated mRNA ends, represent an initial approach to overcome the lack of information on the molecular biology of this organism. PMID:15869816

  8. Analysis of the dermatophyte Trichophyton rubrum expressed sequence tags

    PubMed Central

    Wang, Lingling; Ma, Li; Leng, Wenchuan; Liu, Tao; Yu, Lu; Yang, Jian; Yang, Li; Zhang, Wenliang; Zhang, Qian; Dong, Jie; Xue, Ying; Zhu, Yafang; Xu, Xingye; Wan, Zhe; Ding, Guohui; Yu, Fudong; Tu, Kang; Li, Yixue; Li, Ruoyu; Shen, Yan; Jin, Qi

    2006-01-01

    Background Dermatophytes are the primary causative agent of dermatophytoses, a disease that affects billions of individuals worldwide. Trichophyton rubrum is the most common of the superficial fungi. Although T. rubrum is a recognized pathogen for humans, little is known about how its transcriptional pattern is related to development of the fungus and establishment of disease. It is therefore necessary to identify genes whose expression is relevant to growth, metabolism and virulence of T. rubrum. Results We generated 10 cDNA libraries covering nearly the entire growth phase and used them to isolate 11,085 unique expressed sequence tags (ESTs), including 3,816 contigs and 7,269 singletons. Comparisons with the GenBank non-redundant (NR) protein database revealed putative functions or matched homologs from other organisms for 7,764 (70%) of the ESTs. The remaining 3,321 (30%) of ESTs were only weakly similar or not similar to known sequences, suggesting that these ESTs represent novel genes. Conclusion The present data provide a comprehensive view of fungal physiological processes including metabolism, sexual and asexual growth cycles, signal transduction and pathogenic mechanisms. PMID:17032460

  9. Patterns of gene expression in microarrays and expressed sequence tags from normal and cataractous lenses.

    PubMed

    Sousounis, Konstantinos; Tsonis, Panagiotis A

    2012-01-01

    In this contribution, we have examined the patterns of gene expression in normal and cataractous lenses as presented in five different papers using microarrays and expressed sequence tags. The purpose was to evaluate unique and common patterns of gene expression during development, aging and cataracts. PMID:23244575

  10. Analysis of expressed sequence tags from the Ulva prolifera (Chlorophyta)

    NASA Astrophysics Data System (ADS)

    Niu, Jianfeng; Hu, Haiyan; Hu, Songnian; Wang, Guangce; Peng, Guang; Sun, Song

    2010-01-01

    In 2008, a green tide broke out before the sailing competition of the 29th Olympic Games in Qingdao. The causative species was determined to be Enteromorpha prolifera ( Ulva prolifera O. F. Müller), a familiar green macroalga along the coastline of China. Rapid accumulation of a large biomass of floating U. prolifera prompted research on different aspects of this species. In this study, we constructed a nonnormalized cDNA library from the thalli of U. prolifera and acquired 10 072 high-quality expressed sequence tags (ESTs). These ESTs were assembled into 3 519 nonredundant gene groups, including 1 446 clusters and 2 073 singletons. After annotation with the nr database, a large number of genes were found to be related with chloroplast and ribosomal protein, GO functional classification showed 1 418 ESTs participated in photosynthesis and 1 359 ESTs were responsible for the generation of precursor metabolites and energy. In addition, rather comprehensive carbon fixation pathways were found in U. prolifera using KEGG. Some stress-related and signal transduction-related genes were also found in this study. All the evidences displayed that U. prolifera had substance and energy foundation for the intense photosynthesis and the rapid proliferation. Phylogenetic analysis of cytochrome c oxidase subunit I revealed that this green-tide causative species is most closely affiliated to Pseudendoclonium akinetum (Ulvophyceae).

  11. Identification of human chromosome 22 transcribed sequences with ORF expressed sequence tags

    PubMed Central

    de Souza, Sandro J.; Camargo, Anamaria A.; Briones, Marcelo R. S.; Costa, Fernando F.; Nagai, Maria Aparecida; Verjovski-Almeida, Sergio; Zago, Marco A.; Andrade, Luis Eduardo C.; Carrer, Helaine; El-Dorry, Hamza F. A.; Espreafico, Enilza M.; Habr-Gama, Angelita; Giannella-Neto, Daniel; Goldman, Gustavo H.; Gruber, Arthur; Hackel, Christine; Kimura, Edna T.; Maciel, Rui M. B.; Marie, Suely K. N.; Martins, Elizabeth A. L.; Nóbrega, Marina P.; Paçó-Larson, Maria Luisa; Pardini, Maria Inês M. C.; Pereira, Gonçalo G.; Pesquero, João Bosco; Rodrigues, Vanderlei; Rogatto, Silvia R.; da Silva, Ismael D. C. G.; Sogayar, Mari C.; de Fátima Sonati, Maria; Tajara, Eloiza H.; Valentini, Sandro R.; Acencio, Marcio; Alberto, Fernando L.; Amaral, Maria Elisabete J.; Aneas, Ivy; Bengtson, Mário Henrique; Carraro, Dirce M.; Carvalho, Alex F.; Carvalho, Lúcia Helena; Cerutti, Janete M.; Corrêa, Maria Lucia C.; Costa, Maria Cristina R.; Curcio, Cyntia; Gushiken, Tsieko; Ho, Paulo L.; Kimura, Elza; Leite, Luciana C. C.; Maia, Gustavo; Majumder, Paromita; Marins, Mozart; Matsukuma, Adriana; Melo, Analy S. A.; Mestriner, Carlos Alberto; Miracca, Elisabete C.; Miranda, Daniela C.; Nascimento, Ana Lucia T. O.; Nóbrega, Francisco G.; Ojopi, Élida P. B.; Pandolfi, José Rodrigo C.; Pessoa, Luciana Gilbert; Rahal, Paula; Rainho, Claudia A.; da Ro's, Nancy; de Sá, Renata G.; Sales, Magaly M.; da Silva, Neusa P.; Silva, Tereza C.; da Silva, Wilson; Simão, Daniel F.; Sousa, Josane F.; Stecconi, Daniella; Tsukumo, Fernando; Valente, Valéria; Zalcberg, Heloisa; Brentani, Ricardo R.; Reis, Luis F. L.; Dias-Neto, Emmanuel; Simpson, Andrew J. G.

    2000-01-01

    Transcribed sequences in the human genome can be identified with confidence only by alignment with sequences derived from cDNAs synthesized from naturally occurring mRNAs. We constructed a set of 250,000 cDNAs that represent partial expressed gene sequences and that are biased toward the central coding regions of the resulting transcripts. They are termed ORF expressed sequence tags (ORESTES). The 250,000 ORESTES were assembled into 81,429 contigs. Of these, 1,181 (1.45%) were found to match sequences in chromosome 22 with at least one ORESTES contig for 162 (65.6%) of the 247 known genes, for 67 (44.6%) of the 150 related genes, and for 45 of the 148 (30.4%) EST-predicted genes on this chromosome. Using a set of stringent criteria to validate our sequences, we identified a further 219 previously unannotated transcribed sequences on chromosome 22. Of these, 171 were in fact also defined by EST or full length cDNA sequences available in GenBank but not utilized in the initial annotation of the first human chromosome sequence. Thus despite representing less than 15% of all expressed human sequences in the public databases at the time of the present analysis, ORESTES sequences defined 48 transcribed sequences on chromosome 22 not defined by other sequences. All of the transcribed sequences defined by ORESTES coincided with DNA regions predicted as encoding exons by genscan. (http://genes.mit.edu/GENSCAN.html). PMID:11070084

  12. Peanut (Arachis hypogaea) expressed sequence tag (EST) project: Progress and application.

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Millions of expressed sequence tag (EST) sequences from several hundred plant species have been deposited in public EST databases. Many plant ESTs have been sequenced as an alternative to whole genome sequences, including peanut because of the genome size and complexity. The US peanut research commu...

  13. Peanut (Arachis hypogaea) Expressed Sequence Tag Project: Progress and Application

    PubMed Central

    Feng, Suping; Wang, Xingjun; Zhang, Xinyou; Dang, Phat M.; Holbrook, C. Corley; Culbreath, Albert K.; Wu, Yaoting; Guo, Baozhu

    2012-01-01

    Many plant ESTs have been sequenced as an alternative to whole genome sequences, including peanut because of the genome size and complexity. The US peanut research community had the historic 2004 Atlanta Genomics Workshop and named the EST project as a main priority. As of August 2011, the peanut research community had deposited 252,832 ESTs in the public NCBI EST database, and this resource has been providing the community valuable tools and core foundations for various genome-scale experiments before the whole genome sequencing project. These EST resources have been used for marker development, gene cloning, microarray gene expression and genetic map construction. Certainly, the peanut EST sequence resources have been shown to have a wide range of applications and accomplished its essential role at the time of need. Then the EST project contributes to the second historic event, the Peanut Genome Project 2010 Inaugural Meeting also held in Atlanta where it was decided to sequence the entire peanut genome. After the completion of peanut whole genome sequencing, ESTs or transcriptome will continue to play an important role to fill in knowledge gaps, to identify particular genes and to explore gene function. PMID:22745594

  14. Gene expression profile in the anterior regeneration of the earthworm using expressed sequence tags.

    PubMed

    Cho, Sung-Jin; Lee, Myung Sik; Tak, Eun Sik; Lee, Eun; Koh, Ki Seok; Ahn, Chi Hyun; Park, Soon Cheol

    2009-01-01

    In order to gain insight into the gene expression profiles associated with anterior regeneration of the earthworm, Perionyx excavatus, we analyzed 1,159 expressed sequence tags (ESTs) derived from cDNA library early anterior regenerated tissue. Among the 1,159 ESTs analyzed, 622 (53.7%) ESTs showed significant similarity to known genes and represented 338 genes, of which 233 ESTs were singletons and 105 ESTs manifested as two or more ESTs. While 663 ESTs (57.2%) were sequenced only once, 308 ESTs (26.6%) appeared 2 to 5 times, and 188 ESTs (16.2%) were sequenced more than 5 times. A total of 803 genes were categorized into 15 groups according to their biological functions. Among 1,159 ESTs sequenced, we found several gene encoding signaling molecules, such as Notch and Distal-less. The ESTs used in this study should provide a resource for future research in earthworm regeneration. PMID:19129665

  15. Development of peanut EST (expressed sequence tag)-based genomic resources and tools

    Technology Transfer Automated Retrieval System (TEKTRAN)

    U.S. Peanut Genome Initiative (PGI) has widely recognized the need for peanut genome tools and resources development for mitigating peanut allergens and food safety. Genomics such as Expressed Sequence Tag (EST), microarray technologies, and whole genome sequencing provides robotic tools for profili...

  16. Analysis of expressed sequence tags (ESTs) from Agrostis species obtained using sequence related amplified polymorphism.

    PubMed

    Dinler, Gizem; Budak, Hikmet

    2008-10-01

    Bentgrass (Agrostis spp.), a genus of the Poaceae family, consists of more than 200 species and is mainly used in athletic fields and golf courses. Creeping bentgrass (A. stolonifera L.) is the most commonly used species in maintaining golf courses, followed by colonial bentgrass (A. capillaris L.) and velvet bentgrass (A. canina L.). The presence and nature of sequence related amplified polymorphism (SRAP) at the cDNA level were investigated. We isolated 80 unique cDNA fragment bands from these species using 56 SRAP primer combinations. Sequence analysis of cDNA clones and analysis of putative translation products revealed that some encoded amino acid sequences were similar to proteins involved in DNA synthesis, transcription, and signal transduction. The cytosolic glyceraldehyde-3-phosphate dehydrogenase (GAPDH) gene (GenBank accession no. EB812822) was also identified from velvet bentgrass, and the corresponding protein sequence is further analyzed due to its critical role in many cellular processes. The partial peptide sequence obtained was 112 amino acids long, presenting a high degree of homology to parts of the N-terminal and C-terminal regions of cytosolic phosphorylating GAPDH (GapC). The existence of common expressed sequence tags (ESTs) revealed by a minimum evolutionary dendrogram among the Agrostis ESTs indicated the usefulness of SRAP for comparative genome analysis of transcribed genes in the grass species. PMID:18726683

  17. A score system for quality evaluation of RNA sequence tags: an improvement for gene expression profiling

    PubMed Central

    Pinheiro, Daniel G; Galante, Pedro AF; de Souza, Sandro J; Zago, Marco A; Silva, Wilson A

    2009-01-01

    Background High-throughput molecular approaches for gene expression profiling, such as Serial Analysis of Gene Expression (SAGE), Massively Parallel Signature Sequencing (MPSS) or Sequencing-by-Synthesis (SBS) represent powerful techniques that provide global transcription profiles of different cell types through sequencing of short fragments of transcripts, denominated sequence tags. These techniques have improved our understanding about the relationships between these expression profiles and cellular phenotypes. Despite this, more reliable datasets are still necessary. In this work, we present a web-based tool named S3T: Score System for Sequence Tags, to index sequenced tags in accordance with their reliability. This is made through a series of evaluations based on a defined rule set. S3T allows the identification/selection of tags, considered more reliable for further gene expression analysis. Results This methodology was applied to a public SAGE dataset. In order to compare data before and after filtering, a hierarchical clustering analysis was performed in samples from the same type of tissue, in distinct biological conditions, using these two datasets. Our results provide evidences suggesting that it is possible to find more congruous clusters after using S3T scoring system. Conclusion These results substantiate the proposed application to generate more reliable data. This is a significant contribution for determination of global gene expression profiles. The library analysis with S3T is freely available at . S3T source code and datasets can also be downloaded from the aforementioned website. PMID:19500384

  18. Comparative mapping of expressed sequence tags containing microsatellites in rainbow trout (Oncorhynchus mykiss)

    PubMed Central

    Rexroad, Caird E; Rodriguez, Maria F; Coulibaly, Issa; Gharbi, Karim; Danzmann, Roy G; DeKoning, Jenefer; Phillips, Ruth; Palti, Yniv

    2005-01-01

    Background Comparative genomics, through the integration of genetic maps from species of interest with whole genome sequences of other species, will facilitate the identification of genes affecting phenotypes of interest. The development of microsatellite markers from expressed sequence tags will serve to increase marker densities on current salmonid genetic maps and initiate in silico comparative maps with species whose genomes have been fully sequenced. Results Eighty-nine polymorphic microsatellite markers were generated for rainbow trout of which at least 74 amplify in other salmonids. Fifty-five have been associated with functional annotation and 30 were mapped on existing genetic maps. Homologous sequences were identified for 20 of the EST containing microsatellites to identify comparative assignments within the tetraodon, mouse, and/or human genomes. Conclusion The addition of microsatellite markers constructed from expressed sequence tag data will facilitate the development of high-density genetic maps for rainbow trout and comparative maps with other salmonids and better studied species. PMID:15836796

  19. Mining and comparison of haplotype-based expressed sequence tag single nucleotide polymorphisms among citrus cultivars

    Technology Transfer Automated Retrieval System (TEKTRAN)

    In this paper, haplotype-based SNPs were mined out of publicly available citrus expressed sequence tags (ESTs) from different citrus cultivars (genotypes) individually and collectively for comparison. There were a total of 567,297 ESTs belonging to 27 cultivars in varying numbers and consequentially...

  20. Seventy microsatellite markers from Persea americana Miller (avocado) expressed sequence tags

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Expressed sequence tags (ESTs) for Persea americana Mill. were investigated to expand upon the number of informative microsatellite markers available for avocado. Seventy informative loci were discovered using twenty-four P. americana var. americana Mill. accessions. The number of alleles detected r...

  1. Comparative gene expression in the symbiotic and aposymbiotic Aiptasia pulchella by expressed sequence tag analysis.

    PubMed

    Kuo, Jimmy; Chen, Ming-Chyuan; Lin, Chorng-Horng; Fang, Lee-Shing

    2004-05-21

    Intracellular symbiotic relationships are prevalent between cnidarians, such as corals and sea anemones, and the photosynthetic dinoflagellate symbionts. However, there is little understanding about how the genes express when the symbiotic relationship is set up. To characterize genes involved in this association, the endosymbiosis between sea anemone, Aiptasia pulchella, and dinoflagellate zooxanthellae, Symbiodinium spp., was employed as a model. Two complementary DNA (cDNA) libraries were constructed from RNA isolated from symbiotic and aposymbiotic A. pulchella. Using single-pass sequencing of cDNA clones, a total of 870 expressed sequence tags (ESTs) clones were generated from the two libraries: 474 from symbiotic animal and 396 from aposymbiotic animal. The initial ESTs consisted of 143 clusters and 231 singletons. A BLASTX search revealed that 147 unique genes had similarities with protein sequences available from databases; 120 of these clones were categorized according to their putative function. However, many ESTs could not assign functionally. The putative roles of some of the identified genes relative to endosymbiosis were discussed. This is the first report of the use of EST analysis to examine the gene expression in symbiotic and aposymbiotic states of the cnidarians. The systematic analysis of EST from this study provides a useful database for future investigations of the molecular mechanisms involved in algal-cnidarian symbiosis. PMID:15110770

  2. Development of expressed sequence tag and expressed sequence tag–simple sequence repeat marker resources for Musa acuminata

    PubMed Central

    Passos, Marco A. N.; de Oliveira Cruz, Viviane; Emediato, Flavia L.; de Camargo Teixeira, Cristiane; Souza, Manoel T.; Matsumoto, Takashi; Rennó Azevedo, Vânia C.; Ferreira, Claudia F.; Amorim, Edson P.; de Alencar Figueiredo, Lucio Flavio; Martins, Natalia F.; de Jesus Barbosa Cavalcante, Maria; Baurens, Franc-Christophe; da Silva, Orzenil Bonfim; Pappas, Georgios J.; Pignolet, Luc; Abadie, Catherine; Ciampi, Ana Y.; Piffanelli, Pietro; Miller, Robert N. G.

    2012-01-01

    Background and aims Banana (Musa acuminata) is a crop contributing to global food security. Many varieties lack resistance to biotic stresses, due to sterility and narrow genetic background. The objective of this study was to develop an expressed sequence tag (EST) database of transcripts expressed during compatible and incompatible banana–Mycosphaerella fijiensis (Mf) interactions. Black leaf streak disease (BLSD), caused by Mf, is a destructive disease of banana. Microsatellite markers were developed as a resource for crop improvement. Methodology cDNA libraries were constructed from in vitro-infected leaves from BLSD-resistant M. acuminata ssp. burmaniccoides Calcutta 4 (MAC4) and susceptible M. acuminata cv. Cavendish Grande Naine (MACV). Clones were 5′-end Sanger sequenced, ESTs assembled with TGICL and unigenes annotated using BLAST, Blast2GO and InterProScan. Mreps was used to screen for simple sequence repeats (SSRs), with markers evaluated for polymorphism using 20 diploid (AA) M. acuminata accessions contrasting in resistance to Mycosphaerella leaf spot diseases. Principal results A total of 9333 high-quality ESTs were obtained for MAC4 and 3964 for MACV, which assembled into 3995 unigenes. Of these, 2592 displayed homology to genes encoding proteins with known or putative function, and 266 to genes encoding proteins with unknown function. Gene ontology (GO) classification identified 543 GO terms, 2300 unigenes were assigned to EuKaryotic orthologous group categories and 312 mapped to Kyoto Encyclopedia of Genes and Genomes pathways. A total of 624 SSR loci were identified, with trinucleotide repeat motifs the most abundant in MAC4 (54.1 %) and MACV (57.6 %). Polymorphism across M. acuminata accessions was observed with 75 markers. Alleles per polymorphic locus ranged from 2 to 8, totalling 289. The polymorphism information content ranged from 0.08 to 0.81. Conclusions This EST collection offers a resource for studying functional genes, including

  3. [Isolation and expression of novel expressed sequence tags (ESTs) from ovarian follicles of Shaoxing ducks].

    PubMed

    Shu, Gang; Chen, Jie; Ni, Ying-Dong; Zhou, Yu-Chuan; Zhao, Ru-Qian

    2004-10-01

    Three expressed sequence tags ( ESTs), SXDF0201 (271 bp), SXDF0202 (200 bp) and SXDF0203 (173 bp), were isolated from ovarian follicles of Shaoxing ducks by using silver staining mRNA differential display. GenBank/BLAST analysis revealed that SXDF0201 was not homologous to any of the published sequences from all species, indicating that it was a novel EST and was then registered in GenBank (GenBank Accession No.: CB072629), while SXDF0202 and SXDF0203 were found to be highly homologous to seven known chicken ESTs and chicken mRNA for gizzard smooth muscle myosin heavy chain. 5'-RACE was employed to extend the SXDF0201 to 544 bp which was confirmed as novel in BLAST search. The temporal and spatial expression of SXDF0201 and SXDF0202 were also investigated with semi-quantitative RT-PCR. The result showed that: both SXDF0201 and SXDF0202 were found to be expressed in hypothalamus, pituitary, muscle, liver, and fat tissues of Shaoxing ducks; SXDF0201 was expressed significantly higher in ovaries of 30-day-old Shaoxing ducks compared with that of 60-day-old (P < 0.05) and 90-day-old (P = 0.015), but the expression of SXDF0202 showed no difference throughout the ovarian development; granulose layers expressed higher SXDF0201 than theca layers in almost all hierarchical follicles, the expression of SXDF0202 in granulose layers increased along with follicular maturation (P < 0.01) from Fw to F3 follicles, but decreased dramatically to the lowest in F1 follicles (P < 0.01). In theca layers, the highest expression of SXDF0202 was found in Fw follicles (P < 0.01). PMID:15552044

  4. Initiation of a Sarcocystis neurona expressed sequence tag (EST) sequencing project: a preliminary report.

    PubMed

    Howe, D K

    2001-02-26

    To accelerate genetic and molecular characterization of Sarcocystis neurona, the primary causative agent of equine protozoal myeloencephalitis (EPM), a sequencing project has been initiated that will generate approximately 7000-8000 expressed sequence tags (ESTs) from this apicomplexan parasite. Poly(A)(+) RNA was isolated from culture-derived S. neurona merozoites, and a cDNA library was constructed in a unidirectional lambda phage cloning vector. Sixty phage clones were randomly picked from the library, and the cDNA inserts were amplified from these clones using the T3 and T7 primers that flank the multi-cloning site of the lambda vector. This analysis demonstrated that 100% (60/60) of the clones selected from this library contained recombinant cDNA inserts ranging in size from 0.4 to 4.0 kilobases (kb) with an average size of 1.23kb. Single-pass sequencing from the 5' end of the 60 amplified cDNAs produced high-quality nucleotide sequence from 53 of the clones. Comparison of these ESTs to the current gene databases revealed significant matches for 10 of the ESTs, six of which are similar to sequences from other Apicomplexa (i.e., Toxoplasma gondii). Importantly, none of the ESTs were of obvious mammalian origin, thus indicating that the cDNAs in this library were derived primarily from parasite mRNA and not from mRNA of the bovine turbinate host cells. Collectively, these data indicate that the described cDNA library will provide an excellent substrate for generating a portion of the ESTs that are planned from S. neurona. This sequencing project will greatly hasten gene discovery for this protozoan pathogen thereby enhancing efforts towards the development of improved diagnostics, treatments, and preventatives for EPM. In addition, the S. neurona ESTs will represent a significant contribution to the extensive database of sequences from the Apicomplexa. Comparative analyses of these apicomplexan sequences will likely offer a multitude of important information

  5. Analysis of early hepatic stage schistosomula gene expression by subtractive expressed sequence tags library.

    PubMed

    Wang, Xinzhi; Gobert, Geoffrey N; Feng, XinGang; Fu, Zhiqiang; Jin, Yamei; Peng, Jinbiao; Lin, Jiaojiao

    2009-07-01

    Schistosome parasites require a complex lifecycle requiring two hosts and aquatic phases of development. The schistosomula is a key phase of parasite development within the mammalian host, however relatively little is understood about the molecular processes underlying this stage. In this study 5723 subtractive expressed sequence tags (ESTs) were randomly selected from a 7 day hepatic schistosomula enriched library constructed using suppression subtractive hybridization method. Sequence analysis of these ESTs identified 1762 unique genes (contigs). Among them, 989 contigs were annotated with known genes, 311 contigs were homologous to established genes, 101 contigs were similar to established genes, 72 contigs were weakly similar to established genes and 289 sequences did not match any published sequences. Genes identified related to metabolism, cellular development, immune evasion and host-parasite interactions were identified as enriched in the hepatic schistosomula stage. The future identification of poorly annotated but stage-specific genes may potentially represent new drugs or vaccine targets, applicable for the future controlling of schistosomiasis. PMID:19428674

  6. Alternative splicing and expression profile analysis of expressed sequence tags in domestic pig.

    PubMed

    Zhang, Liang; Tao, Lin; Ye, Lin; He, Ling; Zhu, Yuan-Zhong; Zhu, Yue-Dong; Zhou, Yan

    2007-02-01

    Domestic pig (Sus scrofa domestica) is one of the most important mammals to humans. Alternative splicing is a cellular mechanism in eukaryotes that greatly increases the diversity of gene products. Expression sequence tags (ESTs) have been widely used for gene discovery, expression profile analysis, and alternative splicing detection. In this study, a total of 712,905 ESTs extracted from 101 different non-normalized EST libraries of the domestic pig were analyzed. These EST libraries cover the nervous system, digestive system, immune system, and meat production related tissues from embryo, newborn, and adult pigs, making contributions to the analysis of alternative splicing variants as well as expression profiles in various stages of tissues. A modified approach was designed to cluster and assemble large EST datasets, aiming to detect alternative splicing together with EST abundance of each splicing variant. Much efforts were made to classify alternative splicing into different types and apply different filters to each type to get more reliable results. Finally, a total of 1,223 genes with average 2.8 splicing variants were detected among 16,540 unique genes. The overview of expression profiles would change when we take alternative splicing into account. PMID:17572361

  7. Comprehensive analyses of prostate gene expression: convergence of expressed sequence tag databases, transcript profiling and proteomics.

    PubMed

    Nelson, P S; Han, D; Rochon, Y; Corthals, G L; Lin, B; Monson, A; Nguyen, V; Franza, B R; Plymate, S R; Aebersold, R; Hood, L

    2000-05-01

    Several methods have been developed for the comprehensive analysis of gene expression in complex biological systems. Generally these procedures assess either a portion of the cellular transcriptome or a portion of the cellular proteome. Each approach has distinct conceptual and methodological advantages and disadvantages. We have investigated the application of both methods to characterize the gene expression pathway mediated by androgens and the androgen receptor in prostate cancer cells. This pathway is of critical importance for the development and progression of prostate cancer. Of clinical importance, modulation of androgens remains the mainstay of treatment for patients with advanced disease. To facilitate global gene expression studies we have first sought to define the prostate transcriptome by assembling and annotating prostate-derived expressed sequence tags (ESTs). A total of 55000 prostate ESTs were assembled into a set of 15953 clusters putatively representing 15953 distinct transcripts. These clusters were used to construct cDNA microarrays suitable for examining the androgen-response pathway at the level of transcription. The expression of 20 genes was found to be induced by androgens. This cohort included known androgen-regulated genes such as prostate-specific antigen (PSA) and several novel complementary DNAs (cDNAs). Protein expression profiles of androgen-stimulated prostate cancer cells were generated by two-dimensional electrophoresis (2-DE). Mass spectrometric analysis of androgen-regulated proteins in these cells identified the metastasis-suppressor gene NDKA/nm23, a finding that may explain a marked reduction in metastatic potential when these cells express a functional androgen receptor pathway. PMID:10870968

  8. Evaluation of anonymous and expressed sequence tag derived polymorphic microsatellite markers in the tobacco budworm Heliothis virescens (Lepidoptera: noctuidae)

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Polymorphic genetic markers were identified and characterized using a partial genomic library of Heliothis virescens enriched for simple sequence repeats (SSR) and nucleotide sequences of expressed sequence tags (EST). Nucleotide sequences of 192 clones from the partial genomic library yielded 147 u...

  9. Large-scale detection and application of expressed sequence tag single nucleotide polymorphisms in Nicotiana.

    PubMed

    Wang, Y; Zhou, D; Wang, S; Yang, L

    2015-01-01

    Single nucleotide polymorphisms (SNPs) are widespread in the Nicotiana genome. Using an alignment and variation detection method, we developed 20,607,973 SNPs, based on the expressed sequence tag sequences of 10 Nicotiana species. The replacement rate was much higher than the transversion rate in the SNPs, and SNPs widely exist in the Nicotiana. In vitro verification indicated that all of the SNPs were high quality and accurate. Evolutionary relationships between 15 varieties were investigated by polymerase chain reaction with a special primer; the specific 302 locus of these sequence results clearly indicated the origin of Zhongyan 100. A database of Nicotiana SNPs (NSNP) was developed to store and search for SNPs in Nicotiana. NSNP is a tool for researchers to develop SNP markers of sequence data. PMID:26214460

  10. Expressed sequence tags of Chinese cabbage flower bud cDNA.

    PubMed Central

    Lim, C O; Kim, H Y; Kim, M G; Lee, S I; Chung, W S; Park, S H; Hwang, I; Cho, M J

    1996-01-01

    We randomly selected and partially sequenced cDNA clones from a library of Chinese cabbage (Brassica campestris L. ssp. pekinensis) flower bud cDNAs. Out of 1216 expressed sequence tags (ESTs), 904 cDNA clones were unique or nonredundant. Five hundred eighty-eight clones (48.4%) had sequence homology to functionally defined genes at the peptide level. Only 5 clones encoded known flower-specific proteins. Among the cDNAs with no similarity to known protein sequences (628), 184 clones had significant similarity to nucleotide sequences registered in the databases. Among these 184 clones, 142 exhibited similarities at the nucleotide level only with plant ESTs. Also, sequence similarities were evident between these 142 ESTs and their matching ESTs when compared using the deduced amino acid sequences. Therefore, it is possible that the anonymous ESTs encode plant-specific ubiquitous proteins. Our extensive EST analysis of genes expressed in floral organs not only contributes to the understanding of the dynamics of genome expression patterns in floral organs but also adds data to the repertoire of all genomic genes. PMID:8787028

  11. Identification of antimicrobial peptides from teleosts and anurans in expressed sequence tag databases using conserved signal sequences.

    PubMed

    Tessera, Valentina; Guida, Filomena; Juretić, Davor; Tossi, Alessandro

    2012-03-01

    The problem of multidrug resistance requires the efficient and accurate identification of new classes of antimicrobial agents. Endogenous antimicrobial peptides produced by most organisms are a promising source of such molecules. We have exploited the high conservation of signal sequences in teleost and anuran antimicrobial peptides to search cDNA (expressed sequence tag) databases for likely candidates. Subject sequences were then analysed for the presence of potential antimicrobial peptides based on physicochemical properties (amphipathic helical structure, cationicity) and use of the D-descriptor model to predict the therapeutic index (relation between the minimum inhibitory concentration and the concentration giving 50% haemolysis). This analysis also suggested mutations to probe the role of the primary structure in determining potency and selectivity. Selected sequences were chemically synthesized and the antimicrobial activity of the peptides was confirmed. In particular, a short (21-residue) sequence, likely of sticklefish origin, showed potent activity and it was possible to tune the spectrum of action and/or selectivity by combining three directed mutations. Membrane permeabilization studies on both bacterial and host cells indicate that the mode of action was prevalently membranolytic. This method opens up the possibility for more effective searching of the vast and continuously growing expressed sequence tag databases for novel antimicrobial peptides, which are likely abundant, and the efficient identification of the most promising candidates among them. PMID:22188679

  12. Characterization of Expressed Sequence Tags From a Gallus gallus Pineal Gland cDNA Library

    PubMed Central

    Hartman, Stefanie; Touchton, Greg; Wynn, Jessica; Geng, Tuoyu; Chong, Nelson W.

    2005-01-01

    The pineal gland is the circadian oscillator in the chicken, regulating diverse functions ranging from egg laying to feeding. Here, we describe the isolation and characterization of expressed sequence tags (ESTs) isolated from a chicken pineal gland cDNA library. A total of 192 unique sequences were analysed and submitted to GenBank; 6% of the ESTs matched neither GenBank cDNA sequences nor the newly assembled chicken genomic DNA sequence, three ESTs aligned with sequences designated to be on the Z_random, while one matched a W chromosome sequence and could be useful in cataloguing functionally important genes on this sex chromosome. Additionally, single nucleotide polymorphisms (SNPs) were identified and validated in 10 ESTs that showed 98% or higher sequence similarity to known chicken genes. Here, we have described resources that may be useful in comparative and functional genomic analysis of genes expressed in an important organ, the pineal gland, in a model and agriculturally important organism. PMID:18629218

  13. Sixteen Polymorphic Simple Sequence Repeat Markers from Expressed Sequence Tags of the Chinese Mitten Crab Eriocheir sinensis

    PubMed Central

    Gao, Xiang-Gang; Li, Hong-Jun; Li, Yun-Feng; Sui, Li-Jun; Zhu, Bao; Liang, Yu; Liu, Wei-Dong; He, Chong-Bo

    2010-01-01

    The Chinese mitten crab (Eriocheir sinensis) is an economically important aquaculture species in China. In this study, we developed and evaluated simple sequence repeat markers from expressed sequence tags of E. sinensis. Among the 40 wild E. sinensis individuals tested, 16 loci were polymorphic. The number of alleles per locus ranged from two to ten. The observed heterozygosity ranged from 0.0667 to 0.9667, whereas the expected heterozygosity ranged from 0.0661 to 0.9051. These markers have the potential for use in genetic studies of population structure and intraspecific variation in E. sinensis. PMID:21152289

  14. Generation and analysis of expressed sequence tags from the ciliate protozoan parasite Ichthyophthirius multifiliis

    PubMed Central

    Abernathy, Jason W; Xu, Peng; Li, Ping; Xu, De-Hai; Kucuktas, Huseyin; Klesius, Phillip; Arias, Covadonga; Liu, Zhanjiang

    2007-01-01

    Background The ciliate protozoan Ichthyophthirius multifiliis (Ich) is an important parasite of freshwater fish that causes 'white spot disease' leading to significant losses. A genomic resource for large-scale studies of this parasite has been lacking. To study gene expression involved in Ich pathogenesis and virulence, our goal was to generate expressed sequence tags (ESTs) for the development of a powerful microarray platform for the analysis of global gene expression in this species. Here, we initiated a project to sequence and analyze over 10,000 ESTs. Results We sequenced 10,368 EST clones using a normalized cDNA library made from pooled samples of the trophont, tomont, and theront life-cycle stages, and generated 9,769 sequences (94.2% success rate). Post-sequencing processing led to 8,432 high quality sequences. Clustering analysis of these ESTs allowed identification of 4,706 unique sequences containing 976 contigs and 3,730 singletons. These unique sequences represent over two million base pairs (~10% of Plasmodium falciparum genome, a phylogenetically related protozoan). BLASTX searches produced 2,518 significant (E-value < 10-5) hits and further Gene Ontology (GO) analysis annotated 1,008 of these genes. The ESTs were analyzed comparatively against the genomes of the related protozoa Tetrahymena thermophila and P. falciparum, allowing putative identification of additional genes. All the EST sequences were deposited by dbEST in GenBank (GenBank: EG957858–EG966289). Gene discovery and annotations are presented and discussed. Conclusion This set of ESTs represents a significant proportion of the Ich transcriptome, and provides a material basis for the development of microarrays useful for gene expression studies concerning Ich development, pathogenesis, and virulence. PMID:17577414

  15. A known expressed sequence tag, BM742401, is a potent lincRNA inhibiting cancer metastasis.

    PubMed

    Park, Seong-Min; Park, Sung-Joon; Kim, Hee-Jin; Kwon, Oh-Hyung; Kang, Tae-Wook; Sohn, Hyun-Ahm; Kim, Seon-Kyu; Moo Noh, Seung; Song, Kyu-Sang; Jang, Se-Jin; Sung Kim, Yong; Kim, Seon-Young

    2013-01-01

    Long intergenic non-coding RNAs (lincRNAs) have historically been ignored in cancer biology. However, thousands of lincRNAs have been identified in mammals using recently developed genomic tools, including microarray and high-throughput RNA sequencing (RNA-seq). Several of the lincRNAs identified have been well characterized for their functions in carcinogenesis. Here we performed RNA-seq experiments comparing gastric cancer with normal tissues to find differentially expressed transcripts in intergenic regions. By analyzing our own RNA-seq and public microarray data, we identified 31 transcripts, including a known expressed sequence tag, BM742401. BM742401 was downregulated in cancer, and its downregulation was associated with poor survival in gastric cancer patients. Ectopic overexpression of BM742401 inhibited metastasis-related phenotypes and decreased the concentration of extracellular MMP9. These results suggest that BM742401 is a potential lincRNA marker and therapeutic target. PMID:23846333

  16. Transcriptome analysis of Loxosceles laeta (Araneae, Sicariidae) spider venomous gland using expressed sequence tags

    PubMed Central

    Fernandes-Pedrosa, Matheus de F; Junqueira-de-Azevedo, Inácio de LM; Gonçalves-de-Andrade, Rute M; Kobashi, Leonardo S; Almeida, Diego D; Ho, Paulo L; Tambourgi, Denise V

    2008-01-01

    Background The bite of spiders belonging to the genus Loxosceles can induce a variety of clinical symptoms, including dermonecrosis, thrombosis, vascular leakage, haemolysis, and persistent inflammation. In order to examine the transcripts expressed in venom gland of Loxosceles laeta spider and to unveil the potential of its products on cellular structure and functional aspects, we generated 3,008 expressed sequence tags (ESTs) from a cDNA library. Results All ESTs were clustered into 1,357 clusters, of which 16.4% of the total ESTs belong to recognized toxin-coding sequences, being the Sphingomyelinases D the most abundant transcript; 14.5% include "possible toxins", whose transcripts correspond to metalloproteinases, serinoproteinases, hyaluronidases, lipases, C-lectins, cystein peptidases and inhibitors. Thirty three percent of the ESTs are similar to cellular transcripts, being the major part represented by molecules involved in gene and protein expression, reflecting the specialization of this tissue for protein synthesis. In addition, a considerable number of sequences, 25%, has no significant similarity to any known sequence. Conclusion This study provides a first global view of the gene expression scenario of the venom gland of L. laeta described so far, indicating the molecular bases of its venom composition. PMID:18547439

  17. Expressed sequence tags from a NaCl-treated Suaeda salsa cDNA library.

    PubMed

    Zhang, L; Ma, X L; Zhang, Q; Ma, C L; Wang, P P; Sun, Y F; Zhao, Y X; Zhang, H

    2001-04-18

    Past efforts to improve plant tolerance to osmotic stress have had limited success owing to the genetic complexity of stress responses. The first step towards cataloging and categorizing genetically complex abotic stress responses is the rapid discovery of genes by the large-scale partial sequencing of randomly selected cDNA clones or expressed sequence tags (ESTs). Suaeda salsa, which can survive seawater-level salinity, is a favorite halophytic model for salt tolerant research. We constructed a NaCl-treated cDNA library of Suaeda salsa and sequenced 1048 randomly selected clones, out of which 1016 clones produced readable sequences (773 showed homology to previously identified genes, 227 matched unknown protein coding regions, 16 anomalous sequences or sequences of bacterial origin were excluded from further analysis). By sequence analysis we identified 492 unique clones: 315 showed homology to previously identified genes, 177 matched unknown protein coding regions (101 of which have been found before in other organisms and 76 are completely novel). All our EST data are available on the Internet. We believe that our dbEST and the associated DNA materials will be a useful source to scientists engaging in stress-tolerance study. PMID:11313146

  18. Expressed sequence tag analysis of the emu (Dromaius novaehollandiae) pituitary by 454 GS Junior pyrosequencing.

    PubMed

    Kim, Ji Eun; Leung, Frederick C; Jiang, Jingwei; Kwok, Amy H Y; Bennett, Darin C; Cheng, Kimberly M

    2013-01-01

    Emus (Dromaius novaehollandiae) are farmed for their oil for pharmaceutical and cosmetic uses. This emu pituitary expressed sequence tag study was undertaken to identify novel transcripts in the emu pituitary to propel their identification and functional studies. By mapping reads derived from the Roche 454 GS Junior pyrosequencer to 8 reference species (human, mouse, chicken, zebra finch, fruit fly, turkey, round worm, and Carolina anole lizard) from the UniGene database, a total of 81,788 reads (53,312 mapped reads) were obtained and assembled with Reference Sequence (RefSeq). We annotated 6,676 potential emu genes by referencing 7 species (excluding lizard) and identified 1,232 potential genes common among 3 species (human, mouse, and chicken) with complete available reference genomes. Gene Ontology analysis revealed 376 Gene Ontology terms showing, with the highest counts, their involvements in biological processes, metabolism, and cellular components. These potential genes were detected to associate with 20 pathways including mitogen-activated protein kinase, insulin, neurotrophin signaling pathways, and carbohydrate digestion and absorption pathway. We also revealed a panel of tissue-specific genes including regulator of G-protein signaling protein (RGS), glucagon-like peptide receptor (GLPR), and growth hormone-inducible transmembrane protein (GHITM). Additionally, fatty acid binding protein (FABP), fatty acid desaturase (FAS), and stearoyl-coenzyme A desaturase (SCD), key enzyme genes in fat metabolism, were found to be also expressed in emu pituitary. This expressed sequence tag study represents the first step in functional characterization of emu pituitary gene expression and SNP identification for the improvement of fat production in the emu. PMID:23243234

  19. Gene ontology based characterization of expressed sequence tags (ESTs) of Brassica rapa cv. Osome.

    PubMed

    Arasan, Senthil Kumar Thamil; Park, Jong-In; Ahmed, Nasar Uddin; Jung, Hee-Jeong; Lee, In-Ho; Cho, Yong-Gu; Lim, Yong-Pyo; Kang, Kwon-Kyoo; Nou, Ill-Sup

    2013-07-01

    Chinese cabbage (Brassica rapa) is widely recognized for its economic importance and contribution to human nutrition but abiotic and biotic stresses are main obstacle for its quality, nutritional status and production. In this study, 3,429 Express Sequence Tag (EST) sequences were generated from B. rapa cv. Osome cDNA library and the unique transcripts were classified functionally using a gene ontology (GO) hierarchy, Kyoto encyclopedia of genes and genomes (KEGG). KEGG orthology and the structural domain data were obtained from the biological database for stress related genes (SRG). EST datasets provided a wide outlook of functional characterization of B. rapa cv. Osome. In silico analysis revealed % 83 of ESTs to be well annotated towards reeds one dimensional concept. Clustering of ESTs returned 333 contigs and 2,446 singlets, giving a total of 3,284 putative unigene sequences. This dataset contained 1,017 EST sequences functionally annotated to stress responses and from which expression of randomly selected SRGs were analyzed against cold, salt, drought, ABA, water and PEG stresses. Most of the SRGs showed differentially expression against these stresses. Thus, the EST dataset is very important for discovering the potential genes related to stress resistance in Chinese cabbage, and can be of useful resources for genetic engineering of Brassica sp. PMID:23898551

  20. A Comprehensive Approach to Clustering of Expressed Human Gene Sequence: The Sequence Tag Alignment and Consensus Knowledge Base

    PubMed Central

    Miller, Robert T.; Christoffels, Alan G.; Gopalakrishnan, Chella; Burke, John; Ptitsyn, Andrey A.; Broveak, Tania R.; Hide, Winston A.

    1999-01-01

    The expressed human genome is being sequenced and analyzed by disparate groups producing disparate data. The majority of the identified coding portion is in the form of expressed sequence tags (ESTs). The need to discover exonic representation and expression forms of full-length cDNAs for each human gene is frustrated by the partial and variable quality nature of this data delivery. A highly redundant human EST data set has been processed into integrated and unified expressed transcript indices that consist of hierarchically organized human transcript consensi reflecting gene expression forms and genetic polymorphism within an index class. The expression index and its intermediate outputs include cleaned transcript sequence, expression, and alignment information and a higher fidelity subset, SANIGENE. The STACK_PACK clustering system has been applied to dbEST release 121598 (GenBank version 110). Sixty-four percent of 1,313,103 Homo sapiens ESTs are condensed into 143,885 tissue level multiple sequence clusters; linking through clone-ID annotations produces 68,701 total assemblies, such that 81% of the original input set is captured in a STACK multiple sequence or linked cluster. Indexing of alignments by substituent EST accession allows browsing of the data structure and its cross-links to UniGene. STACK metaclusters consolidate a greater number of ESTs by a factor of 1.86 with respect to the corresponding UniGene build. Fidelity comparison with genome reference sequence AC004106 demonstrates consensus expression clusters that reflect significantly lower spurious repeat sequence content and capture alternate splicing within a whole body index cluster and three STACK v.2.3 tissue-level clusters. Statistics of a staggered release whole body index build of STACK v.2.0 are presented. PMID:10568754

  1. Development of polymorphic microsatellite markers based on expressed sequence tags in Populus cathayana (Salicaceae).

    PubMed

    Tian, Z Z; Zhang, F Q; Cai, Z Y; Chen, S L

    2016-01-01

    Populus cathayana occupies a large area within the northern, central, and southwestern regions of China, and is considered to be an important reforestation species in western China. In order to investigate the population genetic structure of this species, 10 polymorphic microsatellite loci were identified based on expressed sequence tags from de novo sequencing on the Illumina HiSeq 2000 platform. All microsatellite primers were tested on 48 P. cathayana individuals from four locations on the Qinghai-Tibet Plateau. The observed heterozygosity ranged from 0.000 to 1.000, and the null-allele frequency ranged from 0.000 to 0.904. These microsatellite markers may be a useful tool in genetic studies on P. cathayana and closely related species. PMID:27525845

  2. Cloning, analysis and functional annotation of expressed sequence tags from the Earthworm Eisenia fetida

    PubMed Central

    Pirooznia, Mehdi; Gong, Ping; Guan, Xin; Inouye, Laura S; Yang, Kuan; Perkins, Edward J; Deng, Youping

    2007-01-01

    Background Eisenia fetida, commonly known as red wiggler or compost worm, belongs to the Lumbricidae family of the Annelida phylum. Little is known about its genome sequence although it has been extensively used as a test organism in terrestrial ecotoxicology. In order to understand its gene expression response to environmental contaminants, we cloned 4032 cDNAs or expressed sequence tags (ESTs) from two E. fetida libraries enriched with genes responsive to ten ordnance related compounds using suppressive subtractive hybridization-PCR. Results A total of 3144 good quality ESTs (GenBank dbEST accession number EH669363–EH672369 and EL515444–EL515580) were obtained from the raw clone sequences after cleaning. Clustering analysis yielded 2231 unique sequences including 448 contigs (from 1361 ESTs) and 1783 singletons. Comparative genomic analysis showed that 743 or 33% of the unique sequences shared high similarity with existing genes in the GenBank nr database. Provisional function annotation assigned 830 Gene Ontology terms to 517 unique sequences based on their homology with the annotated genomes of four model organisms Drosophila melanogaster, Mus musculus, Saccharomyces cerevisiae, and Caenorhabditis elegans. Seven percent of the unique sequences were further mapped to 99 Kyoto Encyclopedia of Genes and Genomes pathways based on their matching Enzyme Commission numbers. All the information is stored and retrievable at a highly performed, web-based and user-friendly relational database called EST model database or ESTMD version 2. Conclusion The ESTMD containing the sequence and annotation information of 4032 E. fetida ESTs is publicly accessible at . PMID:18047730

  3. Studies of a Biochemical Factory: Tomato Trichome Deep Expressed Sequence Tag Sequencing and Proteomics1[W][OA

    PubMed Central

    Schilmiller, Anthony L.; Miner, Dennis P.; Larson, Matthew; McDowell, Eric; Gang, David R.; Wilkerson, Curtis; Last, Robert L.

    2010-01-01

    Shotgun proteomics analysis allows hundreds of proteins to be identified and quantified from a single sample at relatively low cost. Extensive DNA sequence information is a prerequisite for shotgun proteomics, and it is ideal to have sequence for the organism being studied rather than from related species or accessions. While this requirement has limited the set of organisms that are candidates for this approach, next generation sequencing technologies make it feasible to obtain deep DNA sequence coverage from any organism. As part of our studies of specialized (secondary) metabolism in tomato (Solanum lycopersicum) trichomes, 454 sequencing of cDNA was combined with shotgun proteomics analyses to obtain in-depth profiles of genes and proteins expressed in leaf and stem glandular trichomes of 3-week-old plants. The expressed sequence tag and proteomics data sets combined with metabolite analysis led to the discovery and characterization of a sesquiterpene synthase that produces β-caryophyllene and α-humulene from E,E-farnesyl diphosphate in trichomes of leaf but not of stem. This analysis demonstrates the utility of combining high-throughput cDNA sequencing with proteomics experiments in a target tissue. These data can be used for dissection of other biochemical processes in these specialized epidermal cells. PMID:20431087

  4. Transcriptome analysis in the midgut of the earthworm (Eisenia andrei) using expressed sequence tags.

    PubMed

    Lee, Myung Sik; Cho, Sung Jin; Tak, Eun Sik; Lee, Jong Ae; Cho, Hyun Ju; Park, Bum Joon; Shin, Chuog; Kim, Dae Kyong; Park, Soon Cheol

    2005-03-25

    In order to gain insight into the expression profiles of the earthworm midgut, we analyzed 1106 expressed sequence tags (ESTs) derived from the earthworm midgut cDNA library. Among the 1106 ESTs analyzed, 557 (50.4%) ESTs showed significant similarity to known genes and represented 229 unique genes of which 166 ESTs were singletons and 63 ESTs manifest as two or more ESTs. While 552 ESTs (49.9%) were sequenced only once, 230 ESTs (20.8%) appeared two to five times and 324 ESTs (29.3%) were sequenced more than five times. Considering this redundancy of expression, it is likely that the gene expression profile of the earthworm midgut would be polarized. The expression of globin-related proteins, including ferritin and linker chain, and fibrinolytic enzymes appeared to account for 10.1% and 4.7% of the total ESTs analyzed in this study, respectively. This suggests that the prime functions of the midgut in the earthworm would be associated with protein hydrolysis as well as globin formation. Among the recognized protein-coding genes, the gene category involved in protein synthesis appeared to be the largest one accounting for 15.6% of the expression in the midgut, followed by gene categories associated with energy (11.2%), homeostasis (10.8%), metabolism (3.6%), cytoskeleton (2.5%), and protein fate (1.4%). With regard to functional aspects, the most abundantly expressed genes were associated with respiratory pigment (10.1%), cellular respiration (8.6%), and fibrin hydrolysis (4.7%). In addition, we were able to identify novel ESTs in the earthworm, which were related to the innate immune system, including destabilase, a possible antagonist of transglutaminase. PMID:15708003

  5. Expressed sequence tag analysis of functional genes associated with adventitious rooting in Liriodendron hybrids.

    PubMed

    Zhong, Y D; Sun, X Y; Liu, E Y; Li, Y Q; Gao, Z; Yu, F X

    2016-01-01

    Liriodendron hybrids (Liriodendron chinense x L. tulipifera) are important landscaping and afforestation hardwood trees. To date, little genomic research on adventitious rooting has been reported in these hybrids, as well as in the genus Liriodendron. In the present study, we used adventitious roots to construct the first cDNA library for Liriodendron hybrids. A total of 5176 expressed sequence tags (ESTs) were generated and clustered into 2921 unigenes. Among these unigenes, 2547 had significant homology to the non-redundant protein database representing a wide variety of putative functions. Homologs of these genes regulated many aspects of adventitious rooting, including those for auxin signal transduction and root hair development. Results of quantitative real-time polymerase chain reaction showed that AUX1, IRE, and FB1 were highly expressed in adventitious roots and the expression of AUX1, ARF1, NAC1, RHD1, and IRE increased during the development of adventitious roots. Additionally, 181 simple sequence repeats were identified from 166 ESTs and more than 91.16% of these were dinucleotide and trinucleotide repeats. To the best of our knowledge, the present study reports the identification of the genes associated with adventitious rooting in the genus Liriodendron for the first time and provides a valuable resource for future genomic studies. Expression analysis of selected genes could allow us to identify regulatory genes that may be essential for adventitious rooting. PMID:27420958

  6. Genome-wide analysis of immune system genes by expressed sequence Tag profiling.

    PubMed

    Giallourakis, Cosmas C; Benita, Yair; Molinie, Benoit; Cao, Zhifang; Despo, Orion; Pratt, Henry E; Zukerberg, Lawrence R; Daly, Mark J; Rioux, John D; Xavier, Ramnik J

    2013-06-01

    Profiling studies of mRNA and microRNA, particularly microarray-based studies, have been extensively used to create compendia of genes that are preferentially expressed in the immune system. In some instances, functional studies have been subsequently pursued. Recent efforts such as the Encyclopedia of DNA Elements have demonstrated the benefit of coupling RNA sequencing analysis with information from expressed sequence tags (ESTs) for transcriptomic analysis. However, the full characterization and identification of transcripts that function as modulators of human immune responses remains incomplete. In this study, we demonstrate that an integrated analysis of human ESTs provides a robust platform to identify the immune transcriptome. Beyond recovering a reference set of immune-enriched genes and providing large-scale cross-validation of previous microarray studies, we discovered hundreds of novel genes preferentially expressed in the immune system, including noncoding RNAs. As a result, we have established the Immunogene database, representing an integrated EST road map of gene expression in human immune cells, which can be used to further investigate the function of coding and noncoding genes in the immune system. Using this approach, we have uncovered a unique metabolic gene signature of human macrophages and identified PRDM15 as a novel overexpressed gene in human lymphomas. Thus, we demonstrate the utility of EST profiling as a basis for further deconstruction of physiologic and pathologic immune processes. PMID:23616578

  7. Mapping of Heterologous Expressed Sequence Tags as an Alternative to Microarrays for Study of Defense Responses in Plants

    Technology Transfer Automated Retrieval System (TEKTRAN)

    In this study, we used publicly available EST (expressed sequence tags) database derived from four different plant species infected with a variety of pathogens, to generate an expression profile of orthologous genes involved in defense response of a model organism, Arabidopsis thaliana. Computer-ass...

  8. Development of expressed sequence tag-simple sequence repeat markers for Chrysanthemum morifolium and closely related species.

    PubMed

    Liu, H; Zhang, Q X; Sun, M; Pan, H T; Kong, Z X

    2015-01-01

    With the development of chrysanthemum breeding in recent years, an increasing number of wild species in genera related to Chrysanthemum were introduced to extend the genetic resources and facilitate the genetic improvement of chrysanthemums via hybridization. However, few simple sequence repeat (SSR) markers are available for marker-assisted breeding and population genetic studies of chrysanthemum and closely related species. Expressed sequence tags (ESTs) in public databases and cross-species transferable markers are considered to be a cost-effective means for developing sequence-based markers. In this study, 25 EST-SSRs were successfully developed from Chrysanthemum EST sequences for Chrysanthemum morifolium and closely related species. In total, 4164 unigene sequences were assembled from 7180 ESTs of chrysanthemum in GenBank, which were subsequently used to screen for the presence of microsatellites with the SSRIT software. The screening criteria were 8, 5, 4, and 3 repeating units for di-, tri-, tetra-, and penta- and higher-order nucleotides, respectively. Moreover, 310 SSR loci from 296 sequences were identified, and 198 primer pairs for SSR amplification were designed with the Primer Premier 5.0 software, of which 25 SSR loci showed polymorphic amplification in 52 species and varieties belonging to Chrysanthemum, Ajania, and Opisthopappus. The application of EST-SSR markers to the identification of intergeneric hybrids between Chrysanthemum and Ajania was demonstrated. Therefore, EST-SSRs can be developed for species that lack gene sequences or ESTs by utilizing ESTs of closely related species. PMID:26214436

  9. Identification of genes related to Parkinson's disease using expressed sequence tags.

    PubMed

    Kim, Jeong-Min; Lee, Kyu-Hwa; Jeon, Yeo-Jin; Oh, Jung-Hwa; Jeong, So-Young; Song, In-Sung; Kim, Jin-Man; Lee, Dong-Seok; Kim, Nam-Soon

    2006-12-31

    In a search for novel target genes related to Parkinson's disease (PD), two full-length cDNA libraries were constructed from a human normal substantia nigra (SN) and a PD patient's SN. An analysis of the gene expression profiles between them was done using the expressed sequence tags (ESTs) frequency. Data for the differently expressed genes were verified by quantitative real-time RT-PCR, immunohistochemical analysis and a cell death assay. Among the 76 genes identified with a significant difference (P > 0.9), 21 upregulated genes and 13 downregulated genes were confirmed to be differentially expressed in human PD tissues and/or in an MPTP-treated mice model by quantitative real-time RT-PCR. Among those genes, an immunohistochemical analysis using an MPTP mice model for alpha-tubulin including TUBA3 and TUBA6 showed that the protein levels are downregulated, as well as the RNA levels. In addition, MBP, PBP and GNAS were confirmed to accelerate cell death activity, whereas SPP1 and TUBA3 to retard this process. Using an analysis of ESTs frequency, it was possible to identify a large number of genes related to human PD. These new genes, MBP, PBP, GNAS, SPP1 and TUBA3 in particular, represent potential biomarkers for PD and could serve as useful targets for elucidating the molecular mechanisms associated with PD. PMID:17213182

  10. Pyrosequence analysis of expressed sequence tags for Manduca sexta hemolymph proteins involved in immune responses.

    PubMed

    Zou, Zhen; Najar, Fares; Wang, Yang; Roe, Bruce; Jiang, Haobo

    2008-06-01

    The tobacco hornworm Manduca sexta is widely used as a model organism to investigate the biochemical basis of insect physiological processes but little transcriptome information is available. To get a broad view of the larval hemolymph proteins, particularly those related to immunity, we synthesized and sequenced cDNA fragments from a mixture of eight total RNA samples: fat body and hemocytes from larvae injected with killed bacteria, fat body, hemocytes, integument and trachea from naïve larvae, and fat body and hemocytes from wandering larvae. Using massively parallel pyrosequencing, we obtained 95,458 M. sexta expressed sequence tags (ESTs) at an average size of 185bp per read. A majority of the sequences (69,429 reads) could be assembled into 7231 contigs with an average size of 300bp, 1178 of which had significant similarity with Drosophila genes from various functional groups. Only approximately 8% (606) of the contigs matched known M. sexta cDNA sequences, representing 186 of the 375 unique NCBI entries. The remaining 6625 contigs represented newly discovered cDNA segments from this well studied biochemical model insect. A search of the 7231 contigs using Tribolium castaneum, Drosophila melanogaster, and Bombyx mori immunity-related sequences revealed 424 cDNA contigs with significant similarity (E-value <1 x 10(-5)). These included 218 previously unknown M. sexta sequences coding for putative defense molecules such as pattern recognition receptors, serine proteinases, serpins, Spätzle, Toll-like receptors, intracellular signaling molecules, and antimicrobial peptides. PMID:18510979

  11. Mining for single nucleotide polymorphisms and insertions / deletions in expressed sequence tag libraries of oil palm.

    PubMed

    Riju, Aykkal; Chandrasekar, Arumugam; Arunachalam, Vadivel

    2007-01-01

    The oil palm is a tropical oil bearing tree. Recently EST-derived SNPs and SSRs are a free by-product of the currently expanding EST (Expressed Sequence Tag) data bases. The development of high-throughput methods for the detection of SNPs (Single Nucleotide Polymorphism) and small indels (insertion / deletion) has led to a revolution in their use as molecular markers. Available (5452) Oil palm EST sequences were mined from dbEST of NCBI. CAP3 program was used to assemble EST sequences into contigs. Candidate SNPs and Indel polymorphisms were detected using the perl script auto_snip version 1.0 which has used 576 ESTs for detecting SNPs and Indel sites. We found 1180 SNP sites and 137 indel polymorphisms with frequency 1.36 SNPs / 100 bp. Among the six tissues from which the EST libraries had been generated, mesocarp had high frequency of 2.91 SNPs and indels per 100 bp whereas the zygotic embryos had lowest frequency of 0.15 per 100 bp. We also used the Shannon index to analyze the proportion of ten possible types of SNP/indels. ESTs from tissues of normal apex showed highest values of Shannon index (0.60) whereas abnormal apex had least value (0.02). The present report deals the use of Shannon index for comparing SNP/ indel frequencies mined from ESTlibraries and also confirm that the frequency of SNP occurrence in oil palm to use them as markers for genetic studies. PMID:21670789

  12. Functional annotation of an expressed sequence tag library from Haliotis diversicolor and analysis of its plant-like sequences.

    PubMed

    Jiang, Jing-Zhe; Zhang, Wei; Guo, Zhi-Xun; Cai, Chen-Chen; Su, You-Lu; Wang, Rui-Xuan; Wang, Jiang-Yong

    2011-09-01

    The small abalone, Haliotis diversicolor, is a widely distributed and cultured species in the subtropical coastal area of China. To identify and classify functional genes of this important species, a normalized expressed sequence tag (EST) library, including 7069 high quality ESTs from the total body of H. diversicolor, was analyzed. A total of 4781 unigenes were assembled and 2991 novel abalone genes were identified. The GC content, codon and amino acid usage of the transcriptome were analyzed. For the accurate annotation of the abalone library, different influencing factors were evaluated. The gene ontology (GO) database provided a higher annotation rate (69.6%), and sequences longer than 800bp were easily subjected to a BLAST search. The taxonomy of the BLAST results showed that lancelet and invertebrates are most closely related to abalone. Sixty-seven identified plant-like genes were further examined by reverse transcription-polymerase chain reaction (RT-PCR) and sequencing, only seven of these were real transcripts in abalone. Phylogenic trees were also constructed to illustrate the positions of two Cystatin sequences and one Calmodulin protein sequence identified in abalone. To perform functional classification, three different databases (GO, KEGG and COG) were used and 60 immune or disease-related unigenes were determined. This work has greatly enlarged the known gene pool of H. diversicolor and will have important implications for future molecular and genetic analyses in this organism. PMID:21867971

  13. Transcriptome analysis of expressed sequence tags from the venom glands of the fish Thalassophryne nattereri.

    PubMed

    Magalhães, G S; Junqueira-de-Azevedo, I L M; Lopes-Ferreira, M; Lorenzini, D M; Ho, P L; Moura-da-Silva, A M

    2006-06-01

    Thalassophryne nattereri (niquim) is a venomous fish found on the northern and northeastern coasts of Brazil. Every year, hundreds of humans are affected by the poison, which causes excruciating local pain, edema, and necrosis, and can lead to permanent disabilities. In experimental models, T. nattereri venom induces edema and nociception, which are correlated to human symptoms and dependent on venom kininogenase activity; myotoxicity; impairment of blood flow; platelet lysis and cytotoxicity on endothelial cells. These effects were observed with minute amounts of venom. To characterize the primary structure of T. nattereri venom toxins, a list of transcripts within the venom gland was made using the expressed sequence tag (EST) strategy. Here we report the analysis of 775 ESTs that were obtained from a directional cDNA library of T. nattereri venom gland. Of these ESTs, 527 (68%) were related to sequences previously described. These were categorized into 10 groups according to their biological functions. Sequences involved in gene and protein expression accounted for 14.3% of the ESTs, reflecting the important role of protein synthesis in this gland. Other groups included proteins engaged in the assembly of disulfide bonds (0.5%), chaperones involved in the folding of nascent proteins (1.4%), and sequences related to clusterin (1.5%), as well as transcripts related to calcium binding proteins (1.0%). We detected a large cluster (1.3%) related to cocaine- and amphetamine-regulated transcript (CART), a peptide involved in the regulation of food intake. Surprisingly, several retrotransposon-like sequences (1.0%) were found in the library. It may be that their presence accounts for some of the variation in venom toxins. The toxin category (18.8%) included natterins (18%), which are a new group of kininogenases recently described by our group, and a group of C-type lectins (0.8%). In addition, a considerable number of sequences (32%) was not related to sequences in the

  14. Sequencing, analysis, and annotation of expressed sequence tags for Camelus dromedarius.

    PubMed

    Al-Swailem, Abdulaziz M; Shehata, Maher M; Abu-Duhier, Faisel M; Al-Yamani, Essam J; Al-Busadah, Khalid A; Al-Arawi, Mohammed S; Al-Khider, Ali Y; Al-Muhaimeed, Abdullah N; Al-Qahtani, Fahad H; Manee, Manee M; Al-Shomrani, Badr M; Al-Qhtani, Saad M; Al-Harthi, Amer S; Akdemir, Kadir C; Inan, Mehmet S; Otu, Hasan H

    2010-01-01

    Despite its economical, cultural, and biological importance, there has not been a large scale sequencing project to date for Camelus dromedarius. With the goal of sequencing complete DNA of the organism, we first established and sequenced camel EST libraries, generating 70,272 reads. Following trimming, chimera check, repeat masking, cluster and assembly, we obtained 23,602 putative gene sequences, out of which over 4,500 potentially novel or fast evolving gene sequences do not carry any homology to other available genomes. Functional annotation of sequences with similarities in nucleotide and protein databases has been obtained using Gene Ontology classification. Comparison to available full length cDNA sequences and Open Reading Frame (ORF) analysis of camel sequences that exhibit homology to known genes show more than 80% of the contigs with an ORF>300 bp and approximately 40% hits extending to the start codons of full length cDNAs suggesting successful characterization of camel genes. Similarity analyses are done separately for different organisms including human, mouse, bovine, and rat. Accompanying web portal, CAGBASE (http://camel.kacst.edu.sa/), hosts a relational database containing annotated EST sequences and analysis tools with possibility to add sequences from public domain. We anticipate our results to provide a home base for genomic studies of camel and other comparative studies enabling a starting point for whole genome sequencing of the organism. PMID:20502665

  15. Sequencing, Analysis, and Annotation of Expressed Sequence Tags for Camelus dromedarius

    PubMed Central

    Al-Swailem, Abdulaziz M.; Shehata, Maher M.; Abu-Duhier, Faisel M.; Al-Yamani, Essam J.; Al-Busadah, Khalid A.; Al-Arawi, Mohammed S.; Al-Khider, Ali Y.; Al-Muhaimeed, Abdullah N.; Al-Qahtani, Fahad H.; Manee, Manee M.; Al-Shomrani, Badr M.; Al-Qhtani, Saad M.; Al-Harthi, Amer S.; Akdemir, Kadir C.; Otu, Hasan H.

    2010-01-01

    Despite its economical, cultural, and biological importance, there has not been a large scale sequencing project to date for Camelus dromedarius. With the goal of sequencing complete DNA of the organism, we first established and sequenced camel EST libraries, generating 70,272 reads. Following trimming, chimera check, repeat masking, cluster and assembly, we obtained 23,602 putative gene sequences, out of which over 4,500 potentially novel or fast evolving gene sequences do not carry any homology to other available genomes. Functional annotation of sequences with similarities in nucleotide and protein databases has been obtained using Gene Ontology classification. Comparison to available full length cDNA sequences and Open Reading Frame (ORF) analysis of camel sequences that exhibit homology to known genes show more than 80% of the contigs with an ORF>300 bp and ∼40% hits extending to the start codons of full length cDNAs suggesting successful characterization of camel genes. Similarity analyses are done separately for different organisms including human, mouse, bovine, and rat. Accompanying web portal, CAGBASE (http://camel.kacst.edu.sa/), hosts a relational database containing annotated EST sequences and analysis tools with possibility to add sequences from public domain. We anticipate our results to provide a home base for genomic studies of camel and other comparative studies enabling a starting point for whole genome sequencing of the organism. PMID:20502665

  16. CoffeebEST: an integrated resource for Coffea spp expressed sequence tags.

    PubMed

    Paschoal, A R; Fernandes, E D M; Silva, J C; Lopes, F M; Pereira, L F P; Domingues, D S

    2014-01-01

    Coffee is one of the most important commodities in the world, and its production relies mainly on two species, Coffea arabica and Coffea canephora. Although there are diverse transcriptome datasets available for coffee trees, few research groups have exploited the potential knowledge contained in these data, especially with respect to fruit and seed development. Here, we present a comparative analysis of the transcriptomes of Coffea arabica and Coffea canephora with a focus on fruit development using publicly available expressed sequence tags (ESTs). Most of the fruit and seed EST data has been obtained from C. canephora. Therefore, we performed a fruit EST analysis of the 5 developmental stages of this species (18, 22, 30, 42, and 46 weeks after flowering) comprising 29,009 sequences. We compared C. canephora fruit ESTs to reference unigenes of C. canephora (7710 contigs and 8955 singletons) and C. arabica (15,656 contigs and 16,351 singletons). Additional analyses included functional annotation based on Gene Onthology, as well as an annotation using PlantCyc, a curated plant protein database. The Coffee Bean EST (CoffeebEST) is a public database available at http://bioinfo-02.cp.utfpr.edu.br/. This database represents an additional resource for the coffee scientific community, offering a user-friendly collection of information for non-specialists in coffee molecular biology to support experimental research on comparative and functional genomics. PMID:25526212

  17. Expressed sequence tag analysis of the erythrocytic stage of Plasmodium berghei.

    PubMed

    Seok, Ji-Woong; Lee, Yong-Seok; Moon, Eun-Kyung; Lee, Jung-Yub; Jha, Bijay Kumar; Kong, Hyun-Hee; Chung, Dong-Il; Hong, Yeonchul

    2011-09-01

    Rodent malaria parasites, such as Plasmodium berghei, are practical and useful model organisms for human malaria research because of their analogies to the human malaria in terms of structure, physiology, and life cycle. Exploiting the available genetic sequence information, we constructed a cDNA library from the erythrocytic stages of P. berghei and analyzed the expressed sequence tag (EST). A total of 10,040 ESTs were generated and assembled into 2,462 clusters. These EST clusters were compared against public protein databases and 48 putative new transcripts, most of which were hypothetical proteins with unknown function, were identified. Genes encoding ribosomal or membrane proteins and purine nucleotide phosphorylases were highly abundant clusters in P. berghei. Protein domain analyses and the Gene Ontology functional categorization revealed translation/protein folding, metabolism, protein degradation, and multiple family of variant antigens to be mainly prevalent. The presently-collected ESTs and its bioinformatic analysis will be useful resources to identify for drug target and vaccine candidates and validate gene predictions of P. berghei. PMID:22072821

  18. Development of polymorphic expressed sequence tag-single sequence repeat markers in the common Chinese cuttlefish, Sepiella maindroni.

    PubMed

    Li, R H; Lu, S K; Zhang, C L; Song, W W; Mu, C K; Wang, C L

    2014-01-01

    The common Chinese cuttlefish (Sepiella maindroni) is one of the popular edible cephalopod consumed across Asia. To facilitate the population genetic investigation of this species, we developed fourteen polymorphic microsatellite makers from expressed sequence tags of S. maindroni. The number of alleles at each locus ranged from 6 to 10 with an average of 7.9 alleles per locus. The ranges of observed and expected heterozygosity were from 0.615 to 0.962 and 0.685 to 0.888, respectively. Four loci were found deviated significantly from Hardy-Weinberg equilibrium. The polymorphism information content ranged from 0.638 to 0.833. These polymorphic microsatellite loci will be helpful for the population genetic, genetic linkage map, and other genetic studies of S. maindroni. PMID:25117305

  19. Isolation of expressed sequence tags of Agaricus bisporus and their assignment to chromosomes.

    PubMed Central

    Sonnenberg, A S; de Groot, P W; Schaap, P J; Baars, J J; Visser, J; Van Griensven, L J

    1996-01-01

    The genome of the cultivated basidiomycete Agaricus bisporus Horst U1 and of its homokaryotic parents has been characterized by using an optimized method of pulsed-field gel electrophoresis. Expressed sequence tags obtained as expressed cDNAs from a primordial tissue-derived cDNA library and a number of previously isolated genes were used to identify the individual chromosomes of the parental lines of Horst U1. The genome consists of 13 chromosomes, and its total size is 31 Mb. For those chromosomes that could not be resolved by contour-clamped homogeneous electric field electrophoresis, the segregation of marker genes was studied in a set of 86 homokaryotic offspring of Horst U1. At least two markers were assigned to each individual chromosome. In this way all individual chromosomes were unequivocally identified. The large size difference observed between the homologous chromosomes IX, harboring the rDNA repeat, was shown to be largely due to a higher copy number of rDNA in parental strain H97 than in parental strain H39. PMID:8953726

  20. Gene cataloging and expression profiling in human gastric cancer cells by expressed sequence tags.

    PubMed

    Kim, Nam-Soon; Hahn, Yoonsoo; Oh, Jung-Hwa; Lee, Ju-Yeon; Oh, Kyung-Jin; Kim, Jeong-Min; Park, Hong-Seog; Kim, Sangsoo; Song, Kyu-Sang; Rho, Seung-Moo; Yoo, Hyang-Sook; Kim, Yong Sung

    2004-06-01

    To understand the molecular mechanism associated with gastric carcinogenesis, we identified genes expressed in gastric cancer cell lines and tissues. Of 97,609 high-quality ESTs sequenced from 36 cDNA libraries, 92,545 were coalesced into 10,418 human Unigene clusters (Build 151). The gene expression profile was produced by counting the cluster frequencies in each library. Although the profiles of highly expressed genes varied greatly from library to library, those genes related to cell structure formation, heat shock proteins, the glycolysis pathway, and the signaling pathway were highly represented in human gastric cancer cell lines and in primary tumors. Conversely, the genes encoding immunoglobulins, ribosomal proteins, and digestive proteins were down-regulated in gastric cancer cell lines and tissues compared to normal tissues. The transcription levels of some of these genes were confirmed by RT-PCR. We found that genes related to cell adhesion, apoptosis, and cytoskeleton formation were particularly up-regulated in the gastric cancer cell lines established from malignant ascites compared to those from primary tumors. This comprehensive molecular profiling of human gastric cancer should be useful for elucidating the genetic events associated with human gastric cancer. PMID:15177556

  1. Development of Simple Sequence Repeat Markers from Expressed Sequence Tags of the Maize Gray Leaf Spot Pathogen, Cercospora Zea-Maydis

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Ten simple sequence repeat markers were developed from expressed sequence tags of Cercospora zeae-maydis, the cause of gray leaf spot of maize (Zea mays). All loci were evaluated on 80 isolates from a local population of C. zeae-maydis and all were highly polymorphic, with 4 to 14 alleles per locus....

  2. Genome-wide characterization and selection of expressed sequence tag simple sequence repeat primers for optimized marker distribution and reliability in peach

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Expressed sequence tag (EST) simple sequence repeats (SSRs) in Prunus were mined, and flanking primers designed and used for genome-wide characterization and selection of primers to optimize marker distribution and reliability. A total of 12,618 contigs were assembled from 84,727 ESTs, along with 34...

  3. Analysis and RT-PCR identification of viral sequences in peanut (Arachis hypogaea L.) expressed sequence tags from different peanut tissues

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Peanut plants grown in the field have been naturally infected with different viruses resulting in economic yield loss in the southeastern US, such as tomato spotted wilt tospovirus (TSWV) in peanuts. The objectives of this study were to investigate peanut sequences of expressed sequence tags (EST) f...

  4. Characterization of Expressed Sequence Tag-Derived Simple Sequence Repeat Markers for Aspergillus flavus: Emphasis on Variability of Isolates from the Southern United States

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Simple Sequence Repeat (SSR) markers were developed from Aspergillus flavus expressed sequence tag (EST) database to conduct an analysis of genetic relationships of Aspergillus isolates from numerous host species and geographical regions, but primarily from the United States. Twenty-nine primers wer...

  5. Expressed sequence tags reveal genetic diversity and putative virulence factors of the pathogenic oomycete Pythium insidiosum.

    PubMed

    Krajaejun, Theerapong; Khositnithikul, Rommanee; Lerksuthirat, Tassanee; Lowhnoo, Tassanee; Rujirawat, Thidarat; Petchthong, Thanom; Yingyong, Wanta; Suriyaphol, Prapat; Smittipat, Nat; Juthayothin, Tada; Phuntumart, Vipaporn; Sullivan, Thomas D

    2011-07-01

    Oomycetes are unique eukaryotic microorganisms that share a mycelial morphology with fungi. Many oomycetes are pathogenic to plants, and a more limited number are pathogenic to animals. Pythium insidiosum is the only oomycete that is capable of infecting both humans and animals, and causes a life-threatening infectious disease, called "pythiosis". In the majority of pythiosis patients life-long handicaps result from the inevitable radical excision of infected organs, and many die from advanced infection. Better understanding P. insidiosum pathogenesis at molecular levels could lead to new forms of treatment. Genetic and genomic information is lacking for P. insidiosum, so we have undertaken an expressed sequence tag (EST) study, and report on the first dataset of 486 ESTs, assembled into 217 unigenes. Of these, 144 had significant sequence similarity with known genes, including 47 with ribosomal protein homology. Potential virulence factors included genes involved in antioxidation, thermal adaptation, immunomodulation, and iron and sterol binding. Effectors resembling pathogenicity factors of plant-pathogenic oomycetes were also discovered, such as, a CBEL-like protein (possible involvement in host cell adhesion and hemagglutination), a putative RXLR effector (possibly involved in host cell modulation) and elicitin-like (ELL) proteins. Phylogenetic analysis mapped P. insidiosum ELLs to several novel clades of oomycete elicitins (ELIs), and homology modeling predicted that P. insidiosum ELLs should bind sterols. Most of the P. insidiosum ESTs showed homology to sequences in the genome or EST databases of other oomycetes, but one putative gene, with unknown function, was found to be unique to P. insidiosum. The EST dataset reported here represents the first steps in identifying genes of P. insidiosum and beginning transcriptome analysis. This genetic information will facilitate understanding of pathogenic mechanisms of this devastating pathogen. PMID:21724174

  6. Identification and Characterization of Microsatellites in Expressed Sequence Tags and Their Cross Transferability in Different Plants

    PubMed Central

    Haq, Shamshad ul; Jain, Rohit; Sharma, Meenakshi; Kachhwaha, Sumita; Kothari, S. L.

    2014-01-01

    Expressed sequence tags (EST) are potential source for the development of genic microsatellite markers, gene discovery, comparative genomics, and other genomic studies. In the present study, 7630 ESTs were examined from NCBI for SSR identification and characterization. A total of 263 SSRs were identified with an average density of one SSR/4.2 kb (3.4% frequency). Analysis revealed that trinucleotide repeats (47.52%) were most abundant followed by tetranucleotide (19.77%), dinucleotide (19.01%), pentanucleotide (9.12%), and hexanucleotide repeats (4.56%). Functional annotation was done through homology search and gene ontology, and 35 EST-SSRs were selected. Primer pairs were designed for evaluation of cross transferability and polymorphism among 11 plants belonging to five different families. Total 402 alleles were generated at 155 loci with an average of 2.6 alleles/locus and the polymorphic information content (PIC) ranged from 0.15 to 0.92 with an average of 0.75. The cross transferability ranged from 34.84% to 98.06% in different plants, with an average of 67.86%. Thus, the validation study of annotated 35 EST-SSR markers which correspond to particular metabolic activity revealed polymorphism and evolutionary nature in different families of Angiospermic plants. PMID:25389527

  7. Desiccation survival in an Antarctic nematode: molecular analysis using expressed sequenced tags

    PubMed Central

    Adhikari, Bishwo N; Wall, Diana H; Adams, Byron J

    2009-01-01

    Background Nematodes are the dominant soil animals in Antarctic Dry Valleys and are capable of surviving desiccation and freezing in an anhydrobiotic state. Genes induced by desiccation stress have been successfully enumerated in nematodes; however we have little knowledge of gene regulation by Antarctic nematodes which can survive multiple environmental stresses. To address this problem we investigated the genetic responses of a nematode species, Plectus murrayi, that is capable of tolerating Antarctic environmental extremes, in particular desiccation and freezing. In this study, we provide the first insight into the desiccation induced transcriptome of an Antarctic nematode through cDNA library construction and suppressive subtractive hybridization. Results We obtained 2,486 expressed sequence tags (ESTs) from 2,586 clones derived from the cDNA library of desiccated P. murrayi. The 2,486 ESTs formed 1,387 putative unique transcripts of which 523 (38%) had matches in the model-nematode Caenorhabditis elegans, 107 (7%) in nematodes other than C. elegans, 153 (11%) in non-nematode organisms and 605 (44%) had no significant match to any sequences in the current databases. The 1,387 unique transcripts were functionally classified by using Gene Ontology (GO) hierarchy and the Kyoto Encyclopedia of Genes and Genomes (KEGG) database. The results indicate that the transcriptome contains a group of transcripts from diverse functional areas. The subtractive library of desiccated nematodes showed 80 transcripts differentially expressed during desiccation stress, of which 28% were metabolism related, 19% were involved in environmental information processing, 28% involved in genetic information processing and 21% were novel transcripts. Expression profiling of 14 selected genes by quantitative Real-time PCR showed 9 genes significantly up-regulated, 3 down-regulated and 2 continuously expressed in response to desiccation. Conclusion The establishment of a desiccation EST

  8. Expressed sequence tag analysis in Cycas, the most primitive living seed plant

    PubMed Central

    Brenner, Eric D; Stevenson, Dennis W; McCombie, Richard W; Katari, Manpreet S; Rudd, Stephen A; Mayer, Klaus FX; Palenchar, Peter M; Runko, Suzan J; Twigg, Richard W; Dai, Guangwei; Martienssen, Rob A; Benfey, Phillip N; Coruzzi, Gloria M

    2003-01-01

    Background Cycads are ancient seed plants (living fossils) with origins in the Paleozoic. Cycads are sometimes considered a 'missing link' as they exhibit characteristics intermediate between vascular non-seed plants and the more derived seed plants. Cycads have also been implicated as the source of 'Guam's dementia', possibly due to the production of S(+)-beta-methyl-alpha, beta-diaminopropionic acid (BMAA), which is an agonist of animal glutamate receptors. Results A total of 4,200 expressed sequence tags (ESTs) were created from Cycas rumphii and clustered into 2,458 contigs, of which 1,764 had low-stringency BLAST similarity to other plant genes. Among those cycad contigs with similarity to plant genes, 1,718 cycad 'hits' are to angiosperms, 1,310 match genes in gymnosperms and 734 match lower (non-seed) plants. Forty-six contigs were found that matched only genes in lower plants and gymnosperms. Upon obtaining the complete sequence from the clones of 37/46 contigs, 14 still matched only gymnosperms. Among those cycad contigs common to higher plants, ESTs were discovered that correspond to those involved in development and signaling in present-day flowering plants. We purified a cycad EST for a glutamate receptor (GLR)-like gene, as well as ESTs potentially involved in the synthesis of the GLR agonist BMAA. Conclusions Analysis of cycad ESTs has uncovered conserved and potentially novel genes. Furthermore, the presence of a glutamate receptor agonist, as well as a glutamate receptor-like gene in cycads, supports the hypothesis that such neuroactive plant products are not merely herbivore deterrents but may also serve a role in plant signaling. PMID:14659015

  9. Identification of candidates for cyclotide biosynthesis and cyclisation by expressed sequence tag analysis of Oldenlandia affinis

    PubMed Central

    2010-01-01

    Background Cyclotides are a family of circular peptides that exhibit a range of biological activities, including anti-bacterial, cytotoxic, anti-HIV activities, and are proposed to function in plant defence. Their high stability has motivated their development as scaffolds for the stabilisation of peptide drugs. Oldenlandia affinis is a member of the Rubiaceae (coffee) family from which 18 cyclotides have been sequenced to date, but the details of their processing from precursor proteins have only begun to be elucidated. To increase the speed at which genes involved in cyclotide biosynthesis and processing are being discovered, an expressed sequence tag (EST) project was initiated to survey the transcript profile of O. affinis and to propose some future directions of research on in vivo protein cyclisation. Results Using flow cytometry the holoploid genome size (1C-value) of O. affinis was estimated to be 4,210 - 4,284 Mbp, one of the largest genomes of the Rubiaceae family. High-quality ESTs were identified, 1,117 in total, from leaf cDNAs and assembled into 502 contigs, comprising 202 consensus sequences and 300 singletons. ESTs encoding the cyclotide precursors for kalata B1 (Oak1) and kalata B2 (Oak4) were among the 20 most abundant ESTs. In total, 31 ESTs encoded cyclotide precursors, representing a distinct commitment of 2.8% of the O. affinis transcriptome to cyclotide biosynthesis. The high expression levels of cyclotide precursor transcripts are consistent with the abundance of mature cyclic peptides in O. affinis. A new cyclotide precursor named Oak5 was isolated and represents the first cDNA for the bracelet class of cyclotides in O. affinis. Clones encoding enzymes potentially involved in processing cyclotides were also identified and include enzymes involved in oxidative folding and proteolytic processing. Conclusion The EST library generated in this study provides a valuable resource for the study of the cyclisation of plant peptides. Further analysis

  10. Comprehensive analysis of expressed sequence tags from cultivated and wild radish (Raphanus spp.)

    PubMed Central

    2013-01-01

    Background Radish (Raphanus sativus L., 2n = 2× = 18) is an economically important vegetable crop worldwide. A large collection of radish expressed sequence tags (ESTs) has been generated but remains largely uncharacterized. Results In this study, approximately 315,000 ESTs derived from 22 Raphanus cDNA libraries from 18 different genotypes were analyzed, for the purpose of gene and marker discovery and to evaluate large-scale genome duplication and phylogenetic relationships among Raphanus spp. The ESTs were assembled into 85,083 unigenes, of which 90%, 65%, 89% and 89% had homologous sequences in the GenBank nr, SwissProt, TrEMBL and Arabidopsis protein databases, respectively. A total of 66,194 (78%) could be assigned at least one gene ontology (GO) term. Comparative analysis identified 5,595 gene families unique to radish that were significantly enriched with genes related to small molecule metabolism, as well as 12,899 specific to the Brassicaceae that were enriched with genes related to seed oil body biogenesis and responses to phytohormones. The analysis further indicated that the divergence of radish and Brassica rapa occurred approximately 8.9-14.9 million years ago (MYA), following a whole-genome duplication event (12.8-21.4 MYA) in their common ancestor. An additional whole-genome duplication event in radish occurred at 5.1-8.4 MYA, after its divergence from B. rapa. A total of 13,570 simple sequence repeats (SSRs) and 28,758 high-quality single nucleotide polymorphisms (SNPs) were also identified. Using a subset of SNPs, the phylogenetic relationships of eight different accessions of Raphanus was inferred. Conclusion Comprehensive analysis of radish ESTs provided new insights into radish genome evolution and the phylogenetic relationships of different radish accessions. Moreover, the radish EST sequences and the associated SSR and SNP markers described in this study represent a valuable resource for radish functional genomics studies and

  11. Chromosome-specific physical localisation of expressed sequence tag loci in Corchorus olitorius L.

    PubMed

    Joshi, A; Das, S K; Samanta, P; Paria, P; Sen, S K; Basu, A

    2014-11-01

    Jute (Corchorus spp.), as a natural fibre-producing species, ranks next only to cotton. Inadequate understanding of its genetic architecture is a major lacuna for genetic improvement of this crop in terms of yield and quality. Establishment of a physical map provides a genomic tool that helps in positional cloning of valuable genes. In this report, an attempt was initiated to study association and localisation of single copy expressed sequence tag (EST) loci in the genome of Corchorus olitorius. The chromosome-specific association of EST was determined based on the appearance of an extra signal for a single copy cDNA probe in mitotic interphase nuclei of specific trisomic(s) for fluorescence in situ hybridisation, and validated using a cDNA fragment of the 26S rRNA gene (600 bp) as molecular probe. The probe exhibited three signals in meiotic interphase nuclei of trisomic 5, instead of two as observed in diploids and other trisomics, indicating its association with chromosome 5. Subsequent hybridisation of the same probe on the pachytene chromosomes of diploids confirmed that 26S rRNA occupies the terminal end of the short arm of chromosome 5 in C. olitorius. Subsequently, chromosome-specific association of 63 single copy EST and their physical localisation were determined on chromosomes 2, 4, 5 and 7. The study describes chromosome-specific physical localisation of genes in jute. The approach used here could be a step towards construction of genome-wide physical maps for any recalcitrant plant species like jute. PMID:24628982

  12. Expressed sequence tags from Atta laevigata and identification of candidate genes for the control of pest leaf-cutting ants

    PubMed Central

    2011-01-01

    Background Leafcutters are the highest evolved within Neotropical ants in the tribe Attini and model systems for studying caste formation, labor division and symbiosis with microorganisms. Some species of leafcutters are agricultural pests controlled by chemicals which affect other animals and accumulate in the environment. Aiming to provide genetic basis for the study of leafcutters and for the development of more specific and environmentally friendly methods for the control of pest leafcutters, we generated expressed sequence tag data from Atta laevigata, one of the pest ants with broad geographic distribution in South America. Results The analysis of the expressed sequence tags allowed us to characterize 2,006 unique sequences in Atta laevigata. Sixteen of these genes had a high number of transcripts and are likely positively selected for high level of gene expression, being responsible for three basic biological functions: energy conservation through redox reactions in mitochondria; cytoskeleton and muscle structuring; regulation of gene expression and metabolism. Based on leafcutters lifestyle and reports of genes involved in key processes of other social insects, we identified 146 sequences potential targets for controlling pest leafcutters. The targets are responsible for antixenobiosis, development and longevity, immunity, resistance to pathogens, pheromone function, cell signaling, behavior, polysaccharide metabolism and arginine kynase activity. Conclusion The generation and analysis of expressed sequence tags from Atta laevigata have provided important genetic basis for future studies on the biology of leaf-cutting ants and may contribute to the development of a more specific and environmentally friendly method for the control of agricultural pest leafcutters. PMID:21682882

  13. Regional localisation of 19 brain expressed sequence tags to human chromosome 11 using PCR amplification of somatic cell hybrid DNAs.

    PubMed

    Slorach, E M; Polymeropoulos, M H; Evans, K L; Seawright, A; Fletcher, J M; Porteous, D J; Brookes, A J

    1995-01-01

    Expressed sequence tags (ESTs) provide an efficient route to the identification of genes involved in normal development and in disease. PCR amplification of somatic cell hybrid DNAs was used to localise 22 brain-derived ESTs to subregions of human chromosome 11. Problems encountered with the standardised PCR conditions were overcome by optimising the annealing temperatures and the use of "touchdown" PCR. Amplification of the correct target sequence allowed the mapping of 19 ESTs, 8 to the short arm and 11 to the long arm of chromosome 11. No definitive localisation could be determined for the three remaining ESTs. PMID:7736794

  14. Analyses of Expressed Sequence Tags from the Maize Foliar Pathogen Cercospora Zeae-Maydis Identifing Novel Genes expressed during Vegetative, Infectious, & Reproductive Growth

    Technology Transfer Automated Retrieval System (TEKTRAN)

    The fungus Cercospora zeae-maydis is an aggressive foliar pathogen of maize that causes substantial yield losses annually throughout the western hemisphere. To learn more about the molecular regulation of pathogenesis in C. zeae-maydis, we generated a collection of expressed sequence tags (ESTs) and...

  15. Identification of novel highly expressed genes in pancreatic ductal adenocarcinomas through a bioinformatics analysis of expressed sequence tags.

    PubMed

    Cao, Dengfeng; Hustinx, Steven R; Sui, Guoping; Bala, P; Sato, Norihiro; Martin, Sean; Maitra, Anirban; Murphy, Kathleen M; Cameron, John L; Yeo, Charles J; Kern, Scott E; Goggins, Michael; Pandey, Akhilesh; Hruban, Ralph H

    2004-11-01

    In most microarray experiments, a significant fraction of the differentially expressed mRNAs identified correspond to expressed sequence tags (ESTs) and are generally discarded from further analyses. We used careful bioinformatics analyses to characterize those ESTs that were found to be highly overexpressed in a series of pancreatic adenocarcinomas. cDNA was prepared from 60 non-neoplastic samples (normal pancreas [n = 20], normal colon [n = 10], or normal duodenal mucosal [n = 30]) and from 64 pancreatic cancers (resected cancers [n = 50] or cancer cell lines [n = 14]) and hybridized to the complete Affymetrix Human Genome U133 GeneChip(R) set (arrays U133A and B) for simultaneous analysis of 45,000 fragments corresponding to 33,000 known genes and 6,000 ESTs. The GeneExpress(R) software system Fold Change Analysis Tool was used and 60 ESTs were identified that were expressed at levels at least 3-fold greater in the pancreatic cancers as compared to normal tissues. Searches against the human genomic sequence and comparative genomic analysis of human and mouse genomes was carried out using basic local alignment search tools (BLAST), BLASTN, and BLASTX, for identifying protein coding genes corresponding to the ESTs. Subsequently, in order to pick the most relevant candidate genes for a more detailed analysis, we looked for domains/motifs in the open reading frames using SMART and Pfam programs. We were able to definitively map 43 of the 60 ESTs to known or novel genes, and 15 of the ESTs could be localized in close proximity to a gene in the human genome although we were unable to establish that the EST was indeed derived from those genes. The differential expression of a subset of genes was confirmed at the protein level by immunohistochemical labeling of tissue microarrays (inhibin beta A [INHBA] and CD29) and/or at the transcript level by RT-PCR (INHBA, AKAP12, ELK3, FOXQ1, EIF5A2, and EFNA5). We conclude that bioinformatics tools can be used to characterize

  16. Generation and analysis of expressed sequence tags from Trypanosoma cruzi trypomastigote and amastigote cDNA libraries.

    PubMed

    Agüero, Fernán; Abdellah, Karim Ben; Tekiel, Valeria; Sánchez, Daniel O; González, Antonio

    2004-08-01

    We have generated 2771 expressed sequence tags (ESTs) from two cDNA libraries of Trypanosoma cruzi CL-Brener. The libraries were constructed from trypomastigote and amastigotes, using a spliced leader primer to synthesize the cDNA second strand, thus selecting for full-length cDNAs. Since the libraries were not normalized nor pre-screened, we compared the representation of transcripts between the two using a statistical test and identify a subset of transcripts that show apparent differential representation. A non-redundant set of 1619 reconstructed transcripts was generated by sequence clustering. This dataset was used to perform similarity searches against protein and nucleotide databases. Based on these searches, 339 sequences could be assigned a putative identity. One thousand one-hundred and sixteen sequences in the non-redundant clustered dataset (68.8%) are new expression tags, not represented in the T. cruzi epimastigote ESTs that are in the public databases. Additional information is provided online at http://genoma.unsam.edu.ar/projects/tram. To the best of our knowledge these are the first ESTs reported for the life cycle stages of T. cruzi that occur in the vertebrate host. PMID:15478800

  17. Expressed sequence tags of the peanut pod nematode Ditylenchus africanus: the first transcriptome analysis of an Anguinid nematode

    PubMed Central

    Haegeman, Annelies; Jacob, Joachim; Vanholme, Bartel; Kyndt, Tina; Mitreva, Makedonka; Gheysen, Godelieve

    2009-01-01

    In this study, 4847 expressed sequenced tags (ESTs) from mixed stages of the migratory plant-parasitic nematode Ditylenchus africanus (peanut pod nematode) were investigated. It is the first molecular survey of a nematode which belongs to the family of the Anguinidae (order Rhabditida, superfamily Sphaerularioidea). The sequences were clustered into 2596 unigenes, of which 43% did not show any homology to known protein, nucleotide, nematode EST or plant-parasitic nematode genome sequences. Gene ontology mapping revealed that most putative proteins are involved in developmental and reproductive processes. In addition unigenes involved in oxidative stress as well as in anhydrobiosis, such as LEA (late embryogenesis abundant protein) and trehalose-6-phosphate synthase were identified. Other tags showed homology to genes previously described as being involved in parasitism (expansin, SEC-2, calreticulin, 14-3-3b and various allergen proteins). In situ hybridization revealed that the expression of a putative expansin and a venom allergen protein was restricted to the gland cell area of the nematode, being in agreement with their presumed role in parasitism. Furthermore, 7 putative novel candidate parasitism genes were identified based on the prediction of a signal peptide in the corresponding protein sequence and homologous ESTs exclusively in parasitic nematodes. These genes are interesting for further research and functional characterization. Finally, 34 unigenes were retained as good target candidates for future RNAi experiments, because of their nematode specific nature and observed lethal phenotypes of Caenorhabditis elegans homologs. PMID:19383517

  18. Gene discovery and expression profile analysis through sequencing of expressed sequence tags from different developmental stages of the chytridiomycete Blastocladiella emersonii.

    PubMed

    Ribichich, Karina F; Salem-Izacc, Silvia M; Georg, Raphaela C; Vêncio, Ricardo Z N; Navarro, Luci D; Gomes, Suely L

    2005-02-01

    Blastocladiella emersonii is an aquatic fungus of the chytridiomycete class which diverged early from the fungal lineage and is notable for the morphogenetic processes which occur during its life cycle. Its particular taxonomic position makes this fungus an interesting system to be considered when investigating phylogenetic relationships and studying the biology of lower fungi. To contribute to the understanding of the complexity of the B. emersonii genome, we present here a survey of expressed sequence tags (ESTs) from various stages of the fungal development. Nearly 20,000 cDNA clones from 10 different libraries were partially sequenced from their 5' end, yielding 16,984 high-quality ESTs. These ESTs were assembled into 4,873 putative transcripts, of which 48% presented no matches with existing sequences in public databases. As a result of Gene Ontology (GO) project annotation, 1,680 ESTs (35%) were classified into biological processes of the GO structure, with transcription and RNA processing, protein biosynthesis, and transport as prevalent processes. We also report full-length sequences, useful for construction of molecular phylogenies, and several ESTs that showed high similarity with known proteins, some of which were not previously described in fungi. Furthermore, we analyzed the expression profile (digital Northern analysis) of each transcript throughout the life cycle of the fungus using Bayesian statistics. The in silico approach was validated by Northern blot analysis with good agreement between the two methodologies. PMID:15701807

  19. Adult midgut expressed sequence tags from the tsetse fly Glossina morsitans morsitans and expression analysis of putative immune response genes

    PubMed Central

    Lehane, M J; Aksoy, S; Gibson, W; Kerhornou, A; Berriman, M; Hamilton, J; Soares, M B; Bonaldo, M F; Lehane, S; Hall, N

    2003-01-01

    Background Tsetse flies transmit African trypanosomiasis leading to half a million cases annually. Trypanosomiasis in animals (nagana) remains a massive brake on African agricultural development. While trypanosome biology is widely studied, knowledge of tsetse flies is very limited, particularly at the molecular level. This is a serious impediment to investigations of tsetse-trypanosome interactions. We have undertaken an expressed sequence tag (EST) project on the adult tsetse midgut, the major organ system for establishment and early development of trypanosomes. Results A total of 21,427 ESTs were produced from the midgut of adult Glossina morsitans morsitans and grouped into 8,876 clusters or singletons potentially representing unique genes. Putative functions were ascribed to 4,035 of these by homology. Of these, a remarkable 3,884 had their most significant matches in the Drosophila protein database. We selected 68 genes with putative immune-related functions, macroarrayed them and determined their expression profiles following bacterial or trypanosome challenge. In both infections many genes are downregulated, suggesting a malaise response in the midgut. Trypanosome and bacterial challenge result in upregulation of different genes, suggesting that different recognition pathways are involved in the two responses. The most notable block of genes upregulated in response to trypanosome challenge are a series of Toll and Imd genes and a series of genes involved in oxidative stress responses. Conclusions The project increases the number of known Glossina genes by two orders of magnitude. Identification of putative immunity genes and their preliminary characterization provides a resource for the experimental dissection of tsetse-trypanosome interactions. PMID:14519198

  20. Analysis of expressed sequence tags from Musa acuminata ssp. burmannicoides, var. Calcutta 4 (AA) leaves submitted to temperature stresses.

    PubMed

    Santos, C M R; Martins, N F; Hörberg, H M; de Almeida, E R P; Coelho, M C F; Togawa, R C; da Silva, F R; Caetano, A R; Miller, R N G; Souza, M T

    2005-05-01

    In order to discover genes expressed in leaves of Musa acuminata ssp. burmannicoides var. Calcutta 4 (AA), from plants submitted to temperature stress, we produced and characterized two full-length enriched cDNA libraries. Total RNA from plants subjected to temperatures ranging from 5 degrees C to 25 degrees C and from 25 degrees C to 45 degrees C was used to produce a COLD and a HOT cDNA library, respectively. We sequenced 1,440 clones from each library. Following quality analysis and vector trimming, we assembled 2,286 sequences from both libraries into 1,019 putative transcripts, consisting of 217 clusters and 802 singletons, which we denoted Musa acuminata assembled expressed sequence tagged (EST) sequences (MaAES). Of these MaAES, 22.87% showed no matches with existing sequences in public databases. A global analysis of the MaAES data set indicated that 10% of the sequenced cDNAs are present in both cDNA libraries, while 42% and 48% are present only in the COLD or in the HOT libraries, respectively. Annotation of the MaAES data set categorized them into 22 functional classes. Of the 2,286 high-quality sequences, 715 (31.28%) originated from full-length cDNA clones and resulted in a set of 149 genes. PMID:15841358

  1. Generation and Analysis of Expressed Sequence Tags from Olea europaea L.

    PubMed Central

    Ozdemir Ozgenturk, Nehir; Oruç, Fatma; Sezerman, Ugur; Kuçukural, Alper; Vural Korkut, Senay; Toksoz, Feriha; Un, Cemal

    2010-01-01

    Olive (Olea europaea L.) is an important source of edible oil which was originated in Near-East region. In this study, two cDNA libraries were constructed from young olive leaves and immature olive fruits for generation of ESTs to discover the novel genes and search the function of unknown genes of olive. The randomly selected 3840 colonies were sequenced for EST collection from both libraries. Readable 2228 sequences for olive leaf and 1506 sequences for olive fruit were assembled into 205 and 69 contigs, respectively, whereas 2478 were singletons. Putative functions of all 2752 differentially expressed unique sequences were designated by gene homology based on BLAST and annotated using BLAST2GO. While 1339 ESTs show no homology to the database, 2024 ESTs have homology (under 80%) with hypothetical proteins, putative proteins, expressed proteins, and unknown proteins in NCBI-GenBank. 635 EST's unique genes sequence have been identified by over 80% homology to known function in other species which were not previously described in Olea family. Only 3.1% of total EST's was shown similarity with olive database existing in NCBI. This generated EST's data and consensus sequences were submitted to NCBI as valuable source for functional genome studies of olive. PMID:21197085

  2. Immune gene discovery by expressed sequence tag (EST) analysis of hemocytes in the ridgetail white prawn Exopalaemon carinicauda

    PubMed Central

    Duan, Yafei; Liu, Ping; Li, Jitao; Li, Jian; Chen, Ping

    2013-01-01

    The ridgetail white prawn Exopalaemon carinicauda is one of the most important commercial species in eastern China. However, little information of immune genes in E. carinicauda has been reported. To identify distinctive genes associated with immunity, an expressed sequence tag (EST) library was constructed from hemocytes of E. carinicauda. A total of 3411 clones were sequenced, yielding 2853 ESTs and the average sequence length is 436 bp. The cluster and assembly analysis yielded 1053 unique sequences including 329 contigs and 724 singletons. Blast analysis identified 593 (56.3%) of the unique sequences as orthologs of genes from other organisms (E-value < 1e-5). Based on the COG and Gene Ontology (GO), 593 unique sequences were classified. Through comparison with previous studies, 153 genes assembled from 367 ESTs have been identified as possibly involved in defense or immune functions. These genes are categorized into seven categories according to their putative functions in shrimp immune system: antimicrobial peptides, prophenoloxidase activating system, antioxidant defense systems, chaperone proteins, clottable proteins, pattern recognition receptors and other immune-related genes. According to EST abundance, the major immune-related genes were thioredoxin (141, 4.94% of all ESTs) and calmodulin (14, 0.49% of all ESTs). The EST sequences of E. carinicauda hemocytes provide important information of the immune system and lay the groundwork for development of molecular markers related to disease resistance in prawn species. PMID:23092732

  3. Analysis of expressed sequence tags from the anamorphic basidiomycetous yeast, Pseudozyma antarctica, which produces glycolipid biosurfactants, mannosylerythritol lipids.

    PubMed

    Morita, Tomotake; Konishi, Masaaki; Fukuoka, Tokuma; Imura, Tomohiro; Kitamoto, Dai

    2006-07-15

    Pseudozyma antarctica T-34 secretes a large amount of biosurfactants (BS), mannosylerythritol lipids (MEL), from different carbon sources such as hydrocarbons and vegetable oils. The detailed biosynthetic pathway of MEL remained unknown due to lack of genetic information on the anamorphic basidiomycetous yeasts, including the genus Pseudozyma. Here, in order to obtain genetic information on P. antarctica T-34, we constructed a cDNA library from yeast cells producing MEL from soybean oil and identified the genes expressed through the creation of an expressed sequence tags (EST) library. We generated 398 ESTs, assembled into 146 contiguous sequences. Based upon a BLAST search similarity cut-off of Esequences in the protein database; 60.3% of all contiguous sequences shared significant identities to hypothetical protein of Ustilago maydis, which is a smut fungus and BS producer. Based on the gene expression study using real-time reverse transcriptase-PCR, the predicted genes, such as mannosyltranferase and acyltransferase, were demonstrated to be highly involved in MEL biosynthesis in soybean oil-grown cells. PMID:16845679

  4. Characterization of expressed sequence tags from a full-length enriched cDNA library of Cryptomeria japonica male strobili

    PubMed Central

    Futamura, Norihiro; Totoki, Yasushi; Toyoda, Atsushi; Igasaki, Tomohiro; Nanjo, Tokihiko; Seki, Motoaki; Sakaki, Yoshiyuki; Mari, Adriano; Shinozaki, Kazuo; Shinohara, Kenji

    2008-01-01

    Background Cryptomeria japonica D. Don is one of the most commercially important conifers in Japan. However, the allergic disease caused by its pollen is a severe public health problem in Japan. Since large-scale analysis of expressed sequence tags (ESTs) in the male strobili of C. japonica should help us to clarify the overall expression of genes during the process of pollen development, we constructed a full-length enriched cDNA library that was derived from male strobili at various developmental stages. Results We obtained 36,011 expressed sequence tags (ESTs) from either one or both ends of 19,437 clones derived from the cDNA library of C. japonica male strobili at various developmental stages. The 19,437 cDNA clones corresponded to 10,463 transcripts. Approximately 80% of the transcripts resembled ESTs from Pinus and Picea, while approximately 75% had homologs in Arabidopsis. An analysis of homologies between ESTs from C. japonica male strobili and known pollen allergens in the Allergome Database revealed that products of 180 transcripts exhibited significant homology. Approximately 2% of the transcripts appeared to encode transcription factors. We identified twelve genes for MADS-box proteins among these transcription factors. The twelve MADS-box genes were classified as DEF/GLO/GGM13-, AG-, AGL6-, TM3- and TM8-like MIKCC genes and type I MADS-box genes. Conclusion Our full-length enriched cDNA library derived from C. japonica male strobili provides information on expression of genes during the development of male reproductive organs. We provided potential allergens in C. japonica. We also provided new information about transcription factors including MADS-box genes expressed in male strobili of C. japonica. Large-scale gene discovery using full-length cDNAs is a valuable tool for studies of gymnosperm species. PMID:18691438

  5. Expressed sequence tag analysis of khat (Catha edulis) provides a putative molecular biochemical basis for the biosynthesis of phenylpropylamino alkaloids.

    PubMed

    Hagel, Jillian M; Krizevski, Raz; Kilpatrick, Korey; Sitrit, Yaron; Marsolais, Frédéric; Lewinsohn, Efraim; Facchini, Peter J

    2011-10-01

    Khat (Catha edulis Forsk.) is a flowering perennial shrub cultivated for its neurostimulant properties resulting mainly from the occurrence of (S)-cathinone in young leaves. The biosynthesis of (S)-cathinone and the related phenylpropylamino alkaloids (1S,2S)-cathine and (1R,2S)-norephedrine is not well characterized in plants. We prepared a cDNA library from young khat leaves and sequenced 4,896 random clones, generating an expressed sequence tag (EST) library of 3,293 unigenes. Putative functions were assigned to > 98% of the ESTs, providing a key resource for gene discovery. Candidates potentially involved at various stages of phenylpropylamino alkaloid biosynthesis from L-phenylalanine to (1S,2S)-cathine were identified. PMID:22215969

  6. Expressed sequence tag analysis of khat (Catha edulis) provides a putative molecular biochemical basis for the biosynthesis of phenylpropylamino alkaloids

    PubMed Central

    Hagel, Jillian M.; Krizevski, Raz; Kilpatrick, Korey; Sitrit, Yaron; Marsolais, Frédéric; Lewinsohn, Efraim; Facchini, Peter J.

    2011-01-01

    Khat (Catha edulis Forsk.) is a flowering perennial shrub cultivated for its neurostimulant properties resulting mainly from the occurrence of (S)-cathinone in young leaves. The biosynthesis of (S)-cathinone and the related phenylpropylamino alkaloids (1S,2S)-cathine and (1R,2S)-norephedrine is not well characterized in plants. We prepared a cDNA library from young khat leaves and sequenced 4,896 random clones, generating an expressed sequence tag (EST) library of 3,293 unigenes. Putative functions were assigned to > 98% of the ESTs, providing a key resource for gene discovery. Candidates potentially involved at various stages of phenylpropylamino alkaloid biosynthesis from L-phenylalanine to (1S,2S)-cathine were identified. PMID:22215969

  7. Generation of expressed sequence tags under cadmium stress for gene discovery and development of molecular markers in chickpea.

    PubMed

    Gaur, Rashmi; Bhatia, Sabhyata; Gupta, Meetu

    2014-07-01

    Chickpea is the world's third most important legume crop and belongs to Fabaceae family but suffered from severe yield loss due to various biotic and abiotic stresses. Development of modern genomic tools such as molecular markers and identification of resistant genes associated with these stresses facilitate improvement in chickpea breeding towards abiotic stress tolerance. In this study, 1597 high-quality expressed sequence tags (ESTs) were generated from a cDNA library of variety Pusa 1105 root tissue after cadmium (Cd) treatment. Assembly of ESTs resulted in a total of 914 unigenes of which putative homology was obtained for 38.8 % of unigenes after BLASTX search. In terms of species distribution, majority of sequences found similarity with Medicago truncatula followed by Glycine max, Vitis vinifera and Populus trichocarpa and Pisum sativum sequences. Functional annotation was assigned using Blast2Go, and the Gene Ontology (GO) terms were categorized into biological process, molecular function and cellular component. Approximately 10.83 % of unigenes were assigned at least one GO term. Moreover, in the distribution of transcripts into various biological pathways, 20 of the annotated transcripts were assigned to ten pathways in KEGG database. A majority of the genes were found to be involved in sulphur and nitrogen metabolism. In the quantitative real-time PCR analysis, five of the transcription factors and three of the transporter genes were found to be highly expressed after Cd treatment. Besides, the utility of ESTs was demonstrated by exploiting them for the development of 83 genic molecular markers including EST-simple sequence repeats and intron targeted polymorphism that would assist in tagging of genes related to metal stress for future prospects. PMID:24414095

  8. Development of expressed sequence tag resources for Vanda Mimi Palmer and data mining for EST-SSR.

    PubMed

    Teh, Seow-Ling; Chan, Wai-Sun; Abdullah, Janna Ong; Namasivayam, Parameswari

    2011-08-01

    Vanda Mimi Palmer (VMP) is a highly sought as fragrant-orchid hybrid in Malaysia. It is economically important in cosmetic and beauty industries and also a famous potted ornamental plant. To date, no work on fragrance-related genes of vandaceous orchids has been reported from other research groups although the analysis of floral fragrance or volatiles have been extensively studied. An expressed sequence tag (EST) resource was developed for VMP principally to mine any potential fragrance-related expressed sequence tag-simple sequence repeat (EST-SSR) for future development as markers in the identification of fragrant vandaceous orchids endemic to Malaysia. Clustering, annotation and assembling of the ESTs identified 1,196 unigenes which defined 966 singletons and 230 contigs. The VMP dbEST was functionally classified by gene ontology (GO) into three groups: molecular functions (51.2%), cellular components (16.4%) and biological processes (24.6%) while the remaining 7.8% showed no hits with GO identifier. A total of 112 EST-SSR (9.4%) was mined on which at least five units of di-, tri-, tetra-, penta-, or hexa-nucleotide repeats were predicted. The di-nucleotide motif repeats appeared to be the most frequent repeats among the detected SSRs with the AT/TA types as the most abundant among the dimerics, while AAG/TTC, AGA/TCT-type were the most frequent trimerics. The mined EST-SSR is believed to be useful in the development of EST-SSR markers that is applicable in the screening and characterization of fragrance-related transcripts in closely related species. PMID:21116862

  9. Identification of Anhydrobiosis-related Genes from an Expressed Sequence Tag Database in the Cryptobiotic Midge Polypedilum vanderplanki (Diptera; Chironomidae)*

    PubMed Central

    Cornette, Richard; Kanamori, Yasushi; Watanabe, Masahiko; Nakahara, Yuichi; Gusev, Oleg; Mitsumasu, Kanako; Kadono-Okuda, Keiko; Shimomura, Michihiko; Mita, Kazuei; Kikawada, Takahiro; Okuda, Takashi

    2010-01-01

    Some organisms are able to survive the loss of almost all their body water content, entering a latent state known as anhydrobiosis. The sleeping chironomid (Polypedilum vanderplanki) lives in the semi-arid regions of Africa, and its larvae can survive desiccation in an anhydrobiotic form during the dry season. To unveil the molecular mechanisms of this resistance to desiccation, an anhydrobiosis-related Expressed Sequence Tag (EST) database was obtained from the sequences of three cDNA libraries constructed from P. vanderplanki larvae after 0, 12, and 36 h of desiccation. The database contained 15,056 ESTs distributed into 4,807 UniGene clusters. ESTs were classified according to gene ontology categories, and putative expression patterns were deduced for all clusters on the basis of the number of clones in each library; expression patterns were confirmed by real-time PCR for selected genes. Among up-regulated genes, antioxidants, late embryogenesis abundant (LEA) proteins, and heat shock proteins (Hsps) were identified as important groups for anhydrobiosis. Genes related to trehalose metabolism and various transporters were also strongly induced by desiccation. Those results suggest that the oxidative stress response plays a central role in successful anhydrobiosis. Similarly, protein denaturation and aggregation may be prevented by marked up-regulation of Hsps and the anhydrobiosis-specific LEA proteins. A third major feature is the predicted increase in trehalose synthesis and in the expression of various transporter proteins allowing the distribution of trehalose and other solutes to all tissues. PMID:20833722

  10. An expressed sequence tag database of T-cell-enriched activated chicken splenocytes: sequence analysis of 5251 clones.

    PubMed

    Tirunagaru, V G; Sofer, L; Cui, J; Burnside, J

    2000-06-01

    The cDNA and gene sequences of many mammalian cytokines and their receptors are known. However, corresponding information on avian cytokines is limited due to the lack of cross-species activity at the functional level or strong homology at the molecular level. To improve the efficiency of identifying cytokines and novel chicken genes, a directionally cloned cDNA library from T-cell-enriched activated chicken splenocytes was constructed, and the partial sequence of 5251 clones was obtained. Sequence clustering indicates that 2357 (42%) of the clones are present as a single copy, and 2961 are distinct clones, demonstrating the high level of complexity of this library. Comparisons of the sequence data with known DNA sequences in GenBank indicate that approximately 25% of the clones match known chicken genes, 39% have similarity to known genes in other species, and 11% had no match to any sequence in the database. Several previously uncharacterized chicken cytokines and their receptors were present in our library. This collection provides a useful database for cataloging genes expressed in T cells and a valuable resource for future investigations of gene expression in avian immunology. A chicken EST Web site (http://udgenome. ags.udel. edu/chickest/chick.htm) has been created to provide access to the data, and a set of unique sequences has been deposited with GenBank (Accession Nos. AI979741-AI982511). Our new Web site (http://www. chickest.udel.edu) will be active as of March 3, 2000, and will also provide keyword-searching capabilities for BLASTX and BLASTN hits of all our clones. PMID:10860659

  11. Expressed sequence tags from the red imported fire ant, Solenopsis invicta: annotation and utilization for discovery of viruses.

    PubMed

    Valles, Steven M; Strong, Charles A; Hunter, Wayne B; Dang, Phat M; Pereira, Roberto M; Oi, David H; Williams, David F

    2008-09-01

    An expression library was created and 2304 clones sequenced from a monogyne colony of Solenopsis invicta. The primary intention of the project was to utilize homologous gene identification to facilitate discovery of viruses infecting this ant pest that could potentially be used in pest management. Additional genes were identified from the ant host and associated pathogens that serve as an important resource for studying these organisms. After assembly and removal of mitochondrial and poor quality sequences, 1054 unique sequences were yielded and deposited into the GenBank database under Accession Nos. EH412746 through EH413799. At least nine expressed sequence tags (ESTs) were identified as possessing microsatellite motifs and 15 ESTs exhibited significant homology with microsporidian genes. These sequences most likely originated from Thelohania solenopsae, a well-characterized microsporidian that infects S. invicta. Six ESTs exhibited significant homology with single-stranded RNA viruses (3B4, 3F6, 11F1, 12G12, 14D5, and 24C10). Subsequent analysis of these putative viral ESTs revealed that 3B4 was most likely a ribosomal gene of S. invicta, 11F1 was a single-stranded RNA (ssRNA) virus contaminant introduced into the colony from the cricket food source, 12G12 appeared to be a plant-infecting tenuivirus also introduced into the colony as a field contaminant, and 3F6, 14D5, and 24C10 were all from a unique ssRNA virus found to infect S. invicta. The sequencing project illustrates the utility of this method for discovery of viruses and pathogens that may otherwise go undiscovered. PMID:18329665

  12. Construction of a Lotus japonicus late nodulin expressed sequence tag library and identification of novel nodule-specific genes.

    PubMed Central

    Szczyglowski, K; Hamburger, D; Kapranov, P; de Bruijn, F J

    1997-01-01

    A range of novel expressed sequence tags (ESTs) associated with late developmental events during nodule organogenesis in the legume Lotus japonicus were identified using mRNA differential display; 110 differentially displayed polymerase chain reaction products were cloned and analyzed. Of 88 unique cDNAs obtained, 22 shared significant homology to DNA/protein sequences in the respective databases. This group comprises, among others, a nodule-specific homolog of protein phosphatase 2C, a peptide transporter protein, and a nodule-specific form of cytochrome P450. RNA gel-blot analysis of 16 differentially displayed ESTs confirmed their nodule-specific expression pattern. The kinetics of mRNA accumulation of the majority of the ESTs analyzed were found to resemble the expression pattern observed for the L. japonicus leghemoglobin gene. These results indicate that the newly isolated molecular markers correspond to genes induced during late developmental stages of L. japonicus nodule organogenesis and provide important, novel tools for the study of nodulation. PMID:9276951

  13. Integration of Expressed Sequence Tag Data Flanking Predicted RNA Secondary Structures Facilitates Novel Non-Coding RNA Discovery

    PubMed Central

    Krzyzanowski, Paul M.; Price, Feodor D.; Muro, Enrique M.; Rudnicki, Michael A.; Andrade-Navarro, Miguel A.

    2011-01-01

    Many computational methods have been used to predict novel non-coding RNAs (ncRNAs), but none, to our knowledge, have explicitly investigated the impact of integrating existing cDNA-based Expressed Sequence Tag (EST) data that flank structural RNA predictions. To determine whether flanking EST data can assist in microRNA (miRNA) prediction, we identified genomic sites encoding putative miRNAs by combining functional RNA predictions with flanking ESTs data in a model consistent with miRNAs undergoing cleavage during maturation. In both human and mouse genomes, we observed that the inclusion of flanking ESTs adjacent to and not overlapping predicted miRNAs significantly improved the performance of various methods of miRNA prediction, including direct high-throughput sequencing of small RNA libraries. We analyzed the expression of hundreds of miRNAs predicted to be expressed during myogenic differentiation using a customized microarray and identified several known and predicted myogenic miRNA hairpins. Our results indicate that integrating ESTs flanking structural RNA predictions improves the quality of cleaved miRNA predictions and suggest that this strategy can be used to predict other non-coding RNAs undergoing cleavage during maturation. PMID:21698286

  14. Integration of expressed sequence tag data flanking predicted RNA secondary structures facilitates novel non-coding RNA discovery.

    PubMed

    Krzyzanowski, Paul M; Price, Feodor D; Muro, Enrique M; Rudnicki, Michael A; Andrade-Navarro, Miguel A

    2011-01-01

    Many computational methods have been used to predict novel non-coding RNAs (ncRNAs), but none, to our knowledge, have explicitly investigated the impact of integrating existing cDNA-based Expressed Sequence Tag (EST) data that flank structural RNA predictions. To determine whether flanking EST data can assist in microRNA (miRNA) prediction, we identified genomic sites encoding putative miRNAs by combining functional RNA predictions with flanking ESTs data in a model consistent with miRNAs undergoing cleavage during maturation. In both human and mouse genomes, we observed that the inclusion of flanking ESTs adjacent to and not overlapping predicted miRNAs significantly improved the performance of various methods of miRNA prediction, including direct high-throughput sequencing of small RNA libraries. We analyzed the expression of hundreds of miRNAs predicted to be expressed during myogenic differentiation using a customized microarray and identified several known and predicted myogenic miRNA hairpins. Our results indicate that integrating ESTs flanking structural RNA predictions improves the quality of cleaved miRNA predictions and suggest that this strategy can be used to predict other non-coding RNAs undergoing cleavage during maturation. PMID:21698286

  15. Development of expressed sequence tag-simple sequence repeat markers for genetic characterization and population structure analysis of Praxelis clematidea (Asteraceae).

    PubMed

    Wang, Q Z; Huang, M; Downie, S R; Chen, Z X

    2016-01-01

    Invasive plants tend to spread aggressively in new habitats and an understanding of their genetic diversity and population structure is useful for their management. In this study, expressed sequence tag-simple sequence repeat (EST-SSR) markers were developed for the invasive plant species Praxelis clematidea (Asteraceae) from 5548 Stevia rebaudiana (Asteraceae) expressed sequence tags (ESTs). A total of 133 microsatellite-containing ESTs (2.4%) were identified, of which 56 (42.1%) were hexanucleotide repeat motifs and 50 (37.6%) were trinucleotide repeat motifs. Of the 24 primer pairs designed from these 133 ESTs, 7 (29.2%) resulted in significant polymorphisms. The number of alleles per locus ranged from 5 to 9. The relatively high genetic diversity (H = 0.2667, I = 0.4212, and P = 100%) of P. clematidea was related to high gene flow (Nm = 1.4996) among populations. The coefficient of population differentiation (GST = 0.2500) indicated that most genetic variation occurred within populations. A Mantel test suggested that there was significant correlation between genetic distance and geographical distribution (r = 0.3192, P = 0.012). These results further support the transferability of EST-SSR markers between closely related genera of the same family. PMID:27323082

  16. SSH Analysis of Endosperm Transcripts and Characterization of Heat Stress Regulated Expressed Sequence Tags in Bread Wheat

    PubMed Central

    Goswami, Suneha; Kumar, Ranjeet R.; Dubey, Kavita; Singh, Jyoti P.; Tiwari, Sachidanand; Kumar, Ashok; Smita, Shuchi; Mishra, Dwijesh C.; Kumar, Sanjeev; Grover, Monendra; Padaria, Jasdeep C.; Kala, Yugal K.; Singh, Gyanendra P.; Pathak, Himanshu; Chinnusamy, Viswanathan; Rai, Anil; Praveen, Shelly; Rai, Raj D.

    2016-01-01

    Heat stress is one of the major problems in agriculturally important cereal crops, especially wheat. Here, we have constructed a subtracted cDNA library from the endosperm of HS-treated (42°C for 2 h) wheat cv. HD2985 by suppression subtractive hybridization (SSH). We identified ~550 recombinant clones ranging from 200 to 500 bp with an average size of 300 bp. Sanger's sequencing was performed with 205 positive clones to generate the differentially expressed sequence tags (ESTs). Most of the ESTs were observed to be localized on the long arm of chromosome 2A and associated with heat stress tolerance and metabolic pathways. Identified ESTs were BLAST search using Ensemble, TriFLD, and TIGR databases and the predicted CDS were translated and aligned with the protein sequences available in pfam and InterProScan 5 databases to predict the differentially expressed proteins (DEPs). We observed eight different types of post-translational modifications (PTMs) in the DEPs corresponds to the cloned ESTs-147 sites with phosphorylation, 21 sites with sumoylation, 237 with palmitoylation, 96 sites with S-nitrosylation, 3066 calpain cleavage sites, and 103 tyrosine nitration sites, predicted to sense the heat stress and regulate the expression of stress genes. Twelve DEPs were observed to have transmembrane helixes (TMH) in their structure, predicted to play the role of sensors of HS. Quantitative Real-Time PCR of randomly selected ESTs showed very high relative expression of HSP17 under HS; up-regulation was observed more in wheat cv. HD2985 (thermotolerant), as compared to HD2329 (thermosusceptible) during grain-filling. The abundance of transcripts was further validated through northern blot analysis. The ESTs and their corresponding DEPs can be used as molecular marker for screening or targeted precision breeding program. PTMs identified in the DEPs can be used to elucidate the thermotolerance mechanism of wheat—a novel step toward the development of

  17. SSH Analysis of Endosperm Transcripts and Characterization of Heat Stress Regulated Expressed Sequence Tags in Bread Wheat.

    PubMed

    Goswami, Suneha; Kumar, Ranjeet R; Dubey, Kavita; Singh, Jyoti P; Tiwari, Sachidanand; Kumar, Ashok; Smita, Shuchi; Mishra, Dwijesh C; Kumar, Sanjeev; Grover, Monendra; Padaria, Jasdeep C; Kala, Yugal K; Singh, Gyanendra P; Pathak, Himanshu; Chinnusamy, Viswanathan; Rai, Anil; Praveen, Shelly; Rai, Raj D

    2016-01-01

    Heat stress is one of the major problems in agriculturally important cereal crops, especially wheat. Here, we have constructed a subtracted cDNA library from the endosperm of HS-treated (42°C for 2 h) wheat cv. HD2985 by suppression subtractive hybridization (SSH). We identified ~550 recombinant clones ranging from 200 to 500 bp with an average size of 300 bp. Sanger's sequencing was performed with 205 positive clones to generate the differentially expressed sequence tags (ESTs). Most of the ESTs were observed to be localized on the long arm of chromosome 2A and associated with heat stress tolerance and metabolic pathways. Identified ESTs were BLAST search using Ensemble, TriFLD, and TIGR databases and the predicted CDS were translated and aligned with the protein sequences available in pfam and InterProScan 5 databases to predict the differentially expressed proteins (DEPs). We observed eight different types of post-translational modifications (PTMs) in the DEPs corresponds to the cloned ESTs-147 sites with phosphorylation, 21 sites with sumoylation, 237 with palmitoylation, 96 sites with S-nitrosylation, 3066 calpain cleavage sites, and 103 tyrosine nitration sites, predicted to sense the heat stress and regulate the expression of stress genes. Twelve DEPs were observed to have transmembrane helixes (TMH) in their structure, predicted to play the role of sensors of HS. Quantitative Real-Time PCR of randomly selected ESTs showed very high relative expression of HSP17 under HS; up-regulation was observed more in wheat cv. HD2985 (thermotolerant), as compared to HD2329 (thermosusceptible) during grain-filling. The abundance of transcripts was further validated through northern blot analysis. The ESTs and their corresponding DEPs can be used as molecular marker for screening or targeted precision breeding program. PTMs identified in the DEPs can be used to elucidate the thermotolerance mechanism of wheat-a novel step toward the development of "climate-smart" wheat

  18. Generation of expressed sequence tags of random root cDNA clones of Brassica napus by single-run partial sequencing.

    PubMed Central

    Park, Y S; Kwak, J M; Kwon, O Y; Kim, Y S; Lee, D S; Cho, M J; Lee, H H; Nam, H G

    1993-01-01

    Two hundred thirty-seven expressed sequence tags (ESTs) of Brassica napus were generated by single-run partial sequencing of 197 random root cDNA clones. A computer search of these root ESTs revealed that 21 ESTs show significant similarity to the protein-coding sequences in the existing data bases, including five stress- or defense-related genes and four clones related to the genes from other kingdoms. Northern blot analysis of the 10 data base-matched cDNA clones revealed that many of the clones are expressed most abundantly in root but less abundantly in other organs. However, two clones were highly root specific. The results show that generation of the root ESTs by partial sequencing of random cDNA clones along with the expression analysis is an efficient approach to isolate genes that are functional in plant root in a large scale. We also discuss the results of the examination of cDNA libraries and sequencing methods suitable for this approach. PMID:8029332

  19. A High-Throughput Data Mining of Single Nucleotide Polymorphisms in Coffea Species Expressed Sequence Tags Suggests Differential Homeologous Gene Expression in the Allotetraploid Coffea arabica1[W

    PubMed Central

    Vidal, Ramon Oliveira; Mondego, Jorge Maurício Costa; Pot, David; Ambrósio, Alinne Batista; Andrade, Alan Carvalho; Pereira, Luiz Filipe Protasio; Colombo, Carlos Augusto; Vieira, Luiz Gonzaga Esteves; Carazzolle, Marcelo Falsarella; Pereira, Gonçalo Amarante Guimarães

    2010-01-01

    Polyploidization constitutes a common mode of evolution in flowering plants. This event provides the raw material for the divergence of function in homeologous genes, leading to phenotypic novelty that can contribute to the success of polyploids in nature or their selection for use in agriculture. Mounting evidence underlined the existence of homeologous expression biases in polyploid genomes; however, strategies to analyze such transcriptome regulation remained scarce. Important factors regarding homeologous expression biases remain to be explored, such as whether this phenomenon influences specific genes, how paralogs are affected by genome doubling, and what is the importance of the variability of homeologous expression bias to genotype differences. This study reports the expressed sequence tag assembly of the allopolyploid Coffea arabica and one of its direct ancestors, Coffea canephora. The assembly was used for the discovery of single nucleotide polymorphisms through the identification of high-quality discrepancies in overlapped expressed sequence tags and for gene expression information indirectly estimated by the transcript redundancy. Sequence diversity profiles were evaluated within C. arabica (Ca) and C. canephora (Cc) and used to deduce the transcript contribution of the Coffea eugenioides (Ce) ancestor. The assignment of the C. arabica haplotypes to the C. canephora (CaCc) or C. eugenioides (CaCe) ancestral genomes allowed us to analyze gene expression contributions of each subgenome in C. arabica. In silico data were validated by the quantitative polymerase chain reaction and allele-specific combination TaqMAMA-based method. The presence of differential expression of C. arabica homeologous genes and its implications in coffee gene expression, ontology, and physiology are discussed. PMID:20864545

  20. A high-throughput data mining of single nucleotide polymorphisms in Coffea species expressed sequence tags suggests differential homeologous gene expression in the allotetraploid Coffea arabica.

    PubMed

    Vidal, Ramon Oliveira; Mondego, Jorge Maurício Costa; Pot, David; Ambrósio, Alinne Batista; Andrade, Alan Carvalho; Pereira, Luiz Filipe Protasio; Colombo, Carlos Augusto; Vieira, Luiz Gonzaga Esteves; Carazzolle, Marcelo Falsarella; Pereira, Gonçalo Amarante Guimarães

    2010-11-01

    Polyploidization constitutes a common mode of evolution in flowering plants. This event provides the raw material for the divergence of function in homeologous genes, leading to phenotypic novelty that can contribute to the success of polyploids in nature or their selection for use in agriculture. Mounting evidence underlined the existence of homeologous expression biases in polyploid genomes; however, strategies to analyze such transcriptome regulation remained scarce. Important factors regarding homeologous expression biases remain to be explored, such as whether this phenomenon influences specific genes, how paralogs are affected by genome doubling, and what is the importance of the variability of homeologous expression bias to genotype differences. This study reports the expressed sequence tag assembly of the allopolyploid Coffea arabica and one of its direct ancestors, Coffea canephora. The assembly was used for the discovery of single nucleotide polymorphisms through the identification of high-quality discrepancies in overlapped expressed sequence tags and for gene expression information indirectly estimated by the transcript redundancy. Sequence diversity profiles were evaluated within C. arabica (Ca) and C. canephora (Cc) and used to deduce the transcript contribution of the Coffea eugenioides (Ce) ancestor. The assignment of the C. arabica haplotypes to the C. canephora (CaCc) or C. eugenioides (CaCe) ancestral genomes allowed us to analyze gene expression contributions of each subgenome in C. arabica. In silico data were validated by the quantitative polymerase chain reaction and allele-specific combination TaqMAMA-based method. The presence of differential expression of C. arabica homeologous genes and its implications in coffee gene expression, ontology, and physiology are discussed. PMID:20864545

  1. RefNetBuilder: a platform for construction of integrated reference gene regulatory networks from expressed sequence tags

    PubMed Central

    2011-01-01

    Background Gene Regulatory Networks (GRNs) provide integrated views of gene interactions that control biological processes. Many public databases contain biological interactions extracted from experimentally validated literature reports, but most furnish only information for a few genetic model organisms. In order to provide a bioinformatic tool for researchers who work with non-model organisms, we developed RefNetBuilder, a new platform that allows construction of putative reference pathways or GRNs from expressed sequence tags (ESTs). Results RefNetBuilder was designed to have the flexibility to extract and archive pathway or GRN information from public databases such as the Kyoto Encyclopedia of Genes and Genomes (KEGG). It features sequence alignment tools such as BLAST to allow mapping ESTs to pathways and GRNs in model organisms. A scoring algorithm was incorporated to rank and select the best match for each query EST. We validated RefNetBuilder using DNA sequences of Caenorhabditis elegans, a model organism having manually curated KEGG pathways. Using the earthworm Eisenia fetida as an example, we demonstrated the functionalities and features of RefNetBuilder. Conclusions The RefNetBuilder provides a standalone application for building reference GRNs for non-model organisms on a number of operating system platforms with standard desktop computer hardware. As a new bioinformatic tool aimed for constructing putative GRNs for non-model organisms that have only ESTs available, RefNetBuilder is especially useful to explore pathway- or network-related information in these organisms. PMID:22166047

  2. In silico identification of conserved microRNAs and their target transcripts from expressed sequence tags of three earthworm species.

    PubMed

    Gong, Ping; Xie, Fuliang; Zhang, Baohong; Perkins, Edward J

    2010-12-01

    MicroRNAs are a recently identified class of small regulatory RNAs that target more than 30% protein-coding genes. Elevating evidence shows that miRNAs play a critical role in many biological processes, including developmental timing, tissue differentiation, and response to chemical exposure. In this study, we applied a computational approach to analyze expressed sequence tags, and identified 32 miRNAs belonging to 22 miRNA families, in three earthworm species Eisenia fetida, Eisenia andrei, and Lumbricus rubellus. These newly identified earthworm miRNAs possess a difference of 2-4 nucleotides from their homologous counterparts in Caenorhabditis elegans. They also share similar features with other known animal miRNAs, for instance, the nucleotide U being dominant in both mature and pre-miRNA sequences, particularly in the first position of mature miRNA sequences at the 5' end. The newly identified earthworm miRNAs putatively regulate mRNA genes that are involved in many important biological processes and pathways related to development, growth, locomotion, and reproduction as well as response to stresses, particularly oxidative stress. Future efforts will focus on experimental validation of their presence and target mRNA genes to further elucidate their biological functions in earthworms. PMID:21030313

  3. Analysis of expressed sequence tags from the blue-green sharpshooter, Graphocephala atropunctata

    Technology Transfer Automated Retrieval System (TEKTRAN)

    We used a metagenomic approach and identified and sequenced 6,836 genetic sequences isolated from adult blue-green sharpshooters, BGSS, Graphocephala atropunctata. These results provided over 70% of the mitochondrial genome sequence which is being completed. The BGSS is endemic to southern Californ...

  4. Protein identities from 'Graphocephala atropunctata' expressed sequence tags: Expanding leafhopper vector biology

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Heat shock proteins and 44 protein sequences from the blue-green sharpshooter, BGSS, were produced and identified. The sequences were submitted and published under accession numbers: DQ445499-DQ445542, in the National Center for Biotechnology Information, NCBI, Public Database. The blue-green sharps...

  5. Ribosomal proteins and expressed sequence tags from Lysiphlebus testaceipes(Hymenoptera: Aphidiidae)

    Technology Transfer Automated Retrieval System (TEKTRAN)

    A dataset containing 101 putative ribosomal protein (RP) sequences is provided for the aphid parasitoid, Lysiphlebus testaceipes. These data were obtained as a subset from a cDNA library constructed from adult L. testaceipes, and represent one of the largest complete sets of cytoplasmic RP sequence...

  6. Transcriptome analysis of the phytopathogenic fungus Rhizoctonia solani AG1-IB 7/3/14 applying high-throughput sequencing of expressed sequence tags (ESTs).

    PubMed

    Wibberg, Daniel; Jelonek, Lukas; Rupp, Oliver; Kröber, Magdalena; Goesmann, Alexander; Grosch, Rita; Pühler, Alfred; Schlüter, Andreas

    2014-01-01

    Rhizoctonia solani is a soil-borne plant pathogenic fungus of the phylum Basidiomycota. It affects a wide range of agriculturally important crops and hence is responsible for economically relevant crop losses. Transcriptome analysis of the bottom rot pathogen R. solani AG1-1B (isolate 7/3/14) by applying high-throughput sequencing and bioinformatics methods addressing Expressed Sequence Tag (EST) data interpretation provided new insights in expressed genes of this fungus. Two normalized cDNA libraries representing different cultivation conditions of the fungus were sequenced on the 454 FLX (Roche) system. Subsequent to cDNA sequence assembly and quality control, ESTs were analysed applying advanced bioinformatics methods. More than 14 000 transcript isoforms originating from approximately 10 000 predictable R. solani AG1-IB 7/3/14 genes are represented in each dataset. Comparative analyses revealed several differentially expressed genes depending on the growth conditions applied. Determinants with predicted functions in recognition processes between the fungus and the host plant were identified. Moreover, many R. solani AG1-IB ESTs were predicted to encode putative cellulose, pectin, and lignin degrading enzymes. Furthermore, genes playing a possible role in mitogen-activated protein (MAP) kinase cascades, 4-aminobutyric acid (GABA) metabolism, melanin synthesis, plant defence antagonism, phytotoxin, and mycotoxin synthesis were detected. PMID:25209639

  7. Analysis of expressed sequence tags (ESTs) from a normalized cDNA library and isolation of EST simple sequence repeats from the invasive cotton mealybug Phenacoccus solenopsis.

    PubMed

    Li, Hui; Lang, Kun-Ling; Fu, Hai-Bin; Shen, Chang-Peng; Wan, Fang-Hao; Chu, Dong

    2015-12-01

    The cotton mealybug, Phenacoccus solenopsis Tinsley, is a serious and invasive pest. At present, genetic resources for studying P. solenopsis are limited, and this negatively affects genetic research on the organism and, consequently, translational work to improve management of this pest. In the present study, expressed sequence tags (ESTs) were analyzed from a normalized complementary DNA library of P. solenopsis. In addition, EST-derived microsatellite loci (also known as simple sequence repeats or SSRs) were isolated and characterized. A total of 1107 high-quality ESTs were acquired from the library. Clustering and assembly analysis resulted in 785 unigenes, which were classified functionally into 23 categories according to the Gene Ontology database. Seven EST-based SSR markers were developed in this study and are expected to be useful in characterizing how this invasive species was introduced, as well as providing insights into its genetic microevolution. PMID:25380551

  8. Expressed Sequence Tags Analysis and Design of Simple Sequence Repeats Markers from a Full-Length cDNA Library in Perilla frutescens (L.)

    PubMed Central

    Seong, Eun Soo; Yoo, Ji Hye; Choi, Jae Hoo; Kim, Chang Heum; Jeon, Mi Ran; Kang, Byeong Ju; Lee, Jae Geun; Choi, Seon Kang; Ghimire, Bimal Kumar; Yu, Chang Yeon

    2015-01-01

    Perilla frutescens is valuable as a medicinal plant as well as a natural medicine and functional food. However, comparative genomics analyses of P. frutescens are limited due to a lack of gene annotations and characterization. A full-length cDNA library from P. frutescens leaves was constructed to identify functional gene clusters and probable EST-SSR markers via analysis of 1,056 expressed sequence tags. Unigene assembly was performed using basic local alignment search tool (BLAST) homology searches and annotated Gene Ontology (GO). A total of 18 simple sequence repeats (SSRs) were designed as primer pairs. This study is the first to report comparative genomics and EST-SSR markers from P. frutescens will help gene discovery and provide an important source for functional genomics and molecular genetic research in this interesting medicinal plant. PMID:26664999

  9. Identification and characterization of 43 microsatellite markers derived from expressed sequence tags of the sea cucumber ( Apostichopus japonicus)

    NASA Astrophysics Data System (ADS)

    Jiang, Qun; Li, Qi; Yu, Hong; Kong, Lingfeng

    2011-06-01

    The sea cucumber Apostichopus japonicus is a commercially and ecologically important species in China. A total of 3056 potential unigenes were generated after assembling 7597 A. japonicus expressed sequence tags (ESTs) downloaded from Gen-Bank. Two hundred and fifty microsatellite-containing ESTs (8.18%) and 299 simple sequence repeats (SSRs) were detected. The average density of SSRs was 1 per 7.403 kb of EST after redundancy elimination. Di-nucleotide repeat motifs appeared to be the most abundant type with a percentage of 69.90%. Of the 126 primer pairs designed, 90 amplified the expected products and 43 showed polymorphism in 30 individuals tested. The number of alleles per locus ranged from 2 to 26 with an average of 7.0 alleles, and the observed and expected heterozygosities varied from 0.067 to 1.000 and from 0.066 to 0.959, respectively. These new EST-derived microsatellite markers would provide sufficient polymorphism for population genetic studies and genome mapping of this sea cucumber species.

  10. Development and characterization of new single nucleotide polymorphism markers from expressed sequence tags in common carp (Cyprinus carpio).

    PubMed

    Zhu, Chuankun; Cheng, Lei; Tong, Jingou; Yu, Xiaomu

    2012-01-01

    The common carp (Cyprinus carpio) is an important aquaculture fish worldwide but only limited single nucleotide polymorphism (SNP) markers are characterized from expressed sequence tags (ESTs) in this species. In this study, 1487 putative SNPs were bioinformatically mined from 14,066 online ESTs mainly from the European common carp, with the occurrence rate of about one SNP every 173 bp. One hundred and twenty-one of these SNPs were selected for validation using PCR fragment sequencing, and 48 out of 81 primers could amplify the expected fragments in the Chinese common carp genome. Only 26 (21.5%) putative SNPs were validated, however, 508 new SNPs and 68 indels were identified. The ratios of transitions to transversions were 1.77 for exon SNPs and 1.05 for intron SNPs. All the 23 SNPs selected for population tests were polymorphic, with the observed heterozygosity (Ho) ranging from 0.053 to 0.526 (mean 0.262), polymorphism information content (PIC) from 0.095 to 0.357 (mean 0.246), and 21 SNPs were in Hardy-Weinberg equilibrium. These results suggest that different common carp populations with geographic isolation have significant genetic variation at the SNP level, and these new EST-SNP markers are readily available for genetics and breeding studies in common carp. PMID:22837697

  11. Gene expression profiling of coelomic cells and discovery of immune-related genes in the earthworm, Eisenia andrei, using expressed sequence tags.

    PubMed

    Tak, Eun Sik; Cho, Sung-Jin; Park, Soon Cheol

    2015-01-01

    The coelomic cells of the earthworm consist of leukocytes, chlorogocytes, and coelomocytes, which play an important role in innate immunity reactions. To gain insight into the expression profiles of coelomic cells of the earthworm, Eisenia andrei, we analyzed 1151 expressed sequence tags (ESTs) derived from the cDNA library of the coelomic cells. Among the 1151 ESTs analyzed, 493 ESTs (42.8%) showed a significant similarity to known genes and represented 164 unique genes, of which 93 ESTs were singletons and 71 ESTs manifested as two or more ESTs. From the 164 unique genes sequenced, we found 24 immune-related and cell defense genes. Furthermore, real-time PCR analysis showed that levels of lysenin-related proteins mRNA in coelomic cells of E. andrei were upregulated after the injection of Bacillus subtilis bacteria. This EST data-set would provide a valuable resource for future researches of earthworm immune system. PMID:25496401

  12. Comparative analysis and functional annotation of a large expressed sequence tag collection of apple

    Technology Transfer Automated Retrieval System (TEKTRAN)

    A total of 34 apple cDNA libraries were constructed from root, leaf, bud, shoot, flower, and fruit tissues, at varying developmental stages and/or under biotic or abiotic stress conditions, and of several genotypes. From these libraries, 190,425 clones were partially sequenced from the 5’ end and 4...

  13. Analysis and functional annotation of expressed sequence tags from the Asian longhorned beetle, Anoplophora glabripennis

    Technology Transfer Automated Retrieval System (TEKTRAN)

    We identified 600 genetic sequences of which ~380 were uniquely identified to the Asian longhorned beetle (ALB), Anoplophora glabripennis, (Coleoptera) which is one of the most serious invasive forest insect pests discovered in North America in recent years. Despite the substantial impact of this p...

  14. Protein identities - Graphocephala atropunctata expressed sequenced tags: expanding leafhopper vector biology

    Technology Transfer Automated Retrieval System (TEKTRAN)

    A small heat shock protein was isolated and sequenced from the Blue-green sharpshooter, BGSS, Graphocephala atropunctata (Signoret) (Hemiptera: Cicadellidae). The BGSS has been the native vector of Pierce’s disease in vineyards in California for nearly a century. The importance of this vector spec...

  15. Expressed sequence tags (ESTs) from immune tissues of turbot (Scophthalmus maximus) challenged with pathogens

    PubMed Central

    Pardo, Belén G; Fernández, Carlos; Millán, Adrián; Bouza, Carmen; Vázquez-López, Araceli; Vera, Manuel; Alvarez-Dios, José A; Calaza, Manuel; Gómez-Tato, Antonio; Vázquez, María; Cabaleiro, Santiago; Magariños, Beatriz; Lemos, Manuel L; Leiro, José M; Martínez, Paulino

    2008-01-01

    Background The turbot (Scophthalmus maximus; Scophthalmidae; Pleuronectiformes) is a flatfish species of great relevance for marine aquaculture in Europe. In contrast to other cultured flatfish, very few genomic resources are available in this species. Aeromonas salmonicida and Philasterides dicentrarchi are two pathogens that affect turbot culture causing serious economic losses to the turbot industry. Little is known about the molecular mechanisms for disease resistance and host-pathogen interactions in this species. In this work, thousands of ESTs for functional genomic studies and potential markers linked to ESTs for mapping (microsatellites and single nucleotide polymorphisms (SNPs)) are provided. This information enabled us to obtain a preliminary view of regulated genes in response to these pathogens and it constitutes the basis for subsequent and more accurate microarray analysis. Results A total of 12584 cDNAs partially sequenced from three different cDNA libraries of turbot (Scophthalmus maximus) infected with Aeromonas salmonicida, Philasterides dicentrarchi and from healthy fish were analyzed. Three immune-relevant tissues (liver, spleen and head kidney) were sampled at several time points in the infection process for library construction. The sequences were processed into 9256 high-quality sequences, which constituted the source for the turbot EST database. Clustering and assembly of these sequences, revealed 3482 different putative transcripts, 1073 contigs and 2409 singletons. BLAST searches with public databases detected significant similarity (e-value ≤ 1e-5) in 1766 (50.7%) sequences and 816 of them (23.4%) could be functionally annotated. Two hundred three of these genes (24.9%), encoding for defence/immune-related proteins, were mostly identified for the first time in turbot. Some ESTs showed significant differences in the number of transcripts when comparing the three libraries, suggesting regulation in response to these pathogens. A total of

  16. Rediscovering Medicinal Plants' Potential with OMICS: Microsatellite Survey in Expressed Sequence Tags of Eleven Traditional Plants with Potent Antidiabetic Properties

    PubMed Central

    Sahu, Jagajjit; Sen, Priyabrata; Choudhury, Manabendra Dutta; Dehury, Budheswar; Barooah, Madhumita; Modi, Mahendra Kumar

    2014-01-01

    Abstract Herbal medicines and traditionally used medicinal plants present an untapped potential for novel molecular target discovery using systems science and OMICS biotechnology driven strategies. Since up to 40% of the world's poor people have no access to government health services, traditional and folk medicines are often the only therapeutics available to them. In this vein, North East (NE) India is recognized for its rich bioresources. As part of the Indo-Burma hotspot, it is regarded as an epicenter of biodiversity for several plants having myriad traditional uses, including medicinal use. However, the improvement of these valuable bioresources through molecular breeding strategies, for example, using genic microsatellites or Simple Sequence Repeats (SSRs) or Expressed Sequence Tags (ESTs)-derived SSRs has not been fully utilized in large scale to date. In this study, we identified a total of 47,700 microsatellites from 109,609 ESTs of 11 medicinal plants (pineapple, papaya, noyontara, bitter orange, bermuda brass, ratalu, barbados nut, mango, mulberry, lotus, and guduchi) having proven antidiabetic properties. A total of 58,159 primer pairs were designed for the non-redundant 8060 SSR-positive ESTs and putative functions were assigned to 4483 unique contigs. Among the identified microsatellites, excluding mononucleotide repeats, di-/trinucleotides are predominant, among which repeat motifs of AG/CT and AAG/CTT were most abundant. Similarity search of SSR containing ESTs and antidiabetic gene sequences revealed 11 microsatellites linked to antidiabetic genes in five plants. GO term enrichment analysis revealed a total of 80 enriched GO terms widely distributed in 53 biological processes, 17 molecular functions, and 10 cellular components associated with the 11 markers. The present study therefore provides concrete insights into the frequency and distribution of SSRs in important medicinal resources. The microsatellite markers reported here markedly add to

  17. Transcriptomic analysis of the venom gland of the red-headed krait (Bungarus flaviceps) using expressed sequence tags

    PubMed Central

    2010-01-01

    Background The Red-headed krait (Bungarus flaviceps, Squamata: Serpentes: Elapidae) is a medically important venomous snake that inhabits South-East Asia. Although the venoms of most species of the snake genus Bungarus have been well characterized, a detailed compositional analysis of B. flaviceps is currently lacking. Results Here, we have sequenced 845 expressed sequence tags (ESTs) from the venom gland of a B. flaviceps. Of the transcripts, 74.8% were putative toxins; 20.6% were cellular; and 4.6% were unknown. The main venom protein families identified were three-finger toxins (3FTxs), Kunitz-type serine protease inhibitors (including chain B of β-bungarotoxin), phospholipase A2 (including chain A of β-bungarotoxin), natriuretic peptide (NP), CRISPs, and C-type lectin. Conclusion The 3FTxs were found to be the major component of the venom (39%). We found eight groups of unique 3FTxs and most of them were different from the well-characterized 3FTxs. We found three groups of Kunitz-type serine protease inhibitors (SPIs); one group was comparable to the classical SPIs and the other two groups to chain B of β-bungarotoxins (with or without the extra cysteine) based on sequence identity. The latter group may be functional equivalents of dendrotoxins in Bungarus venoms. The natriuretic peptide (NP) found is the first NP for any Asian elapid, and distantly related to Australian elapid NPs. Our study identifies several unique toxins in B. flaviceps venom, which may help in understanding the evolution of venom toxins and the pathophysiological symptoms induced after envenomation. PMID:20350308

  18. Differentially displayed expressed sequence tags in Melipona scutellaris (Hymenoptera, Apidae, Meliponini) development.

    PubMed

    Santana, Flávia A; Nunes, Francis M F; Vieira, Carlos U; Machado, Maria Alice M S; Kerr, Warwick E; Silva, Wilson A; Bonetti, Ana Maria

    2006-03-01

    We have compared gene expression, using the Differential Display Reverse Transcriptase-Polymerase Chain Reaction (DDRT-PCR) technique, by means of mRNA profile in Melipona scutellaris during ontogenetic postembryonic development, in adult worker, and in both Natural and Juvenile Hormone III-induced adult queen. Six, out of the nine ESTs described here, presented differentially expressed in the phases L1 or L2, or even in both of them, suggesting that key mechanisms to the development of Melipona scutellaris are regulated in these stages. The combination HT11G-AP05 revealed in L1 and L2 a product which matches to thioredoxin reductase protein domain in the Clostridium sporogenes, an important protein during cellular oxidoreduction processes. This study represents the first molecular evidence of differential gene expression profiles toward a description of the genetic developmental traits in the genus Melipona. PMID:16532208

  19. Generation and analysis of expressed sequence tags from the ciliate protozoan parasite Ichthyophthirius multifiliis

    Technology Transfer Automated Retrieval System (TEKTRAN)

    The ciliate protozoan Ichthyophthirius multifiliis (Ich) is an important parasite of freshwater fish that causes 'white spot disease' leading to significant losses. A genomic resource for large-scale studies of this parasite has been lacking. To study gene expression involved in Ich pathogenesis and...

  20. Construction and characterization of subtractive stage-specific expressed sequence tag (EST) libraries of the pinewood nematode Bursaphelenchus xylophilus.

    PubMed

    Kang, Jae Soon; Lee, Hyoungseok; Moon, Il Sung; Lee, Yi; Koh, Young Ho; Je, Yeon Ho; Lim, Kook-Jin; Lee, Si Hyeock

    2009-07-01

    To establish expressed sequence tag databases of the two life stages (the dispersal and propagative stages) of pinewood nematode Bursaphelenchus xylophilus, subtractive EST libraries that were specific to the dispersal 4th larval stage (D4S) and the pine-grown propagative mixed (PGPS) stage were constructed by suppressed subtractive hybridization, and annotated by BLASTx and Gene Ontology (GO). A total of 1112 (57.7%) contigs from the D4S-cDNA library and 1215 (46.7%) contigs from the PGPS-specific cDNA libraries had matched BLASTx hits (E

  1. Not All Sequence Tags Are Created Equal: Designing and Validating Sequence Identification Tags Robust to Indels

    PubMed Central

    Faircloth, Brant C.; Glenn, Travis C.

    2012-01-01

    Ligating adapters with unique synthetic oligonucleotide sequences (sequence tags) onto individual DNA samples before massively parallel sequencing is a popular and efficient way to obtain sequence data from many individual samples. Tag sequences should be numerous and sufficiently different to ensure sequencing, replication, and oligonucleotide synthesis errors do not cause tags to be unrecoverable or confused. However, many design approaches only protect against substitution errors during sequencing and extant tag sets contain too few tag sequences. We developed an open-source software package to validate sequence tags for conformance to two distance metrics and design sequence tags robust to indel and substitution errors. We use this software package to evaluate several commercial and non-commercial sequence tag sets, design several large sets (maxcount = 7,198) of edit metric sequence tags having different lengths and degrees of error correction, and integrate a subset of these edit metric tags to polymerase chain reaction (PCR) primers and sequencing adapters. We validate a subset of these edit metric tagged PCR primers and sequencing adapters by sequencing on several platforms and subsequent comparison to commercially available alternatives. We find that several commonly used sets of sequence tags or design methodologies used to produce sequence tags do not meet the minimum expectations of their underlying distance metric, and we find that PCR primers and sequencing adapters incorporating edit metric sequence tags designed by our software package perform as well as their commercial counterparts. We suggest that researchers evaluate sequence tags prior to use or evaluate tags that they have been using. The sequence tag sets we design improve on extant sets because they are large, valid across the set, and robust to the suite of substitution, insertion, and deletion errors affecting massively parallel sequencing workflows on all currently used platforms

  2. Identification of key genes involved in root development of tomato using expressed sequence tag analysis.

    PubMed

    Kalidhasan, N; Joshi, Deepti; Bhatt, Tarun Kumar; Gupta, Aditya Kumar

    2015-10-01

    Root system of plants are actually fascinating structures, not only critical for plant development, but also important for storage and conduction. Due to its agronomic importance, identification of genes involved in root development has been a subject of intense study. Tomato is the one of the most consumed vegetables in the world. Tomato has been used as model system for dicot plants because of its small genome, well-established transformation techniques and well-constructed physical map. The present study is targeted to identify of root specific genes expressed temporally and also gene(s) involved in lateral root and profuse root development. A total of 890 ESTs were identified from five EST libraries constructed using SSH approach which included temporal gene regulation (early and late) and genes involved in morphogenetic traits (lateral and profuse rooting). One hundred sixty-one unique ESTs identified from various libraries were categorized based on their putative functions and deposited in NCBI-dbEST database. In addition, 36 ESTs were selected for validation of their expression by RT-PCR. The present findings will help in shedding light to the unexplored developmental process of root growth in tomato and plant in general. PMID:26600676

  3. Mining expressed sequence tags of rapeseed (Brassica napus L.) to predict the drought responsive regulatory network.

    PubMed

    Shamloo-Dashtpagerdi, Roohollah; Razi, Hooman; Ebrahimie, Esmaeil

    2015-07-01

    It is of great significance to understand the regulatory mechanisms by which plants deal with drought stress. Two EST libraries derived from rapeseed (Brassica napus) leaves in non-stressed and drought stress conditions were analyzed in order to obtain the transcriptomic landscape of drought-exposed B. napus plants, and also to identify and characterize significant drought responsive regulatory genes and microRNAs. The functional ontology analysis revealed a substantial shift in the B. napus transcriptome to govern cellular drought responsiveness via different stress-activated mechanisms. The activity of transcription factor and protein kinase modules generally increased in response to drought stress. The 26 regulatory genes consisting of 17 transcription factor genes, eight protein kinase genes and one protein phosphatase gene were identified showing significant alterations in their expressions in response to drought stress. We also found the six microRNAs which were differentially expressed during drought stress supporting the involvement of a post-transcriptional level of regulation for B. napus drought response. The drought responsive regulatory network shed light on the significance of some regulatory components involved in biosynthesis and signaling of various plant hormones (abscisic acid, auxin and brassinosteroids), ubiquitin proteasome system, and signaling through Reactive Oxygen Species (ROS). Our findings suggested a complex and multi-level regulatory system modulating response to drought stress in B. napus. PMID:26261397

  4. A new view of insect-crustacean relationships II. Inferences from expressed sequence tags and comparisons with neural cladistics.

    PubMed

    Andrew, David R

    2011-05-01

    The enormous diversity of Arthropoda has complicated attempts by systematists to deduce the history of this group in terms of phylogenetic relationships and phenotypic change. Traditional hypotheses regarding the relationships of the major arthropod groups (Chelicerata, Myriapoda, Crustacea, and Hexapoda) focus on suites of morphological characters, whereas phylogenomics relies on large amounts of molecular sequence data to infer evolutionary relationships. The present discussion is based on expressed sequence tags (ESTs) that provide large numbers of short molecular sequences and so provide an abundant source of sequence data for phylogenetic inference. This study presents well-supported phylogenies of diverse arthropod and metazoan outgroup taxa obtained from publicly-available databases. An in-house bioinformatics pipeline has been used to compile and align conserved orthologs from each taxon for maximum likelihood inferences. This approach resolves many currently accepted hypotheses regarding internal relationships between the major groups of Arthropoda, including monophyletic Hexapoda, Tetraconata (Crustacea + Hexapoda), Myriapoda, and Chelicerata sensu lato (Pycnogonida + Euchelicerata). "Crustacea" is a paraphyletic group with some taxa more closely related to the monophyletic Hexapoda. These results support studies that have utilized more restricted EST data for phylogenetic inference, yet they differ in important regards from recently published phylogenies employing nuclear protein-coding sequences. The present results do not, however, depart from other phylogenies that resolve Branchiopoda as the crustacean sister group of Hexapoda. Like other molecular phylogenies, EST-derived phylogenies alone are unable to resolve morphological convergences or evolved reversals and thus omit what may be crucial events in the history of life. For example, molecular data are unable to resolve whether a Hexapod-Branchiopod sister relationship infers a branchiopod

  5. Construction of cDNA library and preliminary analysis of expressed sequence tags from Siberian tiger

    PubMed Central

    Liu, Chang-Qing; Lu, Tao-Feng; Feng, Bao-Gang; Liu, Dan; Guan, Wei-Jun; Ma, Yue-Hui

    2010-01-01

    In this study we successfully constructed a full-length cDNA library from Siberian tiger, Panthera tigris altaica, the most well-known wild Animal. Total RNA was extracted from cultured Siberian tiger fibroblasts in vitro. The titers of primary and amplified libraries were 1.30×106 pfu/ml and 1.62×109 pfu/ml respectively. The proportion of recombinants from unamplified library was 90.5% and average length of exogenous inserts was 1.13 kb. A total of 282 individual ESTs with sizes ranging from 328 to 1,142bps were then analyzed the BLASTX score revealed that 53.9% of the sequences were classified as strong match, 38.6% as nominal and 7.4% as weak match. 28.0% of them were found to be related to enzyme/catalytic protein, 20.9% ESTs to metabolism, 13.1% ESTs to transport, 12.1% ESTs to signal transducer/cell communication, 9.9% ESTs to structure protein, 3.9% ESTs to immunity protein/defense metabolism, 3.2% ESTs to cell cycle, and 8.9 ESTs classified as novel genes. These results demonstrated that the reliability and representativeness of the cDNA library attained to the requirements of a standard cDNA library. This library provided a useful platform for the functional genomic research of Siberian tigers. PMID:20941376

  6. Computational identification of microRNAs and their targets in Catharanthus roseus expressed sequence tags

    PubMed Central

    Pani, Alok; Mahapatra, Rajani Kanta

    2013-01-01

    No study has been performed on identifying microRNAs (miRNAs) and their targets in the medicinal plant, Catharanthus roseus. In the present study, using the comparative genomics approach, we have predicted two potential C. roseus miRNAs. Furthermore, twelve potential mRNA targets were identified in C. roseus genome based on the characteristics that miRNAs exhibit perfect or nearly perfect complementarity with their targeted mRNA sequences. Among them many of the targets were predicted to encode enzymes that regulate the biosynthesis of terpenoid indole alkaloids (TIA). In addition, most of the predicted targets were the gene coding for transcription factors which are mainly involved in cell growth and development, signaling and metabolism. This is the first in silico study to indicate that miRNA target gene encoding enzymes involved in vinblastine and vincristine biosynthesis, which may help to understand the miRNA-mediated regulation of TIA alkaloid biosynthesis in C. roseus. PMID:26484050

  7. Computational identification of microRNAs and their targets in Catharanthus roseus expressed sequence tags.

    PubMed

    Pani, Alok; Mahapatra, Rajani Kanta

    2013-12-01

    No study has been performed on identifying microRNAs (miRNAs) and their targets in the medicinal plant, Catharanthus roseus. In the present study, using the comparative genomics approach, we have predicted two potential C. roseus miRNAs. Furthermore, twelve potential mRNA targets were identified in C. roseus genome based on the characteristics that miRNAs exhibit perfect or nearly perfect complementarity with their targeted mRNA sequences. Among them many of the targets were predicted to encode enzymes that regulate the biosynthesis of terpenoid indole alkaloids (TIA). In addition, most of the predicted targets were the gene coding for transcription factors which are mainly involved in cell growth and development, signaling and metabolism. This is the first in silico study to indicate that miRNA target gene encoding enzymes involved in vinblastine and vincristine biosynthesis, which may help to understand the miRNA-mediated regulation of TIA alkaloid biosynthesis in C. roseus. PMID:26484050

  8. Exploring the host parasitism of the migratory plant-parasitic nematode Ditylenchus destuctor by expressed sequence tags analysis.

    PubMed

    Peng, Huan; Gao, Bing-li; Kong, Ling-an; Yu, Qing; Huang, Wen-kun; He, Xu-feng; Long, Hai-bo; Peng, De-liang

    2013-01-01

    The potato rot nematode, Ditylenchus destructor, is a very destructive nematode pest on many agriculturally important crops worldwide, but the molecular characterization of its parasitism of plant has been limited. The effectors involved in nematode parasitism of plant for several sedentary endo-parasitic nematodes such as Heterodera glycines, Globodera rostochiensis and Meloidogyne incognita have been identified and extensively studied over the past two decades. Ditylenchus destructor, as a migratory plant parasitic nematode, has different feeding behavior, life cycle and host response. Comparing the transcriptome and parasitome among different types of plant-parasitic nematodes is the way to understand more fully the parasitic mechanism of plant nematodes. We undertook the approach of sequencing expressed sequence tags (ESTs) derived from a mixed stage cDNA library of D. destructor. This is the first study of D. destructor ESTs. A total of 9800 ESTs were grouped into 5008 clusters including 3606 singletons and 1402 multi-member contigs, representing a catalog of D. destructor genes. Implementing a bioinformatics' workflow, we found 1391 clusters have no match in the available gene database; 31 clusters only have similarities to genes identified from D. africanus, the most closely related species to D. destructor; 1991 clusters were annotated using Gene Ontology (GO); 1550 clusters were assigned enzyme commission (EC) numbers; and 1211 clusters were mapped to 181 KEGG biochemical pathways. 22 ESTs had similarities to reported nematode effectors. Interestedly, most of the effectors identified in this study are involved in host cell wall degradation or modification, such as 1,4-beta-glucanse, 1,3-beta-glucanse, pectate lyase, chitinases and expansin, or host defense suppression such as calreticulin, annexin and venom allergen-like protein. This result implies that the migratory plant-parasitic nematode D. destructor secrets similar effectors to those of sedentary

  9. Exploring the Host Parasitism of the Migratory Plant-Parasitic Nematode Ditylenchus destuctor by Expressed Sequence Tags Analysis

    PubMed Central

    Peng, Huan; Gao, Bing-li; Kong, Ling-an; Yu, Qing; Huang, Wen-kun; He, Xu-feng; Long, Hai-bo; Peng, De-liang

    2013-01-01

    The potato rot nematode, Ditylenchus destructor, is a very destructive nematode pest on many agriculturally important crops worldwide, but the molecular characterization of its parasitism of plant has been limited. The effectors involved in nematode parasitism of plant for several sedentary endo-parasitic nematodes such as Heterodera glycines, Globodera rostochiensis and Meloidogyne incognita have been identified and extensively studied over the past two decades. Ditylenchus destructor, as a migratory plant parasitic nematode, has different feeding behavior, life cycle and host response. Comparing the transcriptome and parasitome among different types of plant-parasitic nematodes is the way to understand more fully the parasitic mechanism of plant nematodes. We undertook the approach of sequencing expressed sequence tags (ESTs) derived from a mixed stage cDNA library of D. destructor. This is the first study of D. destructor ESTs. A total of 9800 ESTs were grouped into 5008 clusters including 3606 singletons and 1402 multi-member contigs, representing a catalog of D. destructor genes. Implementing a bioinformatics' workflow, we found 1391 clusters have no match in the available gene database; 31 clusters only have similarities to genes identified from D. africanus, the most closely related species to D. destructor; 1991 clusters were annotated using Gene Ontology (GO); 1550 clusters were assigned enzyme commission (EC) numbers; and 1211 clusters were mapped to 181 KEGG biochemical pathways. 22 ESTs had similarities to reported nematode effectors. Interestedly, most of the effectors identified in this study are involved in host cell wall degradation or modification, such as 1,4-beta-glucanse, 1,3-beta-glucanse, pectate lyase, chitinases and expansin, or host defense suppression such as calreticulin, annexin and venom allergen-like protein. This result implies that the migratory plant-parasitic nematode D. destructor secrets similar effectors to those of sedentary

  10. Chasing migration genes: a brain expressed sequence tag resource for summer and migratory monarch butterflies (Danaus plexippus).

    PubMed

    Zhu, Haisun; Casselman, Amy; Reppert, Steven M

    2008-01-01

    North American monarch butterflies (Danaus plexippus) undergo a spectacular fall migration. In contrast to summer butterflies, migrants are juvenile hormone (JH) deficient, which leads to reproductive diapause and increased longevity. Migrants also utilize time-compensated sun compass orientation to help them navigate to their overwintering grounds. Here, we describe a brain expressed sequence tag (EST) resource to identify genes involved in migratory behaviors. A brain EST library was constructed from summer and migrating butterflies. Of 9,484 unique sequences, 6068 had positive hits with the non-redundant protein database; the EST database likely represents approximately 52% of the gene-encoding potential of the monarch genome. The brain transcriptome was cataloged using Gene Ontology and compared to Drosophila. Monarch genes were well represented, including those implicated in behavior. Three genes involved in increased JH activity (allatotropin, juvenile hormone acid methyltransfersase, and takeout) were upregulated in summer butterflies, compared to migrants. The locomotion-relevant turtle gene was marginally upregulated in migrants, while the foraging and single-minded genes were not differentially regulated. Many of the genes important for the monarch circadian clock mechanism (involved in sun compass orientation) were in the EST resource, including the newly identified cryptochrome 2. The EST database also revealed a novel Na+/K+ ATPase allele predicted to be more resistant to the toxic effects of milkweed than that reported previously. Potential genetic markers were identified from 3,486 EST contigs and included 1599 double-hit single nucleotide polymorphisms (SNPs) and 98 microsatellite polymorphisms. These data provide a template of the brain transcriptome for the monarch butterfly. Our "snap-shot" analysis of the differential regulation of candidate genes between summer and migratory butterflies suggests that unbiased, comprehensive transcriptional

  11. Micro- and minisatellite-expressed sequence tag (EST) markers discriminate between populations of Rhipicephalus appendiculatus.

    PubMed

    Kanduma, Esther G; Mwacharo, Joram M; Sunter, Jack D; Nzuki, Inosters; Mwaura, Stephen; Kinyanjui, Peter W; Kibe, Michael; Heyne, Heloise; Hanotte, Olivier; Skilton, Robert A; Bishop, Richard P

    2012-06-01

    Biological differences, including vector competence for the protozoan parasite Theileria parva have been reported among populations of Rhipicephalus appendiculatus (Acari: Ixodidae) from different geographic regions. However, the genetic diversity and population structure of this important tick vector remain unknown due to the absence of appropriate genetic markers. Here, we describe the development and evaluation of a panel of EST micro- and minisatellite markers to characterize the genetic diversity within and between populations of R. appendiculatus and other rhipicephaline species. Sixty-six micro- and minisatellite markers were identified through analysis of the R. appendiculatus Gene Index (RaGI) EST database and selected bacterial artificial chromosome (BAC) sequences. These were used to genotype 979 individual ticks from 10 field populations, 10 laboratory-bred stocks, and 5 additional Rhipicephalus species. Twenty-nine markers were polymorphic and therefore informative for genetic studies while 6 were monomorphic. Primers designed from the remaining 31 loci did not reliably generate amplicons. The 29 polymorphic markers discriminated populations of R. appendiculatus and also 4 other Rhipicephalus species, but not R. zambeziensis. The percentage Principal Component Analysis (PCA) implemented using Multiple Co-inertia Analysis (MCoA) clustered populations of R. appendiculatus into 2 groups. Individual markers however differed in their ability to generate the reference typology using the MCoA approach. This indicates that different panels of markers may be required for different applications. The 29 informative polymorphic micro- and minisatellite markers are the first available tools for the analysis of the phylogeography and population genetics of R. appendiculatus. PMID:22789728

  12. Analysis of expressed sequence tags from Uromyces appendiculatus hyphae and haustoria and their comparison to sequences from other rust fungi

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Two separate cDNA libraries were prepared for RNA extracted from bean rust (Uromyces appendiculatus) hyphae and haustoria isolated from infected leaves bean leaves (Phaseolus vulgaris cv Pint 111) between 2 and 8 dpi. Approximately 13,000 clones were sequenced from both ends and the sequences assem...

  13. Expressed sequence tags from the red imported fire ant, Solenopsis invicta: Annotation and utilization for discovery of viruses

    Technology Transfer Automated Retrieval System (TEKTRAN)

    An expression library was created and 2,300 clones sequenced from a monogyne colony of Solenopsis invicta with the primary intention of discovering viruses infecting this ant pest. After assembly and removal of mitochondrial and poor quality sequences, 1,054 unique sequences were yielded and deposi...

  14. Diversity analysis in Cannabis sativa based on large-scale development of expressed sequence tag-derived simple sequence repeat markers.

    PubMed

    Gao, Chunsheng; Xin, Pengfei; Cheng, Chaohua; Tang, Qing; Chen, Ping; Wang, Changbiao; Zang, Gonggu; Zhao, Lining

    2014-01-01

    Cannabis sativa L. is an important economic plant for the production of food, fiber, oils, and intoxicants. However, lack of sufficient simple sequence repeat (SSR) markers has limited the development of cannabis genetic research. Here, large-scale development of expressed sequence tag simple sequence repeat (EST-SSR) markers was performed to obtain more informative genetic markers, and to assess genetic diversity in cannabis (Cannabis sativa L.). Based on the cannabis transcriptome, 4,577 SSRs were identified from 3,624 ESTs. From there, a total of 3,442 complementary primer pairs were designed as SSR markers. Among these markers, trinucleotide repeat motifs (50.99%) were the most abundant, followed by hexanucleotide (25.13%), dinucleotide (16.34%), tetranucloetide (3.8%), and pentanucleotide (3.74%) repeat motifs, respectively. The AAG/CTT trinucleotide repeat (17.96%) was the most abundant motif detected in the SSRs. One hundred and seventeen EST-SSR markers were randomly selected to evaluate primer quality in 24 cannabis varieties. Among these 117 markers, 108 (92.31%) were successfully amplified and 87 (74.36%) were polymorphic. Forty-five polymorphic primer pairs were selected to evaluate genetic diversity and relatedness among the 115 cannabis genotypes. The results showed that 115 varieties could be divided into 4 groups primarily based on geography: Northern China, Europe, Central China, and Southern China. Moreover, the coefficient of similarity when comparing cannabis from Northern China with the European group cannabis was higher than that when comparing with cannabis from the other two groups, owing to a similar climate. This study outlines the first large-scale development of SSR markers for cannabis. These data may serve as a foundation for the development of genetic linkage, quantitative trait loci mapping, and marker-assisted breeding of cannabis. PMID:25329551

  15. Diversity Analysis in Cannabis sativa Based on Large-Scale Development of Expressed Sequence Tag-Derived Simple Sequence Repeat Markers

    PubMed Central

    Cheng, Chaohua; Tang, Qing; Chen, Ping; Wang, Changbiao; Zang, Gonggu; Zhao, Lining

    2014-01-01

    Cannabis sativa L. is an important economic plant for the production of food, fiber, oils, and intoxicants. However, lack of sufficient simple sequence repeat (SSR) markers has limited the development of cannabis genetic research. Here, large-scale development of expressed sequence tag simple sequence repeat (EST-SSR) markers was performed to obtain more informative genetic markers, and to assess genetic diversity in cannabis (Cannabis sativa L.). Based on the cannabis transcriptome, 4,577 SSRs were identified from 3,624 ESTs. From there, a total of 3,442 complementary primer pairs were designed as SSR markers. Among these markers, trinucleotide repeat motifs (50.99%) were the most abundant, followed by hexanucleotide (25.13%), dinucleotide (16.34%), tetranucloetide (3.8%), and pentanucleotide (3.74%) repeat motifs, respectively. The AAG/CTT trinucleotide repeat (17.96%) was the most abundant motif detected in the SSRs. One hundred and seventeen EST-SSR markers were randomly selected to evaluate primer quality in 24 cannabis varieties. Among these 117 markers, 108 (92.31%) were successfully amplified and 87 (74.36%) were polymorphic. Forty-five polymorphic primer pairs were selected to evaluate genetic diversity and relatedness among the 115 cannabis genotypes. The results showed that 115 varieties could be divided into 4 groups primarily based on geography: Northern China, Europe, Central China, and Southern China. Moreover, the coefficient of similarity when comparing cannabis from Northern China with the European group cannabis was higher than that when comparing with cannabis from the other two groups, owing to a similar climate. This study outlines the first large-scale development of SSR markers for cannabis. These data may serve as a foundation for the development of genetic linkage, quantitative trait loci mapping, and marker-assisted breeding of cannabis. PMID:25329551

  16. Generation and Analysis of Expressed Sequence Tags from Chimonanthus praecox (Wintersweet) Flowers for Discovering Stress-Responsive and Floral Development-Related Genes

    PubMed Central

    Sui, Shunzhao; Luo, Jianghui; Ma, Jing; Zhu, Qinlong; Lei, Xinghua; Li, Mingyang

    2012-01-01

    A complementary DNA library was constructed from the flowers of Chimonanthus praecox, an ornamental perennial shrub blossoming in winter in China. Eight hundred sixty-seven high-quality expressed sequence tag sequences with an average read length of 673.8 bp were acquired. A nonredundant set of 479 unigenes, including 94 contigs and 385 singletons, was identified after the expressed sequence tags were clustered and assembled. BLAST analysis against the nonredundant protein database and nonredundant nucleotide database revealed that 405 unigenes shared significant homology with known genes. The homologous unigenes were categorized according to Gene Ontology hierarchies (biological, cellular, and molecular). By BLAST analysis and Gene Ontology annotation, 95 unigenes involved in stress and defense and 19 unigenes related to floral development were identified based on existing knowledge. Twelve genes, of which 9 were annotated as “cold response,” were examined by real-time RT-PCR to understand the changes in expression patterns under cold stress and to validate the findings. Fourteen genes, including 11 genes related to floral development, were also detected by real-time RT-PCR to validate the expression patterns in the blooming process and in different tissues. This study provides a useful basis for the genomic analysis of C. praecox. PMID:22536115

  17. Expressed sequence-tag analysis of ovaries of Brachiaria brizantha reveals genes associated with the early steps of embryo sac differentiation of apomictic plants.

    PubMed

    Silveira, Erica Duarte; Guimarães, Larissa Arrais; de Alencar Dusi, Diva Maria; da Silva, Felipe Rodrigues; Martins, Natália Florencio; do Carmo Costa, Marcos Mota; Alves-Ferreira, Márcio; de Campos Carneiro, Vera Tavares

    2012-02-01

    In apomixis, asexual mode of plant reproduction through seeds, an unreduced megagametophyte is formed due to circumvented or altered meiosis. The embryo develops autonomously from the unreduced egg cell, independently of fertilization. Brachiaria is a genus of tropical forage grasses that reproduces sexually or by apomixis. A limited number of studies have reported the sequencing of apomixis-related genes and a few Brachiaria sequences have been deposited at genebank databases. This work shows sequencing and expression analyses of expressed sequence-tags (ESTs) of Brachiaria genus and points to transcripts from ovaries with preferential expression at megasporogenesis in apomictic plants. From the 11 differentially expressed sequences from immature ovaries of sexual and apomictic Brachiaria brizantha obtained from macroarray analysis, 9 were preferentially detected in ovaries of apomicts, as confirmed by RT-qPCR. A putative involvement in early steps of Panicum-type embryo sac differentiation of four sequences from B. brizantha ovaries: BbrizHelic, BbrizRan, BbrizSec13 and BbrizSti1 is suggested. Two of these, BbrizSti1 and BbrizHelic, with similarity to a gene coding to stress induced protein and a helicase, respectively, are preferentially expressed in the early stages of apomictic ovaries development, especially in the nucellus, in a stage previous to the differentiation of aposporous initials, as verified by in situ hybridization. PMID:22068439

  18. Development and Characterization of 1,827 Expressed Sequence Tag-Derived Simple Sequence Repeat Markers for Ramie (Boehmeria nivea L. Gaud)

    PubMed Central

    Liu, Touming; Zhu, Siyuan; Fu, Lili; Tang, Qingming; Yu, Yongting; Chen, Ping; Luan, Mingbao; Wang, Changbiao; Tang, Shouwei

    2013-01-01

    Ramie (Boehmeria nivea L. Gaud) is one of the most important natural fiber crops, and improvement of fiber yield and quality is the main goal in efforts to breed superior cultivars. However, efforts aimed at enhancing the understanding of ramie genetics and developing more effective breeding strategies have been hampered by the shortage of simple sequence repeat (SSR) markers. In our previous study, we had assembled de novo 43,990 expressed sequence tags (ESTs). In the present study, we searched these previously assembled ESTs for SSRs and identified 1,685 ESTs (3.83%) containing 1,878 SSRs. Next, we designed 1,827 primer pairs complementary to regions flanking these SSRs, and these regions were designated as SSR markers. Among these markers, dinucleotide and trinucleotide repeat motifs were the most abundant types (36.4% and 36.3%, respectively), whereas tetranucleotide, pentanucleotide, and hexanucleotide motifs represented <10% of the markers. The motif AG/CT was the most abundant, accounting for 28.74% of the markers. One hundred EST-SSR markers (97 SSRs located in genes encoding transcription factors and 3 SSRs in genes encoding cellulose synthases) were amplified using polymerase chain reaction for detecting 24 ramie varieties. Of these 100 markers, 98 markers were successfully amplified and 81 markers were polymorphic, with 2–6 alleles among the 24 varieties. Analysis of the genetic diversity of all 24 varieties revealed similarity coefficients that ranged from 0.51 to 0.80. The EST-SSRs developed in this study represent the first large-scale development of SSR markers for ramie. These SSR markers could be used for development of genetic and physical maps, quantitative trait loci mapping, genetic diversity studies, association mapping, and cultivar fingerprinting. PMID:23565230

  19. Identification of genes expressed in human CD34+ hematopoietic stem/progenitor cells by expressed sequence tags and efficient full-length cDNA cloning

    PubMed Central

    Mao, Mao; Fu, Gang; Wu, Ji-Sheng; Zhang, Qing-Hua; Zhou, Jun; Kan, Li-Xin; Huang, Qiu-Hua; He, Kai-Li; Gu, Bai-Wei; Han, Ze-Guang; Shen, Yu; Gu, Jian; Yu, Ya-Ping; Xu, Shu-Hua; Wang, Ya-Xin; Chen, Sai-Juan; Chen, Zhu

    1998-01-01

    Hematopoietic stem/progenitor cells (HSPCs) possess the potentials of self-renewal, proliferation, and differentiation toward different lineages of blood cells. These cells not only play a primordial role in hematopoietic development but also have important clinical application. Characterization of the gene expression profile in CD34+ HSPCs may lead to a better understanding of the regulation of normal and pathological hematopoiesis. In the present work, genes expressed in human umbilical cord blood CD34+ cells were catalogued by partially sequencing a large amount of cDNA clones [or expressed sequence tags (ESTs)] and analyzing these sequences with the tools of bioinformatics. Among 9,866 ESTs thus obtained, 4,697 (47.6%) showed identity to known genes in the GenBank database, 2,603 (26.4%) matched to the ESTs previously deposited in a public domain database, 1,415 (14.3%) were previously undescribed ESTs, and the remaining 1,151 (11.7%) were mitochondrial DNA, ribosomal RNA, or repetitive (Alu or L1) sequences. Integration of ESTs of known genes generated a profile including 855 genes that could be divided into different categories according to their functions. Some (8.2%) of the genes in this profile were considered related to early hematopoiesis. The possible function of ESTs corresponding to so far unknown genes were approached by means of homology and functional motif searches. Moreover, attempts were made to generate libraries enriched for full-length cDNAs, to better explore the genes in HSPCs. Nearly 60% of the cDNA clones of mRNA under 2 kb in our libraries had 5′ ends upstream of the first ATG codon of the ORF. With this satisfactory result, we have developed an efficient working system that allowed fast sequencing of 32 full-length cDNAs, 16 of them being mapped to the chromosomes with radiation hybrid panels. This work may lay a basis for the further research on the molecular network of hematopoietic regulation. PMID:9653160

  20. Computational exploration of microRNAs from expressed sequence tags of Humulus lupulus, target predictions and expression analysis.

    PubMed

    Mishra, Ajay Kumar; Duraisamy, Ganesh Selvaraj; Týcová, Anna; Matoušek, Jaroslav

    2015-12-01

    Among computationally predicted and experimentally validated plant miRNAs, several are conserved across species boundaries in the plant kingdom. In this study, a combined experimental-in silico computational based approach was adopted for the identification and characterization of miRNAs in Humulus lupulus (hop), which is widely cultivated for use by the brewing industry and apart from, used as a medicinal herb. A total of 22 miRNAs belonging to 17 miRNA families were identified in hop following comparative computational approach and EST-based homology search according to a series of filtering criteria. Selected miRNAs were validated by end-point PCR and quantitative reverse transcription-polymerase chain reaction (qRT-PCR), confirmed the existence of conserved miRNAs in hop. Based on the characteristic that miRNAs exhibit perfect or nearly perfect complementarity with their targeted mRNA sequences, a total of 47 potential miRNA targets were identified in hop. Strikingly, the majority of predicted targets were belong to transcriptional factors which could regulate hop growth and development, including leaf, root and even cone development. Moreover, the identified miRNAs may also be involved in other cellular and metabolic processes, such as stress response, signal transduction, and other physiological processes. The cis-regulatory elements relevant to biotic and abiotic stress, plant hormone response, flavonoid biosynthesis were identified in the promoter regions of those miRNA genes. Overall, findings from this study will accelerate the way for further researches of miRNAs, their functions in hop and shows a path for the prediction and analysis of miRNAs to those species whose genomes are not available. PMID:26476128

  1. Mining an Ostrinia nubilalis Midgut Expressed Sequence Tag (EST) Library for Candidate Genes and Single Nucleotide Polymorphisms (SNPs)

    Technology Transfer Automated Retrieval System (TEKTRAN)

    European corn borer, Ostrinia nubilalis, larvae feed upon many plant hosts and are a major target for genetically-engineered corn expressing Bacillus thuringiensis (Bt) toxins. DNA sequencing of a non-normalized O. nubilalis larval midgut cDNA library (ARS-CICGRU ONmgEST) identified 535 unique sequ...

  2. A molecular analysis of desiccation tolerance mechanisms in the anhydrobiotic nematode Panagrolaimus superbus using expressed sequenced tags

    PubMed Central

    2012-01-01

    Background Some organisms can survive extreme desiccation by entering into a state of suspended animation known as anhydrobiosis. Panagrolaimus superbus is a free-living anhydrobiotic nematode that can survive rapid environmental desiccation. The mechanisms that P. superbus uses to combat the potentially lethal effects of cellular dehydration may include the constitutive and inducible expression of protective molecules, along with behavioural and/or morphological adaptations that slow the rate of cellular water loss. In addition, inducible repair and revival programmes may also be required for successful rehydration and recovery from anhydrobiosis. Results To identify constitutively expressed candidate anhydrobiotic genes we obtained 9,216 ESTs from an unstressed mixed stage population of P. superbus. We derived 4,009 unigenes from these ESTs. These unigene annotations and sequences can be accessed at http://www.nematodes.org/nembase4/species_info.php?species=PSC. We manually annotated a set of 187 constitutively expressed candidate anhydrobiotic genes from P. superbus. Notable among those is a putative lineage expansion of the lea (late embryogenesis abundant) gene family. The most abundantly expressed sequence was a member of the nematode specific sxp/ral-2 family that is highly expressed in parasitic nematodes and secreted onto the surface of the nematodes' cuticles. There were 2,059 novel unigenes (51.7% of the total), 149 of which are predicted to encode intrinsically disordered proteins lacking a fixed tertiary structure. One unigene may encode an exo-β-1,3-glucanase (GHF5 family), most similar to a sequence from Phytophthora infestans. GHF5 enzymes have been reported from several species of plant parasitic nematodes, with horizontal gene transfer (HGT) from bacteria proposed to explain their evolutionary origin. This P. superbus sequence represents another possible HGT event within the Nematoda. The expression of five of the 19 putative stress response

  3. Annotated Expressed Sequence Tags (ESTs) from pre-smolt Atlantic salmon (Salmo salar) in a searchable data resource

    PubMed Central

    Adzhubei, Alexei A; Vlasova, Anna V; Hagen-Larsen, Heidi; Ruden, Torgeir A; Laerdahl, Jon K; Høyheim, Bjørn

    2007-01-01

    Background To identify as many different transcripts/genes in the Atlantic salmon genome as possible, it is crucial to acquire good cDNA libraries from different tissues and developmental stages, their relevant sequences (ESTs or full length sequences) and attempt to predict function. Such libraries allow identification of a large number of different transcripts and can provide valuable information on genes expressed in a particular tissue at a specific developmental stage. This data is important in constructing a microarray chip, identifying SNPs in coding regions, and for future identification of genes in the whole genome sequence. An important factor that determines the usefulness of generated data for biologists is efficient data access. Public searchable databases play a crucial role in providing such service. Description Twenty-three Atlantic salmon cDNA libraries were constructed from 15 tissues, yielding nearly 155,000 clones. From these libraries 58,109 ESTs were generated, of which 57,212 were used for contig assembly. Following deletion of mitochondrial sequences 55,118 EST sequences were submitted to GenBank. In all, 20,019 unique sequences, consisting of 6,424 contigs and 13,595 singlets, were generated. The Norwegian Salmon Genome Project Database has been constructed and annotation performed by the annotation transfer approach. Annotation was successful for 50.3% (10,075) of the sequences and 6,113 sequences (30.5%) were annotated with Gene Ontology terms for molecular function, biological process and cellular component. Conclusion We describe the construction of cDNA libraries from juvenile/pre-smolt Atlantic salmon (Salmo salar), EST sequencing, clustering, and annotation by assigning putative function to the transcripts. These sequences represents 97% of all sequences submitted to GenBank from the pre-smoltification stage. The data has been grouped into datasets according to its source and type of annotation. Various data query options are offered

  4. Existence of microsatellites in expressed sequence tags of common carp ( Cyprinus carpio L.) available in GenBank dbEST database

    NASA Astrophysics Data System (ADS)

    Jingjie, Hu; Xiaolong, Wang; Xiaoli, Hu; Zhenmin, Bao

    2006-01-01

    Common carp expressed sequence tags (ESTs) were analyzed for the existence of microsatellites, or simple sequence repeats (SSRs). In the NCBI dbEST database, a total of 10612 sequences were registered before December 31, 2004. A complete search of 2-6 nucleotide microsatellites resulted in the identification of 513 SSR-containing ESTs, accounting for 4.8% of the total. Cluster analysis indicated that 73 sequences of SSR-containing ESTs fell into 27 groups and the remaining 440 ESTs were indenpendent. A total of 467 unique SSR-containing ESTs were identified. These EST-SSRs contained a variety of simple sequence types, and di- and tri-nucleotide repeats were the most abundant, accounting for 42.1% and 27.9% of the whole, respectively. Of the dinucleotide repeats, CA/TG was the most abundant, followed by GA/TC. BLASTx search showed that 38.1% of the SSR loci could be associated with genes or proteins of known or unknown function. BLASTx searches of SSR-containing ESTs also showed high frequencies (98/179) of hits on zebrafish sequences.

  5. Existence of microsatellites in expressed sequence tags of common carp ( Cyprinus carpio L.) available in GenBank dbEST database

    NASA Astrophysics Data System (ADS)

    Hu, Jingjie; Wang, Xiaolong; Hu, Xiaoli; Bao, Zhenmin

    2006-01-01

    Common carp expressed sequence tags (ESTs) were analyzed for the existence of microsatellites, or simple sequence repeats (SSRs). In the NCBI dbEST database, a total of 10612 sequences were registered before December 31, 2004. A complete search of 2 6 nucleotide microsatellites resulted in the identification of 513 SSR-containing ESTs, accounting for 4.8% of the total. Cluster analysis indicated that 73 sequences of SSR-containing ESTs fell into 27 groups and the remaining 440 ESTs were indenpendent. A total of 467 unique SSR-containing ESTs were identified. These EST-SSRs contained a variety of simple sequence types, and di- and tri-nucleotide repeats were the most abundant, accounting for 42.1% and 27.9% of the whole, respectively. Of the dinucleotide repeats, CA/TG was the most abundant, followed by GA/TC. BLASTx search showed that 38.1% of the SSR loci could be associated with genes or proteins of known or unknown function. BLASTx searches of SSR-containing ESTs also showed high frequencies (98/179) of hits on zebrafish sequences.

  6. Generation and Analysis of Expressed Sequence Tags (ESTs) from Halophyte Atriplex canescens to Explore Salt-Responsive Related Genes

    PubMed Central

    Li, Jingtao; Sun, Xinhua; Yu, Gang; Jia, Chengguo; Liu, Jinliang; Pan, Hongyu

    2014-01-01

    Little information is available on gene expression profiling of halophyte A. canescens. To elucidate the molecular mechanism for stress tolerance in A. canescens, a full-length complementary DNA library was generated from A. canescens exposed to 400 mM NaCl, and provided 343 high-quality ESTs. In an evaluation of 343 valid EST sequences in the cDNA library, 197 unigenes were assembled, among which 190 unigenes (83.1% ESTs) were identified according to their significant similarities with proteins of known functions. All the 343 EST sequences have been deposited in the dbEST GenBank under accession numbers JZ535802 to JZ536144. According to Arabidopsis MIPS functional category and GO classifications, we identified 193 unigenes of the 311 annotations EST, representing 72 non-redundant unigenes sharing similarities with genes related to the defense response. The sets of ESTs obtained provide a rich genetic resource and 17 up-regulated genes related to salt stress resistance were identified by qRT-PCR. Six of these genes may contribute crucially to earlier and later stage salt stress resistance. Additionally, among the 343 unigenes sequences, 22 simple sequence repeats (SSRs) were also identified contributing to the study of A. canescens resources. PMID:24960361

  7. Identification and isolation of full-length cDNA sequences by sequencing and analysis of expressed sequence tags from guarana (Paullinia cupana).

    PubMed

    Figueirêdo, L C; Faria-Campos, A C; Astolfi-Filho, S; Azevedo, J L

    2011-01-01

    The current intense production of biological data, generated by sequencing techniques, has created an ever-growing volume of unanalyzed data. We reevaluated data produced by the guarana (Paullinia cupana) transcriptome sequencing project to identify cDNA clones with complete coding sequences (full-length clones) and complete sequences of genes of biotechnological interest, contributing to the knowledge of biological characteristics of this organism. We analyzed 15,490 ESTs of guarana in search of clones with complete coding regions. A total of 12,402 sequences were analyzed using BLAST, and 4697 full-length clones were identified, responsible for the production of 2297 different proteins. Eighty-four clones were identified as full-length for N-methyltransferase and 18 were sequenced in both directions to obtain the complete genome sequence, and confirm the search made in silico for full-length clones. Phylogenetic analyses were made with the complete genome sequences of three clones, which showed only 0.017% dissimilarity; these are phylogenetically close to the caffeine synthase of Theobroma cacao. The search for full-length clones allowed the identification of numerous clones that had the complete coding region, demonstrating this to be an efficient and useful tool in the process of biological data mining. The sequencing of the complete coding region of identified full-length clones corroborated the data from the in silico search, strengthening its efficiency and utility. PMID:21732283

  8. Analysis of expression sequence tags from a full-length-enriched cDNA library of developing sesame seeds (Sesamum indicum)

    PubMed Central

    2011-01-01

    Background Sesame (Sesamum indicum) is one of the most important oilseed crops with high oil contents and rich nutrient value. However, genetic improvement efforts in sesame could not get benefit from molecular biology technology due to poor DNA and RNA sequence resources. In this study, we carried out a large scale of expressed sequence tags (ESTs) sequencing from developing sesame seeds and further conducted analysis on seed storage products-related genes. Results A normalized and full-length enriched cDNA library from 5 ~ 30 days old immature seeds was constructed and randomly sequenced, leading to generation of 41,248 expressed sequence tags (ESTs) which then formed 4,713 contigs and 27,708 singletons with 44.9% uniESTs being putative full-length open reading frames. Approximately 26,091 of all these uniESTs have significant matches to the counterparts in Nr database of GenBank, and 21,628 of them were assigned to one or more Gene ontology (GO) terms. Homologous genes involved in oil biosynthesis were identified including some conservative transcription factors regulating oil biosynthesis such as LEAFY COTYLEDON1 (LEC1), PICKLE (PKL), WRINKLED1 (WRI1) and majority of them were found for the first time in sesame seeds. One hundred and 17 ESTs were identified possibly involved in biosynthesis of sesame lignans, sesamin and sesamolin. In total, 9,347 putative functional genes from developing seeds were identified, which accounts for one third of total genes in the sesame genome. Further analysis of the uniESTs identified 1,949 non-redundant simple sequence repeats (SSRs). Conclusions This study has provided an overview of genes expressed during sesame seed development. This collection of sesame full-length cDNAs covered a wide variety of genes in seeds, in particular, candidate genes involved in biosynthesis of sesame oils and lignans. These EST sequences enriched with full length will contribute to comparative genomic studies on sesame and other oilseed plants

  9. Expressed Sequence Tags for Bovine Muscle Satellite Cells, Myotube Formed-Cells and Adipocyte-Like Cells

    PubMed Central

    Pokharel, Smritee; Malik, Adeel; Tareq, K. M. A.; Roouf Bhat, Abdul; Park, Hee-Bok; Lee, Yong Seok; Kim, SangHoon; Yang, Bohsuk; Young Chung, Ki; Choi, Inho

    2013-01-01

    Background Muscle satellite cells (MSCs) represent a devoted stem cell population that is responsible for postnatal muscle growth and skeletal muscle regeneration. An important characteristic of MSCs is that they encompass multi potential mesenchymal stem cell activity and are able to differentiate into myocytes and adipocytes. To achieve a global view of the genes differentially expressed in MSCs, myotube formed-cells (MFCs) and adipocyte-like cells (ALCs), we performed large-scale EST sequencing of normalized cDNA libraries developed from bovine MSCs. Results A total of 24,192 clones were assembled into 3,333 clusters, 5,517 singletons and 3,842contigs. Functional annotation of these unigenes revealed that a large portion of the differentially expressed genes are involved in cellular and signaling processes. Database for Annotation, Visualization and Integrated Discovery (DAVID) functional analysis of three subsets of highly expressed gene lists (MSC233, MFC258, and ALC248) highlighted some common and unique biological processes among MSC, MFC and ALC. Additionally, genes that may be specific to MSC, MFC and ALC are reported here, and the role of dimethylarginine dimethylaminohydrolase2 (DDAH2) during myogenesis and hemoglobin subunit alpha2 (HBA2) during transdifferentiation in C2C12 were assayed as a case study. DDAH2 was up-regulated during myognesis and knockdown of DDAH2 by siRNA significantly decreased myogenin (MYOG) expression corresponding with the slight change in cell morphology. In contrast, HBA2 was up-regulated during ALC formation and resulted in decreased intracellular lipid accumulation and CD36 mRNA expression upon knockdown assay. Conclusion In this study, a large number of EST sequences were generated from the MSC, MFC and ALC. Overall, the collection of ESTs generated in this study provides a starting point for the identification of novel genes involved in MFC and ALC formation, which in turn offers a fundamental resource to enable better

  10. Porcine transcriptome analysis based on 97 non-normalized cDNA libraries and assembly of 1,021,891 expressed sequence tags

    PubMed Central

    Gorodkin, Jan; Cirera, Susanna; Hedegaard, Jakob; Gilchrist, Michael J; Panitz, Frank; Jørgensen, Claus; Scheibye-Knudsen, Karsten; Arvin, Troels; Lumholdt, Steen; Sawera, Milena; Green, Trine; Nielsen, Bente J; Havgaard, Jakob H; Rosenkilde, Carina; Wang, Jun; Li, Heng; Li, Ruiqiang; Liu, Bin; Hu, Songnian; Dong, Wei; Li, Wei; Yu, Jun; Wang, Jian; Stærfeldt, Hans-Henrik; Wernersson, Rasmus; Madsen, Lone B; Thomsen, Bo; Hornshøj, Henrik; Bujie, Zhan; Wang, Xuegang; Wang, Xuefei; Bolund, Lars; Brunak, Søren; Yang, Huanming; Bendixen, Christian; Fredholm, Merete

    2007-01-01

    Background Knowledge of the structure of gene expression is essential for mammalian transcriptomics research. We analyzed a collection of more than one million porcine expressed sequence tags (ESTs), of which two-thirds were generated in the Sino-Danish Pig Genome Project and one-third are from public databases. The Sino-Danish ESTs were generated from one normalized and 97 non-normalized cDNA libraries representing 35 different tissues and three developmental stages. Results Using the Distiller package, the ESTs were assembled to roughly 48,000 contigs and 73,000 singletons, of which approximately 25% have a high confidence match to UniProt. Approximately 6,000 new porcine gene clusters were identified. Expression analysis based on the non-normalized libraries resulted in the following findings. The distribution of cluster sizes is scaling invariant. Brain and testes are among the tissues with the greatest number of different expressed genes, whereas tissues with more specialized function, such as developing liver, have fewer expressed genes. There are at least 65 high confidence housekeeping gene candidates and 876 cDNA library-specific gene candidates. We identified differential expression of genes between different tissues, in particular brain/spinal cord, and found patterns of correlation between genes that share expression in pairs of libraries. Finally, there was remarkable agreement in expression between specialized tissues according to Gene Ontology categories. Conclusion This EST collection, the largest to date in pig, represents an essential resource for annotation, comparative genomics, assembly of the pig genome sequence, and further porcine transcription studies. PMID:17407547

  11. Analysis of expressed sequence tags from a single wheat cultivar facilitates interpretation of tandem mass spectrometry data and discrimination of gamma gliadin proteins that may play different functional roles in flour

    Technology Transfer Automated Retrieval System (TEKTRAN)

    The complement of gamma gliadin genes expressed in the wheat cultivar Butte 86 was evaluated by analyzing publicly available expressed sequence tag (EST) data. Eleven contigs were assembled from 153 Butte 86 ESTs. Nine of the contigs encoded full-length proteins and four of the proteins contained an...

  12. Sequence tagging reveals unexpected modifications in toxicoproteomics

    PubMed Central

    Dasari, Surendra; Chambers, Matthew C.; Codreanu, Simona G.; Liebler, Daniel C.; Collins, Ben C.; Pennington, Stephen R.; Gallagher, William M.; Tabb, David L.

    2010-01-01

    Toxicoproteomic samples are rich in posttranslational modifications (PTMs) of proteins. Identifying these modifications via standard database searching can incur significant performance penalties. Here we describe the latest developments in TagRecon, an algorithm that leverages inferred sequence tags to identify modified peptides in toxicoproteomic data sets. TagRecon identifies known modifications more effectively than the MyriMatch database search engine. TagRecon outperformed state of the art software in recognizing unanticipated modifications from LTQ, Orbitrap, and QTOF data sets. We developed user-friendly software for detecting persistent mass shifts from samples. We follow a three-step strategy for detecting unanticipated PTMs in samples. First, we identify the proteins present in the sample with a standard database search. Next, identified proteins are interrogated for unexpected PTMs with a sequence tag-based search. Finally, additional evidence is gathered for the detected mass shifts with a refinement search. Application of this technology on toxicoproteomic data sets revealed unintended cross-reactions between proteins and sample processing reagents. Twenty five proteins in rat liver showed signs of oxidative stress when exposed to potentially toxic drugs. These results demonstrate the value of mining toxicoproteomic data sets for modifications. PMID:21214251

  13. Development of expressed sequence tag-based microsatellite markers for the critically endangered Isoëtes sinensis (Isoetaceae) based on transcriptome analysis.

    PubMed

    Gichira, A W; Long, Z C; Wang, Q F; Chen, J M; Liao, K

    2016-01-01

    Isoëtes sinensis is a critically endangered quillwort. To facilitate studies on the conservation genetics of this species, we developed expressed sequence tag-simple sequence repeat (EST-SSR) markers. A total of 50,063 unigenes were predicted by transcriptome sequencing, 5294 (10.6%) of which significantly matched 3011 Gene Ontology annotations and 2363 were assigned to Kyoto Encyclopedia of Genes and Genomes metabolic pathways. Most of these (2297) were involved in metabolism. A total of 1982 SSR motifs were identified, with trinucleotides being the dominant repeat motif, and 1438 (72.6%) SSR primers were designed. Eighteen randomly selected primer pairs were used to genotype 24 I. sinensis accessions, which confirmed the suitability of these novel markers for molecular studies of I. sinensis. The heterozygosity index value ranged between 0.0799 and 0.9106, while the Shannon-Wiener diversity index value ranged between 0.1732 and 2.5589. The EST-SSRs reported in this study are linked to genic sequences, and are therefore ideal for investigating the evolutionary history of I. sinensis. These markers, together with the large EST dataset generated in this study, will greatly facilitate conservation genetic studies of I. sinensis. PMID:27525847

  14. Gene identification in black cohosh (Actaea racemosa L.): expressed sequence tag profiling and genetic screening yields candidate genes for production of bioactive secondary metabolites.

    PubMed

    Spiering, Martin J; Urban, Lori A; Nuss, Donald L; Gopalan, Vivek; Stoltzfus, Arlin; Eisenstein, Edward

    2011-04-01

    Black cohosh (Actaea racemosa L., syn. Cimicifuga racemosa, Nutt., Ranunculaceae) is a popular herb used for relieving menopausal discomforts. A variety of secondary metabolites, including triterpenoids, phenolic dimers, and serotonin derivatives have been associated with its biological activity, but the genes and metabolic pathways as well as the tissue distribution of their production in this plant are unknown. A gene discovery effort was initiated in A. racemosa by partial sequencing of cDNA libraries constructed from young leaf, rhizome, and root tissues. In total, 2,066 expressed sequence tags (ESTs) were assembled into 1,590 unique genes (unigenes). Most of the unigenes were predicted to encode primary metabolism genes, but about 70 were identified as putative secondary metabolism genes. Several of these candidates were analyzed further and full-length cDNA and genomic sequences for a putative 2,3 oxidosqualene cyclase (CAS1) and two BAHD-type acyltransferases (ACT1 and HCT1) were obtained. Homology-based PCR screening for the central gene in plant serotonin biosynthesis, tryptophan decarboxylase (TDC), identified two TDC-related sequences in A. racemosa. CAS1, ACT1, and HCT1 were expressed in most plant tissues, whereas expression of TDC genes was detected only sporadically in immature flower heads and some very young leaf tissues. The cDNA libraries described and assorted genes identified provide initial insight into gene content and diversity in black cohosh, and provide tools and resources for detailed investigations of secondary metabolite genes and enzymes in this important medicinal plant. PMID:21188383

  15. Generation and analysis of expressed sequence tags (ESTs) of Camelina sativa to mine drought stress-responsive genes.

    PubMed

    Kanth, Bashistha Kumar; Kumari, Shipra; Choi, Seo Hee; Ha, Hye-Jeong; Lee, Geung-Joo

    2015-11-01

    Camelina sativa is an oil-producing crop belonging to the family of Brassicaceae. Due to exceptionally high content of omega fatty acid, it is commercially grown around the world as edible oil, biofuel, and animal feed. A commonly referred 'false flax' or gold-of-pleasure Camelina sativa has been interested as one of biofuel feedstocks. The species can grow on marginal land due to its superior drought tolerance with low requirement of agricultural inputs. This crop has been unexploited due to very limited transcriptomic and genomic data. Use of gene-specific molecular markers is an important strategy for new cultivar development in breeding program. In this study, Illumina paired-end sequencing technology and bioinformatics tools were used to obtain expression profiling of genes responding to drought stress in Camelina sativa BN14. A total of more than 60,000 loci were assembled, corresponding to approximately 275 K transcripts. When the species was exposed to 10 kPa drought stress, 100 kPa drought stress, and rehydrated conditions, a total of 107, 2,989, and 982 genes, respectively, were up-regulated, while 146, 3,659, and 1189 genes, respectively, were down-regulated compared to control condition. Some unknown genes were found to be highly expressed under drought conditions, together with some already reported gene families such as senescence-associated genes, CAP160, and LEA under 100 kPa soil water condition, cysteine protease, 2OG, Fe(II)-dependent oxygenase, and RAD-like 1 under rehydrated condition. These genes will be further validated and mapped to determine their function and loci. This EST library will be favorably applied to develop gene-specific molecular markers and discover genes responsible for drought tolerance in Camelina species. PMID:26410535

  16. Generation and analysis of a 29,745 unique Expressed Sequence Tags from the Pacific oyster (Crassostrea gigas) assembled into a publicly accessible database: the GigasDatabase

    PubMed Central

    2009-01-01

    Background Although bivalves are among the most-studied marine organisms because of their ecological role and economic importance, very little information is available on the genome sequences of oyster species. This report documents three large-scale cDNA sequencing projects for the Pacific oyster Crassostrea gigas initiated to provide a large number of expressed sequence tags that were subsequently compiled in a publicly accessible database. This resource allowed for the identification of a large number of transcripts and provides valuable information for ongoing investigations of tissue-specific and stimulus-dependant gene expression patterns. These data are crucial for constructing comprehensive DNA microarrays, identifying single nucleotide polymorphisms and microsatellites in coding regions, and for identifying genes when the entire genome sequence of C. gigas becomes available. Description In the present paper, we report the production of 40,845 high-quality ESTs that identify 29,745 unique transcribed sequences consisting of 7,940 contigs and 21,805 singletons. All of these new sequences, together with existing public sequence data, have been compiled into a publicly-available Website http://public-contigbrowser.sigenae.org:9090/Crassostrea_gigas/index.html. Approximately 43% of the unique ESTs had significant matches against the SwissProt database and 27% were annotated using Gene Ontology terms. In addition, we identified a total of 208 in silico microsatellites from the ESTs, with 173 having sufficient flanking sequence for primer design. We also identified a total of 7,530 putative in silico, single-nucleotide polymorphisms using existing and newly-generated EST resources for the Pacific oyster. Conclusion A publicly-available database has been populated with 29,745 unique sequences for the Pacific oyster Crassostrea gigas. The database provides many tools to search cleaned and assembled ESTs. The user may input and submit several filters, such as

  17. Computational identification of conserved microRNAs and their targets from expression sequence tags of blueberry (Vaccinium corybosum)

    PubMed Central

    Li, Xuyan; Hou, Yanming; Zhang, Li; Zhang, Wenhao; Quan, Chen; Cui, Yuhai; Bian, Shaomin

    2014-01-01

    MicroRNAs (miRNAs) are a class of endogenous, approximately 21nt in length, non-coding RNA, which mediate the expression of target genes primarily at post-transcriptional levels. miRNAs play critical roles in almost all plant cellular and metabolic processes. Although numerous miRNAs have been identified in the plant kingdom, the miRNAs in blueberry, which is an economically important small fruit crop, still remain totally unknown. In this study, we reported a computational identification of miRNAs and their targets in blueberry. By conducting an EST-based comparative genomics approach, 9 potential vco-miRNAs were discovered from 22,402 blueberry ESTs according to a series of filtering criteria, designated as vco-miR156–5p, vco-miR156–3p, vco-miR1436, vco-miR1522, vco-miR4495, vco-miR5120, vco-miR5658, vco-miR5783, and vco-miR5986. Based on sequence complementarity between miRNA and its target transcript, 34 target ESTs from blueberry and 70 targets from other species were identified for the vco-miRNAs. The targets were found to be involved in transcription, RNA splicing and binding, DNA duplication, signal transduction, transport and trafficking, stress response, as well as synthesis and metabolic process. These findings will greatly contribute to future research in regard to functions and regulatory mechanisms of blueberry miRNAs. PMID:25763692

  18. Expressed sequence tags from larval gut of the European corn borer (Ostrinia nubilalis): Exploring candidate genes potentially involved in Bacillus thuringiensis toxicity and resistance

    PubMed Central

    Khajuria, Chitvan; Zhu, Yu Cheng; Chen, Ming-Shun; Buschman, Lawrent L; Higgins, Randall A; Yao, Jianxiu; Crespo, Andre LB; Siegfried, Blair D; Muthukrishnan, Subbaratnam; Zhu, Kun Yan

    2009-01-01

    Background Lepidoptera represents more than 160,000 insect species which include some of the most devastating pests of crops, forests, and stored products. However, the genomic information on lepidopteran insects is very limited. Only a few studies have focused on developing expressed sequence tag (EST) libraries from the guts of lepidopteran larvae. Knowledge of the genes that are expressed in the insect gut are crucial for understanding basic physiology of food digestion, their interactions with Bacillus thuringiensis (Bt) toxins, and for discovering new targets for novel toxins for use in pest management. This study analyzed the ESTs generated from the larval gut of the European corn borer (ECB, Ostrinia nubilalis), one of the most destructive pests of corn in North America and the western world. Our goals were to establish an ECB larval gut-specific EST database as a genomic resource for future research and to explore candidate genes potentially involved in insect-Bt interactions and Bt resistance in ECB. Results We constructed two cDNA libraries from the guts of the fifth-instar larvae of ECB and sequenced a total of 15,000 ESTs from these libraries. A total of 12,519 ESTs (83.4%) appeared to be high quality with an average length of 656 bp. These ESTs represented 2,895 unique sequences, including 1,738 singletons and 1,157 contigs. Among the unique sequences, 62.7% encoded putative proteins that shared significant sequence similarities (E-value ≤ 10-3)with the sequences available in GenBank. Our EST analysis revealed 52 candidate genes that potentially have roles in Bt toxicity and resistance. These genes encode 18 trypsin-like proteases, 18 chymotrypsin-like proteases, 13 aminopeptidases, 2 alkaline phosphatases and 1 cadherin-like protein. Comparisons of expression profiles of 41 selected candidate genes between Cry1Ab-susceptible and resistant strains of ECB by RT-PCR showed apparently decreased expressions in 2 trypsin-like and 2 chymotrypsin

  19. Identification and functional characterization of effectors in expressed sequence tags from various life cycle stages of the potato cyst nematode Globodera pallida.

    PubMed

    Jones, John T; Kumar, Amar; Pylypenko, Liliya A; Thirugnanasambandam, Amarnath; Castelli, Lydia; Chapman, Sean; Cock, Peter J A; Grenier, Eric; Lilley, Catherine J; Phillips, Mark S; Blok, Vivian C

    2009-11-01

    In this article, we describe the analysis of over 9000 expressed sequence tags (ESTs) from cDNA libraries obtained from various life cycle stages of Globodera pallida. We have identified over 50 G. pallida effectors from this dataset using bioinformatics analysis, by screening clones in order to identify secreted proteins up-regulated after the onset of parasitism and using in situ hybridization to confirm the expression in pharyngeal gland cells. A substantial gene family encoding G. pallida SPRYSEC proteins has been identified. The expression of these genes is restricted to the dorsal pharyngeal gland cell. Different members of the SPRYSEC family of proteins from G. pallida show different subcellular localization patterns in plants, with some localized to the cytoplasm and others to the nucleus and nucleolus. Differences in subcellular localization may reflect diverse functional roles for each individual protein or, more likely, variety in the compartmentalization of plant proteins targeted by the nematode. Our data are therefore consistent with the suggestion that the SPRYSEC proteins suppress host defences, as suggested previously, and that they achieve this through interaction with a range of host targets. PMID:19849787

  20. An evaluation for cross-species proteomics research by publicly available expressed sequence tag database search using tandem mass spectral data.

    PubMed

    Huang, Mei; Chen, Tong; Chan, ZhuLong

    2006-01-01

    With 1383 tandem mass spectra derived from 120 individual protein spots separated by the two-dimensional (2-D) gel electrophoresis of protein samples from three different species, comparative analyses were performed by searching the Expressed Sequence Tag (EST) database (DB) and the NCBI non-redundant (nr) DB of green plants, respectively, which uses the Mascot search engine to establish a statistical basis. It was confirmed that the former could identify more peptides manually validated by de novo sequencing (DNS) from fewer species in more closely phylogenetic relationships than the latter in a statistically significant manner. Our data demonstrated that correct peptide identifications were given low Mascot scores (e.g. 6-14) and incorrect peptide identifications were given high Mascot scores (e.g. 68-83). Our data also showed that the current evaluation approaches to protein assignments are unsatisfactory because a few 'false-positive' proteins are recognized and several 'false-negative' proteins are rescued by manual validation. PMID:16941525

  1. Functional categorization of unique expressed sequence tags obtained from the yeast-like growth phase of the elm pathogen Ophiostoma novo-ulmi

    PubMed Central

    2011-01-01

    Background The highly aggressive pathogenic fungus Ophiostoma novo-ulmi continues to be a serious threat to the American elm (Ulmus americana) in North America. Extensive studies have been conducted in North America to understand the mechanisms of virulence of this introduced pathogen and its evolving population structure, with a view to identifying potential strategies for the control of Dutch elm disease. As part of a larger study to examine the genomes of economically important Ophiostoma spp. and the genetic basis of virulence, we have constructed an expressed sequence tag (EST) library using total RNA extracted from the yeast-like growth phase of O. novo-ulmi (isolate H327). Results A total of 4,386 readable EST sequences were annotated by determining their closest matches to known or theoretical sequences in public databases by BLASTX analysis. Searches matched 2,093 sequences to entries found in Genbank, including 1,761 matches with known proteins and 332 matches with unknown (hypothetical/predicted) proteins. Known proteins included a collection of 880 unique transcripts which were categorized to obtain a functional profile of the transcriptome and to evaluate physiological function. These assignments yielded 20 primary functional categories (FunCat), the largest including Metabolism (FunCat 01, 20.28% of total), Sub-cellular localization (70, 10.23%), Protein synthesis (12, 10.14%), Transcription (11, 8.27%), Biogenesis of cellular components (42, 8.15%), Cellular transport, facilitation and routes (20, 6.08%), Classification unresolved (98, 5.80%), Cell rescue, defence and virulence (32, 5.31%) and the unclassified category, or known sequences of unknown metabolic function (99, 7.5%). A list of specific transcripts of interest was compiled to initiate an evaluation of their impact upon strain virulence in subsequent studies. Conclusions This is the first large-scale study of the O. novo-ulmi transcriptome. The expression profile obtained from the yeast

  2. Generation and analysis of expressed sequence tags(ESTs) for marker development in yam (Dioscores alata L.)

    Technology Transfer Automated Retrieval System (TEKTRAN)

    A total of 44,757 EST sequences , 1705 EST-SSR and 104 SNP markers were generated from the cDNA libraries of the resistant and susceptible genotypes. We have developed a comprehensive annotated transcriptome data set in yam to enrich the EST information in public databases. These EST resources prov...

  3. Global Transcriptome Analysis of the Tentacle of the Jellyfish Cyanea capillata Using Deep Sequencing and Expressed Sequence Tags: Insight into the Toxin- and Degenerative Disease-Related Transcripts

    PubMed Central

    Liu, Dan; Wang, Qianqian; Ruan, Zengliang; He, Qian; Zhang, Liming

    2015-01-01

    Background Jellyfish contain diverse toxins and other bioactive components. However, large-scale identification of novel toxins and bioactive components from jellyfish has been hampered by the low efficiency of traditional isolation and purification methods. Results We performed de novo transcriptome sequencing of the tentacle tissue of the jellyfish Cyanea capillata. A total of 51,304,108 reads were obtained and assembled into 50,536 unigenes. Of these, 21,357 unigenes had homologues in public databases, but the remaining unigenes had no significant matches due to the limited sequence information available and species-specific novel sequences. Functional annotation of the unigenes also revealed general gene expression profile characteristics in the tentacle of C. capillata. A primary goal of this study was to identify putative toxin transcripts. As expected, we screened many transcripts encoding proteins similar to several well-known toxin families including phospholipases, metalloproteases, serine proteases and serine protease inhibitors. In addition, some transcripts also resembled molecules with potential toxic activities, including cnidarian CfTX-like toxins with hemolytic activity, plancitoxin-1, venom toxin-like peptide-6, histamine-releasing factor, neprilysin, dipeptidyl peptidase 4, vascular endothelial growth factor A, angiotensin-converting enzyme-like and endothelin-converting enzyme 1-like proteins. Most of these molecules have not been previously reported in jellyfish. Interestingly, we also characterized a number of transcripts with similarities to proteins relevant to several degenerative diseases, including Huntington’s, Alzheimer’s and Parkinson’s diseases. This is the first description of degenerative disease-associated genes in jellyfish. Conclusion We obtained a well-categorized and annotated transcriptome of C. capillata tentacle that will be an important and valuable resource for further understanding of jellyfish at the molecular

  4. EGENES: Transcriptome-Based Plant Database of Genes with Metabolic Pathway Information and Expressed Sequence Tag Indices in KEGG1[C][W][OA

    PubMed Central

    Masoudi-Nejad, Ali; Goto, Susumu; Jauregui, Ruy; Ito, Masumi; Kawashima, Shuichi; Moriya, Yuki; Endo, Takashi R.; Kanehisa, Minoru

    2007-01-01

    EGENES is a knowledge-based database for efficient analysis of plant expressed sequence tags (ESTs) that was recently added to the KEGG suite of databases. It links plant genomic information with higher order functional information in a single database. It also provides gene indices for each genome. The genomic information in EGENES is a collection of EST contigs constructed from assembly of ESTs. Due to the extremely large genomes of plant species, the bulk collection of data such as ESTs is a quick way to capture a complete repertoire of genes expressed in an organism. Using ESTs for reconstructing metabolic pathways is a new expansion in KEGG and provides researchers with a new resource for species in which only EST sequences are available. Functional annotation in EGENES is a process of linking a set of genes/transcripts in each genome with a network of interacting molecules in the cell. EGENES is a multispecies, integrated resource consisting of genomic, chemical, and network information containing a complete set of building blocks (genes and molecules) and wiring diagrams (biological pathways) to represent cellular functions. Using EGENES, genome-based pathway annotation and EST-based annotation can now be compared and mutually validated. The ultimate goals of EGENES will be to: bring new plant species into KEGG by clustering and annotating ESTs; abstract knowledge and principles from large-scale plant EST data; and improve computational prediction of systems of higher complexity. EGENES will be updated at least once a year. EGENES is publicly available and is accessible by the following link or by KEGG's navigation system (http://www.genome.jp/kegg-bin/create_kegg_menu?category=plants_egenes). PMID:17468225

  5. A consensus linkage map for sugi (Cryptomeria japonica) from two pedigrees, based on microsatellites and expressed sequence tags.

    PubMed Central

    Tani, Naoki; Takahashi, Tomokazu; Iwata, Hiroyoshi; Mukai, Yuzuru; Ujino-Ihara, Tokuko; Matsumoto, Asako; Yoshimura, Kensuke; Yoshimaru, Hiroshi; Murai, Masafumi; Nagasaka, Kazutoshi; Tsumura, Yoshihiko

    2003-01-01

    A consensus map for sugi (Cryptomeria japonica) was constructed by integrating linkage data from two unrelated third-generation pedigrees, one derived from a full-sib cross and the other by self-pollination of F1 individuals. The progeny segregation data of the first pedigree were derived from cleaved amplified polymorphic sequences, microsatellites, restriction fragment length polymorphisms, and single nucleotide polymorphisms. The data of the second pedigree were derived from cleaved amplified polymorphic sequences, isozyme markers, morphological traits, random amplified polymorphic DNA markers, and restriction fragment length polymorphisms. Linkage analyses were done for the first pedigree with JoinMap 3.0, using its parameter set for progeny derived by cross-pollination, and for the second pedigree with the parameter set for progeny derived from selfing of F1 individuals. The 11 chromosomes of C. japonica are represented in the consensus map. A total of 438 markers were assigned to 11 large linkage groups, 1 small linkage group, and 1 nonintegrated linkage group from the second pedigree; their total length was 1372.2 cM. On average, the consensus map showed 1 marker every 3.0 cM. PCR-based codominant DNA markers such as cleaved amplified polymorphic sequences and microsatellite markers were distributed in all linkage groups and occupied about half of mapped loci. These markers are very useful for integration of different linkage maps, QTL mapping, and comparative mapping for evolutional study, especially for species with a large genome size such as conifers. PMID:14668402

  6. Development of Microsatellite Markers Derived from Expressed Sequence Tags of Polyporales for Genetic Diversity Analysis of Endangered Polyporus umbellatus.

    PubMed

    Zhang, Yuejin; Chen, Yuanyuan; Wang, Ruihong; Zeng, Ailin; Deyholos, Michael K; Shu, Jia; Guo, Hongbo

    2015-01-01

    A large scale of EST sequences of Polyporales was screened in this investigation in order to identify EST-SSR markers for various applications. The distribution of EST sequences and SSRs in five families of Polyporales was analyzed, respectively. Mononucleotide was the most abundant type, followed by trinucleotide. Among five families, Ganodermataceae occupied the most SSR markers, followed by Coriolaceae. Functional prediction of SSR marker-containing EST sequences in Ganoderma lucidum obtained three main groups, namely, cellular component, biological process, and molecular function. Thirty EST-SSR primers were designed to evaluate the genetic diversity of 13 natural Polyporus umbellatus accessions. Twenty one EST-SSRs were polymorphic with average PIC value of 0.33 and transferability rate of 71%. These 13 P. umbellatus accessions showed relatively high genetic diversity. The expected heterozygosity, Nei's gene diversity, and Shannon information index were 0.41, 0.39, and 0.57, respectively. Both UPGMA dendrogram and principal coordinate analysis (PCA) showed the same cluster result that divided the 13 accessions into three or four groups. PMID:26146636

  7. Expressed sequence tag analysis and development of gene associated markers in a near-isogenic plant system of Eragrostis curvula.

    PubMed

    Cervigni, Gerardo D L; Paniego, Norma; Díaz, Marina; Selva, Juan P; Zappacosta, Diego; Zanazzi, Darío; Landerreche, Iñaki; Martelotto, Luciano; Felitti, Silvina; Pessino, Silvina; Spangenberg, Germán; Echenique, Viviana

    2008-05-01

    Eragrostis curvula (Schrad.) Nees is a forage grass native to the semiarid regions of Southern Africa, which reproduces mainly by pseudogamous diplosporous apomixis. A collection of ESTs was generated from four cDNA libraries, three of them obtained from panicles of near-isogenic lines with different ploidy levels and reproductive modes, and one obtained from 12 days-old plant leaves. A total of 12,295 high-quality ESTs were clustered and assembled, rendering 8,864 unigenes, including 1,490 contigs and 7,394 singletons, with a genome coverage of 22%. A total of 7,029 (79.11%) unigenes were functionally categorized by BLASTX analysis against sequences deposited in public databases, but only 37.80% could be classified according to Gene Ontology. Sequence comparison against the cereals genes indexes (GI) revealed 50% significant hits. A total of 254 EST-SSRs were detected from 219 singletons and 35 from contigs. Di- and tri- motifs were similarly represented with percentages of 38.95 and 40.16%, respectively. In addition, 190 SNPs and Indels were detected in 18 contigs generated from 3 to 4 libraries. The ESTs and the molecular markers obtained in this study will provide valuable resources for a wide range of applications including gene identification, genetic mapping, cultivar identification, analysis of genetic diversity, phenotype mapping and marker assisted selection. PMID:18196464

  8. Analyses of expressed sequence tags in Neurospora reveal rapid evolution of genes associated with the early stages of sexual reproduction in fungi

    PubMed Central

    2012-01-01

    Background The broadly accepted pattern of rapid evolution of reproductive genes is primarily based on studies of animal systems, although several examples of rapidly evolving genes involved in reproduction are found in diverse additional taxa. In fungi, genes involved in mate recognition have been found to evolve rapidly. However, the examples are too few to draw conclusions on a genome scale. Results In this study, we performed microarray hybridizations between RNA from sexual and vegetative tissues of two strains of the heterothallic (self-sterile) filamentous ascomycete Neurospora intermedia, to identify a set of sex-associated genes in this species. We aligned Expressed Sequence Tags (ESTs) from sexual and vegetative tissue of N. intermedia to orthologs from three closely related species: N. crassa, N. discreta and N. tetrasperma. The resulting four-species alignments provided a dataset for molecular evolutionary analyses. Our results confirm a general pattern of rapid evolution of fungal sex-associated genes, compared to control genes with constitutive expression or a high relative expression during vegetative growth. Among the rapidly evolving sex-associated genes, we identified candidates that could be of importance for mating or fruiting-body development. Analyses of five of these candidate genes from additional species of heterothallic Neurospora revealed that three of them evolve under positive selection. Conclusions Taken together, our study represents a novel finding of a genome-wide pattern of rapid evolution of sex-associated genes in the fungal kingdom, and provides a list of candidate genes important for reproductive isolation in Neurospora. PMID:23186325

  9. Identification of salt-induced genes from Salicornia brachiata, an extreme halophyte through expressed sequence tags analysis.

    PubMed

    Jha, Bhavanath; Agarwal, Pradeep K; Reddy, Palakolanu Sudhakar; Lal, Sanjay; Sopory, Sudhir K; Reddy, Malireddy K

    2009-04-01

    Salinity severely affects plant growth and development causing crop loss worldwide. We have isolated a large number of salt-induced genes as well as unknown and hypothetical genes from Salicornia brachiata Roxb. (Amaranthaceae). This is the first description of identification of genes in response to salinity stress in this extreme halophyte plant. Salicornia accumulates salt in its pith and survives even at 2 M NaCl under field conditions. For isolating salt responsive genes, cDNA subtractive hybridization was performed between control and 500 mM NaCl treated plants. Out of the 1200 recombinant clones, 930 sequences were submitted to the NCBI database (GenBank accession: EB484528 to EB485289 and EC906125 to EC906292). 789 ESTs showed matching with different genes in NCBI database. 4.8% ESTs belonged to stress-tolerant gene category and approximately 29% ESTs showed no homology with known functional gene sequences, thus classified as unknown or hypothetical. The detection of a large number of ESTs with unknown putative function in this species makes it an interesting contribution. The 90 unknown and hypothetical genes were selected to study their differential regulation by reverse Northern analysis for identifying their role in salinity tolerance. Interestingly, both up and down regulation at 500 mM NaCl were observed (21 and 10 genes, respectively). Northern analysis of two important salt tolerant genes, ASR1 (Abscisic acid stress ripening gene) and plasma membrane H+ATPase, showed the basal level of transcripts in control condition and an increase with NaCl treatment. ASR1 gene is made full length using 5' RACE and its potential role in imparting salt tolerance is being studied. PMID:19556705

  10. Construction of a cDNA library and preliminary analysis of expressed sequence tags in Piper hainanense.

    PubMed

    Fan, R; Ling, P; Hao, C Y; Li, F P; Huang, L F; Wu, B D; Wu, H S

    2015-01-01

    Black pepper is a perennial climbing vine. It is widely cultivated because its berries can be utilized not only as a spice in food but also for medicinal use. This study aimed to construct a standardized, high-quality cDNA library to facilitated identification of new Piper hainanense transcripts. For this, 262 unigenes were used to generate raw reads. The average length of these 262 unigenes was 774.8 bp. Of these, 94 genes (35.9%) were newly identified, according to the NCBI protein database. Thus, identification of new genes may broaden the molecular knowledge of P. hainanense on the basis of Clusters of Orthologous Groups and Gene Ontology categories. In addition, certain basic genes linked to physiological processes, which can contribute to disease resistance and thereby to the breeding of black pepper. A total of 26 unigenes were found to be SSR markers. Dinucleotide SSR was the main repeat motif, accounting for 61.54%, followed by trinucleotide SSR (23.07%). Eight primer pairs successfully amplified DNA fragments and detected significant amounts of polymorphism among twenty-one piper germplasm. These results present a novel sequence information of P. hainanense, which can serve as the foundation for further genetic research on this species. PMID:26505424

  11. Identification and characterisation of functional expressed sequence tags-derived simple sequence repeat (eSSR) markers for genetic linkage mapping of Schistosoma mansoni juvenile resistance and susceptibility loci in Biomphalaria glabrata

    PubMed Central

    Ittiprasert, Wannaporn; Miller, André; Su, Xin-zhuan; Mu, Jianbing; Bhusudsawang, Ganlayarat; Ukoskit, Kitipat; Knight, Matty

    2013-01-01

    Biomphalaria glabrata susceptibility to Schistosoma mansoni has a strong genetic component, offering the possibility for investigating host–parasite interactions at the molecular level, perhaps leading to novel control approaches. The identification, mapping and molecular characterisation of genes that influence the outcome of parasitic infection in the intermediate snail host is, therefore, seen as fundamental to the control of schistosomiasis. To better understand the evolutionary processes driving disease resistance/susceptibility phenotypes, we previously identified polymorphic random amplification of polymorphic DNA and genomic simple sequence repeats from B. glabrata. In the present study we identified and characterised polymorphic expressed simple sequence repeats markers (Bg-eSSR) from existing B. glabrata expressed sequence tags. Using these markers, and with previously identified genomic simple sequence repeats, genetic linkage mapping for parasite refractory and susceptibility phenotypes, the first known for B. glabrata, was initiated. Data mining of 54,309 expressed sequence tag, produced 660 expressed simple sequence repeats of which dinucleotide motifs (TA)n were the most common (37.88%), followed by trinucleotide (29.55%), mononucleotide (18.64%) and tetranucleotide (10.15%). Penta- and hexanucleotide motifs represented <3% of the Bg-eSSRs identified. While the majority (71%) of Bg-eSSRs were monomorphic between resistant and susceptible snails, several were, however, useful for the construction of a genetic linkage map based on their inheritance in segregating F2 progeny snails derived from crossing juvenile BS-90 and NMRI snails. Polymorphic Bg-eSSRs assorted into six linkage groups at a logarithm of odds score of 3. Interestingly, the heritability of four markers (Prim1_910, Prim1_771, Prim6_1024 and Prim7_823) with juvenile snail resistance were, by t-test, significant (P < 0.05) while an allelic marker, Prim24_524, showed linkage with the

  12. Analysis of expressed sequence tags from Actinidia: applications of a cross species EST database for gene discovery in the areas of flavor, health, color and ripening

    PubMed Central

    Crowhurst, Ross N; Gleave, Andrew P; MacRae, Elspeth A; Ampomah-Dwamena, Charles; Atkinson, Ross G; Beuning, Lesley L; Bulley, Sean M; Chagne, David; Marsh, Ken B; Matich, Adam J; Montefiori, Mirco; Newcomb, Richard D; Schaffer, Robert J; Usadel, Björn; Allan, Andrew C; Boldingh, Helen L; Bowen, Judith H; Davy, Marcus W; Eckloff, Rheinhart; Ferguson, A Ross; Fraser, Lena G; Gera, Emma; Hellens, Roger P; Janssen, Bart J; Klages, Karin; Lo, Kim R; MacDiarmid, Robin M; Nain, Bhawana; McNeilage, Mark A; Rassam, Maysoon; Richardson, Annette C; Rikkerink, Erik HA; Ross, Gavin S; Schröder, Roswitha; Snowden, Kimberley C; Souleyre, Edwige JF; Templeton, Matt D; Walton, Eric F; Wang, Daisy; Wang, Mindy Y; Wang, Yanming Y; Wood, Marion; Wu, Rongmei; Yauk, Yar-Khing; Laing, William A

    2008-01-01

    Background Kiwifruit (Actinidia spp.) are a relatively new, but economically important crop grown in many different parts of the world. Commercial success is driven by the development of new cultivars with novel consumer traits including flavor, appearance, healthful components and convenience. To increase our understanding of the genetic diversity and gene-based control of these key traits in Actinidia, we have produced a collection of 132,577 expressed sequence tags (ESTs). Results The ESTs were derived mainly from four Actinidia species (A. chinensis, A. deliciosa, A. arguta and A. eriantha) and fell into 41,858 non redundant clusters (18,070 tentative consensus sequences and 23,788 EST singletons). Analysis of flavor and fragrance-related gene families (acyltransferases and carboxylesterases) and pathways (terpenoid biosynthesis) is presented in comparison with a chemical analysis of the compounds present in Actinidia including esters, acids, alcohols and terpenes. ESTs are identified for most genes in color pathways controlling chlorophyll degradation and carotenoid biosynthesis. In the health area, data are presented on the ESTs involved in ascorbic acid and quinic acid biosynthesis showing not only that genes for many of the steps in these pathways are represented in the database, but that genes encoding some critical steps are absent. In the convenience area, genes related to different stages of fruit softening are identified. Conclusion This large EST resource will allow researchers to undertake the tremendous challenge of understanding the molecular basis of genetic diversity in the Actinidia genus as well as provide an EST resource for comparative fruit genomics. The various bioinformatics analyses we have undertaken demonstrates the extent of coverage of ESTs for genes encoding different biochemical pathways in Actinidia. PMID:18655731

  13. Ginger and turmeric expressed sequence tags identify signature genes for rhizome identity and development and the biosynthesis of curcuminoids, gingerols and terpenoids

    PubMed Central

    2013-01-01

    Background Ginger (Zingiber officinale) and turmeric (Curcuma longa) accumulate important pharmacologically active metabolites at high levels in their rhizomes. Despite their importance, relatively little is known regarding gene expression in the rhizomes of ginger and turmeric. Results In order to identify rhizome-enriched genes and genes encoding specialized metabolism enzymes and pathway regulators, we evaluated an assembled collection of expressed sequence tags (ESTs) from eight different ginger and turmeric tissues. Comparisons to publicly available sorghum rhizome ESTs revealed a total of 777 gene transcripts expressed in ginger/turmeric and sorghum rhizomes but apparently absent from other tissues. The list of rhizome-specific transcripts was enriched for genes associated with regulation of tissue growth, development, and transcription. In particular, transcripts for ethylene response factors and AUX/IAA proteins appeared to accumulate in patterns mirroring results from previous studies regarding rhizome growth responses to exogenous applications of auxin and ethylene. Thus, these genes may play important roles in defining rhizome growth and development. Additional associations were made for ginger and turmeric rhizome-enriched MADS box transcription factors, their putative rhizome-enriched homologs in sorghum, and rhizomatous QTLs in rice. Additionally, analysis of both primary and specialized metabolism genes indicates that ginger and turmeric rhizomes are primarily devoted to the utilization of leaf supplied sucrose for the production and/or storage of specialized metabolites associated with the phenylpropanoid pathway and putative type III polyketide synthase gene products. This finding reinforces earlier hypotheses predicting roles of this enzyme class in the production of curcuminoids and gingerols. Conclusion A significant set of genes were found to be exclusively or preferentially expressed in the rhizome of ginger and turmeric. Specific

  14. Identification of potential vaccine and drug target candidates by expressed sequence tag analysis and immunoscreening of Onchocerca volvulus larval cDNA libraries.

    PubMed

    Lizotte-Waniewski, M; Tawe, W; Guiliano, D B; Lu, W; Liu, J; Williams, S A; Lustigman, S

    2000-06-01

    The search for appropriate vaccine candidates and drug targets against onchocerciasis has so far been confronted with several limitations due to the unavailability of biological material, appropriate molecular resources, and knowledge of the parasite biology. To identify targets for vaccine or chemotherapy development we have undertaken two approaches. First, cDNA expression libraries were constructed from life cycle stages that are critical for establishment of Onchocerca volvulus infection, the third-stage larvae (L3) and the molting L3. A gene discovery effort was then initiated by random expressed sequence tag analysis of 5,506 cDNA clones. Cluster analyses showed that many of the transcripts were up-regulated and/or stage specific in either one or both of the cDNA libraries when compared to the microfilariae, L2, and both adult stages of the parasite. Homology searches against the GenBank database facilitated the identification of several genes of interest, such as proteinases, proteinase inhibitors, antioxidant or detoxification enzymes, and neurotransmitter receptors, as well as structural and housekeeping genes. Other O. volvulus genes showed homology only to predicted genes from the free-living nematode Caenorhabditis elegans or were entirely novel. Some of the novel proteins contain potential secretory leaders. Secondly, by immunoscreening the molting L3 cDNA library with a pool of human sera from putatively immune individuals, we identified six novel immunogenic proteins that otherwise would not have been identified as potential vaccinogens using the gene discovery effort. This study lays a solid foundation for a better understanding of the biology of O. volvulus as well as for the identification of novel targets for filaricidal agents and/or vaccines against onchocerciasis based on immunological and rational hypothesis-driven research. PMID:10816503

  15. Needles in the EST Haystack: Large-Scale Identification and Analysis of Excretory-Secretory (ES) Proteins in Parasitic Nematodes Using Expressed Sequence Tags (ESTs)

    PubMed Central

    Nagaraj, Shivashankar H.; Gasser, Robin B.; Ranganathan, Shoba

    2008-01-01

    Background Parasitic nematodes of humans, other animals and plants continue to impose a significant public health and economic burden worldwide, due to the diseases they cause. Promising antiparasitic drug and vaccine candidates have been discovered from excreted or secreted (ES) proteins released from the parasite and exposed to the immune system of the host. Mining the entire expressed sequence tag (EST) data available from parasitic nematodes represents an approach to discover such ES targets. Methods and Findings In this study, we predicted, using EST2Secretome, a novel, high-throughput, computational workflow system, 4,710 ES proteins from 452,134 ESTs derived from 39 different species of nematodes, parasitic in animals (including humans) or plants. In total, 2,632, 786, and 1,292 ES proteins were predicted for animal-, human-, and plant-parasitic nematodes. Subsequently, we systematically analysed ES proteins using computational methods. Of these 4,710 proteins, 2,490 (52.8%) had orthologues in Caenorhabditis elegans, whereas 621 (13.8%) appeared to be novel, currently having no significant match to any molecule available in public databases. Of the C. elegans homologues, 267 had strong “loss-of-function” phenotypes by RNA interference (RNAi) in this nematode. We could functionally classify 1,948 (41.3%) sequences using the Gene Ontology (GO) terms, establish pathway associations for 573 (12.2%) sequences using Kyoto Encyclopaedia of Genes and Genomes (KEGG), and identify protein interaction partners for 1,774 (37.6%) molecules. We also mapped 758 (16.1%) proteins to protein domains including the nematode-specific protein family “transthyretin-like” and “chromadorea ALT,” considered as vaccine candidates against filariasis in humans. Conclusions We report the large-scale analysis of ES proteins inferred from EST data for a range of parasitic nematodes. This set of ES proteins provides an inventory of known and novel members of ES proteins as a

  16. Development and Validation of Single Nucleotide Polymorphism (SNP) Markers from an Expressed Sequence Tag (EST) Database in Olive Flounder (Paralichthys olivaceus)

    PubMed Central

    Kim, Jung Eun; Lee, Young Mee; Lee, Jeong-Ho; Noh, Jae Koo; Kim, Hyun Chul; Park, Choul-Ji; Park, Jong-Won; Kim, Kyung-Kil

    2014-01-01

    To successful molecular breeding, identification and functional characterization of breeding related genes and development of molecular breeding techniques using DNA markers are essential. Although the development of a useful marker is difficult in the aspect of time, cost and effort, many markers are being developed to be used in molecular breeding and developed markers have been used in many fields. Single nucleotide polymorphisms (SNPs) markers were widely used for genomic research and breeding, but has hardly been validated for screening functional genes in olive flounder. We identified single nucleotide polymorphisms (SNPs) from expressed sequence tag (EST) database in olive flounder; out of a total 4,327 ESTs, 693 contigs and 514 SNPs were detected in total EST, and these substitutions include 297 transitions and 217 transversions. As a result, 144 SNP markers were developed on the basis of 514 SNP to selection of useful gene region, and then applied to each of eight wild and culture olive flounder (total 16 samples). In our experimental result, only 32 markers had detected polymorphism in sample, also identified 21 transitions and 11 transversions, whereas indel was not detected in polymorphic SNPs. Heterozygosity of wild and cultured olive flounder using the 32 SNP markers is 0.34 and 0.29, respectively. In conclusion, we identified SNP and polymorphism in olive flounder using newly designed marker, it supports that developed markers are suitable for SNP detection and diversity analysis in olive flounder. The outcome of this study can be basic data for researches for immunity gene and characteristic with SNP. PMID:25949198

  17. TagCleaner: Identification and removal of tag sequences from genomic and metagenomic datasets

    PubMed Central

    2010-01-01

    Background Sequencing metagenomes that were pre-amplified with primer-based methods requires the removal of the additional tag sequences from the datasets. The sequenced reads can contain deletions or insertions due to sequencing limitations, and the primer sequence may contain ambiguous bases. Furthermore, the tag sequence may be unavailable or incorrectly reported. Because of the potential for downstream inaccuracies introduced by unwanted sequence contaminations, it is important to use reliable tools for pre-processing sequence data. Results TagCleaner is a web application developed to automatically identify and remove known or unknown tag sequences allowing insertions and deletions in the dataset. TagCleaner is designed to filter the trimmed reads for duplicates, short reads, and reads with high rates of ambiguous sequences. An additional screening for and splitting of fragment-to-fragment concatenations that gave rise to artificial concatenated sequences can increase the quality of the dataset. Users may modify the different filter parameters according to their own preferences. Conclusions TagCleaner is a publicly available web application that is able to automatically detect and efficiently remove tag sequences from metagenomic datasets. It is easily configurable and provides a user-friendly interface. The interactive web interface facilitates export functionality for subsequent data processing, and is available at http://edwards.sdsu.edu/tagcleaner. PMID:20573248

  18. Expressivity tag: a novel tool for increased expression in Escherichia coli.

    PubMed

    Hansted, Jon Gade; Pietikäinen, Laura; Hög, Friederike; Sperling-Petersen, Hans Uffe; Mortensen, Kim Kusk

    2011-09-20

    Protein expression in Escherichia coli is rarely trivial as low expression and insolubility are common problems. In this work we define a fusion partner, which increases expression levels similarly to the distinct function of solubility and affinity tags. This type of fusion tag we term an expressivity tag. Our work is based on earlier observations where 3' deletions of the InfB gene displays strongly increased expression levels. We have constructed progressively shortened fragments of the InfB(1-471) gene and fused gene fragments to a gfp reporter gene. A 5-fold increase in GFP expression was seen for an optimal 21 nucleotide InfB(1-21) sequence compared to gfp independently. We defined the InfB(1-21) sequence as an expressivity tag. The tag was tested for improved expression of two biotechnological important proteins streptavidin and a single chain antibody (scFv). Expression of both streptavidin and scFv(L32) was improved as evaluated by SDS-PAGE. Calculation of folding energies in the translation initiation region gave higher free energies for gfp, L32 and streptavidin when linked to InfB(1-21) than independently. InfB(1-21) did however not improve the codon usage or codon adaptation index. The expressivity tag is an important addition to the box of tools available for optimizing heterologous protein expression. PMID:21801766

  19. Expression of the Arabidopsis transposable element Tag1 is targeted to developing gametophytes.

    PubMed

    Galli, Mary; Theriault, Angie; Liu, Dong; Crawford, Nigel M

    2003-12-01

    The Arabidopsis transposon Tag1 undergoes late excision during vegetative and germinal development in plants containing 35S-Tag1-GUS constructs. To determine if transcriptional regulation can account for the developmental control of Tag1 excision, the transcriptional activity of Tag1 promoter-GUS fusion constructs of various lengths was examined in transgenic plants. All constructs showed expression in the reproductive organs of developing flowers but no expression in leaves. Expression was restricted to developing gametophytes in both male and female lineages. Quantitative RT-PCR analysis confirmed that Tag1 expression predominates in the reproductive organs of flower buds. These results are consistent with late germinal excision of Tag1, but they cannot explain the vegetative excision activity of Tag1 observed with 35S-Tag1-GUS constructs. To resolve this issue, Tag1 excision was reexamined using elements with no adjacent 35S promoter sequences. Tag1 excision in this context is restricted to germinal events with no detectable vegetative excision. If a 35S enhancer sequence is placed next to Tag1, vegetative excision is restored. These results indicate that the intrinsic activity of Tag1 is restricted to germinal excision due to targeted expression of the Tag1 transposase to developing gametophytes and that this activity is altered by the presence of adjacent enhancers or promoters. PMID:14704189

  20. An Expressed Sequence Tag (EST)-enriched genetic map of turbot (Scophthalmus maximus): a useful framework for comparative genomics across model and farmed teleosts

    PubMed Central

    2012-01-01

    Background The turbot (Scophthalmus maximus) is a relevant species in European aquaculture. The small turbot genome provides a source for genomics strategies to use in order to understand the genetic basis of productive traits, particularly those related to sex, growth and pathogen resistance. Genetic maps represent essential genomic screening tools allowing to localize quantitative trait loci (QTL) and to identify candidate genes through comparative mapping. This information is the backbone to develop marker-assisted selection (MAS) programs in aquaculture. Expressed sequenced tag (EST) resources have largely increased in turbot, thus supplying numerous type I markers suitable for extending the previous linkage map, which was mostly based on anonymous loci. The aim of this study was to construct a higher-resolution turbot genetic map using EST-linked markers, which will turn out to be useful for comparative mapping studies. Results A consensus gene-enriched genetic map of the turbot was constructed using 463 SNP and microsatellite markers in nine reference families. This map contains 438 markers, 180 EST-linked, clustered at 24 linkage groups. Linkage and comparative genomics evidences suggested additional linkage group fusions toward the consolidation of turbot map according to karyotype information. The linkage map showed a total length of 1402.7 cM with low average intermarker distance (3.7 cM; ~2 Mb). A global 1.6:1 female-to-male recombination frequency (RF) ratio was observed, although largely variable among linkage groups and chromosome regions. Comparative sequence analysis revealed large macrosyntenic patterns against model teleost genomes, significant hits decreasing from stickleback (54%) to zebrafish (20%). Comparative mapping supported particular chromosome rearrangements within Acanthopterygii and aided to assign unallocated markers to specific turbot linkage groups. Conclusions The new gene-enriched high-resolution turbot map represents a

  1. Degradation of C-terminal tag sequences on domain antibodies purified from E. coli supernatant.

    PubMed

    Lykkemark, Simon; Mandrup, Ole Aalund; Friis, Niels Anton; Kristensen, Peter

    2014-01-01

    Expression of recombinant proteins often takes advantage of peptide tags expressed in fusion to allow easy detection and purification of the expressed proteins. However, as the fusion peptides most often are flexible appendages at the N- or C-terminal, proteolytic cleavage may result in removal of the tag sequence. Here, we evaluated the functionality and stability of 14 different combinations of commonly used tags for purification and detection of recombinant antibody fragments. The tag sequences were inserted in fusion with the c-terminal end of a domain antibody based on the HEL4 scaffold in a phagemid vector. This particular antibody fragment was able to refold on the membrane after blotting, allowing us to detect c-terminal tag breakdown by use of protein A in combination with detection of the tags in the specific constructs. The degradation of the c-terminal tags suggested specific sites to be particularly prone to proteolytic cleavage, leaving some of the tag combinations partially or completely degraded. This specific work illustrates the importance of tag design with regard to recombinant antibody expression in E. coli, but also aids the more general understanding of protein expression. PMID:25426869

  2. Degradation of C-terminal tag sequences on domain antibodies purified from E. coli supernatant

    PubMed Central

    Lykkemark, Simon; Mandrup, Ole Aalund; Friis, Niels Anton; Kristensen, Peter

    2014-01-01

    Expression of recombinant proteins often takes advantage of peptide tags expressed in fusion to allow easy detection and purification of the expressed proteins. However, as the fusion peptides most often are flexible appendages at the N- or C-terminal, proteolytic cleavage may result in removal of the tag sequence. Here, we evaluated the functionality and stability of 14 different combinations of commonly used tags for purification and detection of recombinant antibody fragments. The tag sequences were inserted in fusion with the c-terminal end of a domain antibody based on the HEL4 scaffold in a phagemid vector. This particular antibody fragment was able to refold on the membrane after blotting, allowing us to detect c-terminal tag breakdown by use of protein A in combination with detection of the tags in the specific constructs. The degradation of the c-terminal tags suggested specific sites to be particularly prone to proteolytic cleavage, leaving some of the tag combinations partially or completely degraded. This specific work illustrates the importance of tag design with regard to recombinant antibody expression in E. coli, but also aids the more general understanding of protein expression. PMID:25426869

  3. Batch submission and release of 14,328 EST (expressed sequence tag) nucleotide sequences of developing peanut seeds of R5, R6, and R7 reproduction stages

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Although peanut is an economically and nutritionally important crop in the U.S., it is virtually unexplored at the genomic level. Peanut has a relatively large genome size (2800 Mb/1C). A database of ESTs, the expressed portion of the peanut genome, will facilitate gene discovery, marker development...

  4. Discovery and mapping of a new expressed sequence tag-single nucleotide polymorphism and simple sequence repeat panel for large-scale genetic studies and breeding of Theobroma cacao L.

    PubMed Central

    Allegre, Mathilde; Argout, Xavier; Boccara, Michel; Fouet, Olivier; Roguet, Yolande; Bérard, Aurélie; Thévenin, Jean Marc; Chauveau, Aurélie; Rivallan, Ronan; Clement, Didier; Courtois, Brigitte; Gramacho, Karina; Boland-Augé, Anne; Tahi, Mathias; Umaharan, Pathmanathan; Brunel, Dominique; Lanaud, Claire

    2012-01-01

    Theobroma cacao is an economically important tree of several tropical countries. Its genetic improvement is essential to provide protection against major diseases and improve chocolate quality. We discovered and mapped new expressed sequence tag-single nucleotide polymorphism (EST-SNP) and simple sequence repeat (SSR) markers and constructed a high-density genetic map. By screening 149 650 ESTs, 5246 SNPs were detected in silico, of which 1536 corresponded to genes with a putative function, while 851 had a clear polymorphic pattern across a collection of genetic resources. In addition, 409 new SSR markers were detected on the Criollo genome. Lastly, 681 new EST-SNPs and 163 new SSRs were added to the pre-existing 418 co-dominant markers to construct a large consensus genetic map. This high-density map and the set of new genetic markers identified in this study are a milestone in cocoa genomics and for marker-assisted breeding. The data are available at http://tropgenedb.cirad.fr. PMID:22210604

  5. Expressed sequence tag survey of gene expression in the scab mite Psoroptes ovis--allergens, proteases and free-radical scavengers.

    PubMed

    Kenyon, F; Welsh, M; Parkinson, J; Whitton, C; Blaxter, M L; Knox, D P

    2003-05-01

    Psoroptes ovis, the causative agent of sheep scab, is an important ectoparasitic mite infecting sheep, goats and cattle. Infection is characterized by an extensive dermatitis, scab formation and intense itching. Initial focal lesions spread outwards, coalesce and may extend over the whole body. The host response to infestation has all the characteristics of an immediate-type hypersensitivity reaction but the mite antigens and allergens which initiate this response are almost completely undefined. Here, 507 randomly selected cDNAs derived from a mixed population of P. ovis were sequenced and the resultant nucleotide sequences subjected to Cluster analysis and Blast searches. This analysis yielded 280 clusters of which 49 had > 1 sequence with 24 showing significant Blast X homology to another protein in the databases. There were 231 sequences which appeared on one occasion and 109 of these showed significant Blast X homology to other sequences in the databases. This analysis identified homologues of 9 different types of allergens which have been characterized in other allergic conditions such as responses to house dust mites. It also identified a number of cysteine proteases which may contribute to lesion development as well as several free-radical scavenging enzymes which may protect the mite from host immune effector responses. PMID:12793649

  6. Analysis of expressed sequence tags (ESTs) from avocado seed (Persea americana var. drymifolia) reveals abundant expression of the gene encoding the antimicrobial peptide snakin.

    PubMed

    Guzmán-Rodríguez, Jaquelina J; Ibarra-Laclette, Enrique; Herrera-Estrella, Luis; Ochoa-Zarzosa, Alejandra; Suárez-Rodríguez, Luis María; Rodríguez-Zapata, Luis C; Salgado-Garciglia, Rafael; Jimenez-Moraila, Beatriz; López-Meza, Joel E; López-Gómez, Rodolfo

    2013-09-01

    Avocado is one of the most important fruits in the world. Avocado "native mexicano" (Persea americana var. drymifolia) seeds are widely used in the propagation of this plant and are the primary source of rootstocks globally for a variety of avocado cultivars, such as the Hass avocado. Here, we report the isolation of 5005 ESTs from the 5' ends of P. americana var. drymifolia seed cDNA clones representing 1584 possible unigenes. These avocado seed ESTs were compared with the avocado flower EST library, and we detected several genes that are expressed either in both tissues or only in the seed. The snakin gene, which encodes an element of the innate immune response in plants, was one of those most frequently found among the seed ESTs, and this suggests that it is abundantly expressed in the avocado seed. We expressed the snakin gene in a heterologous system, namely the bovine endothelial cell line BVE-E6E7. Conditioned media from transfected BVE-E6E7 cells showed antimicrobial activity against strains of Escherichia coli and Staphylococcus aureus. This is the first study of the function of the snakin gene in plant seed tissue, and our observations suggest that this gene might play a protective role in the avocado seed. PMID:23811120

  7. Development of peanut expessed sequence tag-based genomic resources and tools

    Technology Transfer Automated Retrieval System (TEKTRAN)

    U.S. Peanut Genome Initiative (PGI) has widely recognized the need for peanut genome tools and resources development for mitigating peanut allergens and food safety. Genomics such as Expressed Sequence Tag (EST), microarray technologies, and whole genome sequencing provides robotic tools for profili...

  8. Construction of a full-length enriched cDNA library and preliminary analysis of expressed sequence tags from Bengal Tiger Panthera tigris tigris.

    PubMed

    Liu, Changqing; Liu, Dan; Guo, Yu; Lu, Taofeng; Li, Xiangchen; Zhang, Minghai; Ma, Jianzhang; Ma, Yuehui; Guan, Weijun

    2013-01-01

    In this study, a full-length enriched cDNA library was successfully constructed from Bengal tiger, Panthera tigris tigris, the most well-known wild Animal. Total RNA was extracted from cultured Bengal tiger fibroblasts in vitro. The titers of primary and amplified libraries were 1.28 × 106 pfu/mL and 1.56 × 109 pfu/mL respectively. The percentage of recombinants from unamplified library was 90.2% and average length of exogenous inserts was 0.98 kb. A total of 212 individual ESTs with sizes ranging from 356 to 1108 bps were then analyzed. The BLASTX score revealed that 48.1% of the sequences were classified as a strong match, 45.3% as nominal and 6.6% as a weak match. Among the ESTs with known putative function, 26.4% ESTs were found to be related to all kinds of metabolisms, 19.3% ESTs to information storage and processing, 11.3% ESTs to posttranslational modification, protein turnover, chaperones, 11.3% ESTs to transport, 9.9% ESTs to signal transducer/cell communication, 9.0% ESTs to structure protein, 3.8% ESTs to cell cycle, and only 6.6% ESTs classified as novel genes. By EST sequencing, a full-length gene coding ferritin was identified and characterized. The recombinant plasmid pET32a-TAT-Ferritin was constructed, coded for the TAT-Ferritin fusion protein with two 6× His-tags in N and C-terminal. After BCA assay, the concentration of soluble Trx-TAT-Ferritin recombinant protein was 2.32 ± 0.12 mg/mL. These results demonstrated that the reliability and representativeness of the cDNA library attained to the requirements of a standard cDNA library. This library provided a useful platform for the functional genome and transcriptome research of Bengal tigers. PMID:23708105

  9. Construction of a Full-Length Enriched cDNA Library and Preliminary Analysis of Expressed Sequence Tags from Bengal Tiger Panthera tigris tigris

    PubMed Central

    Liu, Changqing; Liu, Dan; Guo, Yu; Lu, Taofeng; Li, Xiangchen; Zhang, Minghai; Ma, Jianzhang; Ma, Yuehui; Guan, Weijun

    2013-01-01

    In this study, a full-length enriched cDNA library was successfully constructed from Bengal tiger, Panthera tigris tigris, the most well-known wild Animal. Total RNA was extracted from cultured Bengal tiger fibroblasts in vitro. The titers of primary and amplified libraries were 1.28 × 106 pfu/mL and 1.56 × 109 pfu/mL respectively. The percentage of recombinants from unamplified library was 90.2% and average length of exogenous inserts was 0.98 kb. A total of 212 individual ESTs with sizes ranging from 356 to 1108 bps were then analyzed. The BLASTX score revealed that 48.1% of the sequences were classified as a strong match, 45.3% as nominal and 6.6% as a weak match. Among the ESTs with known putative function, 26.4% ESTs were found to be related to all kinds of metabolisms, 19.3% ESTs to information storage and processing, 11.3% ESTs to posttranslational modification, protein turnover, chaperones, 11.3% ESTs to transport, 9.9% ESTs to signal transducer/cell communication, 9.0% ESTs to structure protein, 3.8% ESTs to cell cycle, and only 6.6% ESTs classified as novel genes. By EST sequencing, a full-length gene coding ferritin was identified and characterized. The recombinant plasmid pET32a-TAT-Ferritin was constructed, coded for the TAT-Ferritin fusion protein with two 6× His-tags in N and C-terminal. After BCA assay, the concentration of soluble Trx-TAT-Ferritin recombinant protein was 2.32 ± 0.12 mg/mL. These results demonstrated that the reliability and representativeness of the cDNA library attained to the requirements of a standard cDNA library. This library provided a useful platform for the functional genome and transcriptome research of Bengal tigers. PMID:23708105

  10. OligoTag: a program for designing sets of tags for next-generation sequencing of multiplexed samples.

    PubMed

    Coissac, Eric

    2012-01-01

    Next-generation sequencing systems allow high-throughput production of DNA sequence data. But this technology is more adapted for analyzing a small number of samples needing a huge amount of sequences rather than a large number of samples needing a small number of sequences. One solution to this problem is sample multiplexing. To achieve this, one can add a small tag at the extremities of the sequenced DNA molecules. These tags will be identified using bioinformatics tools after the sequencing step to sort sequences among samples. The rules to apply for selecting a good set of tags adapted to each situation are described in this chapter. Depending on the number of samples to tag and on the required quality of assignation, different solutions are possible. The software oligoTag, a part of OBITools that computes these sets of tags, is presented with some example sets of tags. PMID:22665273

  11. Identification of reproduction-related genes and SSR-markers through expressed sequence tags analysis of a monsoon breeding carp rohu, Labeo rohita (Hamilton).

    PubMed

    Sahu, Dinesh K; Panda, Soumya P; Panda, Sujata; Das, Paramananda; Meher, Prem K; Hazra, Rupenangshu K; Peatman, Eric; Liu, Zhanjiang J; Eknath, Ambekar E; Nandi, Samiran

    2013-07-15

    Labeo rohita (Ham.) also called rohu is the most important freshwater aquaculture species on the Indian sub continent. Monsoon dependent breeding restricts its seed production beyond season indicating a strong genetic control about which very limited information is available. Additionally, few genomic resources are publicly available for this species. Here we sought to identify reproduction-relevant genes from normalized cDNA libraries of the brain-pituitary-gonad-liver (BPGL-axis) tissues of adult L. rohita collected during post preparatory phase. 6161 random clones sequenced (Sanger-based) from these libraries produced 4642 (75.34%) high-quality sequences. They were assembled into 3631 (78.22%) unique sequences composed of 709 contigs and 2922 singletons. A total of 182 unique sequences were found to be associated with reproduction-related genes, mainly under the GO term categories of reproduction, neuro-peptide hormone activity, hormone and receptor binding, receptor activity, signal transduction, embryonic development, cell-cell signaling, cell death and anti-apoptosis process. Several important reproduction-related genes reported here for the first time in L. rohita are zona pellucida sperm-binding protein 3, aquaporin-12, spermine oxidase, sperm associated antigen 7, testis expressed 261, progesterone receptor membrane component, Neuropeptide Y and Pro-opiomelanocortin. Quantitative RT-PCR-based analyses of 8 known and 8 unknown transcripts during preparatory and post-spawning phase showed increased expression level of most of the transcripts during preparatory phase (except Neuropeptide Y) in comparison to post-spawning phase indicating possible roles in initiation of gonad maturation. Expression of unknown transcripts was also found in prolific breeder common carp and tilapia, but levels of expression were much higher in seasonal breeder rohu. 3631 unique sequences contained 236 (6.49%) putative microsatellites with the AG (28.16%) repeat as the most

  12. Arabidopsis Genes Involved in Acyl Lipid Metabolism. A 2003 Census of the Candidates, a Study of the Distribution of Expressed Sequence Tags in Organs, and a Web-Based Database1

    PubMed Central

    Beisson, Frédéric; Koo, Abraham J.K.; Ruuska, Sari; Schwender, Jörg; Pollard, Mike; Thelen, Jay J.; Paddock, Troy; Salas, Joaquín J.; Savage, Linda; Milcamps, Anne; Mhaske, Vandana B.; Cho, Younghee; Ohlrogge, John B.

    2003-01-01

    The genome of Arabidopsis has been searched for sequences of genes involved in acyl lipid metabolism. Over 600 encoded proteins have been identified, cataloged, and classified according to predicted function, subcellular location, and alternative splicing. At least one-third of these proteins were previously annotated as “unknown function” or with functions unrelated to acyl lipid metabolism; therefore, this study has improved the annotation of over 200 genes. In particular, annotation of the lipolytic enzyme group (at least 110 members total) has been improved by the critical examination of the biochemical literature and the sequences of the numerous proteins annotated as “lipases.” In addition, expressed sequence tag (EST) data have been surveyed, and more than 3,700 ESTs associated with the genes were cataloged. Statistical analysis of the number of ESTs associated with specific cDNA libraries has allowed calculation of probabilities of differential expression between different organs. More than 130 genes have been identified with a statistical probability > 0.95 of preferential expression in seed, leaf, root, or flower. All the data are available as a Web-based database, the Arabidopsis Lipid Gene database (http://www.plantbiology.msu.edu/lipids/genesurvey/index.htm). The combination of the data of the Lipid Gene Catalog and the EST analysis can be used to gain insights into differential expression of gene family members and sets of pathway-specific genes, which in turn will guide studies to understand specific functions of individual genes. PMID:12805597

  13. Analysis of Expressed Sequence Tags from Chinese Bayberry Fruit (Myrica rubra Sieb. and Zucc.) at Different Ripening Stages and Their Association with Fruit Quality Development

    PubMed Central

    Zhu, Changqing; Feng, Chao; Li, Xian; Xu, Changjie; Sun, Chongde; Chen, Kunsong

    2013-01-01

    A total of 2000 EST sequences were produced from cDNA libraries generated from Chinese bayberry fruit (Myrica rubra Sieb. and Zucc. cv. “Biqi”) at four different ripening stages. After cluster and assembly analysis of the datasets by UniProt, 395 unigenes were identified, and their presumed functions were assigned to 14 putative cellular roles. Furthermore, a sequence BLAST was done for the top ten highly expressed genes in the ESTs, and genes associated with disease/defense and anthocyanin accumulation were analyzed. Gene-encoding elements associated with ethylene biosynthesis and signal transductions, in addition to other senescence-regulating proteins, as well as those associated with quality formation during fruit ripening, were also identified. Their possible roles were subsequently discussed. PMID:23377019

  14. Multiple tag labeling method for DNA sequencing

    DOEpatents

    Mathies, R.A.; Huang, X.C.; Quesada, M.A.

    1995-07-25

    A DNA sequencing method is described which uses single lane or channel electrophoresis. Sequencing fragments are separated in the lane and detected using a laser-excited, confocal fluorescence scanner. Each set of DNA sequencing fragments is separated in the same lane and then distinguished using a binary coding scheme employing only two different fluorescent labels. Also described is a method of using radioisotope labels. 5 figs.

  15. Multiple tag labeling method for DNA sequencing

    DOEpatents

    Mathies, Richard A.; Huang, Xiaohua C.; Quesada, Mark A.

    1995-01-01

    A DNA sequencing method described which uses single lane or channel electrophoresis. Sequencing fragments are separated in said lane and detected using a laser-excited, confocal fluorescence scanner. Each set of DNA sequencing fragments is separated in the same lane and then distinguished using a binary coding scheme employing only two different fluorescent labels. Also described is a method of using radio-isotope labels.

  16. Assembly of 500,000 inter-specific catfish expressed sequence tags and large scale gene-associated marker development for whole genome association studies

    SciTech Connect

    Catfish Genome Consortium; Wang, Shaolin; Peatman, Eric; Abernathy, Jason; Waldbieser, Geoff; Lindquist, Erika; Richardson, Paul; Lucas, Susan; Wang, Mei; Li, Ping; Thimmapuram, Jyothi; Liu, Lei; Vullaganti, Deepika; Kucuktas, Huseyin; Murdock, Christopher; Small, Brian C; Wilson, Melanie; Liu, Hong; Jiang, Yanliang; Lee, Yoona; Chen, Fei; Lu, Jianguo; Wang, Wenqi; Xu, Peng; Somridhivej, Benjaporn; Baoprasertkul, Puttharat; Quilang, Jonas; Sha, Zhenxia; Bao, Baolong; Wang, Yaping; Wang, Qun; Takano, Tomokazu; Nandi, Samiran; Liu, Shikai; Wong, Lilian; Kaltenboeck, Ludmilla; Quiniou, Sylvie; Bengten, Eva; Miller, Norman; Trant, John; Rokhsar, Daniel; Liu, Zhanjiang

    2010-03-23

    Background-Through the Community Sequencing Program, a catfish EST sequencing project was carried out through a collaboration between the catfish research community and the Department of Energy's Joint Genome Institute. Prior to this project, only a limited EST resource from catfish was available for the purpose of SNP identification. Results-A total of 438,321 quality ESTs were generated from 8 channel catfish (Ictalurus punctatus) and 4 blue catfish (Ictalurus furcatus) libraries, bringing the number of catfish ESTs to nearly 500,000. Assembly of all catfish ESTs resulted in 45,306 contigs and 66,272 singletons. Over 35percent of the unique sequences had significant similarities to known genes, allowing the identification of 14,776 unique genes in catfish. Over 300,000 putative SNPs have been identified, of which approximately 48,000 are high-quality SNPs identified from contigs with at least four sequences and the minor allele presence of at least two sequences in the contig. The EST resource should be valuable for identification of microsatellites, genome annotation, large-scale expression analysis, and comparative genome analysis. Conclusions-This project generated a large EST resource for catfish that captured the majority of the catfish transcriptome. The parallel analysis of ESTs from two closely related Ictalurid catfishes should also provide powerful means for the evaluation of ancient and recent gene duplications, and for the development of high-density microarrays in catfish. The inter- and intra-specific SNPs identified from all catfish EST dataset assembly will greatly benefit the catfish introgression breeding program and whole genome association studies.

  17. Assembly of 500,000 inter-specific catfish expressed sequence tags and large scale gene-associated marker development for whole genome association studies

    PubMed Central

    2010-01-01

    Background Through the Community Sequencing Program, a catfish EST sequencing project was carried out through a collaboration between the catfish research community and the Department of Energy's Joint Genome Institute. Prior to this project, only a limited EST resource from catfish was available for the purpose of SNP identification. Results A total of 438,321 quality ESTs were generated from 8 channel catfish (Ictalurus punctatus) and 4 blue catfish (Ictalurus furcatus) libraries, bringing the number of catfish ESTs to nearly 500,000. Assembly of all catfish ESTs resulted in 45,306 contigs and 66,272 singletons. Over 35% of the unique sequences had significant similarities to known genes, allowing the identification of 14,776 unique genes in catfish. Over 300,000 putative SNPs have been identified, of which approximately 48,000 are high-quality SNPs identified from contigs with at least four sequences and the minor allele presence of at least two sequences in the contig. The EST resource should be valuable for identification of microsatellites, genome annotation, large-scale expression analysis, and comparative genome analysis. Conclusions This project generated a large EST resource for catfish that captured the majority of the catfish transcriptome. The parallel analysis of ESTs from two closely related Ictalurid catfishes should also provide powerful means for the evaluation of ancient and recent gene duplications, and for the development of high-density microarrays in catfish. The inter- and intra-specific SNPs identified from all catfish EST dataset assembly will greatly benefit the catfish introgression breeding program and whole genome association studies. PMID:20096101

  18. Comparative analyses of genotype dependent expressed sequence tags and stress-responsive transcriptome of chickpea wilt illustrate predicted and unexpected genes and novel regulators of plant immunity

    PubMed Central

    Ashraf, Nasheeman; Ghai, Deepali; Barman, Pranjan; Basu, Swaraj; Gangisetty, Nagaraju; Mandal, Mihir K; Chakraborty, Niranjan; Datta, Asis; Chakraborty, Subhra

    2009-01-01

    Background The ultimate phenome of any organism is modulated by regulated transcription of many genes. Characterization of genetic makeup is thus crucial for understanding the molecular basis of phenotypic diversity, evolution and response to intra- and extra-cellular stimuli. Chickpea is the world's third most important food legume grown in over 40 countries representing all the continents. Despite its importance in plant evolution, role in human nutrition and stress adaptation, very little ESTs and differential transcriptome data is available, let alone genotype-specific gene signatures. Present study focuses on Fusarium wilt responsive gene expression in chickpea. Results We report 6272 gene sequences of immune-response pathway that would provide genotype-dependent spatial information on the presence and relative abundance of each gene. The sequence assembly led to the identification of a CaUnigene set of 2013 transcripts comprising of 973 contigs and 1040 singletons, two-third of which represent new chickpea genes hitherto undiscovered. We identified 209 gene families and 262 genotype-specific SNPs. Further, several novel transcription regulators were identified indicating their possible role in immune response. The transcriptomic analysis revealed 649 non-cannonical genes besides many unexpected candidates with known biochemical functions, which have never been associated with pathostress-responsive transcriptome. Conclusion Our study establishes a comprehensive catalogue of the immune-responsive root transcriptome with insight into their identity and function. The development, detailed analysis of CaEST datasets and global gene expression by microarray provide new insight into the commonality and diversity of organ-specific immune-responsive transcript signatures and their regulated expression shaping the species specificity at genotype level. This is the first report on differential transcriptome of an unsequenced genome during vascular wilt. PMID:19732460

  19. A new idea for simple and rapid monitoring of gene expression: requirement of nucleotide sequences encoding an N-terminal HA tag in the T7 promoter-driven expression in E. coli.

    PubMed

    Moon, Jeong-Mi; Kim, Goo-Young; Rhim, Hyangshuk

    2012-10-01

    Mammalian expression vectors are used to overexpress genes of interest in mammalian cells. High temperature requirement protein A1 (HtrA1), used as a specific target, was expressed from the pHA-M-HtrA1 plasmid in HEK293T cells, inducing cell death. Expression of HtrA1 was driven by the pHA-M-HtrA1 mammalian expression vector in E. coli resulting in growth suppression of E. coli in an HtrA1 serine protease-dependent manner. By using various combinations of promoters, target genes and N-terminal tags, the T7 promoter and N-terminal HA tag in the mammalian expression vector were shown to be responsible for expression of target genes in E. coli. Thus the pHA-M-HtrA1 plasmid can be used as a novel, rapid pre-test system for expression and cytotoxicity of the specific target gene in E. coli before assessing its functions in mammalian cells. PMID:22714269

  20. Analysis of common bean expressed sequence tags identifies sulfur metabolic pathways active in seed and sulfur-rich proteins highly expressed in the absence of phaseolin and major lectins

    PubMed Central

    2011-01-01

    Background A deficiency in phaseolin and phytohemagglutinin is associated with a near doubling of sulfur amino acid content in genetically related lines of common bean (Phaseolus vulgaris), particularly cysteine, elevated by 70%, and methionine, elevated by 10%. This mostly takes place at the expense of an abundant non-protein amino acid, S-methyl-cysteine. The deficiency in phaseolin and phytohemagglutinin is mainly compensated by increased levels of the 11S globulin legumin and residual lectins. Legumin, albumin-2, defensin and albumin-1 were previously identified as contributing to the increased sulfur amino acid content in the mutant line, on the basis of similarity to proteins from other legumes. Results Profiling of free amino acid in developing seeds of the BAT93 reference genotype revealed a biphasic accumulation of gamma-glutamyl-S-methyl-cysteine, the main soluble form of S-methyl-cysteine, with a lag phase occurring during storage protein accumulation. A collection of 30,147 expressed sequence tags (ESTs) was generated from four developmental stages, corresponding to distinct phases of gamma-glutamyl-S-methyl-cysteine accumulation, and covering the transitions to reserve accumulation and dessication. Analysis of gene ontology categories indicated the occurrence of multiple sulfur metabolic pathways, including all enzymatic activities responsible for sulfate assimilation, de novo cysteine and methionine biosynthesis. Integration of genomic and proteomic data enabled the identification and isolation of cDNAs coding for legumin, albumin-2, defensin D1 and albumin-1A and -B induced in the absence of phaseolin and phytohemagglutinin. Their deduced amino acid sequences have a higher content of cysteine than methionine, providing an explanation for the preferential increase of cysteine in the mutant line. Conclusion The EST collection provides a foundation to further investigate sulfur metabolism and the differential accumulation of sulfur amino acids in seed

  1. Expression of Epitope-Tagged Proteins in Mammalian Cells in Culture.

    PubMed

    Bhatt, Jay M; Styers, Melanie L; Sztul, Elizabeth

    2016-01-01

    Before the advent of molecular methods to tag proteins, visualization of proteins within cells required the use of antibodies directed against the protein of interest. Thus, only proteins for which antibodies were available could be visualized. Epitope tagging allows the detection of all proteins with existing sequence information, irrespective of the availability of antibodies directed against them. This technique involves the generation of DNA constructs that express the protein of interest tagged with an epitope that can be recognized by a commercially available antibody. Proteins can be tagged with a wide variety of epitopes using commercially available vectors that allow expression in mammalian cells. Epitope-tagged proteins are easily transfected into mammalian cell lines and, in most cases, tightly mimic the behavior of the endogenous protein. Tagged proteins exogenously expressed in cells provide different types of information depending on the subsequent detection approaches. Using immunofluorescence and immunoelectron microscopy with anti-tag antibodies, relative to known markers of cellular organelles, can provide information on the subcellular localization of the tagged protein and may provide clues regarding the protein's function. Immunofluorescence with anti-tag antibodies can also be utilized to assess the tagged protein's responses to cellular signals and pharmacological treatments. Immunoprecipitations with anti-tag antibodies can recover protein complexes containing the protein of interest, resulting in the identification of interacting proteins. Recovery of tagged proteins on affinity matrices allows their purification for use in biochemical assays. In addition, specialized fluorescent tags, such as the green fluorescent protein (GFP) allow the analysis of cellular dynamics in live cells in real time. PMID:27515071

  2. Characterization of expressed sequence tags (ESTs) of pigeonpea (Cajanus cajan L.) and functional validation of selected genes for abiotic stress tolerance in Arabidopsis thaliana.

    PubMed

    Priyanka, B; Sekhar, K; Sunita, T; Reddy, V D; Rao, Khareedu Venkateswara

    2010-03-01

    Pigeonpea, a major grain legume crop with remarkable drought tolerance traits, has been used for the isolation of stress-responsive genes. Herein, we report generation of ESTs, transcript profiles of selected genes and validation of candidate genes obtained from the subtracted cDNA libraries of pigeonpea plants subjected to PEG/water-deficit stress conditions. Cluster analysis of 124 selected ESTs yielded 75 high-quality ESTs. Homology searches disclosed that 55 ESTs share significant similarity with the known/putative proteins or ESTs available in the databases. These ESTs were characterized and genes relevant to the specific physiological processes were identified. Of the 75 ESTs obtained from the cDNA libraries of drought-stressed plants, 20 ESTs proved to be unique to the pigeonpea. These sequences are envisaged to serve as a potential source of stress-inducible genes of the drought stress-response transcriptome, and hence may be used for deciphering the mechanism of drought tolerance of the pigeonpea. Expression profiles of selected genes revealed increased levels of m-RNA transcripts in pigeonpea plants subjected to different abiotic stresses. Transgenic Arabidopsis lines, expressing Cajanus cajan hybrid-proline-rich protein (CcHyPRP), C. cajan cyclophilin (CcCYP) and C. cajan cold and drought regulatory (CcCDR) genes, exhibited marked tolerance, increased plant biomass and enhanced photosynthetic rates under PEG/NaCl/cold/heat stress conditions. This study represents the first report dealing with the isolation of drought-specific ESTs, transcriptome analysis and functional validation of drought-responsive genes of the pigeonpea. These genes, as such, hold promise for engineering crop plants bestowed with tolerance to major abiotic stresses. PMID:20131066

  3. Multiplexed genotyping with sequence-tagged molecular inversion probes.

    PubMed

    Hardenbol, Paul; Banér, Johan; Jain, Maneesh; Nilsson, Mats; Namsaraev, Eugeni A; Karlin-Neumann, George A; Fakhrai-Rad, Hossein; Ronaghi, Mostafa; Willis, Thomas D; Landegren, Ulf; Davis, Ronald W

    2003-06-01

    We report on the development of molecular inversion probe (MIP) genotyping, an efficient technology for large-scale single nucleotide polymorphism (SNP) analysis. This technique uses MIPs to produce inverted sequences, which undergo a unimolecular rearrangement and are then amplified by PCR using common primers and analyzed using universal sequence tag DNA microarrays, resulting in highly specific genotyping. With this technology, multiplex analysis of more than 1,000 probes in a single tube can be done using standard laboratory equipment. Genotypes are generated with a high call rate (95%) and high accuracy (>99%) as determined by independent sequencing. PMID:12730666

  4. Assembly of 500,000 Inter-Specific Catfish Expressed Sequence Tags and Large Scale Gene-Associated Marker Development for Whole Genome Association Studies

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Through the Community Sequencing Program, a catfish EST sequencing project was carried out through a collaboration between the catfish research community and the Department of Energy's Joint Genome Institute. Prior to this project, only a limited EST resource from catfish was available for the purpo...

  5. Tag jumps illuminated--reducing sequence-to-sample misidentifications in metabarcoding studies.

    PubMed

    Schnell, Ida Baerholm; Bohmann, Kristine; Gilbert, M Thomas P

    2015-11-01

    Metabarcoding of environmental samples on second-generation sequencing platforms has rapidly become a valuable tool for ecological studies. A fundamental assumption of this approach is the reliance on being able to track tagged amplicons back to the samples from which they originated. In this study, we address the problem of sequences in metabarcoding sequencing outputs with false combinations of used tags (tag jumps). Unless these sequences can be identified and excluded from downstream analyses, tag jumps creating sequences with false, but already used tag combinations, can cause incorrect assignment of sequences to samples and artificially inflate diversity. In this study, we document and investigate tag jumping in metabarcoding studies on Illumina sequencing platforms by amplifying mixed-template extracts obtained from bat droppings and leech gut contents with tagged generic arthropod and mammal primers, respectively. We found that an average of 2.6% and 2.1% of sequences had tag combinations, which could be explained by tag jumping in the leech and bat diet study, respectively. We suggest that tag jumping can happen during blunt-ending of pools of tagged amplicons during library build and as a consequence of chimera formation during bulk amplification of tagged amplicons during library index PCR. We argue that tag jumping and contamination between libraries represents a considerable challenge for Illumina-based metabarcoding studies, and suggest measures to avoid false assignment of tag jumping-derived sequences to samples. PMID:25740652

  6. Express Sequence Tag Analysis - Identification of Anseriformes Trypsin Genes from Full-Length cDNA Library of the Duck (Anas platyrhynchos) and Characterization of Their Structure and Function.

    PubMed

    Yu, Haining; Cai, Shasha; Gao, Jiuxiang; Wang, Chen; Qiao, Xue; Wang, Hui; Feng, Lan; Wang, Yipeng

    2016-02-01

    Trypsins are key proteins important in animal protein digestion by breaking down the peptide bonds on the carboxyl side of lysine and arginine residues, hence it has been used widely in various biotechnological processes. In the current study, a full-length cDNA library with capacity of 5·10(5) CFU/ml from the duck (Anas platyrhynchos) was constructed. Using express sequence tag (EST) sequencing, genes coding two trypsins were identified and two full-length trypsin cDNAs were then obtained by rapid-amplification of cDNA end (RACE)-PCR. Using Blast, they were classified into the trypsin I and II subfamilies, but both encoded a signal peptide, an activation peptide, and a 223-a.a. mature protein located in the C-terminus. The two deduced mature proteins were designated as trypsin-IAP and trypsin-IIAP, and their theoretical isoelectric points (pI) and molecular weights (MW) were 7.99/23466.4 Da and 4.65/24066.0 Da, respectively. Molecular characterizations of genes were further performed by detailed bioinformatics analysis. Phylogenetic analysis revealed that trypsin-IIAP has an evolution pattern distinct from trypsin-IAP, suggesting its evolutionary advantage. Then the duck trypsin-IIAP was expressed in an Escherichia coli system, and its kinetic parameters were measured. The three dimensional structures of trypsin-IAP and trypsin-IIAP were predicted by homology modeling, and the conserved residues required for functionality were identified. Two loops controlling the specificity of the trypsin and the substrate-binding pocket represented in the model are almost identical in primary sequences and backbone tertiary structures of the trypsin families. PMID:27260395

  7. Generation and analysis of a large-scale expressed sequence tags from a full-length enriched cDNA library of Siberian tiger (Panthera tigris altaica).

    PubMed

    Guo, Yu; Liu, Changqing; Lu, Taofeng; Liu, Dan; Bai, Chunyu; Li, Xiangchen; Ma, Yuehui; Guan, Weijun

    2014-05-15

    In this study, a full-length enriched cDNA library was successfully constructed from Siberian tiger, the world's most endangered species. The titers of primary and amplified libraries were 1.28×10(6)pfu/mL and 1.59×10(10)pfu/mL respectively. The proportion of recombinants from unamplified library was 91.3% and the average length of exogenous inserts was 1.06kb. A total of 279 individual ESTs with sizes ranging from 316 to 1258bps were then analyzed. Furthermore, 204 unigenes were successfully annotated and involved in 49 functions of the GO classification, cell (175, 85.5%), cellular process (165, 80.9%), and binding (152, 74.5%) are the dominant terms. 198 unigenes were assigned to 156 KEGG pathways, and the pathways with the most representation are metabolic pathways (18, 9.1%). The proportion pattern of each COG subcategory was similar among Panthera tigris altaica, P. tigris tigris and Homo sapiens, and general function prediction only cluster (44, 15.8%) represents the largest group, followed by translation, ribosomal structure and biogenesis (33, 11.8%), replication, recombination and repair (24, 8.6%), and only 7.2% ESTs classified as novel genes. Moreover, the recombinant plasmid pET32a-TAT-COL6A2 was constructed, coded for the Trx-TAT-COL6A2 fusion protein with two 6× His-tags in N and C-terminal. After BCA assay, the concentration of soluble Trx-TAT-COL6A2 recombinant protein was 2.64±0.18mg/mL. This library will provide a useful platform for the functional genome and transcriptome research of for the P. tigris and other felid animals in the future. PMID:24630959

  8. HIV-1 Quasispecies Delineation by Tag Linkage Deep Sequencing

    PubMed Central

    Wu, Nicholas C.; De La Cruz, Justin; Al-Mawsawi, Laith Q.; Olson, C. Anders; Qi, Hangfei; Luan, Harding H.; Nguyen, Nguyen; Du, Yushen; Le, Shuai; Wu, Ting-Ting; Li, Xinmin; Lewis, Martha J.; Yang, Otto O.; Sun, Ren

    2014-01-01

    Trade-offs between throughput, read length, and error rates in high-throughput sequencing limit certain applications such as monitoring viral quasispecies. Here, we describe a molecular-based tag linkage method that allows assemblage of short sequence reads into long DNA fragments. It enables haplotype phasing with high accuracy and sensitivity to interrogate individual viral sequences in a quasispecies. This approach is demonstrated to deduce ∼2000 unique 1.3 kb viral sequences from HIV-1 quasispecies in vivo and after passaging ex vivo with a detection limit of ∼0.005% to ∼0.001%. Reproducibility of the method is validated quantitatively and qualitatively by a technical replicate. This approach can improve monitoring of the genetic architecture and evolution dynamics in any quasispecies population. PMID:24842159

  9. Expressed sequence tags from larval gut of the european corn borer (Ostrinia nubilalis): exploring candidate genes potenially involved in Bacillus thuringiensis toxicity and resistance

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Background: Knowledge of the genes that are expressed in the insect gut are crucial for understanding basic physiology of food digestion, their interactions with Bacillus thuringiensis (Bt) toxin and for discovering new targets for novel toxins for use in pest management. This study analyzed the ES...

  10. A sequence-tagged site map of human chromosome 11.

    PubMed

    Smith, M W; Clark, S P; Hutchinson, J S; Wei, Y H; Churukian, A C; Daniels, L B; Diggle, K L; Gen, M W; Romo, A J; Lin, Y

    1993-09-01

    We report the construction of 370 sequence-tagged sites (STSs) that are detectable by PCR amplification under sets of standardized conditions and that have been regionally mapped to human chromosome 11. DNA sequences were determined by sequencing directly from cosmid templates using primers complementary to T3 and T7 promoters present in the cloning vector. Oligonucleotide PCR primers were predicted by computer and tested using a battery of genomic DNAs. Cosmids were regionally localized on chromosome 11 by using fluorescence in situ hybridization or by analyzing a somatic cell hybrid panel. Additional STSs corresponding to known genes and markers on chromosome 11 were also produced under the same series of standardized conditions. The resulting STSs provide uniform coverage of chromosome 11 with an average spacing of 340 kb. The DNA sequence determined for use in STS production corresponds to about 0.1% (116 kb) of chromosome 11 and has been analyzed for the presence of repetitive sequences, similarities to known genes and motifs, and possible exons. Computer analysis of this sequence has identified and therefore mapped at least eight new genes on chromosome 11. PMID:8244387

  11. Identification of genes encoding Schistosoma mansoni antigens using an antigenic sequence tag strategy.

    PubMed

    Zouain, C S; Azevedo, V A; Franco, G R; Pena, S D; Goes, A M

    1998-12-01

    Another approach for the identification of genes that code for antigenic products is described using an antigenic sequence tag (AST) strategy. A Schistosoma mansoni adult worm cDNA library was screened with affinity chromatography-purified immunoglobulins from infected human sera and a mild oxidation treatment with sodium periodate. From 1 or both ends of 30 cDNA clones, 30 ASTs were obtained. Of these, 22 were previously known Sm antigens. One clone had matches with entries for other organisms in the databases and 6 had homology with Sm-expressed sequence tags (EST) entries. These clones, together with another 1 that had no significant database matches, were considered new antigenic genes in S. mansoni. The strategy proved to be efficient for the identification of genes that could be used for immunological studies and evaluation as vaccine candidates. PMID:9920341

  12. CREST--classification resources for environmental sequence tags.

    PubMed

    Lanzén, Anders; Jørgensen, Steffen L; Huson, Daniel H; Gorfer, Markus; Grindhaug, Svenn Helge; Jonassen, Inge; Øvreås, Lise; Urich, Tim

    2012-01-01

    Sequencing of taxonomic or phylogenetic markers is becoming a fast and efficient method for studying environmental microbial communities. This has resulted in a steadily growing collection of marker sequences, most notably of the small-subunit (SSU) ribosomal RNA gene, and an increased understanding of microbial phylogeny, diversity and community composition patterns. However, to utilize these large datasets together with new sequencing technologies, a reliable and flexible system for taxonomic classification is critical. We developed CREST (Classification Resources for Environmental Sequence Tags), a set of resources and tools for generating and utilizing custom taxonomies and reference datasets for classification of environmental sequences. CREST uses an alignment-based classification method with the lowest common ancestor algorithm. It also uses explicit rank similarity criteria to reduce false positives and identify novel taxa. We implemented this method in a web server, a command line tool and the graphical user interfaced program MEGAN. Further, we provide the SSU rRNA reference database and taxonomy SilvaMod, derived from the publicly available SILVA SSURef, for classification of sequences from bacteria, archaea and eukaryotes. Using cross-validation and environmental datasets, we compared the performance of CREST and SilvaMod to the RDP Classifier. We also utilized Greengenes as a reference database, both with CREST and the RDP Classifier. These analyses indicate that CREST performs better than alignment-free methods with higher recall rate (sensitivity) as well as precision, and with the ability to accurately identify most sequences from novel taxa. Classification using SilvaMod performed better than with Greengenes, particularly when applied to environmental sequences. CREST is freely available under a GNU General Public License (v3) from http://apps.cbu.uib.no/crest and http://lcaclassifier.googlecode.com. PMID:23145153

  13. Data presenting a modified bacterial expression vector for expressing and purifying Nus solubility-tagged proteins.

    PubMed

    Gupta, Nidhi; Wu, Heng; Terman, Jonathan R

    2016-09-01

    Bacteria are the predominant source for producing recombinant proteins but while many exogenous proteins are expressed, only a fraction of those are soluble. We have found that a new actin regulatory enzyme Mical is poorly soluble when expressed in bacteria but the use of a Nus fusion protein tag greatly increases its solubility. However, available vectors containing a Nus tag have been engineered in a way that hinders the separation of target proteins from the Nus tag during protein purification. We have now used recombinant DNA approaches to overcome these issues and reengineer a Nus solubility tag-containing bacterial expression vector. The data herein present a modified bacterial expression vector useful for expressing proteins fused to the Nus solubility tag and separating such target proteins from the Nus tag during protein purification. PMID:27547802

  14. Multiplexed metagenome mining using short DNA sequence tags facilitates targeted discovery of epoxyketone proteasome inhibitors

    PubMed Central

    Owen, Jeremy G.; Charlop-Powers, Zachary; Smith, Alexandra G.; Ternei, Melinda A.; Calle, Paula Y.; Reddy, Boojala Vijay B.; Montiel, Daniel; Brady, Sean F.

    2015-01-01

    In molecular evolutionary analyses, short DNA sequences are used to infer phylogenetic relationships among species. Here we apply this principle to the study of bacterial biosynthesis, enabling the targeted isolation of previously unidentified natural products directly from complex metagenomes. Our approach uses short natural product sequence tags derived from conserved biosynthetic motifs to profile biosynthetic diversity in the environment and then guide the recovery of gene clusters from metagenomic libraries. The methodology is conceptually simple, requires only a small investment in sequencing, and is not computationally demanding. To demonstrate the power of this approach to natural product discovery we conducted a computational search for epoxyketone proteasome inhibitors within 185 globally distributed soil metagenomes. This led to the identification of 99 unique epoxyketone sequence tags, falling into 6 phylogenetically distinct clades. Complete gene clusters associated with nine unique tags were recovered from four saturating soil metagenomic libraries. Using heterologous expression methodologies, seven potent epoxyketone proteasome inhibitors (clarepoxcins A–E and landepoxcins A and B) were produced from these pathways, including compounds with different warhead structures and a naturally occurring halohydrin prodrug. This study provides a template for the targeted expansion of bacterially derived natural products using the global metagenome. PMID:25831524

  15. The contribution of 700,000 ORF sequence tags to the definition of the human transcriptome.

    PubMed

    Camargo, A A; Samaia, H P; Dias-Neto, E; Simão, D F; Migotto, I A; Briones, M R; Costa, F F; Nagai, M A; Verjovski-Almeida, S; Zago, M A; Andrade, L E; Carrer, H; El-Dorry, H F; Espreafico, E M; Habr-Gama, A; Giannella-Neto, D; Goldman, G H; Gruber, A; Hackel, C; Kimura, E T; Maciel, R M; Marie, S K; Martins, E A; Nobrega, M P; Paco-Larson, M L; Pardini, M I; Pereira, G G; Pesquero, J B; Rodrigues, V; Rogatto, S R; da Silva, I D; Sogayar, M C; Sonati, M F; Tajara, E H; Valentini, S R; Alberto, F L; Amaral, M E; Aneas, I; Arnaldi, L A; de Assis, A M; Bengtson, M H; Bergamo, N A; Bombonato, V; de Camargo, M E; Canevari, R A; Carraro, D M; Cerutti, J M; Correa, M L; Correa, R F; Costa, M C; Curcio, C; Hokama, P O; Ferreira, A J; Furuzawa, G K; Gushiken, T; Ho, P L; Kimura, E; Krieger, J E; Leite, L C; Majumder, P; Marins, M; Marques, E R; Melo, A S; Melo, M B; Mestriner, C A; Miracca, E C; Miranda, D C; Nascimento, A L; Nobrega, F G; Ojopi, E P; Pandolfi, J R; Pessoa, L G; Prevedel, A C; Rahal, P; Rainho, C A; Reis, E M; Ribeiro, M L; da Ros, N; de Sa, R G; Sales, M M; Sant'anna, S C; dos Santos, M L; da Silva, A M; da Silva, N P; Silva, W A; da Silveira, R A; Sousa, J F; Stecconi, D; Tsukumo, F; Valente, V; Soares, F; Moreira, E S; Nunes, D N; Correa, R G; Zalcberg, H; Carvalho, A F; Reis, L F; Brentani, R R; Simpson, A J; de Souza, S J; Melo, M

    2001-10-01

    Open reading frame expressed sequences tags (ORESTES) differ from conventional ESTs by providing sequence data from the central protein coding portion of transcripts. We generated a total of 696,745 ORESTES sequences from 24 human tissues and used a subset of the data that correspond to a set of 15,095 full-length mRNAs as a means of assessing the efficiency of the strategy and its potential contribution to the definition of the human transcriptome. We estimate that ORESTES sampled over 80% of all highly and moderately expressed, and between 40% and 50% of rarely expressed, human genes. In our most thoroughly sequenced tissue, the breast, the 130,000 ORESTES generated are derived from transcripts from an estimated 70% of all genes expressed in that tissue, with an equally efficient representation of both highly and poorly expressed genes. In this respect, we find that the capacity of the ORESTES strategy both for gene discovery and shotgun transcript sequence generation significantly exceeds that of conventional ESTs. The distribution of ORESTES is such that many human transcripts are now represented by a scaffold of partial sequences distributed along the length of each gene product. The experimental joining of the scaffold components, by reverse transcription-PCR, represents a direct route to transcript finishing that may represent a useful alternative to full-length cDNA cloning. PMID:11593022

  16. The contribution of 700,000 ORF sequence tags to the definition of the human transcriptome

    PubMed Central

    Camargo, Anamaria A.; Samaia, Helena P. B.; Dias-Neto, Emmanuel; Simão, Daniel F.; Migotto, Italo A.; Briones, Marcelo R. S.; Costa, Fernando F.; Aparecida Nagai, Maria; Verjovski-Almeida, Sergio; Zago, Marco A.; Andrade, Luis Eduardo C.; Carrer, Helaine; El-Dorry, Hamza F. A.; Espreafico, Enilza M.; Habr-Gama, Angelita; Giannella-Neto, Daniel; Goldman, Gustavo H.; Gruber, Arthur; Hackel, Christine; Kimura, Edna T.; Maciel, Rui M. B.; Marie, Suely K. N.; Martins, Elizabeth A. L.; Nóbrega, Marina P.; Paçó-Larson, Maria Luisa; Pardini, Maria Inês M. C.; Pereira, Gonçalo G.; Pesquero, João Bosco; Rodrigues, Vanderlei; Rogatto, Silvia R.; da Silva, Ismael D. C. G.; Sogayar, Mari C.; Sonati, Maria de Fátima; Tajara, Eloiza H.; Valentini, Sandro R.; Alberto, Fernando L.; Amaral, Maria Elisabete J.; Aneas, Ivy; Arnaldi, Liliane A. T.; de Assis, Angela M.; Bengtson, Mário Henrique; Bergamo, Nadia Aparecida; Bombonato, Vanessa; de Camargo, Maria E. R.; Canevari, Renata A.; Carraro, Dirce M.; Cerutti, Janete M.; Corrêa, Maria Lucia C.; Corrêa, Rosana F. R.; Costa, Maria Cristina R.; Curcio, Cyntia; Hokama, Paula O. M.; Ferreira, Ari J. S.; Furuzawa, Gilberto K.; Gushiken, Tsieko; Ho, Paulo L.; Kimura, Elza; Krieger, José E.; Leite, Luciana C. C.; Majumder, Paromita; Marins, Mozart; Marques, Everaldo R.; Melo, Analy S. A.; Melo, Monica; Mestriner, Carlos Alberto; Miracca, Elisabete C.; Miranda, Daniela C.; Nascimento, Ana Lucia T. O.; Nóbrega, Francisco G.; Ojopi, Élida P. B.; Pandolfi, José Rodrigo C.; Pessoa, Luciana G.; Prevedel, Aline C.; Rahal, Paula; Rainho, Claudia A.; Reis, Eduardo M. R.; Ribeiro, Marcelo L.; da Rós, Nancy; de Sá, Renata G.; Sales, Magaly M.; Sant'anna, Simone Cristina; dos Santos, Mariana L.; da Silva, Aline M.; da Silva, Neusa P.; Silva, Wilson A.; da Silveira, Rosana A.; Sousa, Josane F.; Stecconi, Daniella; Tsukumo, Fernando; Valente, Valéria; Soares, Fernando; Moreira, Eloisa S.; Nunes, Diana N.; Correa, Ricardo G.; Zalcberg, Heloisa; Carvalho, Alex F.; Reis, Luis F. L.; Brentani, Ricardo R.; Simpson, Andrew J. G.; de Souza, Sandro J.

    2001-01-01

    Open reading frame expressed sequences tags (ORESTES) differ from conventional ESTs by providing sequence data from the central protein coding portion of transcripts. We generated a total of 696,745 ORESTES sequences from 24 human tissues and used a subset of the data that correspond to a set of 15,095 full-length mRNAs as a means of assessing the efficiency of the strategy and its potential contribution to the definition of the human transcriptome. We estimate that ORESTES sampled over 80% of all highly and moderately expressed, and between 40% and 50% of rarely expressed, human genes. In our most thoroughly sequenced tissue, the breast, the 130,000 ORESTES generated are derived from transcripts from an estimated 70% of all genes expressed in that tissue, with an equally efficient representation of both highly and poorly expressed genes. In this respect, we find that the capacity of the ORESTES strategy both for gene discovery and shotgun transcript sequence generation significantly exceeds that of conventional ESTs. The distribution of ORESTES is such that many human transcripts are now represented by a scaffold of partial sequences distributed along the length of each gene product. The experimental joining of the scaffold components, by reverse transcription–PCR, represents a direct route to transcript finishing that may represent a useful alternative to full-length cDNA cloning. PMID:11593022

  17. Grouping and identification of sequence tags (GRIST): bioinformatics tools for the NEIBank database.

    PubMed

    Wistow, Graeme; Bernstein, Steven L; Touchman, Jeffrey W; Bouffard, Gerald; Wyatt, M Keith; Peterson, Katherine; Behal, Amita; Gao, James; Buchoff, Patee; Smith, Don

    2002-06-15

    NEIBank is a project to develop and organize genomics and bioinformatics resources for the eye. As part of this effort, tools have been developed for bioinformatics analysis and web based display of data from expressed sequence tag (EST) analyses. EST sequences are identified and formed into groups or clusters representing related transcripts from the same gene. This is carried out by a rules-based procedure called GRIST (GRouping and Identification of Sequence Tags) that uses sequence match parameters derived from BLAST programs. Linked procedures are used to eliminate non-mRNA contaminants. All data are assembled in a relational database and assembled for display as web pages with annotations and links to other informatics resources. Genome projects generate huge amounts of data that need to be classified and organized to become easily accessible to the research community. GRIST provides a useful tool for assembling and displaying the results of EST analyses. The NEIBank web site contains a growing set of pages cataloging the known transcriptional repertoire of eye tissues, derived from new NEIBank cDNA libraries and from eye-related data deposited in the dbEST section of GenBank. PMID:12107414

  18. Perceptual learning of contrast discrimination under roving: the role of semantic sequence in stimulus tagging.

    PubMed

    Cong, Lin-Juan; Zhang, Jun-Yun

    2014-01-01

    Perceptual learning may occur when multiple contrasts are practiced in a fixed, but not in a roving (random), temporal sequence. However, learning may escape roving disruption when each contrast is assigned a letter tag (i.e., A, B, C, D). Because these letter tags carry not only stimulus identity information, but also semantic sequence information, here we investigated whether the semantic sequence information is necessary for learning of tagged contrasts under the roving condition. We found that assigning number tags (i.e., 1, 2, 3, 4), which also contained both identity and semantic sequence information, to four roving contrasts enabled significant learning of discrimination of each contrast, confirming previous data. However, learning became insignificant when the contrast tags were replaced with Greek letters that were familiar to our Chinese observers except their sequence or Chinese characters that carried no sequence information. In addition, assigning orientation tags, which carried no sequence information either, to roving contrasts was ineffective as well because learning occurred only with sequenced but not roving contrasts. These results suggest that semantic sequence information is necessary for stimulus tagging to effectively enable perceptual learning of multiple contrast discrimination under roving. PMID:25368338

  19. Cardiac motion estimation by joint alignment of tagged MRI sequences.

    PubMed

    Oubel, E; De Craene, M; Hero, A O; Pourmorteza, A; Huguet, M; Avegliano, G; Bijnens, B H; Frangi, A F

    2012-01-01

    Image registration has been proposed as an automatic method for recovering cardiac displacement fields from tagged Magnetic Resonance Imaging (tMRI) sequences. Initially performed as a set of pairwise registrations, these techniques have evolved to the use of 3D+t deformation models, requiring metrics of joint image alignment (JA). However, only linear combinations of cost functions defined with respect to the first frame have been used. In this paper, we have applied k-Nearest Neighbors Graphs (kNNG) estimators of the α-entropy (H(α)) to measure the joint similarity between frames, and to combine the information provided by different cardiac views in an unified metric. Experiments performed on six subjects showed a significantly higher accuracy (p<0.05) with respect to a standard pairwise alignment (PA) approach in terms of mean positional error and variance with respect to manually placed landmarks. The developed method was used to study strains in patients with myocardial infarction, showing a consistency between strain, infarction location, and coronary occlusion. This paper also presents an interesting clinical application of graph-based metric estimators, showing their value for solving practical problems found in medical imaging. PMID:22000567

  20. In vivo expression and purification of aptamer-tagged small RNA regulators

    PubMed Central

    Said, Nelly; Rieder, Renate; Hurwitz, Robert; Deckert, Jochen; Urlaub, Henning; Vogel, Jörg

    2009-01-01

    Small non-coding RNAs (sRNAs) are an emerging class of post-transcriptional regulators of bacterial gene expression. To study sRNAs and their potential protein interaction partners, it is desirable to purify sRNAs from cells in their native form. Here, we used RNA-based affinity chromatography to purify sRNAs following their expression as aptamer-tagged variants in vivo. To this end, we developed a family of plasmids to express sRNAs with any of three widely used aptamer sequences (MS2, boxB, eIF4A), and systematically tested how the aptamer tagging impacted on intracellular accumulation and target regulation of the Salmonella GcvB, InvR or RybB sRNAs. In addition, we successfully tagged the chromosomal rybB gene with MS2 to observe that RybB-MS2 is fully functional as an envelope stress-induced repressor of ompN mRNA following induction of sigmaE. We further demonstrate that the common sRNA-binding protein, Hfq, co-purifies with MS2-tagged sRNAs of Salmonella. The presented affinity purification strategy may facilitate the isolation of in vivo assembled sRNA–protein complexes in a wide range of bacteria. PMID:19726584

  1. Genomic Sequence or Signature Tags (GSTs) from the Genome Group at Brookhaven National Laboratory (BNL)

    DOE Data Explorer

    Dunn, John J.; McCorkle, Sean R.; Praissman, Laura A.; Hind, Geoffrey; Van der Lelie, Daniel; Bahou, Wadie F.; Gnatenko, Dmitri V.; Krause, Maureen K.

    Genomic Signature Tags (GSTs) are the products of a method we have developed for identifying and quantitatively analyzing genomic DNAs. The DNA is initially fragmented with a type II restriction enzyme. An oligonucleotide adaptor containing a recognition site for MmeI, a type IIS restriction enzyme, is then used to release 21-bp tags from fixed positions in the DNA relative to the sites recognized by the fragmenting enzyme. These tags are PCR-amplified, purified, concatenated and then cloned and sequenced. The tag sequences and abundances are used to create a high resolution GST sequence profile of the genomic DNA. [Quoted from Genomic Signature Tags (GSTs): A System for Profiling Genomic DNA, Dunn, John J.; McCorkle, Sean R.; Praissman, Laura A.; Hind, Geoffrey; Van der Lelie, Daniel; Bahou, Wadie F.; Gnatenko, Dmitri V.; Krause, Maureen K., Revised 9/13/2002

  2. Generation and Analysis of a Large-Scale Expressed Sequence Tag Database from a Full-Length Enriched cDNA Library of Developing Leaves of Gossypium hirsutum L

    PubMed Central

    Pang, Chaoyou; Fan, Shuli; Song, Meizhen; Yu, Shuxun

    2013-01-01

    Background Cotton (Gossypium hirsutum L.) is one of the world’s most economically-important crops. However, its entire genome has not been sequenced, and limited resources are available in GenBank for understanding the molecular mechanisms underlying leaf development and senescence. Methodology/Principal Findings In this study, 9,874 high-quality ESTs were generated from a normalized, full-length cDNA library derived from pooled RNA isolated from throughout leaf development during the plant blooming stage. After clustering and assembly of these ESTs, 5,191 unique sequences, representative 1,652 contigs and 3,539 singletons, were obtained. The average unique sequence length was 682 bp. Annotation of these unique sequences revealed that 84.4% showed significant homology to sequences in the NCBI non-redundant protein database, and 57.3% had significant hits to known proteins in the Swiss-Prot database. Comparative analysis indicated that our library added 2,400 ESTs and 991 unique sequences to those known for cotton. The unigenes were functionally characterized by gene ontology annotation. We identified 1,339 and 200 unigenes as potential leaf senescence-related genes and transcription factors, respectively. Moreover, nine genes related to leaf senescence and eleven MYB transcription factors were randomly selected for quantitative real-time PCR (qRT-PCR), which revealed that these genes were regulated differentially during senescence. The qRT-PCR for three GhYLSs revealed that these genes express express preferentially in senescent leaves. Conclusions/Significance These EST resources will provide valuable sequence information for gene expression profiling analyses and functional genomics studies to elucidate their roles, as well as for studying the mechanisms of leaf development and senescence in cotton and discovering candidate genes related to important agronomic traits of cotton. These data will also facilitate future whole-genome sequence assembly and annotation

  3. Phylogeny of Saccharina and Laminaria (Laminariaceae, Laminariales, Phaeophyta) in sequence-tagged-site markers

    NASA Astrophysics Data System (ADS)

    Qu, Jieqiong; Zhang, Jing; Wang, Xumin; Chi, Shan; Liu, Cui; Liu, Tao

    2014-01-01

    Laminaria and Saccharina have recently been recognized as two independent clades from the former genus Laminaria. Traditional morphological taxonomy is being challenged by molecular evidence from both nucleus and plastid. Intensive work is in great demand from the perspective of genome colinearity. In this study, 118 sequence-tagged site (STS) markers were screened for phylogenetic analyses, 29 based on genome sequences, while 89 were based on expressed sequence tag (EST) sequences. EST-based STS marker development (29.37%) had an effi ciency twice as high as genome-sequence-based development (9.48%) as a result of high conservation of gene transcripts among the relative species. S. ochotensis, S. religiosa, S. japonica, and L. hyperborea showed great homogeneity in all 118 STS markers. Our result supports the view that the diversifi cation between the genera Saccharina and Laminaria was a more recent event and that Saccharina and Laminaria shared high phylogenetic affi nity. However, when it came to the single nucleotide polymorphism (SNP) level among the 41 SNPs, L. hyperborea owned 29 unique SNPs against 12 within the left three Saccharina species and 12 of the 13 indels were supposedly unique for L. hyperborea, indicated by its high variability. Originating from homologous ancestors, species between the recently diverged genera Laminaria and Saccharina may have taken in enough mutations at the SNP level only, in spite of different evolutionary strategies for better adaptation to the environment. Our study lays a solid foundation from a new perspective, although more accurate phylogenetic analysis is still needed to clarify the evolutionary traces between the genera Saccharina and Laminaria.

  4. C-Terminally fused affinity Strep-tag II is removed by proteolysis from recombinant human erythropoietin expressed in transgenic tobacco plants

    PubMed Central

    Kittur, Farooqahmed S.; Lalgondar, Mallikarjun; Hung, Chiu-Yueh; Sane, David C.

    2014-01-01

    Asialo-erythropoietin (asialo-EPO), a desialylated form of EPO, is a potent tissue-protective agent. Recently, we and others have exploited a low cost plant-based expression system to produce recombinant human asialo-EPO (asialo-rhuEPOP). To facilitate purification from plant extracts, Strep-tag II was engineered at the C-terminus of EPO. Although asialo-rhuEPOP was efficiently expressed in transgenic tobacco plants, affinity purification based on Strep-tag II did not result in the recovery of the protein. In this study, we investigated the stability of Strep-tag II tagged asialo-rhuEPOP expressed in tobacco plants to understand whether this fused tag is cleaved or inaccessible. Sequencing RT-PCR products confirmed that fused DNA sequences encoding Strep-tag II were properly transcribed, and three-dimensional protein structure model revealed that the tag must be fully accessible. However, Western blot analysis of leaf extracts and purified asialo-rhuEPOP revealed that the Strep-tag II was absent on the protein. Additionally, no peptide fragment containing Strep-tag II was identified in the LC-MS/MS analysis of purified protein further supporting that the affinity tag was absent on asialo-rhuEPOP. However, Strep-tag II was detected on asialo-rhuEPOP that was retained in the endoplasmic reticulum, suggesting that the Strep-tag II is removed during protein secretion or extraction. These findings together with recent reports that C-terminally fused Strep-tag II or IgG Fc domain are also removed from EPO in tobacco plants, suggest that its C-terminus may be highly susceptible to proteolysis in tobacco plants. Therefore, direct fusion of purification tags at the C-terminus of EPO should be avoided while expressing it in tobacco plants. PMID:25504272

  5. High-Throughput Tag-Sequencing Analysis of Early Events Induced by Ochratoxin A in HepG-2 Cells.

    PubMed

    Zhang, Yu; Qi, Xiaozhe; Zheng, Juanjuan; Luo, YunBo; Huang, Kunlun; Xu, Wentao

    2016-01-01

    Ochratoxin A (OTA) is produced by fungi of the species Aspergillus and Penicillium. OTA has displayed hepatotoxicity in mammals. Although recent studies have indicated that OTA influences liver function, little is known regarding its impact on differential early liver toxicity. In this study, we report high-throughput tag-sequencing (Tag-seq) analysis of the transcriptome using Solexa Analyzer platform after 4 h of OTA treatment on HepG-2 cells. The analyses of differentially expressed genes revealed the substantial changes. A total of 21,449 genes were identified and quantified, with 2726 displaying significantly altered expression levels. Expression level data were then integrated with a network of gene-gene interactions, and biological pathways to obtain a systems-level view of changes in the transcriptome that occur with OTA resistance. Our data suggest that OTA exposure leads to an imbalance in zinc finger expression and shed light on splicing factor and mitochondrial-based mechanisms. PMID:26377828

  6. Snorkel: an epitope tagging system for measuring the surface expression of membrane proteins.

    PubMed

    Brown, Michael; Stafford, Lewis J; Onisk, Dale; Joaquim, Tony; Tobb, Alhagie; Goldman, Larissa; Fancy, David; Stave, James; Chambers, Ross

    2013-01-01

    Tags are widely used to monitor a protein's expression level, interactions, protein trafficking, and localization. Membrane proteins are often tagged in their extracellular domains to allow discrimination between protein in the plasma membrane from that in internal pools. Multipass membrane proteins offer special challenges for inserting a tag since the extracellular regions are often composed of small loops and thus inserting an epitope tag risks perturbing the structure, function, or location of the membrane protein. We have developed a novel tagging system called snorkel where a transmembrane domain followed by a tag is appended to the cytoplasmic C-terminus of the membrane protein. In this way the tag is displayed extracellularly, but structurally separate from the membrane protein. We have tested the snorkel tag system on a diverse panel of membrane proteins including GPCRs and ion channels and demonstrated that it reliably allows for monitoring of the surface expression. PMID:24023844

  7. TAG Sequence Identification of Genomic Regions Using TAGdb.

    PubMed

    Ruperao, Pradeep

    2016-01-01

    Second-generation sequencing (SGS) technology has enabled the sequencing of genomes and identification of genes. However, large complex plant genomes remain particularly difficult for de novo assembly. Access to the vast quantity of raw sequence data may facilitate discoveries; however the volume of this data makes access difficult. This chapter discusses the Web-based tool TAGdb that enables researchers to identify paired read second-generation DNA sequence data that share identity with a submitted query sequence. The identified reads can be used for PCR amplification of genomic regions to identify genes and promoters without the need for genome assembly. PMID:26519409

  8. Simple amino acid tags improve both expression and secretion of Candida antarctica lipase B in recombinant Escherichia coli.

    PubMed

    Kim, Sun-Ki; Park, Yong-Cheol; Lee, Hyung Ho; Jeon, Seung Taeg; Min, Won-Ki; Seo, Jin-Ho

    2015-02-01

    Escherichia coli is the best-established microbial host strain for production of proteins and chemicals, but has a weakness for not secreting high amounts of active heterologous proteins to the extracellular culture medium, of which origins belong to whether prokaryotes or eukaryotes. In this study, Candida antarctica lipase B (CalB), a popular eukaryotic enzyme which catalyzes a number of biochemical reactions and barely secreted extracellularly, was expressed functionally at a gram scale in culture medium by using a simple amino acid-tag system of E. coli. New fusion tag systems consisting of a pelB signal sequence and various anion amino acid tags facilitated both intracellular expression and extracellular secretion of CalB. Among them, the N-terminal five aspartate tag changed the quaternary structure of the dimeric CalB and allowed production of 1.9 g/L active CalB with 65 U/mL activity in culture medium, which exhibited the same enzymatic properties as the commercial CalB. This PelB-anion amino acid tag-based expression system for CalB can be extended to production of other industrial proteins hardly expressed and exported from E. coli, thereby increasing target protein concentrations and minimizing purification steps. PMID:25182473

  9. Comparison of Sequencing (Barcode Region) and Sequence-Tagged-Site PCR for Blastocystis Subtyping

    PubMed Central

    2013-01-01

    Blastocystis is the most common nonfungal microeukaryote of the human intestinal tract and comprises numerous subtypes (STs), nine of which have been found in humans (ST1 to ST9). While efforts continue to explore the relationship between human health status and subtypes, no consensus regarding subtyping methodology exists. It has been speculated that differences detected in subtype distribution in various cohorts may to some extent reflect different approaches. Blastocystis subtypes have been determined primarily in one of two ways: (i) sequencing of small subunit rRNA gene (SSU-rDNA) PCR products and (ii) PCR with subtype-specific sequence-tagged-site (STS) diagnostic primers. Here, STS primers were evaluated against a panel of samples (n = 58) already subtyped by SSU-rDNA sequencing (barcode region), including subtypes for which STS primers are not available, and a small panel of DNAs from four other eukaryotes often present in feces (n = 18). Although the STS primers appeared to be highly specific, their sensitivity was only moderate, and the results indicated that some infections may go undetected when this method is used. False-negative STS results were not linked exclusively to certain subtypes or alleles, and evidence of substantial genetic variation in STS loci was obtained. Since the majority of DNAs included here were extracted from feces, it is possible that STS primers may generally work better with DNAs extracted from Blastocystis cultures. In conclusion, due to its higher applicability and sensitivity, and since sequence information is useful for other forms of research, SSU-rDNA barcoding is recommended as the method of choice for Blastocystis subtyping. PMID:23115257

  10. Assembly of a gene sequence tag microarray by reversible biotin-streptavidin capture for transcript analysis of Arabidopsis thaliana

    PubMed Central

    Wirta, Valtteri; Holmberg, Anders; Lukacs, Morten; Nilsson, Peter; Hilson, Pierre; Uhlén, Mathias; Bhalerao, Rishikesh P; Lundeberg, Joakim

    2005-01-01

    Background Transcriptional profiling using microarrays has developed into a key molecular tool for the elucidation of gene function and gene regulation. Microarray platforms based on either oligonucleotides or purified amplification products have been utilised in parallel to produce large amounts of data. Irrespective of platform examined, the availability of genome sequence or a large number of representative expressed sequence tags (ESTs) is, however, a pre-requisite for the design and selection of specific and high-quality microarray probes. This is of great importance for organisms, such as Arabidopsis thaliana, with a high number of duplicated genes, as cross-hybridisation signals between evolutionary related genes cannot be distinguished from true signals unless the probes are carefully designed to be specific. Results We present an alternative solid-phase purification strategy suitable for efficient preparation of short, biotinylated and highly specific probes suitable for large-scale expression profiling. Twenty-one thousand Arabidopsis thaliana gene sequence tags were amplified and subsequently purified using the described technology. The use of the arrays is exemplified by analysis of gene expression changes caused by a four-hour indole-3-acetic (auxin) treatment. A total of 270 genes were identified as differentially expressed (120 up-regulated and 150 down-regulated), including several previously known auxin-affected genes, but also several previously uncharacterised genes. Conclusions The described solid-phase procedure can be used to prepare gene sequence tag microarrays based on short and specific amplified probes, facilitating the analysis of more than 21 000 Arabidopsis transcripts. PMID:15689241

  11. New aldehyde tag sequences identified by screening formylglycine generating enzymes in vitro and in vivo.

    PubMed

    Rush, Jason S; Bertozzi, Carolyn R

    2008-09-17

    Formylglycine generating enzyme (FGE) performs a critical posttranslational modification of type I sulfatases, converting cysteine within the motif CxPxR to the aldehyde-bearing residue formylglycine (FGly). This concise motif can be installed within heterologous proteins as a genetically encoded "aldehyde tag" for site-specific labeling with aminooxy- or hydrazide-functionalized probes. In this report, we screened FGEs from M. tuberculosis and S. coelicolor against synthetic peptide libraries and identified new substrate sequences that diverge from the canonical motif. We found that E. coli's FGE-like activity is similarly promiscuous, enabling the use of novel aldehyde tag sequences for in vivo modification of recombinant proteins. PMID:18722427

  12. Expression and purification of recombinant proteins in Escherichia coli tagged with the metal-binding protein CusF.

    PubMed

    Cantu-Bustos, J Enrique; Vargas-Cortez, Teresa; Morones-Ramirez, Jose Ruben; Balderas-Renteria, Isaias; Galbraith, David W; McEvoy, Megan M; Zarate, Xristo

    2016-05-01

    Production of recombinant proteins in Escherichia coli has been improved considerably through the use of fusion proteins, because they increase protein solubility and facilitate purification via affinity chromatography. In this article, we propose the use of CusF as a new fusion partner for expression and purification of recombinant proteins in E. coli. Using a cell-free protein expression system, based on the E. coli S30 extract, Green Fluorescent Protein (GFP) was expressed with a series of different N-terminal tags, immobilized on self-assembled protein microarrays, and its fluorescence quantified. GFP tagged with CusF showed the highest fluorescence intensity, and this was greater than the intensities from corresponding GFP constructs that contained MBP or GST tags. Analysis of protein production in vivo showed that CusF produces large amounts of soluble protein with low levels of inclusion bodies. Furthermore, fusion proteins can be exported to the cellular periplasm, if CusF contains the signal sequence. Taking advantage of its ability to bind copper ions, recombinant proteins can be purified with readily available IMAC resins charged with this metal ion, producing pure proteins after purification and tag removal. We therefore recommend the use of CusF as a viable alternative to MBP or GST as a fusion protein/affinity tag for the production of soluble recombinant proteins in E. coli. PMID:26805756

  13. Characterization of genome-wide ordered sequence-tagged Mycobacterium mutant libraries by Cartesian Pooling-Coordinate Sequencing

    PubMed Central

    Vandewalle, Kristof; Festjens, Nele; Plets, Evelyn; Vuylsteke, Marnik; Saeys, Yvan; Callewaert, Nico

    2015-01-01

    Reverse genetics research approaches require the availability of methods to rapidly generate specific mutants. Alternatively, where these methods are lacking, the construction of pre-characterized libraries of mutants can be extremely valuable. However, this can be complex, expensive and time consuming. Here, we describe a robust, easy to implement parallel sequencing-based method (Cartesian Pooling-Coordinate Sequencing or CP-CSeq) that reports both on the identity as well as on the location of sequence-tagged biological entities in well-plate archived clone collections. We demonstrate this approach using a transposon insertion mutant library of the Mycobacterium bovis BCG vaccine strain, providing the largest resource of mutants in any strain of the M. tuberculosis complex. The method is applicable to any entity for which sequence-tagged identification is possible. PMID:25960123

  14. 3' tag digital gene expression profiling of human brain and universal reference RNA using Illumina Genome Analyzer

    PubMed Central

    2009-01-01

    Background Massive parallel sequencing has the potential to replace microarrays as the method for transcriptome profiling. Currently there are two protocols: full-length RNA sequencing (RNA-SEQ) and 3'-tag digital gene expression (DGE). In this preliminary effort, we evaluated the 3' DGE approach using two reference RNA samples from the MicroArray Quality Control Consortium (MAQC). Results Using Brain RNA sample from multiple runs, we demonstrated that the transcript profiles from 3' DGE were highly reproducible between technical and biological replicates from libraries constructed by the same lab and even by different labs, and between two generations of Illumina's Genome Analyzers. Approximately 65% of all sequence reads mapped to mitochondrial genes, ribosomal RNAs, and canonical transcripts. The expression profiles of brain RNA and universal human reference RNA were compared which demonstrated that DGE was also highly quantitative with excellent correlation of differential expression with quantitative real-time PCR. Furthermore, one lane of 3' DGE sequencing, using the current sequencing chemistry and image processing software, had wider dynamic range for transcriptome profiling and was able to detect lower expressed genes which are normally below the detection threshold of microarrays. Conclusion 3' tag DGE profiling with massive parallel sequencing achieved high sensitivity and reproducibility for transcriptome profiling. Although it lacks the ability of detecting alternative splicing events compared to RNA-SEQ, it is much more affordable and clearly out-performed microarrays (Affymetrix) in detecting lower abundant transcripts. PMID:19917133

  15. Genomic and cDNA sequence tags of the hyperthermophilic archaeon Pyrobaculum aerophilum.

    PubMed Central

    Völkl, P; Markiewicz, P; Baikalov, C; Fitz-Gibbon, S; Stetter, K O; Miller, J H

    1996-01-01

    The hyperthermophilic archaeum, Pyrobaculum aerophilum, grows optimally at 100 degrees C with a doubling time of 180 min. It is a member of the phylogenetically ancient Thermoproteales order, but differs significantly from all other members by its facultatively aerobic metabolism. Due to its simple cultivation requirements and its nearly 100% plating efficiency, it was chosen as a model organism for studying the genome organization of hyperthermophilic ancient archaea. By a G+C content of the DNA of 52 mol%, sequence analysis was easily possible. At least some of the mRNA of P. aerophilum carried poly-A tails facilitating the construction of a cDNA library. 245 sequence tags of a poly-A primed cDNA library and 55 sequence tags from a 1-2 kb Sau3AI-fragment containing genomic library were analyzed and the corresponding amino acid sequences compared with protein sequences from databases. Fourteen percent of the cDNA and >9% of genomic DNA sequence tags revealed significant similarities to proteins in the databases. Matches were obtained to proteins from archaeal, bacterial and eukaryal sources. Some sequences showed greatest similarity to eukaryal rather than to bacterial versions of proteins, other matches were found to proteins which had previously only been found in eukaryotes. PMID:8948626

  16. The HaloTag: Improving Soluble Expression and Applications in Protein Functional Analysis.

    PubMed

    N Peterson, Scott; Kwon, Keehwan

    2012-01-01

    Technological and methodological advances have been critical for the rapidly evolving field of proteomics. The development of fusion tag systems is essential for purification and analysis of recombinant proteins. The HaloTag is a 34 KDa monomeric protein derived from a bacterial haloalkane dehalogenase. The majority of fusion tags in use today utilize a reversible binding interaction with a specific ligand. The HaloTag system is unique in that it forms a covalent linkage to its chloroalkane ligand. This linkage permits attachment of the HaloTag to a variety of functional reporters, which can be used to label and immobilize recombinant proteins. The success rate for HaloTag expression of soluble proteins is very high and comparable to maltose binding protein (MBP) tag. Furthermore, cleavage of the HaloTag does not result in protein insolubility that often is observed with the MBP tag. In the present report, we describe applications of the HaloTag system in our ongoing investigation of protein-protein interactions of the Y. pestis Type 3 secretion system on a custom protein microarray. We also describe the utilization of affinity purification/mass spectroscopy (AP/MS) to evaluate the utility of the Halo Tag system to characterize DNA binding activity and protein specificity. PMID:23115610

  17. The HaloTag: Improving Soluble Expression and Applications in Protein Functional Analysis

    PubMed Central

    N Peterson, Scott; Kwon, Keehwan

    2012-01-01

    Technological and methodological advances have been critical for the rapidly evolving field of proteomics. The development of fusion tag systems is essential for purification and analysis of recombinant proteins. The HaloTag is a 34 KDa monomeric protein derived from a bacterial haloalkane dehalogenase. The majority of fusion tags in use today utilize a reversible binding interaction with a specific ligand. The HaloTag system is unique in that it forms a covalent linkage to its chloroalkane ligand. This linkage permits attachment of the HaloTag to a variety of functional reporters, which can be used to label and immobilize recombinant proteins. The success rate for HaloTag expression of soluble proteins is very high and comparable to maltose binding protein (MBP) tag. Furthermore, cleavage of the HaloTag does not result in protein insolubility that often is observed with the MBP tag. In the present report, we describe applications of the HaloTag system in our ongoing investigation of protein-protein interactions of the Y. pestis Type 3 secretion system on a custom protein microarray. We also describe the utilization of affinity purification/mass spectroscopy (AP/MS) to evaluate the utility of the Halo Tag system to characterize DNA binding activity and protein specificity. PMID:23115610

  18. Improved Sequence Tag Generation Method for Peptide Identification in Tandem Mass Spectrometry

    PubMed Central

    Cao, Xia; Nesvizhskii, Alexey I.

    2013-01-01

    The sequence tag-based peptide identification methods are a promising alternative to the traditional database search approach. However, a more comprehensive analysis, optimization, and comparison with established methods are necessary before these methods can gain widespread use in the proteomics community. Using the InsPecT open source code base (Tanner et al., Anal Chem. 2005, 77:4626–39), we present an improved sequence tag generation method that directly incorporates multi-charged fragment ion peaks present in many tandem mass spectra of higher charge states. We also investigate the performance of sequence tagging under different settings using control datasets generated on five different types of mass spectrometers, as well as using a complex phosphopeptide-enriched sample. We also demonstrate that additional modeling of InsPecT search scores using a semi-parametric approach incorporating the accuracy of the precursor ion mass measurement provides additional improvement in the ability to discriminate between correct and incorrect peptide identifications. The overall superior performance of the sequence tag-based peptide identification method is demonstrated by comparison with a commonly used SEQUEST/PeptideProphet approach. PMID:18785767

  19. Velocity measurement of clay intrusion through a sudden contraction step using a tagging pulse sequence.

    PubMed

    Tsushima, Shohji; Hasegawa, Atsushi; Suekane, Tetsuya; Hirai, Shuichiro; Tanaka, Yoshihiro; Nakasuji, Yoshizumi

    2003-07-01

    Magnetic resonance imaging (MRI) with a spatial tagging sequence was used to measure the velocity distribution of clay that was forced past a sudden contraction. A spatial tagging sequence provided magnetic resonance images of clay that allowed measurement of the velocity distribution in the clay, which can provide profound insights on the deformation process of clay during the intrusion process. The experiments were conducted using a specially-designed vessel that could operate at up to 30 MPa. The vessel offers a rectangle test section with a sudden contraction step that had a ratio of contraction of 2:1. The vessel was installed into a commercial magnetic resonance imaging equipment and then the fluid motion of clay flowing into the narrow contracted channel was quantitatively investigated to examine behaviors of flowing clay as non-Newtonian fluid. MRI results are compared with those obtained by computational fluid dynamics (CFD) calculation. Velocity distributions obtained from each tag displacement did not well agree with those predicted by CFD results near the contraction step where the fluid accelerated rapidly. However, a post-processing on calculation results, in which virtual tag displacement is calculated, gave better agreement with experiment and enabled us to compare MRI results with CFD results. PMID:12915199

  20. Peptides derivatized with bicyclic quaternary ammonium ionization tags. Sequencing via tandem mass spectrometry.

    PubMed

    Setner, Bartosz; Rudowska, Magdalena; Klem, Ewelina; Cebrat, Marek; Szewczuk, Zbigniew

    2014-10-01

    Improving the sensitivity of detection and fragmentation of peptides to provide reliable sequencing of peptides is an important goal of mass spectrometric analysis. Peptides derivatized by bicyclic quaternary ammonium ionization tags: 1-azabicyclo[2.2.2]octane (ABCO) or 1,4-diazabicyclo[2.2.2]octane (DABCO), are characterized by an increased detection sensitivity in electrospray ionization mass spectrometry (ESI-MS) and longer retention times on the reverse-phase (RP) chromatography columns. The improvement of the detection limit was observed even for peptides dissolved in 10 mM NaCl. Collision-induced dissociation tandem mass spectrometry of quaternary ammonium salts derivatives of peptides showed dominant a- and b-type ions, allowing facile sequencing of peptides. The bicyclic ionization tags are stable in collision-induced dissociation experiments, and the resulted fragmentation pattern is not significantly influenced by either acidic or basic amino acid residues in the peptide sequence. Obtained results indicate the general usefulness of the bicyclic quaternary ammonium ionization tags for ESI-MS/MS sequencing of peptides. PMID:25303389

  1. Investigation of TtrD, an expressing recombinant fusion tag, in Escherichia coli.

    PubMed

    Chen, Anqi; Zhang, Li; Gu, Shaohua; Tang, Rong; Xie, Yi; Ji, Chaoneng

    2016-04-01

    Escherichia coli is widely used for expressing recombinant proteins, and several tags have been developed to improve protein solubility. However, expressing and purifying protein from other organisms is not always successful. In this study, we investigated the possibility of using TtrD as an expressing fusion tag in E. coli. Twenty RING finger domain containing human genes were expressed in E. coli grown at 37 °C and 18 °C and tested with four other fusion tags, namely His, SUMO, GST and MBP, for comparison. The results indicated that the soluble expressing ability of the tags was MBP, GST, TtrD, SUMO, and His in descending order. A one-column refolding process was used to purify the expressed proteins in inclusion bodies, and TtrD showed the strongest refolding ability. The results suggested that the TtrD tag enhanced recombinant protein solubility and refolding ability and might be a useful tag for protein expression in E. coli. PMID:26690374

  2. Expression and immune recognition of SV40 Tag in transgenic mice that develop metastatic osteosarcomas.

    PubMed

    Marton, I; Johnson, S E; Damjanov, I; Currier, K S; Sundberg, J P; Knowles, B B

    2000-04-01

    Mature adult mice of the C57BL/6-TgN(Amy1TAg)501Knw transgenic mouse lineage, 501, containing a liver alpha-amylase promoted-SV40 Tag hybrid gene, routinely develop SV40 Tag-induced metastatic osteosarcomas. This form of alpha-amylase was known to be expressed in the liver, salivary glands, pancreas, and fat. Cells in the normal rib adjacent to the periosteum also express alpha-amylase suggesting that transgene expression is correctly targeted to generate osteosarcomas. 501 mice express SV40 Tag in the salivary glands but do not develop abnormalities in these organs by the time of their death from SV40-induced osteosarcomas. Mice of the C57BL/6 strain make a strong and effective anti-tumor immune response to SV40 Tag immunization. However, immunization of 501 mice with SV40 Tag early in life does not alter or prevent SV40 Tag-induced osteosarcomagenesis. 501 mice mount a significantly less effective cytotoxic T-lymphocyte response following SV40 Tag immunization while 501 osteosarcoma-derived cells are fully susceptible to SV40 Tag-specific T-cell lysis. This suggests that partial tolerance, not loss of antigen presentation by tumor cells, characterizes this mouse model of endogenous bone tumor development. To determine whether the immune recognition of endogenous SV40 Tag could influence tumorigenesis, the metastatic potential and time of death from tumor was investigated in CD4-null mutant 501 mice and beta-2 microglobulin-null mutant 501 mice. The size and number of metastases in these strains and longevity of these strains varied. We suggest that components of both the innate and adaptive immune response control tumor appearance and progression. PMID:10951695

  3. An Entry/Gateway® cloning system for general expression of genes with molecular tags in Drosophila melanogaster

    PubMed Central

    Akbari, Omar S; Oliver, Daniel; Eyer, Katie; Pai, Chi-Yun

    2009-01-01

    Background Tagged fusion proteins are priceless tools for monitoring the activities of biomolecules in living cells. However, over-expression of fusion proteins sometimes leads to the unwanted lethality or developmental defects. Therefore, vectors that can express tagged proteins at physiological levels are desirable tools for studying dosage-sensitive proteins. We developed a set of Entry/Gateway® vectors for expressing fluorescent fusion proteins in Drosophila melanogaster. The vectors were used to generate fluorescent CP190 which is a component of the gypsy chromatin insulator. We used the fluorescent CP190 to study the dynamic movement of related chromatin insulators in living cells. Results The Entry/Gateway® system is a timesaving technique for quickly generating expression constructs of tagged fusion proteins. We described in this study an Entry/Gateway® based system, which includes six P-element destination vectors (P-DEST) for expressing tagged proteins (eGFP, mRFP, or myc) in Drosophila melanogaster and a TA-based cloning vector for generating entry clones from unstable DNA sequences. We used the P-DEST vectors to express fluorecent CP190 at tolerable levels. Expression of CP190 using the UAS/Gal4 system, instead, led to either lethality or underdeveloped tissues. The expressed eGFP- or mRFP-tagged CP190 proteins are fully functional and rescued the lethality of the homozygous CP190 mutation. We visualized a wide range of CP190 distribution patterns in living cell nuclei, from thousands of tiny particles to less than ten giant ones, which likely reflects diverse organization of higher-order chromatin structures. We also visualized the fusion of multiple smaller insulator bodies into larger aggregates in living cells, which is likely reflective of the dynamic activities of reorganization of chromatin in living nuclei. Conclusion We have developed an efficient cloning system for expressing dosage-sensitive proteins in Drosophila melanogaster. This system

  4. Maltose-Binding Protein (MBP), a Secretion-Enhancing Tag for Mammalian Protein Expression Systems

    PubMed Central

    Reuten, Raphael; Nikodemus, Denise; Oliveira, Maria B.; Patel, Trushar R.; Brachvogel, Bent; Breloy, Isabelle; Stetefeld, Jörg; Koch, Manuel

    2016-01-01

    Recombinant proteins are commonly expressed in eukaryotic expression systems to ensure the formation of disulfide bridges and proper glycosylation. Although many proteins can be expressed easily, some proteins, sub-domains, and mutant protein versions can cause problems. Here, we investigated expression levels of recombinant extracellular, intracellular as well as transmembrane proteins tethered to different polypeptides in mammalian cell lines. Strikingly, fusion of proteins to the prokaryotic maltose-binding protein (MBP) generally enhanced protein production. MBP fusion proteins consistently exhibited the most robust increase in protein production in comparison to commonly used tags, e.g., the Fc, Glutathione S-transferase (GST), SlyD, and serum albumin (ser alb) tag. Moreover, proteins tethered to MBP revealed reduced numbers of dying cells upon transient transfection. In contrast to the Fc tag, MBP is a stable monomer and does not promote protein aggregation. Therefore, the MBP tag does not induce artificial dimerization of tethered proteins and provides a beneficial fusion tag for binding as well as cell adhesion studies. Using MBP we were able to secret a disease causing laminin β2 mutant protein (congenital nephrotic syndrome), which is normally retained in the endoplasmic reticulum. In summary, this study establishes MBP as a versatile expression tag for protein production in eukaryotic expression systems. PMID:27029048

  5. Maltose-Binding Protein (MBP), a Secretion-Enhancing Tag for Mammalian Protein Expression Systems.

    PubMed

    Reuten, Raphael; Nikodemus, Denise; Oliveira, Maria B; Patel, Trushar R; Brachvogel, Bent; Breloy, Isabelle; Stetefeld, Jörg; Koch, Manuel

    2016-01-01

    Recombinant proteins are commonly expressed in eukaryotic expression systems to ensure the formation of disulfide bridges and proper glycosylation. Although many proteins can be expressed easily, some proteins, sub-domains, and mutant protein versions can cause problems. Here, we investigated expression levels of recombinant extracellular, intracellular as well as transmembrane proteins tethered to different polypeptides in mammalian cell lines. Strikingly, fusion of proteins to the prokaryotic maltose-binding protein (MBP) generally enhanced protein production. MBP fusion proteins consistently exhibited the most robust increase in protein production in comparison to commonly used tags, e.g., the Fc, Glutathione S-transferase (GST), SlyD, and serum albumin (ser alb) tag. Moreover, proteins tethered to MBP revealed reduced numbers of dying cells upon transient transfection. In contrast to the Fc tag, MBP is a stable monomer and does not promote protein aggregation. Therefore, the MBP tag does not induce artificial dimerization of tethered proteins and provides a beneficial fusion tag for binding as well as cell adhesion studies. Using MBP we were able to secret a disease causing laminin β2 mutant protein (congenital nephrotic syndrome), which is normally retained in the endoplasmic reticulum. In summary, this study establishes MBP as a versatile expression tag for protein production in eukaryotic expression systems. PMID:27029048

  6. High Level Expression and Purification of Recombinant Proteins from Escherichia coli with AK-TAG

    PubMed Central

    Luo, Dan; Wen, Caixia; Zhao, Rongchuan; Liu, Xinyu; Liu, Xinxin; Cui, Jingjing; Liang, Joshua G.; Liang, Peng

    2016-01-01

    Adenylate kinase (AK) from Escherichia coli was used as both solubility and affinity tag for recombinant protein production. When fused to the N-terminus of a target protein, an AK fusion protein could be expressed in soluble form and purified to near homogeneity in a single step from Blue-Sepherose via affinity elution with micromolar concentration of P1, P5- di (adenosine—5’) pentaphosphate (Ap5A), a transition-state substrate analog of AK. Unlike any other affinity tags, the level of a recombinant protein expression in soluble form and its yield of recovery during each purification step could be readily assessed by AK enzyme activity in near real time. Coupled to a His-Tag installed at the N-terminus and a thrombin cleavage site at the C terminus of AK, the streamlined method, here we dubbed AK-TAG, could also allow convenient expression and retrieval of a cleaved recombinant protein in high yield and purity via dual affinity purification steps. Thus AK-TAG is a new addition to the arsenal of existing affinity tags for recombinant protein expression and purification, and is particularly useful where soluble expression and high degree of purification are at stake. PMID:27214237

  7. Cloning, Expression, and Purification of Histidine-Tagged Escherichia coli Dihydrodipicolinate Reductase

    PubMed Central

    Trigoso, Yvonne D.; Evans, Russell C.; Karsten, William E.; Chooback, Lilian

    2016-01-01

    The enzyme dihydrodipicolinate reductase (DHDPR) is a component of the lysine biosynthetic pathway in bacteria and higher plants. DHDPR catalyzes the NAD(P)H dependent reduction of 2,3-dihydrodipicolinate to the cyclic imine L-2,3,4,5,-tetrahydropicolinic acid. The dapB gene that encodes dihydrodipicolinate reductase has previously been cloned, but the expression of the enzyme is low and the purification is time consuming. Therefore the E. coli dapB gene was cloned into the pET16b vector to improve the protein expression and simplify the purification. The dapB gene sequence was utilized to design forward and reverse oligonucleotide primers that were used to PCR the gene from Escherichia coli genomic DNA. The primers were designed with NdeI or BamHI restriction sites on the 5’and 3’ terminus respectively. The PCR product was sequenced to confirm the identity of dapB. The gene was cloned into the expression vector pET16b through NdeI and BamHI restriction endonuclease sites. The resulting plasmid containing dapB was transformed into the bacterial strain BL21 (DE3). The transformed cells were utilized to grow and express the histidine-tagged reductase and the protein was purified using Ni-NTA affinity chromatography. SDS/PAGE gel analysis has shown that the protein was 95% pure and has approximate subunit molecular weight of 28 kDa. The protein purification is completed in one day and 3 liters of culture produced approximately 40–50 mgs of protein, an improvement on the previous protein expression and multistep purification. PMID:26815040

  8. Cloning, Expression, and Purification of Histidine-Tagged Escherichia coli Dihydrodipicolinate Reductase.

    PubMed

    Trigoso, Yvonne D; Evans, Russell C; Karsten, William E; Chooback, Lilian

    2016-01-01

    The enzyme dihydrodipicolinate reductase (DHDPR) is a component of the lysine biosynthetic pathway in bacteria and higher plants. DHDPR catalyzes the NAD(P)H dependent reduction of 2,3-dihydrodipicolinate to the cyclic imine L-2,3,4,5,-tetrahydropicolinic acid. The dapB gene that encodes dihydrodipicolinate reductase has previously been cloned, but the expression of the enzyme is low and the purification is time consuming. Therefore the E. coli dapB gene was cloned into the pET16b vector to improve the protein expression and simplify the purification. The dapB gene sequence was utilized to design forward and reverse oligonucleotide primers that were used to PCR the gene from Escherichia coli genomic DNA. The primers were designed with NdeI or BamHI restriction sites on the 5'and 3' terminus respectively. The PCR product was sequenced to confirm the identity of dapB. The gene was cloned into the expression vector pET16b through NdeI and BamHI restriction endonuclease sites. The resulting plasmid containing dapB was transformed into the bacterial strain BL21 (DE3). The transformed cells were utilized to grow and express the histidine-tagged reductase and the protein was purified using Ni-NTA affinity chromatography. SDS/PAGE gel analysis has shown that the protein was 95% pure and has approximate subunit molecular weight of 28 kDa. The protein purification is completed in one day and 3 liters of culture produced approximately 40-50 mgs of protein, an improvement on the previous protein expression and multistep purification. PMID:26815040

  9. Identification of SNP and SSR markers in eggplant using RAD tag sequencing

    PubMed Central

    2011-01-01

    Background The eggplant (Solanum melongena L.) genome is relatively unexplored, especially compared to those of the other major Solanaceae crops tomato and potato. In particular, no SNP markers are publicly available; on the other hand, over 1,000 SSR markers were developed and publicly available. We have combined the recently developed Restriction-site Associated DNA (RAD) approach with Illumina DNA sequencing for rapid and mass discovery of both SNP and SSR markers for eggplant. Results RAD tags were generated from the genomic DNA of a pair of eggplant mapping parents, and sequenced to produce ~17.5 Mb of sequences arrangeable into ~78,000 contigs. The resulting non-redundant genomic sequence dataset consisted of ~45,000 sequences, of which ~29% were putative coding sequences and ~70% were in common between the mapping parents. The shared sequences allowed the discovery of ~10,000 SNPs and nearly 1,000 indels, equivalent to a SNP frequency of 0.8 per Kb and an indel frequency of 0.07 per Kb. Over 2,000 of the SNPs are likely to be mappable via the Illumina GoldenGate assay. A subset of 384 SNPs was used to successfully fingerprint a panel of eggplant germplasm, producing a set of informative diversity data. The RAD sequences also included nearly 2,000 putative SSRs, and primer pairs were designed to amplify 1,155 loci. Conclusion The high throughput sequencing of the RAD tags allowed the discovery of a large number of DNA markers, which will prove useful for extending our current knowledge of the genome organization of eggplant, for assisting in marker-aided selection and for carrying out comparative genomic analyses within the Solanaceae family. PMID:21663628

  10. A Chimeric Affinity Tag for Efficient Expression and Chromatographic Purification of Heterologous Proteins from Plants.

    PubMed

    Sainsbury, Frank; Jutras, Philippe V; Vorster, Juan; Goulet, Marie-Claire; Michaud, Dominique

    2016-01-01

    The use of plants as expression hosts for recombinant proteins is an increasingly attractive option for the production of complex and challenging biopharmaceuticals. Tools are needed at present to marry recent developments in high-yielding gene vectors for heterologous expression with routine protein purification techniques. In this study, we designed the Cysta-tag, a new purification tag for immobilized metal affinity chromatography (IMAC) of plant-made proteins based on the protein-stabilizing fusion partner SlCYS8. We show that the Cysta-tag may be used to readily purify proteins under native conditions, and then be removed enzymatically to isolate the protein of interest. We also show that commonly used protease recognition sites for linking purification tags are differentially stable in leaves of the commonly used expression host Nicotiana benthamiana, with those linkers susceptible to cysteine proteases being less stable then serine protease-cleavable linkers. As an example, we describe a Cysta-tag experimental scheme for the one-step purification of a clinically useful protein, human α1-antitrypsin, transiently expressed in N. benthamiana. With potential applicability to the variety of chromatography formats commercially available for IMAC-based protein purification, the Cysta-tag provides a convenient means for the efficient and cost-effective purification of recombinant proteins from plant tissues. PMID:26913045

  11. A Chimeric Affinity Tag for Efficient Expression and Chromatographic Purification of Heterologous Proteins from Plants

    PubMed Central

    Sainsbury, Frank; Jutras, Philippe V.; Vorster, Juan; Goulet, Marie-Claire; Michaud, Dominique

    2016-01-01

    The use of plants as expression hosts for recombinant proteins is an increasingly attractive option for the production of complex and challenging biopharmaceuticals. Tools are needed at present to marry recent developments in high-yielding gene vectors for heterologous expression with routine protein purification techniques. In this study, we designed the Cysta-tag, a new purification tag for immobilized metal affinity chromatography (IMAC) of plant-made proteins based on the protein-stabilizing fusion partner SlCYS8. We show that the Cysta-tag may be used to readily purify proteins under native conditions, and then be removed enzymatically to isolate the protein of interest. We also show that commonly used protease recognition sites for linking purification tags are differentially stable in leaves of the commonly used expression host Nicotiana benthamiana, with those linkers susceptible to cysteine proteases being less stable then serine protease-cleavable linkers. As an example, we describe a Cysta-tag experimental scheme for the one-step purification of a clinically useful protein, human α1-antitrypsin, transiently expressed in N. benthamiana. With potential applicability to the variety of chromatography formats commercially available for IMAC-based protein purification, the Cysta-tag provides a convenient means for the efficient and cost-effective purification of recombinant proteins from plant tissues. PMID:26913045

  12. Evidence from sequence-tagged-site markers of a recent progenitor-derivative species pair in conifers

    PubMed Central

    Perron, Martin; Perry, Daniel J.; Andalo, Christophe; Bousquet, Jean

    2000-01-01

    Black spruce (Picea mariana [B.S.P.] Mill.) and red spruce (Picea rubens Sarg.) are two conifer species known to hybridize naturally in northeastern North America. We hypothesized that there is a progenitor-derivative relationship between these two taxa and conducted a genetic investigation by using sequence-tagged-site markers of expressed genes. Based on the 26 sequence-tagged-site loci assayed in this study, the unbiased genetic identity between the two taxa was quite high with a value of 0.920. The mean number of polymorphic loci, the mean number of alleles per polymorphic locus, and the average observed heterozygosity were lower in red spruce (P = 35%, AP = 2.1, Ho = 0.069) than in black spruce (P = 54%, AP = 2.9, Ho = 0.103). No unique alleles were found in red spruce, and the observed patterns of allele distribution indicated that the genetic diversity of red spruce was essentially a subset of that found in black spruce. When considered in combination with ecological evidence and simulation results, these observations clearly support the existence of a progenitor-derivative relationship and suggest that the reduced level of genetic diversity in red spruce may result from allopatric speciation through glaciation-induced isolation of a preexisting black spruce population during the Pleistocene era. Our observations signal a need for a thorough reexamination of several conifer species complexes in which natural hybridization is known to occur. PMID:11016967

  13. A physical map of the X chromosome of Drosophila melanogaster: Cosmid contigs and sequence tagged sites

    SciTech Connect

    Madueno, E.; Modolell, J.; Papagiannakis, G.

    1995-04-01

    A physical map of the euchromatic X chromosome of Drosophila melanogaster has been constructed by assembling contiguous arrays of cosmids that were selected by screening a library with DNA isolated from microamplified chromosomal divisions. This map, consisting of 893 cosmids, covers {approximately}64% of the euchromatic part of the chromosome. In addition, 568 sequence tagged sites (STS), in aggregate representing 120 kb of sequenced DNA, were derived from selected cosmids. Most of these STSs, spaced at an average distance of {approximately} 35 kb along the euchromatic region of the chromosome, represent DNA tags that can be used as entry points to the fruitfly genome. Furthermore, 42 genes have been placed on the physical map, either through the hybridization of specific probes to the cosmids or through the fact that they were represented among the STSs. These provide a link between the physical and the genetic maps of D. melanogaster. Nine novel genes have been tentatively identified in Drosophila on the basis of matches between STS sequences and sequences from other species. 32 refs., 3 figs., 4 tabs.

  14. De novo sequencing of unique sequence tags for discovery of post-translational modifications of proteins

    SciTech Connect

    Shen, Yufeng; Tolic, Nikola; Hixson, Kim K.; Purvine, Samuel O.; Anderson, Gordon A.; Smith, Richard D.

    2008-10-15

    De novo sequencing has a promise to discover the protein post-translation modifications; however, such approach is still in their infancy and not widely applied for proteomics practices due to its limited reliability. In this work, we describe a de novo sequencing approach for discovery of protein modifications through identification of the UStags (Anal. Chem. 2008, 80, 1871-1882). The de novo information was obtained from Fourier-transform tandem mass spectrometry for peptides and polypeptides in a yeast lysate, and the de novo sequences obtained were filtered to define a more limited set of UStags. The DNA-predicted database protein sequences were then compared to the UStags, and the differences observed across or in the UStags (i.e., the UStags’ prefix and suffix sequences and the UStags themselves) were used to infer the possible sequence modifications. With this de novo-UStag approach, we uncovered some unexpected variances of yeast protein sequences due to amino acid mutations and/or multiple modifications to the predicted protein sequences. Random matching of the de novo sequences to the predicted sequences were examined with use of two random (false) databases, and ~3% false discovery rates were estimated for the de novo-UStag approach. The factors affecting the reliability (e.g., existence of de novo sequencing noise residues and redundant sequences) and the sensitivity are described. The de novo-UStag complements the UStag method previously reported by enabling discovery of new protein modifications.

  15. Evaluation of Affinity-Tagged Protein Expression Strategies using Local and Global Isotope Ratio Measurements

    SciTech Connect

    Hervey, IV, William Judson; Khalsa-Moyers, Gurusahai K; Lankford, Patricia K; Owens, Elizabeth T; McKeown, Catherine K; Lu, Tse-Yuan S; Foote, Linda J; Morrell-Falvey, Jennifer L; McDonald, W Hayes; Pelletier, Dale A; Hurst, Gregory {Greg} B

    2009-01-01

    Protein enrichments of engineered, affinity-tagged (or bait ) fusion proteins with interaction partners are often laden with background, non-specific proteins, due to interactions that occur in vitro as an artifact of the technique. Furthermore, the in vivo expression of the bait protein may itself affect physiology or metabolism. In this study, intrinsic affinity purification challenges were investigated in a model protein complex, DNA-dependent RNA polymerase (RNAP), encompassing chromosome- and plasmid-encoding strategies for bait proteins in two different microbial species: Escherichia coli and Rhodopseudomonas palustris. Isotope ratio measurements of bait protein expression strains relative to native, wild-type strains were performed by liquid chromatography tandem mass spectrometry (LC-MS-MS) to assess bait protein expression strategies in each species. Authentic interacting proteins of RNAP were successfully discerned from artifactual co-isolating proteins by the isotopic differentiation of interactions as random or targeted (I-DIRT) method (A. J. Tackett et al. J. Proteome Res. 2005, 4 (5), 1752-1756). To investigate broader effects of bait protein production in the bacteria, we compared proteomes from strains harboring a plasmid that encodes an affinity-tagged subunit (RpoA) of the RNAP complex with the corresponding wild-type strains using stable isotope metabolic labeling. The ratio of RpoA abundance in plasmid strains versus wild type was 0.8 for R. palustris and 1.7 for E. coli. While most other proteins showed no appreciable difference, proteins significantly increased in abundance in plasmid-encoded bait-expressing strains of both species included the plasmid encoded antibiotic resistance protein, GenR and proteins involved in amino acid biosynthesis. Together, these local, complex-specific and more global, whole proteome isotopic abundance ratio measurements provided a tool for evaluating both in vivo and in vitro effects of plasmid

  16. Primer and platform effects on 16S rRNA tag sequencing

    SciTech Connect

    Tremblay, Julien; Singh, Kanwar; Fern, Alison; Kirton, Edward S.; He, Shaomei; Woyke, Tanja; Lee, Janey; Chen, Feng; Dangl, Jeffery L.; Tringe, Susannah G.

    2015-08-04

    Sequencing of 16S rRNA gene tags is a popular method for profiling and comparing microbial communities. The protocols and methods used, however, vary considerably with regard to amplification primers, sequencing primers, sequencing technologies; as well as quality filtering and clustering. How results are affected by these choices, and whether data produced with different protocols can be meaningfully compared, is often unknown. Here we compare results obtained using three different amplification primer sets (targeting V4, V6–V8, and V7–V8) and two sequencing technologies (454 pyrosequencing and Illumina MiSeq) using DNA from a mock community containing a known number of species as well as complex environmental samples whose PCR-independent profiles were estimated using shotgun sequencing. We find that paired-end MiSeq reads produce higher quality data and enabled the use of more aggressive quality control parameters over 454, resulting in a higher retention rate of high quality reads for downstream data analysis. While primer choice considerably influences quantitative abundance estimations, sequencing platform has relatively minor effects when matched primers are used. In conclusion, beta diversity metrics are surprisingly robust to both primer and sequencing platform biases.

  17. Primer and platform effects on 16S rRNA tag sequencing

    PubMed Central

    Tremblay, Julien; Singh, Kanwar; Fern, Alison; Kirton, Edward S.; He, Shaomei; Woyke, Tanja; Lee, Janey; Chen, Feng; Dangl, Jeffery L.; Tringe, Susannah G.

    2015-01-01

    Sequencing of 16S rRNA gene tags is a popular method for profiling and comparing microbial communities. The protocols and methods used, however, vary considerably with regard to amplification primers, sequencing primers, sequencing technologies; as well as quality filtering and clustering. How results are affected by these choices, and whether data produced with different protocols can be meaningfully compared, is often unknown. Here we compare results obtained using three different amplification primer sets (targeting V4, V6–V8, and V7–V8) and two sequencing technologies (454 pyrosequencing and Illumina MiSeq) using DNA from a mock community containing a known number of species as well as complex environmental samples whose PCR-independent profiles were estimated using shotgun sequencing. We find that paired-end MiSeq reads produce higher quality data and enabled the use of more aggressive quality control parameters over 454, resulting in a higher retention rate of high quality reads for downstream data analysis. While primer choice considerably influences quantitative abundance estimations, sequencing platform has relatively minor effects when matched primers are used. Beta diversity metrics are surprisingly robust to both primer and sequencing platform biases. PMID:26300854

  18. Primer and platform effects on 16S rRNA tag sequencing

    DOE PAGESBeta

    Tremblay, Julien; Singh, Kanwar; Fern, Alison; Kirton, Edward S.; He, Shaomei; Woyke, Tanja; Lee, Janey; Chen, Feng; Dangl, Jeffery L.; Tringe, Susannah G.

    2015-08-04

    Sequencing of 16S rRNA gene tags is a popular method for profiling and comparing microbial communities. The protocols and methods used, however, vary considerably with regard to amplification primers, sequencing primers, sequencing technologies; as well as quality filtering and clustering. How results are affected by these choices, and whether data produced with different protocols can be meaningfully compared, is often unknown. Here we compare results obtained using three different amplification primer sets (targeting V4, V6–V8, and V7–V8) and two sequencing technologies (454 pyrosequencing and Illumina MiSeq) using DNA from a mock community containing a known number of species as wellmore » as complex environmental samples whose PCR-independent profiles were estimated using shotgun sequencing. We find that paired-end MiSeq reads produce higher quality data and enabled the use of more aggressive quality control parameters over 454, resulting in a higher retention rate of high quality reads for downstream data analysis. While primer choice considerably influences quantitative abundance estimations, sequencing platform has relatively minor effects when matched primers are used. In conclusion, beta diversity metrics are surprisingly robust to both primer and sequencing platform biases.« less

  19. Real-time single-molecule electronic DNA sequencing by synthesis using polymer-tagged nucleotides on a nanopore array

    PubMed Central

    Fuller, Carl W.; Kumar, Shiv; Porel, Mintu; Chien, Minchen; Bibillo, Arek; Stranges, P. Benjamin; Dorwart, Michael; Tao, Chuanjuan; Li, Zengmin; Guo, Wenjing; Shi, Shundi; Korenblum, Daniel; Trans, Andrew; Aguirre, Anne; Liu, Edward; Harada, Eric T.; Pollard, James; Bhat, Ashwini; Cech, Cynthia; Yang, Alexander; Arnold, Cleoma; Palla, Mirkó; Hovis, Jennifer; Chen, Roger; Morozova, Irina; Kalachikov, Sergey; Russo, James J.; Kasianowicz, John J.; Davis, Randy; Roever, Stefan; Church, George M.; Ju, Jingyue

    2016-01-01

    DNA sequencing by synthesis (SBS) offers a robust platform to decipher nucleic acid sequences. Recently, we reported a single-molecule nanopore-based SBS strategy that accurately distinguishes four bases by electronically detecting and differentiating four different polymer tags attached to the 5′-phosphate of the nucleotides during their incorporation into a growing DNA strand catalyzed by DNA polymerase. Further developing this approach, we report here the use of nucleotides tagged at the terminal phosphate with oligonucleotide-based polymers to perform nanopore SBS on an α-hemolysin nanopore array platform. We designed and synthesized several polymer-tagged nucleotides using tags that produce different electrical current blockade levels and verified they are active substrates for DNA polymerase. A highly processive DNA polymerase was conjugated to the nanopore, and the conjugates were complexed with primer/template DNA and inserted into lipid bilayers over individually addressable electrodes of the nanopore chip. When an incoming complementary-tagged nucleotide forms a tight ternary complex with the primer/template and polymerase, the tag enters the pore, and the current blockade level is measured. The levels displayed by the four nucleotides tagged with four different polymers captured in the nanopore in such ternary complexes were clearly distinguishable and sequence-specific, enabling continuous sequence determination during the polymerase reaction. Thus, real-time single-molecule electronic DNA sequencing data with single-base resolution were obtained. The use of these polymer-tagged nucleotides, combined with polymerase tethering to nanopores and multiplexed nanopore sensors, should lead to new high-throughput sequencing methods. PMID:27091962

  20. Real-time single-molecule electronic DNA sequencing by synthesis using polymer-tagged nucleotides on a nanopore array.

    PubMed

    Fuller, Carl W; Kumar, Shiv; Porel, Mintu; Chien, Minchen; Bibillo, Arek; Stranges, P Benjamin; Dorwart, Michael; Tao, Chuanjuan; Li, Zengmin; Guo, Wenjing; Shi, Shundi; Korenblum, Daniel; Trans, Andrew; Aguirre, Anne; Liu, Edward; Harada, Eric T; Pollard, James; Bhat, Ashwini; Cech, Cynthia; Yang, Alexander; Arnold, Cleoma; Palla, Mirkó; Hovis, Jennifer; Chen, Roger; Morozova, Irina; Kalachikov, Sergey; Russo, James J; Kasianowicz, John J; Davis, Randy; Roever, Stefan; Church, George M; Ju, Jingyue

    2016-05-10

    DNA sequencing by synthesis (SBS) offers a robust platform to decipher nucleic acid sequences. Recently, we reported a single-molecule nanopore-based SBS strategy that accurately distinguishes four bases by electronically detecting and differentiating four different polymer tags attached to the 5'-phosphate of the nucleotides during their incorporation into a growing DNA strand catalyzed by DNA polymerase. Further developing this approach, we report here the use of nucleotides tagged at the terminal phosphate with oligonucleotide-based polymers to perform nanopore SBS on an α-hemolysin nanopore array platform. We designed and synthesized several polymer-tagged nucleotides using tags that produce different electrical current blockade levels and verified they are active substrates for DNA polymerase. A highly processive DNA polymerase was conjugated to the nanopore, and the conjugates were complexed with primer/template DNA and inserted into lipid bilayers over individually addressable electrodes of the nanopore chip. When an incoming complementary-tagged nucleotide forms a tight ternary complex with the primer/template and polymerase, the tag enters the pore, and the current blockade level is measured. The levels displayed by the four nucleotides tagged with four different polymers captured in the nanopore in such ternary complexes were clearly distinguishable and sequence-specific, enabling continuous sequence determination during the polymerase reaction. Thus, real-time single-molecule electronic DNA sequencing data with single-base resolution were obtained. The use of these polymer-tagged nucleotides, combined with polymerase tethering to nanopores and multiplexed nanopore sensors, should lead to new high-throughput sequencing methods. PMID:27091962

  1. A protein tagging system for signal amplification in gene expression and fluorescence imaging

    PubMed Central

    Tanenbaum, Marvin E.; Gilbert, Luke A.; Qi, Lei S.; Weissman, Jonathan S.; Vale, Ronald D.

    2014-01-01

    Summary Signals in many biological processes can be amplified by recruiting multiple copies of regulatory proteins to a site of action. Harnessing this principle, we have developed a novel protein scaffold, a repeating peptide array termed SunTag, which can recruit multiple copies of an antibody-fusion protein. We show that the SunTag can recruit up to 24 copies of GFP, thereby enabling long-term imaging of single protein molecules in living cells. We also use the SunTag to create a potent synthetic transcription factor by recruiting multiple copies of a transcriptional activation domain to a nuclease-deficient CRISPR/Cas9 protein and demonstrate strong activation of endogenous gene expression and re-engineered cell behavior with this system. Thus, the SunTag provides a versatile platform for multimerizing proteins on a target protein scaffold and is likely to have many applications in imaging and in controlling biological outputs. PMID:25307933

  2. Myocardial tagging by Cardiovascular Magnetic Resonance: evolution of techniques--pulse sequences, analysis algorithms, and applications

    PubMed Central

    2011-01-01

    Cardiovascular magnetic resonance (CMR) tagging has been established as an essential technique for measuring regional myocardial function. It allows quantification of local intramyocardial motion measures, e.g. strain and strain rate. The invention of CMR tagging came in the late eighties, where the technique allowed for the first time for visualizing transmural myocardial movement without having to implant physical markers. This new idea opened the door for a series of developments and improvements that continue up to the present time. Different tagging techniques are currently available that are more extensive, improved, and sophisticated than they were twenty years ago. Each of these techniques has different versions for improved resolution, signal-to-noise ratio (SNR), scan time, anatomical coverage, three-dimensional capability, and image quality. The tagging techniques covered in this article can be broadly divided into two main categories: 1) Basic techniques, which include magnetization saturation, spatial modulation of magnetization (SPAMM), delay alternating with nutations for tailored excitation (DANTE), and complementary SPAMM (CSPAMM); and 2) Advanced techniques, which include harmonic phase (HARP), displacement encoding with stimulated echoes (DENSE), and strain encoding (SENC). Although most of these techniques were developed by separate groups and evolved from different backgrounds, they are in fact closely related to each other, and they can be interpreted from more than one perspective. Some of these techniques even followed parallel paths of developments, as illustrated in the article. As each technique has its own advantages, some efforts have been made to combine different techniques together for improved image quality or composite information acquisition. In this review, different developments in pulse sequences and related image processing techniques are described along with the necessities that led to their invention, which makes this

  3. De novo sequencing of unique sequence tags for discovery of post-translational modifications of proteins.

    PubMed

    Shen, Yufeng; Tolić, Nikola; Hixson, Kim K; Purvine, Samuel O; Anderson, Gordon A; Smith, Richard D

    2008-10-15

    De novo sequencing is a spectrum analysis approach for mass spectrometry data to discover post-translational modifications in proteins; however, such an approach is still in its infancy and is still not widely applied to proteomic practices due to its limited reliability. In this work, we describe a de novo sequencing approach for the discovery of protein modifications based on identification of the proteome UStags (Shen, Y.; Tolić, N.; Hixson, K. K.; Purvine, S. O.; Pasa-Tolić, L.; Qian, W. J.; Adkins, J. N.; Moore, R. J.; Smith, R. D. Anal. Chem. 2008, 80, 1871-1882). The de novo information was obtained from Fourier-transform tandem mass spectrometry data for peptides and polypeptides from a yeast lysate, and the de novo sequences obtained were selected based on filter levels designed to provide a limited yet high quality subset of UStags. The DNA-predicted database protein sequences were then compared to the UStags, and the differences observed across or in the UStags (i.e., the UStags' prefix and suffix sequences and the UStags themselves) were used to infer possible sequence modifications. With this de novo-UStag approach, we uncovered some unexpected variances within several yeast protein sequences due to amino acid mutations and/or multiple modifications to the predicted protein sequences. To determine false discovery rates, two random (false) databases were independently used for sequence matching, and ~3% false discovery rates were estimated for the de novo-UStag approach. The factors affecting the reliability (e.g., existence of de novo sequencing noise residues and redundant sequences) and the sensitivity of the approach were investigated and described. The combined de novo-UStag approach complements the UStag method previously reported by enabling the discovery of new protein modifications. PMID:18783246

  4. Moving Away from the Reference Genome: Evaluating a Peptide Sequencing Tagging Approach for Single Amino Acid Polymorphism Identifications in the Genus Populus

    SciTech Connect

    Abraham, Paul E; Adams, Rachel M; Tuskan, Gerald A; Hettich, Robert {Bob} L

    2013-01-01

    The genetic diversity across natural populations of the model organism, Populus, is extensive, containing a single nucleotide polymorphism roughly every 200 base pairs. When deviations from the reference genome occur in coding regions, they can impact protein sequences. Rather than relying on a static reference database to profile protein expression, we employed a peptide sequence tagging (PST) approach capable of decoding the plasticity of the Populus proteome. Using shotgun proteomics data from two genotypes of P. trichocarpa, a tag-based approach enabled the detection of 6,653 unexpected sequence variants. Through manual validation, our study investigated how the most abundant chemical modification (methionine oxidation) could masquerade as a sequence variant (AlaSer) when few site-determining ions existed. In fact, precise localization of an oxidation site for peptides with more than one potential placement was indeterminate for 70% of the MS/MS spectra. We demonstrate that additional fragment ions made available by high energy collisional dissociation enhances the robustness of the peptide sequence tagging approach (81% of oxidation events could be exclusively localized to a methionine). We are confident that augmenting fragmentation processes for a PST approach will further improve the identification of single amino acid polymorphism in Populus and potentially other species as well.

  5. EXPRESSED SEQUENCE TAGS (ESTS), FUM CLUSTER TRANSCRIPTION FACTOR AND MICROARRAYS

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Fumonisins are a family of mycotoxins produced by the filamentous fungus Fusarium verticillioides that often contaminates maize around the globe. F. verticillioides is generally an endophyte of maize, but under some conditions, it can cause seedling disease and ear, root, and stalk rots, as well as...

  6. Apple root expressed sequence tags (ESTs) responding to water deficit

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Water availability is frequently the most critical factor limiting plant growth and survival. For crops such as cereals and herbaceous annuals, breeding and/or selecting lines with improved water use efficiency or drought resistance has been a viable option. However, for slow growing perennials an...

  7. Generation and analysis of end sequence database for T-DNA tagging lines in rice.

    PubMed

    An, Suyoung; Park, Sunhee; Jeong, Dong-Hoon; Lee, Dong-Yeon; Kang, Hong-Gyu; Yu, Jung-Hwa; Hur, Junghe; Kim, Sung-Ryul; Kim, Young-Hea; Lee, Miok; Han, Soonki; Kim, Soo-Jin; Yang, Jungwon; Kim, Eunjoo; Wi, Soo Jin; Chung, Hoo Sun; Hong, Jong-Pil; Choe, Vitnary; Lee, Hak-Kyung; Choi, Jung-Hee; Nam, Jongmin; Kim, Seong-Ryong; Park, Phun-Bum; Park, Ky Young; Kim, Woo Taek; Choe, Sunghwa; Lee, Chin-Bum; An, Gynheung

    2003-12-01

    We analyzed 6749 lines tagged by the gene trap vector pGA2707. This resulted in the isolation of 3793 genomic sequences flanking the T-DNA. Among the insertions, 1846 T-DNAs were integrated into genic regions, and 1864 were located in intergenic regions. Frequencies were also higher at the beginning and end of the coding regions and upstream near the ATG start codon. The overall GC content at the insertion sites was close to that measured from the entire rice (Oryza sativa) genome. Functional classification of these 1846 tagged genes showed a distribution similar to that observed for all the genes in the rice chromosomes. This indicates that T-DNA insertion is not biased toward a particular class of genes. There were 764, 327, and 346 T-DNA insertions in chromosomes 1, 4 and 10, respectively. Insertions were not evenly distributed; frequencies were higher at the ends of the chromosomes and lower near the centromere. At certain sites, the frequency was higher than in the surrounding regions. This sequence database will be valuable in identifying knockout mutants for elucidating gene function in rice. This resource is available to the scientific community at http://www.postech.ac.kr/life/pfg/risd. PMID:14630961

  8. Species diagnostic single-nucleotide polymorphism and sequence-tagged site markers for the parasitic WASP Genus Nasonia (Hymenoptera: Ptermalidae)

    Technology Transfer Automated Retrieval System (TEKTRAN)

    We developed, identified and evaluated eight single nucleotide polymorphism (SNP) and three sequence-tagged site (STS) markers in nuclear gene sequences of the wasp genus Nasonia (Hymenoptera). We studied variation of these markers in natural populations of the closely related and regionally sympatr...

  9. Tagging the expressed protein with 6 histidines: rapid cloning of an amplicon with three options.

    PubMed

    Singh, Manika Indrajit; Jain, Vikas

    2013-01-01

    We report the designing of three expression vectors that can be used for rapid cloning of any blunt-end DNA segment. Only a single set of oligonucleotides are required to perform the amplification of the target DNA and its cloning in all three vectors simultaneously. The DNA thus cloned can express a protein either with or without a hexa-histidine tag depending upon the vector used. The expression occurs from T7 promoter when transformed into E. coli BL21(DE3). Two of the three plasmids have been designed to provide the expressed protein with either N- or C-terminus 6 histidine amino acids in tandem. The third plasmid, however, does not add any tag to the expressed protein. The cloning is achieved quickly with the requirement of phosphorylation of PCR product without any restriction digestion. Additionally, the generated clones can be confirmed with a single step PCR reaction carried out from bacterial colonies (generally termed as "colony PCR"). We show the cloning, expression and purification of Green Fluorescent Protein (GFP) as proof-of-concept. Additionally, we also show the cloning and expression of four sigma factors from Mycobacterium tuberculosis further demonstrating the utility of the designed plasmids. We strongly believe that the vectors and the strategy that we have developed will facilitate the rapid cloning and expression of any gene in E. coli BL21(DE3) with or without a hexa-histidine tag. PMID:23691118

  10. Feature expressions: creating and manipulating sequence datasets.

    PubMed

    Fristensky, B

    1993-12-25

    Annotation of features, such as introns, exons and protein coding regions in GenBank/EMBL/DDBJ entries is now standardized through use of the Features Table (FT) language. The essence of the FT language is described by the relation 'expression-->sequence', meaning that each FT expression evaluates to a sequence. For example, the expression M74750:1..50 evaluates to the first 50 bases of the sequence with accession number M74750. Because FT is intrinsic to the database definition, it can serve as a software- and platform-independent lingua franca for sequence manipulation. The XYLEM package makes it possible to create and manipulate sequence datasets using FT expressions. FEATURES is a program that resolves FT expressions into their corresponding sequences. Annotated features can be retrieved either by feature key or by expression. Even unannotated portions of a sequence can be retrieved by user-generated FT expressions. Applications of the FT language include retrieval of subsequences from large sequence entries, generation of chromosome models or artificial DNA constructs, and representation of restriction maps or mutants. PMID:8290362

  11. WEBSAGE: a web tool for visual analysis of differentially expressed human SAGE tags.

    PubMed

    Pylouster, Jean; Sénamaud-Beaufort, Catherine; Saison-Behmoaras, Tula Ester

    2005-07-01

    The serial analysis of gene expression (SAGE) is a powerful method to compare gene expression of mRNA populations. To provide quantitative expression levels on a genome-wide scale, the Cancer Genome Anatomy Project (CGAP) uses SAGE. Over 7 million SAGE tags, from 171 human cell types have been assembled. The growing number of laboratories involved in SAGE research necessitates the use of software that provides statistical analysis of raw data, allowing the rapid visualization and interpretation of results. We have created the first simple tool that performs statistical analysis on SAGE data, identifies the tags differentially expressed and shows the results in a scatter plot. It is freely available and accessible at http://bioserv.rpbs.jussieu.fr/websage/index.php. PMID:15980565

  12. Identification of Simple Sequence Repeat Biomarkers through Cross-Species Comparison in a Tag Cloud Representation

    PubMed Central

    2014-01-01

    Simple sequence repeats (SSRs) are not only applied as genetic markers in evolutionary studies but they also play an important role in gene regulatory activities. Efficient identification of conserved and exclusive SSRs through cross-species comparison is helpful for understanding the evolutionary mechanisms and associations between specific gene groups and SSR motifs. In this paper, we developed an online cross-species comparative system and integrated it with a tag cloud visualization technique for identifying potential SSR biomarkers within fourteen frequently used model species. Ultraconserved or exclusive SSRs among cross-species orthologous genes could be effectively retrieved and displayed through a friendly interface design. Four different types of testing cases were applied to demonstrate and verify the retrieved SSR biomarker candidates. Through statistical analysis and enhanced tag cloud representation on defined functional related genes and cross-species clusters, the proposed system can correctly represent the patterns, loci, colors, and sizes of identified SSRs in accordance with gene functions, pattern qualities, and conserved characteristics among species. PMID:24800246

  13. N-terminal tagging of the dopamine transporter impairs protein expression and trafficking in vivo

    PubMed Central

    Vecchio, Laura M.; Bermejo, M. Kristel; Beerepoot, Pieter; Ramsey, Amy J.

    2014-01-01

    The dopamine transporter (DAT) is the primary protein responsible for the uptake of dopamine from the extracellular space back into presynaptic neurons. As such, it plays an important role in the cessation of dopaminergic neurotransmission and in the maintenance of extracellular dopamine homeostasis. Here, we report the development of a new BAC transgenic mouse line that expresses DAT with an N-terminal HA-epitope (HAD-Tg). In this line, two copies of the HA-DAT BAC are incorporated into the genome, increasing DAT mRNA levels by 47%. Despite the increase in mRNA levels, HAD-Tg mice show no significant increase in the level of DAT protein in the striatum, indicating a defect in protein trafficking or stability. By crossing HAD-Tg mice with DAT knockout mice (DAT-KO), we engineered mice that exclusively express HA-tagged DAT in the absence of endogenous DAT (DAT-KO/HAD-Tg). We show that DAT-KO/HAD-Tg mice express only 8.5% of WT DAT levels in the striatum. Importantly, the HA-tagged DAT that is present in DAT-KO/HAD-Tg mice is functional, as it is able to partially rescue the DAT-KO hyperactive phenotype. Finally, we provide evidence that the HA-tagged DAT is retained in the cell body based on a reduction in the striatum:midbrain protein ratio. These results demonstrate that the presence of the N-terminal tag leads to impaired DAT protein expression in vivo due in part to improper trafficking of the tagged transporter, and highlight the importance of the N-terminus in the transport of DAT to striatal terminals. PMID:24886986

  14. Gene CATCHR--gene cloning and tagging for Caenorhabditis elegans using yeast homologous recombination: a novel approach for the analysis of gene expression.

    PubMed

    Sassi, Holly E; Renihan, Stephanie; Spence, Andrew M; Cooperstock, Ramona L

    2005-01-01

    Expression patterns of gene products provide important insights into gene function. Reporter constructs are frequently used to analyze gene expression in Caenorhabditis elegans, but the sequence context of a given gene is inevitably altered in such constructs. As a result, these transgenes may lack regulatory elements required for proper gene expression. We developed Gene Catchr, a novel method of generating reporter constructs that exploits yeast homologous recombination (YHR) to subclone and tag worm genes while preserving their local sequence context. YHR facilitates the cloning of large genomic regions, allowing the isolation of regulatory sequences in promoters, introns, untranslated regions and flanking DNA. The endogenous regulatory context of a given gene is thus preserved, producing expression patterns that are as accurate as possible. Gene Catchr is flexible: any tag can be inserted at any position without introducing extra sequence. Each step is simple and can be adapted to process multiple genes in parallel. We show that expression patterns derived from Gene Catchr transgenes are consistent with previous reports and also describe novel expression data. Mutant rescue assays demonstrate that Gene Catchr-generated transgenes are functional. Our results validate the use of Gene Catchr as a valuable tool to study spatiotemporal gene expression. PMID:16254074

  15. Expression and purification of human PYY(3-36) in Escherichia coli using a His-tagged small ubiquitin-like modifier fusion.

    PubMed

    Fazen, Christopher H; Kahkoska, Anna R; Doyle, Robert P

    2012-09-01

    Human PYY(3-36) (hPYY3-36) is a 34 amino acid hormone that has received a great deal of attention due to its effects on appetite regulation. hPYY(3-36) was modified at the N-terminus with an octahistidine tag and factor Xa protease sequence along with the small ubiquitin-like modifier (SUMO) tag and expressed in Escherichia coli. The protein was purified from clarified E. coli lysate by immobilized metal affinity chromatography (IMAC) with a yield of 30±7 mg/L of induced culture returned as an average over seven runs, and its identity was confirmed by Western blot and hPYY antibody recognition. The SUMO-tagged hPYY(3-36) was digested with two different proteases to return either His-tagged hPYY(3-36) or unmodified hPYY(3-36): (1) digestion with SUMO protease proceeded at about 50% efficiency yielding His-tagged hPYY(3-36); (2) digestion with factor Xa protease proceeded at greater than 90% efficiency yielding final hPYY(3-36). Products were purified from the digestion mixtures by reverse-phase high-performance liquid chromatography (C(18)) or IMAC, respectively, the identities were confirmed by mass spectrometry and hPYY antibody recognition, and the folded state of His-tagged hPYY(3-36) was investigated by circular dichroism spectroscopy. PMID:22771863

  16. Serial number tagging reveals a prominent sequence preference of retrotransposon integration.

    PubMed

    Chatterjee, Atreyi Ghatak; Esnault, Caroline; Guo, Yabin; Hung, Stevephen; McQueen, Philip G; Levin, Henry L

    2014-07-01

    Transposable elements (TE) have both negative and positive impact on the biology of their host. As a result, a balance is struck between the host and the TE that relies on directing integration to specific genome territories. The extraordinary capacity of DNA sequencing can create ultra dense maps of integration that are being used to study the mechanisms that position integration. Unfortunately, the great increase in the numbers of insertion sites detected comes with the cost of not knowing which positions are rare targets and which sustain high numbers of insertions. To address this problem we developed the serial number system, a TE tagging method that measures the frequency of integration at single nucleotide positions. We sequenced 1 million insertions of retrotransposon Tf1 in the genome of Schizosaccharomyces pombe and obtained the first profile of integration with frequencies for each individual position. Integration levels at individual nucleotides varied over two orders of magnitude and revealed that sequence recognition plays a key role in positioning integration. The serial number system is a general method that can be applied to determine precise integration maps for retroviruses and gene therapy vectors. PMID:24948612

  17. Expression Profiling Using New Generation Sequencing Technologies

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Microarray hybridization technology has become widely used in parallel analysis of gene expression. Recent advances in genome sequencing platforms point to an alternate approach through digital quantitation of sequencing reads produced from cDNA samples. This presentation will compare advantages a...

  18. The Use of Affinity Tags to Overcome Obstacles in Recombinant Protein Expression and Purification.

    PubMed

    Amarasinghe, Chinthaka; Jin, Jian-Ping

    2015-01-01

    Research and industrial demands for recombinant proteins continue to increase over time for their broad applications in structural and functional studies and as therapeutic agents. These applications often require large quantities of recombinant protein at desirable purity, which highlights the importance of developing and improving production approaches that provide high level expression and readily achievable purity of recombinant protein. E. coli is the most widely used host for the expression of a diverse range of proteins at low cost. However, there are common pitfalls that can severely limit the expression of exogenous proteins, such as stability, low solubility and toxicity to the host cell. To overcome these obstacles, one strategy that has found to be promising is the use of affinity tags or carrier peptide to aid in the folding of the target protein, increase solubility, lower toxicity and increase the level of expression. In the meantime, the tags and fusion proteins can be designed to facilitate affinity purification. Since the fusion protein may not exhibit the native conformation of the target protein, various strategies have been developed to remove the tag during or after purification to avoid potential complications in structural and functional studies and to obtain native biological activities. Despite extensive research and rapid development along these lines, there are unsolved problems and imperfect applications. This focused review compares and contrasts various strategies that employ affinity tags to improve bacterial expression and to facilitate purification of recombinant proteins. The pros and cons of the approaches are discussed for more effective applications and new directions of future improvement. PMID:26216265

  19. Repetitive genome elements in a European corn borer, Ostrinia nubilalis, bacterial artificial chromosome library were indicated by bacterial artificial chromosome end sequencing and development of sequence tag site markers: implications for lepidopteran genomic research.

    PubMed

    Coates, Brad S; Sumerford, Douglas V; Hellmich, Richard L; Lewis, Leslie C

    2009-01-01

    The European corn borer, Ostrinia nubilalis, is a serious pest of food, fiber, and biofuel crops in Europe, North America, and Asia and a model system for insect olfaction and speciation. A bacterial artificial chromosome library constructed for O. nubilalis contains 36 864 clones with an estimated average insert size of >or=120 kb and genome coverage of 8.8-fold. Screening OnB1 clones comprising approximately 2.76 genome equivalents determined the physical position of 24 sequence tag site markers, including markers linked to ecologically important and Bacillus thuringiensis toxin resistance traits. OnB1 bacterial artificial chromosome end sequence reads (GenBank dbGSS accessions ET217010 to ET217273) showed homology to annotated genes or expressed sequence tags and identified repetitive genome elements, O. nubilalis miniature subterminal inverted repeat transposable elements (OnMITE01 and OnMITE02), and ezi-like long interspersed nuclear elements. Mobility of OnMITE01 was demonstrated by the presence or absence in O. nubilalis of introns at two different loci. A (GTCT)n tetranucleotide repeat at the 5' ends of OnMITE01 and OnMITE02 are evidence for transposon-mediated movement of lepidopteran microsatellite loci. The number of repetitive elements in lepidopteran genomes will affect genome assembly and marker development. Single-locus sequence tag site markers described here have downstream application for integration within linkage maps and comparative genomic studies. PMID:19132072

  20. The non-coding RNA composition of the mitotic chromosome by 5′-tag sequencing

    PubMed Central

    Meng, Yicong; Yi, Xianfu; Li, Xinhui; Hu, Chuansheng; Wang, Ju; Bai, Ling; Czajkowsky, Daniel M.; Shao, Zhifeng

    2016-01-01

    Mitotic chromosomes are one of the most commonly recognized sub-cellular structures in eukaryotic cells. Yet basic information necessary to understand their structure and assembly, such as their composition, is still lacking. Recent proteomic studies have begun to fill this void, identifying hundreds of RNA-binding proteins bound to mitotic chromosomes. However, by contrast, there are only two RNA species (U3 snRNA and rRNA) that are known to be associated with the mitotic chromosome, suggesting that there are many mitotic chromosome-associated RNAs (mCARs) not yet identified. Here, using a targeted protocol based on 5′-tag sequencing to profile the mammalian mCAR population, we report the identification of 1279 mCARs, the majority of which are ncRNAs, including lncRNAs that exhibit greater conservation across 60 vertebrate species than the entire population of lncRNAs. There is also a significant enrichment of snoRNAs and specific SINE RNAs. Finally, ∼40% of the mCARs are presently unannotated, many of which are as abundant as the annotated mCARs, suggesting that there are also many novel ncRNAs in the mCARs. Overall, the mCARs identified here, together with the previous proteomic and genomic data, constitute the first comprehensive catalogue of the molecular composition of the eukaryotic mitotic chromosomes. PMID:27016738

  1. A new method to identify flanking sequence tags in chlamydomonas using 3’-RACE

    PubMed Central

    2012-01-01

    Background The green alga Chlamydomonas reinhardtii, although a premier model organism in biology, still lacks extensive insertion mutant libraries with well-identified Flanking Sequence Tags (FSTs). Rapid and efficient methods are needed for FST retrieval. Results Here, we present a novel method to identify FSTs in insertional mutants of Chlamydomonas. Transformants can be obtained with a resistance cassette lacking a 3’ untranslated region (UTR), suggesting that the RNA that is produced from the resistance marker terminates in the flanking genome when it encounters a cleavage/polyadenylation signal. We have used a robust 3’-RACE method to specifically amplify such chimeric cDNAs. Out of 38 randomly chosen transformants, 27 (71%) yielded valid FSTs, of which 23 could be unambiguously mapped to the genome. Eighteen of the mutants lie within a predicted gene. All but two of the intragenic insertions occur in the sense orientation with respect to transcription, suggesting a bias against situations of convergent transcription. Among the 14 insertion sites tested by genomic PCR, 12 could be confirmed. Among these are insertions in genes coding for PSBS3 (possibly involved in non-photochemical quenching), the NimA-related protein kinase CNK2, the mono-dehydroascorbate reductase MDAR1, the phosphoglycerate mutase PGM5 etc.. Conclusion We propose that our 3’-RACE FST method can be used to build large scale FST libraries in Chlamydomonas and other transformable organisms. PMID:22735168

  2. The non-coding RNA composition of the mitotic chromosome by 5'-tag sequencing.

    PubMed

    Meng, Yicong; Yi, Xianfu; Li, Xinhui; Hu, Chuansheng; Wang, Ju; Bai, Ling; Czajkowsky, Daniel M; Shao, Zhifeng

    2016-06-01

    Mitotic chromosomes are one of the most commonly recognized sub-cellular structures in eukaryotic cells. Yet basic information necessary to understand their structure and assembly, such as their composition, is still lacking. Recent proteomic studies have begun to fill this void, identifying hundreds of RNA-binding proteins bound to mitotic chromosomes. However, by contrast, there are only two RNA species (U3 snRNA and rRNA) that are known to be associated with the mitotic chromosome, suggesting that there are many mitotic chromosome-associated RNAs (mCARs) not yet identified. Here, using a targeted protocol based on 5'-tag sequencing to profile the mammalian mCAR population, we report the identification of 1279 mCARs, the majority of which are ncRNAs, including lncRNAs that exhibit greater conservation across 60 vertebrate species than the entire population of lncRNAs. There is also a significant enrichment of snoRNAs and specific SINE RNAs. Finally, ∼40% of the mCARs are presently unannotated, many of which are as abundant as the annotated mCARs, suggesting that there are also many novel ncRNAs in the mCARs. Overall, the mCARs identified here, together with the previous proteomic and genomic data, constitute the first comprehensive catalogue of the molecular composition of the eukaryotic mitotic chromosomes. PMID:27016738

  3. pISTil: a pipeline for yeast two-hybrid Interaction Sequence Tags identification and analysis

    PubMed Central

    Pellet, Johann; Meyniel, Laurène; Vidalain, Pierre-Olivier; de Chassey, Benoît; Tafforeau, Lionel; Lotteau, Vincent; Rabourdin-Combe, Chantal; Navratil, Vincent

    2009-01-01

    Background High-throughput screening of protein-protein interactions opens new systems biology perspectives for the comprehensive understanding of cell physiology in normal and pathological conditions. In this context, yeast two-hybrid system appears as a promising approach to efficiently reconstruct protein interaction networks at the proteome-wide scale. This protein interaction screening method generates a large amount of raw sequence data, i.e. the ISTs (Interaction Sequence Tags), which urgently need appropriate tools for their systematic and standardised analysis. Findings We develop pISTil, a bioinformatics pipeline combined with a user-friendly web-interface: (i) to establish a standardised system to analyse and to annotate ISTs generated by two-hybrid technologies with high performance and flexibility and (ii) to provide high-quality protein-protein interaction datasets for systems-level approach. This pipeline has been validated on a large dataset comprising more than 11.000 ISTs. As a case study, a detailed analysis of ISTs obtained from yeast two-hybrid screens of Hepatitis C Virus proteins against human cDNA libraries is also provided. Conclusion We have developed pISTil, an open source pipeline made of a collection of several applications governed by a Perl script. The pISTil pipeline is intended to laboratories, with IT-expertise in system administration, scripting and database management, willing to automatically process large amount of ISTs data for accurate reconstruction of protein interaction networks in a systems biology perspective. pISTil is publicly available for download at . PMID:19874608

  4. Transferability of microsatellite and sequence tagged site markers in Oryza species.

    PubMed

    Brondani, Claudio; Rangel, Paulo Hideo Nakano; Borba, Tereza Cristina Oliveira; Brondani, Rosana Pereira Vianello

    2003-01-01

    The genus Oryza comprises 22 species which are potentially useful as a source of genetic variability that can be introgressed into the worldwide cultivated rice, Oryza sativa. Molecular markers are useful tools for monitoring gene introgressions and for detecting polymorphism among species. In this study, cross-amplification was estimated among 28 accessions of 16 Oryza species, representing the genomes AA, BB, CC, BBCC and CCDD, using 59 microsatellite (OG, OS and RM series) and 15 STS (Sequence Tagged Sites) markers. All markers amplified at least one Oryza species, indicating different levels of transferability across species. Markers based on microsatellite sequences amplified 37 % of the accessions, with an average of 6.58 alleles per locus and an average polymorphism information content (PIC) of 70 %. For STS markers, the amplification level was 53.3 %, and the average number of alleles and PIC values were 1.6 and 10 %, respectively. These Results showed that although the STS markers detected a reduced level of genetic diversity, the transferability was higher, indicating that they can be used for genetic analysis when evaluating less genetically related species of Oryza. Among the microsatellite markers, an analysis of species with an AA genome showed that the OG markers produced the highest level of polymorphic loci (54.6 %), followed by RM markers (48 %). Highly polymorphic and transferable molecular markers in Oryza can be useful for exploiting the genetic resources of this genus, for detecting allelic variants in loci associated with important agronomic traits, and for monitoring alleles introgressed from wild relatives to cultivated rice. PMID:14641482

  5. Sorting of a HaloTag protein that has only a signal peptide sequence into exocrine secretory granules without protein aggregation.

    PubMed

    Fujita-Yoshigaki, Junko; Matsuki-Fukushima, Miwako; Yokoyama, Megumi; Katsumata-Kato, Osamu

    2013-11-15

    The mechanism involved in the sorting and accumulation of secretory cargo proteins, such as amylase, into secretory granules of exocrine cells remains to be solved. To clarify that sorting mechanism, we expressed a reporter protein HaloTag fused with partial sequences of salivary amylase protein in primary cultured parotid acinar cells. We found that a HaloTag protein fused with only the signal peptide sequence (Met(1)-Ala(25)) of amylase, termed SS25H, colocalized well with endogenous amylase, which was confirmed by immunofluorescence microscopy. Percoll-density gradient centrifugation of secretory granule fractions shows that the distributions of amylase and SS25H were similar. These results suggest that SS25H is transported to secretory granules and is not discriminated from endogenous amylase by the machinery that functions to remove proteins other than granule cargo from immature granules. Another reporter protein, DsRed2, that has the same signal peptide sequence also colocalized with amylase, suggesting that the sorting to secretory granules is not dependent on a characteristic of the HaloTag protein. Whereas Blue Native PAGE demonstrates that endogenous amylase forms a high-molecular-weight complex, SS25H does not participate in the complex and does not form self-aggregates. Nevertheless, SS25H was released from cells by the addition of a β-adrenergic agonist, isoproterenol, which also induces amylase secretion. These results indicate that addition of the signal peptide sequence, which is necessary for the translocation in the endoplasmic reticulum, is sufficient for the transportation and storage of cargo proteins in secretory granules of exocrine cells. PMID:24029466

  6. HaloTag is an effective expression and solubilisation fusion partner for a range of fibroblast growth factors

    PubMed Central

    Taylor, Sarah E.; Mao, Xianqing; Wilkinson, Mark C.

    2015-01-01

    The production of recombinant proteins such as the fibroblast growth factors (FGFs) is the key to establishing their function in cell communication. The production of recombinant FGFs in E. coli is limited, however, due to expression and solubility problems. HaloTag has been used as a fusion protein to introduce a genetically-encoded means for chemical conjugation of probes. We have expressed 11 FGF proteins with an N-terminal HaloTag, followed by a tobacco etch virus (TEV) protease cleavage site to allow release of the FGF protein. These were purified by heparin-affinity chromatography, and in some instances by further ion-exchange chromatography. It was found that HaloTag did not adversely affect the expression of FGF1 and FGF10, both of which expressed well as soluble proteins. The N-terminal HaloTag fusion was found to enhance the expression and yield of FGF2, FGF3 and FGF7. Moreover, whereas FGF6, FGF8, FGF16, FGF17, FGF20 and FGF22 were only expressed as insoluble proteins, their N-terminal HaloTag fusion counterparts (Halo-FGFs) were soluble, and could be successfully purified. However, cleavage of Halo-FGF6, -FGF8 and -FGF22 with TEV resulted in aggregation of the FGF protein. Measurement of phosphorylation of p42/44 mitogen-activated protein kinase and of cell growth demonstrated that the HaloTag fusion proteins were biologically active. Thus, HaloTag provides a means to enhance the expression of soluble recombinant proteins, in addition to providing a chemical genetics route for covalent tagging of proteins. PMID:26137434

  7. Complexity of murine cardiomyocyte miRNA biogenesis, sequence variant expression and function.

    PubMed

    Humphreys, David T; Hynes, Carly J; Patel, Hardip R; Wei, Grace H; Cannon, Leah; Fatkin, Diane; Suter, Catherine M; Clancy, Jennifer L; Preiss, Thomas

    2012-01-01

    microRNAs (miRNAs) are critical to heart development and disease. Emerging research indicates that regulated precursor processing can give rise to an unexpected diversity of miRNA variants. We subjected small RNA from murine HL-1 cardiomyocyte cells to next generation sequencing to investigate the relevance of such diversity to cardiac biology. ∼40 million tags were mapped to known miRNA hairpin sequences as deposited in miRBase version 16, calling 403 generic miRNAs as appreciably expressed. Hairpin arm bias broadly agreed with miRBase annotation, although 44 miR* were unexpectedly abundant (>20% of tags); conversely, 33 -5p/-3p annotated hairpins were asymmetrically expressed. Overall, variability was infrequent at the 5' start but common at the 3' end of miRNAs (5.2% and 52.3% of tags, respectively). Nevertheless, 105 miRNAs showed marked 5' isomiR expression (>20% of tags). Among these was miR-133a, a miRNA with important cardiac functions, and we demonstrated differential mRNA targeting by two of its prevalent 5' isomiRs. Analyses of miRNA termini and base-pairing patterns around Drosha and Dicer cleavage regions confirmed the known bias towards uridine at the 5' most position of miRNAs, as well as supporting the thermodynamic asymmetry rule for miRNA strand selection and a role for local structural distortions in fine tuning miRNA processing. We further recorded appreciable expression of 5 novel miR*, 38 extreme variants and 8 antisense miRNAs. Analysis of genome-mapped tags revealed 147 novel candidate miRNAs. In summary, we revealed pronounced sequence diversity among cardiomyocyte miRNAs, knowledge of which will underpin future research into the mechanisms involved in miRNA biogenesis and, importantly, cardiac function, disease and therapy. PMID:22319597

  8. The Arabidopsis Root Transcriptome by Serial Analysis of Gene Expression. Gene Identification Using the Genome Sequence1

    PubMed Central

    Fizames, Cécile; Muños, Stéphane; Cazettes, Céline; Nacry, Philippe; Boucherez, Jossia; Gaymard, Frédéric; Piquemal, David; Delorme, Valérie; Commes, Thérèse; Doumas, Patrick; Cooke, Richard; Marti, Jacques; Sentenac, Hervé; Gojon, Alain

    2004-01-01

    Large-scale identification of genes expressed in roots of the model plant Arabidopsis was performed by serial analysis of gene expression (SAGE), on a total of 144,083 sequenced tags, representing at least 15,964 different mRNAs. For tag to gene assignment, we developed a computational approach based on 26,620 genes annotated from the complete sequence of the genome. The procedure selected warrants the identification of the genes corresponding to the majority of the tags found experimentally, with a high level of reliability, and provides a reference database for SAGE studies in Arabidopsis. This new resource allowed us to characterize the expression of more than 3,000 genes, for which there is no expressed sequence tag (EST) or cDNA in the databases. Moreover, 85% of the tags were specific for one gene. To illustrate this advantage of SAGE for functional genomics, we show that our data allow an unambiguous analysis of most of the individual genes belonging to 12 different ion transporter multigene families. These results indicate that, compared with EST-based tag to gene assignment, the use of the annotated genome sequence greatly improves gene identification in SAGE studies. However, more than 6,000 different tags remained with no gene match, suggesting that a significant proportion of transcripts present in the roots originate from yet unknown or wrongly annotated genes. The root transcriptome characterized in this study markedly differs from those obtained in other organs, and provides a unique resource for investigating the functional specificities of the root system. As an example of the use of SAGE for transcript profiling in Arabidopsis, we report here the identification of 270 genes differentially expressed between roots of plants grown either with NO3- or NH4NO3 as N source. PMID:14730065

  9. Down Syndome: A search for expressed sequences

    SciTech Connect

    Pritchard, M.; Fuentes, J.J.; Bosch, A.

    1994-09-01

    Down Syndrome (DS) is a major cause of congenital heart disease and mental retardation. The most common anomaly is an extra copy of human chromosome 21 (HC21); however, chromosomal studies in rare patients with partial trisomy 21 have defined a minimal region for DS, including human chromosome 21 bands q22.2-q22.3. The study of genes in this chromosomal region will allow the elucidation of the biochemical and molecular bases for several of the distinct phenotypic traits of the syndrome. This information is the key to the design of therapeutic, pharmacological and genetic tools to counter the effects of three copies of chromosome 21 in the cells of DS patients. Towards this goal, we aim to build a transcriptional map of this region and then characterize any genes isolated. We are using two methods to isolate expressed sequences: (1) Alu-splice consensus PCR (2) cDNA hybridizsation selection. We use as starting material, YACs (CEPH/Genethon) from the specified region and cosmid minilibraries constructed from these YACs. Products are subcloned, sequenced and analyzed in the sequence databases. Several homologies with reported expressed sequences have been found and will be discussed. The HC21 origin of these putative expressed sequences is determined and they are then used to initially screen a human fetal brain full-length cDNA library. We have isolated several cDNAs and these are now being analyzed.

  10. Expression and purification of soluble human APRIL in Escherichia coli using ELP-SUMO tag.

    PubMed

    Zhang, Jie; Ma, Lei; Zhang, Shuang Quan

    2014-03-01

    APRIL is a member of the tumor necrosis factor (TNF) family of ligands that mediate tumor cells proliferation as well as survival, depending on the cellular context. In this report, we present a novel method to obtain soluble human APRIL in Escherichia coli using the elastin-like polypeptide and SUMO (ELP-SUMO) tags. The fusion protein with ELP-SUMO tag was expressed in a soluble form at 15°C. After purification based on inverse transition cycling (ITC) method, the purified ELP-SUMO-hAPRIL fusion protein was subsequently cleaved by SUMO protease to release mature hAPRIL. Following affinity chromatography, the target protein was re-purified with high purity. Finally, about 4.8mg recombinant hAPRIL was obtained from 1l bacterial culture with no less than 85% purity. The molecular mass (Mr) of the recombinant hAPRIL was confirmed by MALDI-TOF MS as Mr 16,314. The purified hAPRIL exhibits biological activity on Jurkat cells. It is the first report on soluble production of hAPRIL in E. coli using ELP-SUMO tag. PMID:24412409

  11. Identification of expressed resistance gene-like sequences by data mining in 454-derived transcriptomic sequences of common bean (Phaseolus vulgaris L.)

    PubMed Central

    2012-01-01

    Background Common bean (Phaseolus vulgaris L.) is one of the most important legumes in the world. Several diseases severely reduce bean production and quality; therefore, it is very important to better understand disease resistance in common bean in order to prevent these losses. More than 70 resistance (R) genes which confer resistance against various pathogens have been cloned from diverse plant species. Most R genes share highly conserved domains which facilitates the identification of new candidate R genes from the same species or other species. The goals of this study were to isolate expressed R gene-like sequences (RGLs) from 454-derived transcriptomic sequences and expressed sequence tags (ESTs) of common bean, and to develop RGL-tagged molecular markers. Results A data-mining approach was used to identify tentative P. vulgaris R gene-like sequences from approximately 1.69 million 454-derived sequences and 116,716 ESTs deposited in GenBank. A total of 365 non-redundant sequences were identified and named as common bean (P. vulgaris = Pv) resistance gene-like sequences (PvRGLs). Among the identified PvRGLs, about 60% (218 PvRGLs) were from 454-derived sequences. Reverse transcriptase-polymerase chain reaction (RT-PCR) analysis confirmed that PvRGLs were actually expressed in the leaves of common bean. Upon comparison to P. vulgaris genomic sequences, 105 (28.77%) of the 365 tentative PvRGLs could be integrated into the existing common bean physical map. Based on the syntenic blocks between common bean and soybean, 237 (64.93%) PvRGLs were anchored on the P. vulgaris genetic map and will need to be mapped to determine order. In addition, 11 sequence-tagged-site (STS) and 19 cleaved amplified polymorphic sequence (CAPS) molecular markers were developed for 25 unique PvRGLs. Conclusions In total, 365 PvRGLs were successfully identified from 454-derived transcriptomic sequences and ESTs available in GenBank and about 65% of PvRGLs were integrated into the common

  12. Microbial Diversity in Deep-sea Methane Seep Sediments Presented by SSU rRNA Gene Tag Sequencing

    PubMed Central

    Nunoura, Takuro; Takaki, Yoshihiro; Kazama, Hiromi; Hirai, Miho; Ashi, Juichiro; Imachi, Hiroyuki; Takai, Ken

    2012-01-01

    Microbial community structures in methane seep sediments in the Nankai Trough were analyzed by tag-sequencing analysis for the small subunit (SSU) rRNA gene using a newly developed primer set. The dominant members of Archaea were Deep-sea Hydrothermal Vent Euryarchaeotic Group 6 (DHVEG 6), Marine Group I (MGI) and Deep Sea Archaeal Group (DSAG), and those in Bacteria were Alpha-, Gamma-, Delta- and Epsilonproteobacteria, Chloroflexi, Bacteroidetes, Planctomycetes and Acidobacteria. Diversity and richness were examined by 8,709 and 7,690 tag-sequences from sediments at 5 and 25 cm below the seafloor (cmbsf), respectively. The estimated diversity and richness in the methane seep sediment are as high as those in soil and deep-sea hydrothermal environments, although the tag-sequences obtained in this study were not sufficient to show whole microbial diversity in this analysis. We also compared the diversity and richness of each taxon/division between the sediments from the two depths, and found that the diversity and richness of some taxa/divisions varied significantly along with the depth. PMID:22510646

  13. Regulatory sequences of Arabidopsis drive reporter gene expression in nematode feeding structures.

    PubMed Central

    Barthels, N; van der Lee, F M; Klap, J; Goddijn, O J; Karimi, M; Puzio, P; Grundler, F M; Ohl, S A; Lindsey, K; Robertson, L; Robertson, W M; Van Montagu, M; Gheysen, G; Sijmons, P C

    1997-01-01

    In the quest for plant regulatory sequences capable of driving nematode-triggered effector gene expression in feeding structures, we show that promoter tagging is a valuable tool. A large collection of transgenic Arabidopsis plants was generated. They were transformed with a beta-glucuronidase gene functioning as a promoter tag. Three T-DNA constructs, pGV1047, p delta gusBin19, and pMOG553, were used. Early responses to nematode invasion were of primary interest. Six lines exhibiting beta-glucuronidase activity in syncytia induced by the beet cyst nematode were studied. Reporter gene activation was also identified in galls induced by root knot and ectoparasitic nematodes. Time-course studies revealed that all six tags were differentially activated during the development of the feeding structure. T-DNA-flanking regions responsible for the observed responses after nematode infection were isolated and characterized for promoter activity. PMID:9437858

  14. SAGExplore: a web server for unambiguous tag mapping in serial analysis of gene expression oriented to gene discovery and annotation.

    PubMed

    Norambuena, Tomás; Malig, Rodrigo; Melo, Francisco

    2007-07-01

    We describe a web server for the accurate mapping of experimental tags in serial analysis of gene expression (SAGE). The core of the server relies on a database of genomic virtual tags built by a recently described method that attempts to reduce the amount of ambiguous assignments for those tags that are not unique in the genome. The method provides a complete annotation of potential virtual SAGE tags within a genome, along with an estimation of their confidence for experimental observation that ranks tags that present multiple matches in the genome. The output of the server consists of a table in HTML format that contains links to a graphic representation of the results and to some external servers and databases, facilitating the tasks of analysis of gene expression and gene discovery. Also, a table in tab delimited text format is produced, allowing the user to export the results into custom databases and software for further analysis. The current server version provides the most accurate and complete SAGE tag mapping source that is available for the yeast organism. In the near future, this server will also allow the accurate mapping of experimental SAGE-tags from other model organisms such as human, mouse, frog and fly. The server is freely available on the web at: http://dna.bio.puc.cl/SAGExplore.html. PMID:17626053

  15. Identification of Disulfide Bonds in Protein Proteolytic Degradation Products Using de Novo-Protein Unique Sequence Tags Approach

    SciTech Connect

    Shen, Yufeng; Tolic, Nikola; Purvine, Samuel O.; Smith, Richard D.

    2010-08-01

    Disulfide bonds are a form of posttranslational modification that often determines protein structure(s) and function(s). In this work, we report a mass spectrometry method for identification of disulfides in degradation products of proteins, and specifically endogenous peptides in the human blood plasma peptidome. LC-Fourier transform tandem mass spectrometry (FT MS/MS) was used for acquiring mass spectra that were de novo sequenced and then searched against the IPI human protein database. Through the use of unique sequence tags (UStags) we unambiguously correlated the spectra to specific database proteins. Examination of the UStags’ prefix and/or suffix sequences that contain cysteine(s) in conjunction with sequences of the UStags-specified database proteins is shown to enable the unambigious determination of disulfide bonds. Using this method, we identified the intermolecular and intramolecular disulfides in human blood plasma peptidome peptides that have molecular weights of up to ~10 kDa.

  16. Identification of disulfide bonds in protein proteolytic degradation products using de novo-protein unique sequence tags approach.

    PubMed

    Shen, Yufeng; Tolić, Nikola; Purvine, Samuel O; Smith, Richard D

    2010-08-01

    Disulfide bonds are a form of post-translational modification that often determines protein structure(s) and function(s). In this work, we report a mass spectrometry method for identification of disulfides in degradation products of proteins, specifically endogenous peptides in the human blood plasma peptidome. LC-Fourier transform tandem mass spectrometry (FT MS/MS) was used for acquiring mass spectra that were de novo sequenced and then searched against the IPI human protein database. Through the use of unique sequence tags (UStags), we unambiguously correlated the spectra to specific database proteins. Examination of the UStags' prefix and/or suffix sequences that contain cysteine(s) in conjunction with sequences of the UStags-specified database proteins is shown to enable the unambigious determination of disulfide bonds. Using this method, we identified the intermolecular and intramolecular disulfides in human blood plasma peptidome peptides that have molecular weights of up to approximately 10 kDa. PMID:20590115

  17. Construction of a plasmid coding for green fluorescent protein tagged cathepsin L and data on expression in colorectal carcinoma cells

    PubMed Central

    Tamhane, Tripti; Wolters, Brit K.; Illukkumbura, Rukshala; Maelandsmo, Gunhild M.; Haugen, Mads H.; Brix, Klaudia

    2015-01-01

    The endo-lysosomal cysteine cathepsin L has recently been shown to have moonlighting activities in that its unexpected nuclear localization in colorectal carcinoma cells is involved in cell cycle progression (Tamhane et al., 2015) [1]. Here, we show data on the construction and sequence of a plasmid coding for human cathepsin L tagged with an enhanced green fluorescent protein (phCL-EGFP) in which the fluorescent protein is covalently attached to the C-terminus of the protease. The plasmid was used for transfection of HCT116 colorectal carcinoma cells, while data from non-transfected and pEGFP-N1-transfected cells is also shown. Immunoblotting data of lysates from non-transfected controls and HCT116 cells transfected with pEGFP-N1 and phCL-EGFP, showed stable expression of cathepsin L-enhanced green fluorescent protein chimeras, while endogenous cathepsin L protein amounts exceed those of hCL-EGFP chimeras. An effect of phCL-EGFP expression on proliferation and metabolic states of HCT116 cells at 24 h post-transfection was observed. PMID:26594658

  18. Sequencing degraded DNA from non-destructively sampled museum specimens for RAD-tagging and low-coverage shotgun phylogenetics.

    PubMed

    Tin, Mandy Man-Ying; Economo, Evan Philip; Mikheyev, Alexander Sergeyevich

    2014-01-01

    Ancient and archival DNA samples are valuable resources for the study of diverse historical processes. In particular, museum specimens provide access to biotas distant in time and space, and can provide insights into ecological and evolutionary changes over time. However, archival specimens are difficult to handle; they are often fragile and irreplaceable, and typically contain only short segments of denatured DNA. Here we present a set of tools for processing such samples for state-of-the-art genetic analysis. First, we report a protocol for minimally destructive DNA extraction of insect museum specimens, which produced sequenceable DNA from all of the samples assayed. The 11 specimens analyzed had fragmented DNA, rarely exceeding 100 bp in length, and could not be amplified by conventional PCR targeting the mitochondrial cytochrome oxidase I gene. Our approach made these samples amenable to analysis with commonly used next-generation sequencing-based molecular analytic tools, including RAD-tagging and shotgun genome re-sequencing. First, we used museum ant specimens from three species, each with its own reference genome, for RAD-tag mapping. Were able to use the degraded DNA sequences, which were sequenced in full, to identify duplicate reads and filter them prior to base calling. Second, we re-sequenced six Hawaiian Drosophila species, with millions of years of divergence, but with only a single available reference genome. Despite a shallow coverage of 0.37 ± 0.42 per base, we could recover a sufficient number of overlapping SNPs to fully resolve the species tree, which was consistent with earlier karyotypic studies, and previous molecular studies, at least in the regions of the tree that these studies could resolve. Although developed for use with degraded DNA, all of these techniques are readily applicable to more recent tissue, and are suitable for liquid handling automation. PMID:24828244

  19. Expression and purification of Canis interferon α in Escherichia coli using different tags.

    PubMed

    Yang, Fang; Pan, Yingying; Chen, Yazhou; Tan, Shiming; Jin, Mingfei; Wu, Zirong; Huang, Jing

    2015-11-01

    The potent and broad activity of Canis interferon α (CaIFNα) makes it an attractive candidate for the treatment of many viral diseases of dogs. Here, we fused CaIFNα to three different protein tags: thioredoxin (Trx), glutathione S-transferase (GST), and NusA (Nus), to facilitate its expression and purification in Escherichia coli. The Trx-CaIFNα and GST-CaIFNα fusion proteins formed inclusion bodies, while the Nus-CaIFNα protein was soluble when expressed at low temperatures. Trx-CaIFNα was purified from inclusion bodies and refolded, while Nus-CaIFNα was purified under native conditions. The purity of Trx-CaIFNα and Nus-CaIFNα was greater than 90%, and their yields were 74.8% and 6.5%, respectively. Both Trx-CaIFNα and Nus-CaIFNα had antiviral activity in vitro. Their anti-viral activity was 1.09±0.47×10(14) and 2.25±0.87×10(12) U/mol, respectively, on Madin-Darby canine kidney cells. Both purification methods had advantages and disadvantages. A greater amount of Trx-CaIFNα was obtained, but refolding was required to obtain active protein. In contrast, soluble Nus-CaIFNα did not require refolding, which saved time and materials. However, Nus-CaIFNα, which contained a larger tag, had lower activity than Trx-CaIFNα. In general, we provided two protocols to obtain large amounts of CaIFNα with high antiviral activity. These protocols may promote the clinical development of CaIFNα in treating viral diseases in dog. PMID:26196501

  20. Genome-wide discovery of cis-elements in promoter sequences using gene expression.

    PubMed

    Troukhan, Maxim; Tatarinova, Tatiana; Bouck, John; Flavell, Richard B; Alexandrov, Nickolai N

    2009-04-01

    The availability of complete or nearly complete genome sequences, a large number of 5' expressed sequence tags, and significant public expression data allow for a more accurate identification of cis-elements regulating gene expression. We have implemented a global approach that takes advantage of available expression data, genomic sequences, and transcript information to predict cis-elements associated with specific expression patterns. The key components of our approach are: (1) precise identification of transcription start sites, (2) specific locations of cis-elements relative to the transcription start site, and (3) assessment of statistical significance for all sequence motifs. By applying our method to promoters of Arabidopsis thaliana and Mus musculus, we have identified motifs that affect gene expression under specific environmental conditions or in certain tissues. We also found that the presence of the TATA box is associated with increased variability of gene expression. Strong correlation between our results and experimentally determined motifs shows that the method is capable of predicting new functionally important cis-elements in promoter sequences. PMID:19231992

  1. The generation of knock-in mice expressing fluorescently tagged galanin receptors 1 and 2

    PubMed Central

    Kerr, Niall; Holmes, Fiona E.; Hobson, Sally-Ann; Vanderplank, Penny; Leard, Alan; Balthasar, Nina; Wynick, David

    2015-01-01

    The neuropeptide galanin has diverse roles in the central and peripheral nervous systems, by activating the G protein-coupled receptors Gal1, Gal2 and the less studied Gal3 (GalR1–3 gene products). There is a wealth of data on expression of Gal1–3 at the mRNA level, but not at the protein level due to the lack of specificity of currently available antibodies. Here we report the generation of knock-in mice expressing Gal1 or Gal2 receptor fluorescently tagged at the C-terminus with, respectively, mCherry or hrGFP (humanized Renilla green fluorescent protein). In dorsal root ganglia (DRG) neurons expressing the highest levels of Gal1-mCherry, localization to the somatic cell membrane was detected by live-cell fluorescence and immunohistochemistry, and that fluorescence decreased upon addition of galanin. In spinal cord, abundant Gal1-mCherry immunoreactive processes were detected in the superficial layers of the dorsal horn, and highly expressing intrinsic neurons of the lamina III/IV border showed both somatic cell membrane localization and outward transport of receptor from the cell body, detected as puncta within cell processes. In brain, high levels of Gal1-mCherry immunofluorescence were detected within thalamus, hypothalamus and amygdala, with a high density of nerve endings in the external zone of the median eminence, and regions with lesser immunoreactivity included the dorsal raphe nucleus. Gal2-hrGFP mRNA was detected in DRG, but live-cell fluorescence was at the limits of detection, drawing attention to both the much lower mRNA expression than to Gal1 in mice and the previously unrecognized potential for translational control by upstream open reading frames (uORFs). PMID:26292267

  2. Novel expressed sequences identified in a model of androgen independent prostate cancer

    PubMed Central

    Quayle, Steven N; Hare, Heidi; Delaney, Allen D; Hirst, Martin; Hwang, Dorothy; Schein, Jacqueline E; Jones, Steven JM; Marra, Marco A; Sadar, Marianne D

    2007-01-01

    Background Prostate cancer is the most frequently diagnosed cancer in American men, and few effective treatment options are available to patients who develop hormone-refractory prostate cancer. The molecular changes that occur to allow prostate cells to proliferate in the absence of androgens are not fully understood. Results Subtractive hybridization experiments performed with samples from an in vivo model of hormonal progression identified 25 expressed sequences representing novel human transcripts. Intriguingly, these 25 sequences have small open-reading frames and are not highly conserved through evolution, suggesting many of these novel expressed sequences may be derived from untranslated regions of novel transcripts or from non-coding transcripts. Examination of a large metalibrary of human Serial Analysis of Gene Expression (SAGE) tags demonstrated that only three of these novel sequences had been previously detected. RT-PCR experiments confirmed that the 6 sequences tested were expressed in specific human tissues, as well as in clinical samples of prostate cancer. Further RT-PCR experiments for five of these fragments indicated they originated from large untranslated regions of unannotated transcripts. Conclusion This study underlines the value of using complementary techniques in the annotation of the human genome. The tissue-specific expression of 4 of the 6 clones tested indicates the expression of these novel transcripts is tightly regulated, and future work will determine the possible role(s) these novel transcripts may play in the progression of prostate cancer. PMID:17257419

  3. Transcriptome sequencing and profiling of expressed genes in cambial zone and differentiating xylem of Japanese cedar (Cryptomeria japonica)

    PubMed Central

    2014-01-01

    Background Forest trees have ecological and economic importance, and Japanese cedar has highly valued wood attributes. Thus, studies of molecular aspects of wood formation offer practical information that may be used for screening and forward genetics approaches to improving wood quality. Results After identifying expressed sequence tags in Japanese cedar tissue undergoing xylogenesis, we designed a custom cDNA microarray to compare expression of highly regulated genes throughout a growing season. This led to identification of candidate genes involved both in wood formation and later cessation of growth and dormancy. Based on homology to orthologous protein groups, the genes were assigned to functional classes. A high proportion of sequences fell into functional classes related to posttranscriptional modification and signal transduction, while transcription factors and genes involved in the metabolism of sugars, cell-wall synthesis and lignification, and cold hardiness were among other classes of genes identified as having a potential role in xylem formation and seasonal wood formation. Conclusions We obtained 55,051 unique sequences by next-generation sequencing of a cDNA library prepared from cambial meristem and derivative cells. Previous studies on conifers have identified unique sequences expressed in developing xylem, but this is the first comprehensive study utilizing a collection of expressed sequence tags for expression studies related to xylem formation in Japanese cedar, which belongs to a different lineage than the Pinaceae. Our characterization of these sequences should allow comparative studies of genome evolution and functional genetics of wood species. PMID:24649833

  4. A blackberry (Rubus L.) expressed sequence tag library for the development of simple sequence repeat markers

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Background: The recent development of novel repeat-fruiting types of blackberry (Rubus L.) cultivars, combined with a long history of morphological marker-assisted selection for thornlessness by blackberry breeders, has given rise to increased interest in using molecular markers to facilitate blackb...

  5. SNP discovery using Paired-End RAD-tag sequencing on pooled genomic DNA of Sisymbrium austriacum (Brassicaceae).

    PubMed

    Vandepitte, K; Honnay, O; Mergeay, J; Breyne, P; Roldán-Ruiz, I; De Meyer, T

    2013-03-01

    Single nucleotide polymorphisms SNPs are rapidly replacing anonymous markers in population genomic studies, but their use in non model organisms is hampered by the scarcity of cost-effective approaches to uncover genome-wide variation in a comprehensive subset of individuals. The screening of one or only a few individuals induces ascertainment bias. To discover SNPs for a population genomic study of the Pyrenean rocket (Sisymbrium austriacum subsp. chrysanthum), we undertook a pooled RAD-PE (Restriction site Associated DNA Paired-End sequencing) approach. RAD tags were generated from the PstI-digested pooled genomic DNA of 12 individuals sampled across the species distribution range and paired-end sequenced using Illumina technology to produce ~24.5 Mb of sequences, covering ~7% of the specie's genome. Sequences were assembled into ~76 000 contigs with a mean length of 323 bp (N(50)  = 357 bp, sequencing depth = 24x). In all, >15 000 SNPs were called, of which 47% were annotated in putative genic regions based on homology with the Arabidopsis thaliana genome. Gene ontology (GO) slim categorization demonstrated that the identified SNPs covered extant genic variation well. The validation of 300 SNPs on a larger set of individuals using a KASPar assay underpinned the utility of pooled RAD-PE as an inexpensive genome-wide SNP discovery technique (success rate: 87%). In addition to SNPs, we discovered >600 putative SSR markers. PMID:23231662

  6. Expression of GFP-tagged neuronal glutamate transporters in cerebellar Purkinje neurons.

    PubMed

    Meera, Pratap; Dodson, Paul D; Karakossian, Movses H; Otis, Thomas S

    2005-11-01

    Of the five excitatory amino acid transporters (EAATs) identified, two genes are expressed by neurons (EAAT3 and EAAT4) and give rise to transporters confined to neuronal cell bodies and dendrites. At an ultrastructural level, EAAT3 and EAAT4 proteins are clustered at the edges of postsynaptic densities of excitatory synapses. This pattern of localization suggests that postsynaptic EAATs may help to limit spillover of glutamate from excitatory synapses. In an effort to study transporter localization in living neurons and ultimately to manipulate uptake at intact synapses, we have developed viral reagents encoding neuronal EAATs tagged with GFP. We demonstrate that these fusion proteins are capable of Na(+)-dependent glutamate uptake, that they generate ionic conductances indistinguishable from their wild-type counterparts, and that GFP does not alter their glutamate dose-dependence. Two-photon microscopy was used to examine fusion protein expression in Purkinje neurons in acute cerebellar slices. Both EAAT3-GFP and EAAT4-GFP were observed at high levels in the dendritic spines of transfected Purkinje neurons. These findings indicate that functional EAAT fusion proteins can be synthesized and appropriately trafficked to postsynaptic compartments. Furthermore, they validate a powerful system for looking at EAAT function in situ. PMID:16212990

  7. Expression and purification of recombinant cytoplasmic domain of human erythrocyte band 3 with hexahistidine tag or chitin-binding tag in Escherichia coli.

    PubMed

    Ding, Yu; Jiang, Weihua; Su, Yang; Zhou, Hanqing; Zhang, Zhihong

    2004-04-01

    The cytoplasmic domain of erythrocyte band 3 (cdb3) serves as a center of membrane organization in the erythrocytes by its interaction with multiple proteins including ankyrin, protein 4.1, protein 4.2, hemoglobin, and several glycolytic enzymes. In this paper, human cdb3 was cloned into three different expression vectors controlled by T7 polymerase promoter and induced with isopropyl beta-D-thiogalactopyranoside in Escherichia coli. Two of the fusion proteins containing hexahistidine tag in the N-terminal or C-terminal were purified by immobilized metal affinity column chromatography. The third recombinant cdb3 containing the affinity chitin-binding tag was purified using chitin beads followed by specific self-cleavage, which released cdb3 according to the mechanism of protein splicing. The molecular weights of purified recombinant proteins were verified by mass spectrometry. The pH-dependent properties of the intrinsic tryptophan fluorescence of the three kinds of recombinant cdb3 were compared with that of the cdb3 extracted from the erythrocytes, showing that there were no significant differences between them. Using pull-down assay combined with Western blot analysis, the interaction between recombinant cdb3 and protein 4.2 C3 fragment was verified. These demonstrated that the recombinant proteins were both structurally and functionally active. The typical yields of cdb3 purified with hexahistidine tag in the N-terminal, C-terminal, and cleaved from chitin bead were 10.6, 9.6, and 1.5 mg from 1L culture medium, respectively. PMID:15003247

  8. Generation of plasmid vectors expressing FLAG-tagged proteins under the regulation of human elongation factor-1α promoter using Gibson assembly.

    PubMed

    Grozdanov, Petar N; MacDonald, Clinton C

    2015-01-01

    Gibson assembly (GA) cloning offers a rapid, reliable, and flexible alternative to conventional DNA cloning methods. We used GA to create customized plasmids for expression of exogenous genes in mouse embryonic stem cells (mESCs). Expression of exogenous genes under the control of the SV40 or human cytomegalovirus promoters diminishes quickly after transfection into mESCs. A remedy for this diminished expression is to use the human elongation factor-1 alpha (hEF1α) promoter to drive gene expression. Plasmid vectors containing hEF1α are not as widely available as SV40- or CMV-containing plasmids, especially those also containing N-terminal 3xFLAG-tags. The protocol described here is a rapid method to create plasmids expressing FLAG-tagged CstF-64 and CstF-64 mutant under the expressional regulation of the hEF1α promoter. GA uses a blend of DNA exonuclease, DNA polymerase and DNA ligase to make cloning of overlapping ends of DNA fragments possible. Based on the template DNAs we had available, we designed our constructs to be assembled into a single sequence. Our design used four DNA fragments: pcDNA 3.1 vector backbone, hEF1α promoter part 1, hEF1α promoter part 2 (which contained 3xFLAG-tag purchased as a double-stranded synthetic DNA fragment), and either CstF-64 or specific CstF-64 mutant. The sequences of these fragments were uploaded to a primer generation tool to design appropriate PCR primers for generating the DNA fragments. After PCR, DNA fragments were mixed with the vector containing the selective marker and the GA cloning reaction was assembled. Plasmids from individual transformed bacterial colonies were isolated. Initial screen of the plasmids was done by restriction digestion, followed by sequencing. In conclusion, GA allowed us to create customized plasmids for gene expression in 5 days, including construct screens and verification. PMID:25742071

  9. Tail proteins of phage T5: investigation of the effect of the His6-tag position, from expression to crystallisation.

    PubMed

    Noirclerc-Savoye, Marjolaine; Flayhan, Ali; Pereira, Cindy; Gallet, Benoit; Gans, Pierre; Ebel, Christine; Breyton, Cécile

    2015-05-01

    Upon binding to its bacterial host receptor, the tail tip of phage T5 perforates, by an unknown mechanism, the heavily armoured cell wall of the host. This allows the injection of phage DNA into the cytoplasm to hijack the cell machinery and enable the production of new virions. In the perspective of a structural study of the phage tail, we have systematically overproduced eight of the eleven T5 tail proteins, with or without a N- or a C-terminal His6-tag. The widely used Hi6-tag is very convenient to purify recombinant proteins using immobilised-metal affinity chromatography. The presence of a tag however is not always innocuous. We combined automated gene cloning and expression tests to rapidly identify the most promising constructs for proteins of phage T5 tail, and performed biochemical and biophysical characterisation and crystallisation screening on available proteins. Automated small-scale purification was adapted for two highly expressed proteins. We obtained structural information for three of the proteins. We showed that the presence of a His6-tag can have drastic effect on protein expression, solubility, oligomerisation propensity and crystal quality. PMID:25676818

  10. Subcellular localization of Mayven following expression of wild type and mutant EGFP tagged cDNAs

    PubMed Central

    2010-01-01

    Background Process formation by glial cells is crucial to their function. Mayven, an actin binding, multi-domain polypeptide, and member of the BTB-BACK-Kelch family have been shown to be important in oligodendrocyte process extension. To assess the role of Mayven in neural cell process extension we have tracked the subcellular distribution of exogenous Mayven following expression of a rat Mayven -EGFP cDNA in a variety of neural cell backgrounds and specifically in OEC tranfectants following drug treatment to disrupt the integrity of the cytoskeleton. A comparison was made between the subcellular localization following transient transfection of OECs with full-length Mayven cDNA and a series of mutant domain constructs. Results The subcellular location of Mayven in OEC transfectants showed a characteristic distribution with intense foci of staining towards the process tips corresponding to regions of accumulated Mayven overlapping in part with lammelipodial actin and was absent from the filipodia and the outer membrane. This signature pattern was also observed in Schwann cells, Oli-Neu cells, astrocytes and the neuroblastoma cell line B104 transfectants and resembled the exogenous and endogenous Mayven distribution in oligodendrocytes. This contrasted with the localization pattern in non-neural cells. There was a re-localization of Mayven in OEC transfectants following drug treatment to challenge the integrity of the actin cytoskeleton while breakdown of the microtubular component had no discernible impact on the accumulation of Mayven in the process tips. Deletion of the first three amino acids of the SH3 motif of the putative Fyn Kinase binding domain at the amino terminus significantly compromised this signature pattern as did the removal of the last Kelch repeat unit of six unit Kelch domain comprising the carboxyl terminus. In addition, there was a reduction in process length in mutant transfectants. Co-expression studies with a haemagglutinin (HA) tagged wild

  11. Transcriptome annotation using tandem SAGE tags

    PubMed Central

    Rivals, Eric; Boureux, Anthony; Lejeune, Mireille; Ottones, Florence; Pecharromàn Pérez, Oscar; Tarhio, Jorma; Pierrat, Fabien; Ruffle, Florence; Commes, Thérèse; Marti, Jacques

    2007-01-01

    Analysis of several million expressed gene signatures (tags) revealed an increasing number of different sequences, largely exceeding that of annotated genes in mammalian genomes. Serial analysis of gene expression (SAGE) can reveal new Poly(A) RNAs transcribed from previously unrecognized chromosomal regions. However, conventional SAGE tags are too short to identify unambiguously unique sites in large genomes. Here, we design a novel strategy with tags anchored on two different restrictions sites of cDNAs. New transcripts are then tentatively defined by the two SAGE tags in tandem and by the spanning sequence read on the genome between these tagged sites. Having developed a new algorithm to locate these tag-delimited genomic sequences (TDGS), we first validated its capacity to recognize known genes and its ability to reveal new transcripts with two SAGE libraries built in parallel from a single RNA sample. Our algorithm proves fast enough to experiment this strategy at a large scale. We then collected and processed the complete sets of human SAGE tags to predict yet unknown transcripts. A cross-validation with tiling arrays data shows that 47% of these TDGS overlap transcriptional active regions. Our method provides a new and complementary approach for complex transcriptome annotation. PMID:17709346

  12. Whole genome wide expression profiles of Vitis amurensis grape responding to downy mildew by using Solexa sequencing technology

    PubMed Central

    2010-01-01

    Background Downy mildew (DM), caused by pathogen Plasmopara viticola (PV) is the single most damaging disease of grapes (Vitis L.) worldwide. However, the mechanisms of the disease development in grapes are poorly understood. A method for estimating gene expression levels using Solexa sequencing of Type I restriction-endonuclease-generated cDNA fragments was used for deep sequencing the transcriptomes resulting from PV infected leaves of Vitis amurensis Rupr. cv. Zuoshan-1. Our goal is to identify genes that are involved in resistance to grape DM disease. Results Approximately 8.5 million (M) 21-nt cDNA tags were sequenced in the cDNA library derived from PV pathogen-infected leaves, and about 7.5 M were sequenced from the cDNA library constructed from the control leaves. When annotated, a total of 15,249 putative genes were identified from the Solexa sequencing tags for the infection (INF) library and 14,549 for the control (CON) library. Comparative analysis between these two cDNA libraries showed about 0.9% of the unique tags increased by at least five-fold, and about 0.6% of the unique tags decreased more than five-fold in infected leaves, while 98.5% of the unique tags showed less than five-fold difference between the two samples. The expression levels of 12 differentially expressed genes were confirmed by Real-time RT-PCR and the trends observed agreed well with the Solexa expression profiles, although the degree of change was lower in amplitude. After pathway enrichment analysis, a set of significantly enriched pathways were identified for the differentially expressed genes (DEGs), which associated with ribosome structure, photosynthesis, amino acid and sugar metabolism. Conclusions This study presented a series of candidate genes and pathways that may contribute to DM resistance in grapes, and illustrated that the Solexa-based tag-sequencing approach was a powerful tool for gene expression comparison between control and treated samples. PMID:21029438

  13. Myocardial motion estimation in tagged MR sequences by using alphaMI-based non rigid registration.

    PubMed

    Oubel, E; Tobon-Gomez, C; Hero, A O; Frangi, A F

    2005-01-01

    Tagged Magnetic Resonance Imaging (MRI) is currently the reference MR modality for myocardial motion and strain analysis. NMI-based non rigid registration has proven to be an accurate method to retrieve cardiac deformation fields. The use of alphaMI permits higher dimensional features to be implemented in myocardial deformation estimation through image registration. This paper demonstrates that this is feasible with a set of Haar wavelet features of high dimension. While we do not demonstrate performance improvement for this set of features, there is no significant degradation as compared to implementing the registration method with the traditional NMI metric. We use Entropic Spanning Graphs (ESGs) to estimate the alphaMI of the wavelet feature vectors WFVs since this is not possible with histograms. To the best of our knowledge, this is the first time that ESGs are used for non rigid registration. PMID:16685969

  14. Effect of His-Tag on Expression, Purification, and Structure of Zinc Finger Protein, ZNF191(243-368)

    PubMed Central

    Huang, Zhongxian

    2016-01-01

    Zinc finger proteins are associated with hereditary diseases and cancers. To obtain an adequate amount of zinc finger proteins for studying their properties, structure, and functions, many protein expression systems are used. ZNF191(243-368) is a zinc finger protein and can be fused with His-tag to generate fusion proteins such as His6-ZNF191(243-368) and ZNF191(243-368)-His8. The purification of His-tag protein using Ni-NTA resin can overcome the difficulty of ZNF191(243-368) separation caused by inclusion body formation. The influences of His-tag on ZNF191(243-368) properties and structure were investigated using spectrographic techniques and hydrolase experiment. Our findings suggest that insertion of a His-tag at the N-terminal or C-terminal end of ZNF191(243-368) has different effects on the protein. Therefore, an expression system should be considered based on the properties and structure of the protein. Furthermore, the hydrolase activity of ZNF191(243-368)-His8 has provided new insights into the design of biological functional molecules. PMID:27524954

  15. Expression, purification and kinetic characterization of His-tagged glyceraldehyde-3-phosphate dehydrogenase from Trypanosoma cruzi.

    PubMed

    Cheleski, Juliana; Freitas, Renato F; Wiggers, Helton José; Rocha, Josmar R; de Araújo, Ana Paula Ulian; Montanari, Carlos A

    2011-04-01

    Trypanosomes are flagellated protozoa responsible for serious parasitic diseases that have been classified by the World Health Organization as tropical sicknesses of major importance. One important drug target receiving considerable attention is the enzyme glyceraldehyde-3-phosphate dehydrogenase from the protozoan parasite Trypanosoma cruzi, the causative agent of Chagas disease (T. cruzi Glyceraldehyde-3-phosphate dehydrogenase (TcGAPDH); EC 1.2.1.12). TcGAPDH is a key enzyme in the glycolytic pathway of T. cruzi and catalyzes the oxidative phosphorylation of D-glyceraldehyde-3-phosphate (G3P) to 1,3-bisphosphoglycerate (1,3-BPG) coupled to the reduction of oxidized nicotinamide adenine dinucleotide, (NAD(+)) to NADH, the reduced form. Herein, we describe the cloning of the T. cruzi gene for TcGAPDH into the pET-28a(+) vector, its expression as a tagged protein in Escherichia coli, purification and kinetic characterization. The His(6)-tagged TcGAPDH was purified by affinity chromatography. Enzyme activity assays for the recombinant His(6)-TcGAPDH were carried out spectrophotometrically to determine the kinetic parameters. The apparent Michaelis-Menten constant (K(M)(app)) determined for D-glyceraldehyde-3-phosphate and NAD(+) were 352±21 and 272±25 μM, respectively, which were consistent with the values for the untagged enzyme reported in the literature. We have demonstrated by the use of Isothermal Titration Calorimetry (ITC) that this vector modification resulted in activity preserved for a higher period. We also report here the use of response surface methodology (RSM) to determine the region of optimal conditions for enzyme activity. A quadratic model was developed by RSM to describe the enzyme activity in terms of pH and temperature as independent variables. According to the RMS contour plots and variance analysis, the maximum enzyme activity was at 29.1°C and pH 8.6. Above 37°C, the enzyme activity starts to fall, which may be related to previous

  16. Analysis of transcripts expressed by intracellular stages of Eimeria acervulina using expressed sequence tags (ESTs).

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Coccidiosis in chickens is caused by seven species belonging to the genus Eimeria. Even though coccidiosis is a complex disease that can be caused by any combination of these species most of the molecular research concerning chicken Eimeria has been limited to Eimeria tenella. This study describes...

  17. GST-PRIME: a genome-wide primer design software for the generation of gene sequence tags.

    PubMed

    Varotto, C; Richly, E; Salamini, F; Leister, D

    2001-11-01

    The availability of sequenced genomes has generated a need for experimental approaches that allow the simultaneous analysis of large, or even complete, sets of genes. To facilitate such analyses, we have developed GST-PRIME, a software package for retrieving and assembling gene sequences, even from complex genomes, using the NCBI public database, and then designing sets of primer pairs for use in gene amplification. Primers were designed by the program for the direct amplification of gene sequence tags (GSTs) from either genomic DNA or cDNA. Test runs of GST-PRIME on 2000 randomly selected Arabidopsis and Drosophila genes demonstrate that 93 and 88% of resulting GSTs, respectively, fulfilled imposed length criteria. GST-PRIME primer pairs were tested on a set of 1900 Arabidopsis genes coding for chloroplast-targeted proteins: 95% of the primer pairs used in PCRs with genomic DNA generated the correct amplicons. GST-PRIME can thus be reliably used for large-scale or specific amplification of intron-containing genes of multicellular eukaryotes. PMID:11691924

  18. GST-PRIME: a genome-wide primer design software for the generation of gene sequence tags

    PubMed Central

    Varotto, Claudio; Richly, Erik; Salamini, Francesco; Leister, Dario

    2001-01-01

    The availability of sequenced genomes has generated a need for experimental approaches that allow the simultaneous analysis of large, or even complete, sets of genes. To facilitate such analyses, we have developed GST-PRIME, a software package for retrieving and assembling gene sequences, even from complex genomes, using the NCBI public database, and then designing sets of primer pairs for use in gene amplification. Primers were designed by the program for the direct amplification of gene sequence tags (GSTs) from either genomic DNA or cDNA. Test runs of GST-PRIME on 2000 randomly selected Arabidopsis and Drosophila genes demonstrate that 93 and 88% of resulting GSTs, respectively, fulfilled imposed length criteria. GST-PRIME primer pairs were tested on a set of 1900 Arabidopsis genes coding for chloroplast-targeted proteins: 95% of the primer pairs used in PCRs with genomic DNA generated the correct amplicons. GST-PRIME can thus be reliably used for large-scale or specific amplification of intron-containing genes of multicellular eukaryotes. PMID:11691924

  19. Toward a physical map of Drosophila buzzatii. Use of randomly amplified polymorphic dna polymorphisms and sequence-tagged site landmarks.

    PubMed Central

    Laayouni, H; Santos, M; Fontdevila, A

    2000-01-01

    We present a physical map based on RAPD polymorphic fragments and sequence-tagged sites (STSs) for the repleta group species Drosophila buzzatii. One hundred forty-four RAPD markers have been used as probes for in situ hybridization to the polytene chromosomes, and positive results allowing the precise localization of 108 RAPDs were obtained. Of these, 73 behave as effectively unique markers for physical map construction, and in 9 additional cases the probes gave two hybridization signals, each on a different chromosome. Most markers (68%) are located on chromosomes 2 and 4, which partially agree with previous estimates on the distribution of genetic variation over chromosomes. One RAPD maps close to the proximal breakpoint of inversion 2z(3) but is not included within the inverted fragment. However, it was possible to conclude from this RAPD that the distal breakpoint of 2z(3) had previously been wrongly assigned. A total of 39 cytologically mapped RAPDs were converted to STSs and yielded an aggregate sequence of 28,431 bp. Thirty-six RAPDs (25%) did not produce any detectable hybridization signal, and we obtained the DNA sequence from three of them. Further prospects toward obtaining a more developed genetic map than the one currently available for D. buzzatii are discussed. PMID:11102375

  20. The Short ITS2 Sequence Serves as an Efficient Taxonomic Sequence Tag in Comparison with the Full-Length ITS

    PubMed Central

    Han, Jianping; Zhu, Yingjie; Chen, Xiaochen; Liao, Baoshen; Yao, Hui; Song, Jingyuan; Chen, Shilin; Meng, Fanyun

    2013-01-01

    An ideal DNA barcoding region should be short enough to be amplified from degraded DNA. In this paper, we discuss the possibility of using a short nuclear DNA sequence as a barcode to identify a wide range of medicinal plant species. First, the PCR and sequencing success rates of ITS and ITS2 were evaluated based entirely on materials from dry medicinal product and herbarium voucher specimens, including some samples collected back to 90 years ago. The results showed that ITS2 could recover 91% while ITS could recover only 23% efficiency of PCR and sequencing by using one pair of primer. Second, 12861 ITS and ITS2 plant sequences were used to compare the identification efficiency of the two regions. Four identification criteria (BLAST, inter- and intradivergence Wilcoxon signed rank tests, and TaxonDNA) were evaluated. Our results supported the hypothesis that ITS2 can be used as a minibarcode to effectively identify species in a wide variety of specimens and medicinal materials. PMID:23484151

  1. Random Tagging Genotyping by Sequencing (rtGBS), an Unbiased Approach to Locate Restriction Enzyme Sites across the Target Genome

    PubMed Central

    Hilario, Elena; Barron, Lorna; Deng, Cecilia H.; Datson, Paul M.; Davy, Marcus W.; Storey, Roy D.

    2015-01-01

    Genotyping by sequencing (GBS) is a restriction enzyme based targeted approach developed to reduce the genome complexity and discover genetic markers when a priori sequence information is unavailable. Sufficient coverage at each locus is essential to distinguish heterozygous from homozygous sites accurately. The number of GBS samples able to be pooled in one sequencing lane is limited by the number of restriction sites present in the genome and the read depth required at each site per sample for accurate calling of single-nucleotide polymorphisms. Loci bias was observed using a slight modification of the Elshire et al. method: some restriction enzyme sites were represented in higher proportions while others were poorly represented or absent. This bias could be due to the quality of genomic DNA, the endonuclease and ligase reaction efficiency, the distance between restriction sites, the preferential amplification of small library restriction fragments, or bias towards cluster formation of small amplicons during the sequencing process. To overcome these issues, we have developed a GBS method based on randomly tagging genomic DNA (rtGBS). By randomly landing on the genome, we can, with less bias, find restriction sites that are far apart, and undetected by the standard GBS (stdGBS) method. The study comprises two types of biological replicates: six different kiwifruit plants and two independent DNA extractions per plant; and three types of technical replicates: four samples of each DNA extraction, stdGBS vs. rtGBS methods, and two independent library amplifications, each sequenced in separate lanes. A statistically significant unbiased distribution of restriction fragment size by rtGBS showed that this method targeted 49% (39,145) of BamH I sites shared with the reference genome, compared to only 14% (11,513) by stdGBS. PMID:26633193

  2. Molecular Cloning and Functional Expression of a Protein-Serine/Threonine Phosphatase from the Hyperthermophilic Archaeon Pyrodictium abyssi TAG11

    PubMed Central

    Mai, Bianca; Frey, Gerhard; Swanson, Ronald V.; Mathur, Eric J.; Stetter, K. O.

    1998-01-01

    An open reading frame coding for a putative protein-serine/threonine phosphatase was identified in the hyperthermophilic archaeon Pyrodictium abyssi TAG11 and named Py-PP1. Py-PP1 was expressed in Escherichia coli, purified from inclusion bodies, and biochemically characterized. The phosphatase gene is part of an operon which may provide, for the first time, insight into a physiological role for archaeal protein phosphatases in vivo. PMID:9696747

  3. Single & Multiple Stellar Populations in Globular Clusters: Chemical Tagging, Photometric Sequences, and Dynamics

    NASA Astrophysics Data System (ADS)

    Piotto, Giampaolo

    2015-08-01

    The discovery of multiple stellar populations in globular clusters has revolutionized our view of these objects one thought to be simple, single population stellar systems. Different star formation scenarios have been proposed in order to account for the photometric and spectroscopic properties of the different populations hosted by the single cluster, and some of them imply that the original cluster should have been much more massive than it is now, with a significant fraction of the original stars lost into the environment (Galaxy halo or bulge). Because of this, globular clusters become relevant not only as tracers of the general process of galaxy halo formation, but also possible incubators of most (all?) halo stars.In my talk I will briefly summarize the basic observational facts that made the community at large to accept the idea of population multiplicity.I will also present the newest results coming from an extensive, multi-wavelength astrometric and photometric survey, which includes UV data from ACS and WFC3/HST of close to half of the Milky Way globular clusters. The increasing number of spectroscopic surveys of stars in globular clusters, coupled with the capability of (UV) photometry to distinguish different populations has largely increased our capability to trace the basic chemical properties of the many populations within a single cluster.I will present a census of the presence of multiple populations in GCs, their chemical tagging, radial distribution, and kinematics.Possible correlations of multiple populations characterizing quantities with the main cluster parameters will also be presented. Implications on multiple stellar populations formation will be discussed as well as the still open issues.

  4. DNA Sequence Heterogeneity of Campylobacter jejuni CJIE4 Prophages and Expression of Prophage Genes

    PubMed Central

    Clark, Clifford G.; Chong, Patrick M.; McCorrister, Stuart J.; Mabon, Philip; Walker, Matthew; Westmacott, Garrett R.

    2014-01-01

    Campylobacter jejuni carry temperate bacteriophages that can affect the biology or virulence of the host bacterium. Known effects include genomic rearrangements and resistance to DNA transformation. C. jejuni prophage CJIE1 shows sequence variability and variability in the content of morons. Homologs of the CJIE1 prophage enhance both adherence and invasion to cells in culture and increase the expression of a specific subset of bacterial genes. Other C. jejuni temperate phages have so far not been well characterized. In this study we describe investigations into the DNA sequence variability and protein expression in a second prophage, CJIE4. CJIE4 sequences were obtained de novo from DNA sequencing of five C. jejuni isolates, as well as from whole genome sequences submitted to GenBank by other research groups. These CJIE4 DNA sequences were heterogenous, with several different insertions/deletions (indels) in different parts of the prophage genome. Two variants of a 3–4 kb region inserted within CJIE4 had different gene content that distinguished two major conserved CJIE4 prophage families. Additional indels were detected throughout the prophage. Detection of proteins in the five isolates characterized in our laboratory in isobaric Tags for Relative and Absolute Quantitation (iTRAQ) experiments indicated that prophage proteins within each of the two large indel variants were expressed during growth of the bacteria on Mueller Hinton agar plates. These proteins included the extracellular DNase associated with resistance to DNA transformation and prophage repressor proteins. Other proteins associated with known or suspected roles in prophage biology were also expressed from CJIE4, including capsid protein, the phage integrase, and MazF, a type II toxin-antitoxin system protein. Together with the results previously obtained for the CJIE1 prophage these results demonstrate that sequence variability and expression of moron genes are both general properties of temperate

  5. Development of high-density linkage map and tagging leaf spot resistance in pearl millet using genotyping-by-sequencing markers

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Pearl millet is an important forage and grain crop in many parts of the world. Genome mapping studies are a prerequisite for tagging agronomically important traits. Genotyping-by-Sequencing (GBS) markers can be used to build high density linkage maps even in species lacking a reference genome. A re...

  6. Parallel tagged amplicon sequencing of transcriptome-based genetic markers for Triturus newts with the Ion Torrent next-generation sequencing platform

    PubMed Central

    Wielstra, B; Duijm, E; Lagler, P; Lammers, Y; Meilink, W R M; Ziermann, J M; Arntzen, J W

    2014-01-01

    Next-generation sequencing is a fast and cost-effective way to obtain sequence data for nonmodel organisms for many markers and for many individuals. We describe a protocol through which we obtain orthologous markers for the crested newts (Amphibia: Salamandridae: Triturus), suitable for analysis of interspecific hybridization. We use transcriptome data of a single Triturus species and design 96 primer pairs that amplify c. 180 bp fragments positioned in 3-prime untranslated regions. Next, these markers are tested with uniplex PCR for a set of species spanning the taxonomical width of the genus Triturus. The 52 markers that consistently show a single band of expected length at gel electrophoreses for all tested crested newt species are then amplified in five multiplex PCRs (with a plexity of ten or eleven) for 132 individual newts: a set of 84 representing the seven (candidate) species and a set of 48 from a presumed hybrid population. After pooling multiplexes per individual, unique tags are ligated to link amplicons to individuals. Subsequently, individuals are pooled equimolar and sequenced on the Ion Torrent next-generation sequencing platform. A bioinformatics pipeline identifies the alleles and recodes these to a genotypic format. Next, we test the utility of our markers. baps allocates the 84 crested newt individuals representing (candidate) species to their expected (candidate) species, confirming the markers are suitable for species delineation. newhybrids, a hybrid index and hiest confirm the 48 individuals from the presumed hybrid population to be genetically admixed, illustrating the potential of the markers to identify interspecific hybridization. We expect the set of markers we designed to provide a high resolving power for analysis of hybridization in Triturus. PMID:24571307

  7. Impact of Cell Type and Epitope Tagging on Heterologous Expression of G Protein-Coupled Receptor: A Systematic Study on Angiotensin Type II Receptor

    PubMed Central

    Jiang, Lili; Teng, Gladys M. K.; Chan, Elaine Y. M.; Au, Shannon W. N.; Wise, Helen; Lee, Susanna S. T.; Cheung, Wing-Tai

    2012-01-01

    Despite heterologous expression of epitope-tagged GPCR is widely adopted for functional characterization, there is lacking of systematic analysis of the impact of expression host and epitope tag on GPCR expression. Angiotensin type II (AT2) receptor displays agonist-dependent and -independent activities, coupling to a spectrum of signaling molecules. However, consensus has not been reached on the subcellular distributions, signaling cascades and receptor-mediated actions. To examine the contributions of host cell and epitope tag on receptor expression and activity, epitope-tagged AT2 receptor variants were transiently or stably expressed in HEK293, CHO-K1 and PC12 cells. The epitope-tagged AT2 receptor variants were detected both on the cell membrane and in the perinuclear region. In transiently transfected HEK293 cells, Myc-AT2 existed predominantly as monomer. Additionally, a ladder of ubiquitinated AT2 receptor proteins was detected. By contrast, stably expressed epitope-tagged AT2 receptor variants existed as both monomer and high molecular weight complexes, and the latter was enriched in cell surface. Glycosylation promoted cell surface expression of Myc-AT2 but had no effect on AT2-GFP in HEK293 cells. In cells that stably expressed Myc-AT2, serum starvation induced apoptosis in CHO-K1 cells but not in HEK293 or PC12 cells. Instead, HEK293 and PC12 cells stably expressing Myc-AT2 exhibited partial cell cycle arrest with cells accumulating at G1 and S phases, respectively. Taken together, these results suggest that expression levels, subcellular distributions and ligand-independent constitutive activities of AT2 receptor were cell type-dependent while posttranslational processing of nascent AT2 receptor protein was modulated by epitope tag and mode of expression. PMID:23056563

  8. Novel Y-chromosomal microdeletions associated with non-obstructive azoospermia uncovered by high throughput sequencing of sequence-tagged sites (STSs)

    PubMed Central

    Liu, Xiao; Li, Zesong; Su, Zheng; Zhang, Junjie; Li, Honggang; Xie, Jun; Xu, Hanshi; Jiang, Tao; Luo, Liya; Zhang, Ruifang; Zeng, Xiaojing; Xu, Huaiqian; Huang, Yi; Mou, Lisha; Hu, Jingchu; Qian, Weiping; Zeng, Yong; Zhang, Xiuqing; Xiong, Chengliang; Yang, Huanming; Kristiansen, Karsten; Cai, Zhiming; Wang, Jun; Gui, Yaoting

    2016-01-01

    Y-chromosomal microdeletion (YCM) serves as an important genetic factor in non-obstructive azoospermia (NOA). Multiplex polymerase chain reaction (PCR) is routinely used to detect YCMs by tracing sequence-tagged sites (STSs) in the Y chromosome. Here we introduce a novel methodology in which we sequence 1,787 (post-filtering) STSs distributed across the entire male-specific Y chromosome (MSY) in parallel to uncover known and novel YCMs. We validated this approach with 766 Chinese men with NOA and 683 ethnically matched healthy individuals and detected 481 and 98 STSs that were deleted in the NOA and control group, representing a substantial portion of novel YCMs which significantly influenced the functions of spermatogenic genes. The NOA patients tended to carry more and rarer deletions that were enriched in nearby intragenic regions. Haplogroup O2* was revealed to be a protective lineage for NOA, in which the enrichment of b1/b3 deletion in haplogroup C was also observed. In summary, our work provides a new high-resolution portrait of deletions in the Y chromosome. PMID:26907467

  9. Novel Y-chromosomal microdeletions associated with non-obstructive azoospermia uncovered by high throughput sequencing of sequence-tagged sites (STSs).

    PubMed

    Liu, Xiao; Li, Zesong; Su, Zheng; Zhang, Junjie; Li, Honggang; Xie, Jun; Xu, Hanshi; Jiang, Tao; Luo, Liya; Zhang, Ruifang; Zeng, Xiaojing; Xu, Huaiqian; Huang, Yi; Mou, Lisha; Hu, Jingchu; Qian, Weiping; Zeng, Yong; Zhang, Xiuqing; Xiong, Chengliang; Yang, Huanming; Kristiansen, Karsten; Cai, Zhiming; Wang, Jun; Gui, Yaoting

    2016-01-01

    Y-chromosomal microdeletion (YCM) serves as an important genetic factor in non-obstructive azoospermia (NOA). Multiplex polymerase chain reaction (PCR) is routinely used to detect YCMs by tracing sequence-tagged sites (STSs) in the Y chromosome. Here we introduce a novel methodology in which we sequence 1,787 (post-filtering) STSs distributed across the entire male-specific Y chromosome (MSY) in parallel to uncover known and novel YCMs. We validated this approach with 766 Chinese men with NOA and 683 ethnically matched healthy individuals and detected 481 and 98 STSs that were deleted in the NOA and control group, representing a substantial portion of novel YCMs which significantly influenced the functions of spermatogenic genes. The NOA patients tended to carry more and rarer deletions that were enriched in nearby intragenic regions. Haplogroup O2* was revealed to be a protective lineage for NOA, in which the enrichment of b1/b3 deletion in haplogroup C was also observed. In summary, our work provides a new high-resolution portrait of deletions in the Y chromosome. PMID:26907467

  10. AB039. Novel Y-chromosomal microdeletions associated with non-obstructive azoospermia uncovered by high throughput sequencing of sequence-tagged sites (STSs)

    PubMed Central

    Li, Zesong

    2016-01-01

    Y-chromosomal microdeletion (YCM) serves as an important genetic factor in non-obstructive azoospermia (NOA). Multiplex polymerase chain reaction (PCR) is routinely used to detect YCMs by tracing sequence-tagged sites (STSs) in the Y chromosome. Here we introduce a novel methodology in which we sequence 1,787 (post-filtering) STSs distributed across the entire male-specific Y chromosome (MSY) in parallel to uncover known and novel YCMs. We validated this approach with 766 Chinese men with NOA and 683 ethnically matched healthy individuals and detected 481 and 98 STSs that were deleted in the NOA and control group, representing a substantial portion of novel YCMs which significantly influenced the functions of spermatogenic genes. The NOA patients tended to carry more and rarer deletions that were enriched in nearby intragenic regions. Haplogroup O2* was revealed to be a protective lineage for NOA, in which the enrichment of b1/b3 deletion in haplogroup C was also observed. In summary, our work provides a new high-resolution portrait of deletions in the Y chromosome.

  11. Sequence analysis of 497 mouse brain ESTs expressed in the substantia nigra

    SciTech Connect

    Stewart, G.J.; Savioz, A.; Davies, R.W.

    1997-01-15

    The use of subtracted, region-specific cDNA libraries combined with single-pass cDNA sequencing allows the discovery of novel genes and facilitates molecular description of the tissue or region involved. We report the sequence of 497 mouse expressed sequence tags (ESTs) from two subtracted libraries enriched for cDNAs expressed in the substantia nigra, a brain region with important roles in movement control and Parkinson disease. Of these, 238 ESTs give no database matches and therefore derive from novel genes. A further 115 ESTs show sequence similarity to ESTs from other organisms, which themselves do not yield any significant database matches to genes of known function. Fifty-six ESTs show sequence similarity to previously identified genes whose mouse homologues have not been reported. The total number of ESTs reported that are new for the mouse is 407, which, together with the 90 ESTs corresponding to known mouse genes or cDNAs, contributes to the molecular description of the substantia nigra. 21 refs., 4 tabs.

  12. Sequence analysis of 497 mouse brain ESTs expressed in the substantia nigra.

    PubMed

    Stewart, G J; Savioz, A; Davies, R W

    1997-01-15

    The use of subtracted, region-specific cDNA libraries combined with single-pass cDNA sequencing allows the discovery of novel genes and facilitates molecular description of the tissue or region involved. We report the sequence of 497 mouse expressed sequence tags (ESTs) from two subtracted libraries enriched for cDNAs expressed in the substantia nigra, a brain region with important roles in movement control and Parkinson disease. Of these, 238 ESTs give no database matches and therefore derive from novel genes. A further 115 ESTs show sequence similarity to ESTs from other organisms, which themselves do not yield any significant database matches to genes of known function. Fifty-six ESTs show sequence similarity to previously identified genes whose mouse homologues have not been reported. The total number of ESTs reported that are new for the mouse is 407, which, together with the 90 ESTs corresponding to known mouse genes or cDNAs, contributes to the molecular description of the substantia nigra. PMID:9027501

  13. EST sequencing of the parasitic nematode Haemonchus contortus suggests a shift in gene expression during transition to the parasitic stages.

    PubMed

    Hoekstra, R; Visser, A; Otsen, M; Tibben, J; Lenstra, J A; Roos, M H

    2000-09-01

    Expressed sequence tags from the parasitic nematode Haemonchus contortus were generated in order to identify anchor loci for comparative mapping between nematode genomes and candidate targets for future control measures. In total, 370 SL1 trans-spliced cDNAs from different developmental stages representing 195 different genes were partially sequenced. From these expressed sequence tags 50% were similar to genes with a known or predicted function and 19% were similar to nematode sequences with no ascribed function. From the first, free-living L1 and L3 stages relatively many cDNAs matched to housekeeping genes, and 11% (L1) or 23% (L3) of the encoded proteins were predicted to contain signal peptides. In contrast, no function could be ascribed to most of the cDNAs from the early L5 and adult parasitic stages, but for 30% (L5) or 55% (adult) of the encoded proteins a signal sequence was predicted. This limited analysis suggests that during the transition from the free-living to parasitic stages gene expression shifts towards the synthesis of less conserved extracellular proteins. These proteins offer the best perspectives for vaccine development and the development of anthelmintic drugs. In contrast, cDNAs from the first larval stages may be most suitable for comparative mapping with the free-living nematode Caenorhabditis elegans. PMID:10989145

  14. Purification and characterization of bioactive his6-tagged recombinant human tissue inhibitor of metalloproteinases-1 (TIMP-1) protein expressed at high yields in mammalian cells.

    PubMed

    Vinther, Lena; Lademann, Ulrik; Andersen, Elisabeth Veyhe; Højrup, Peter; Thaysen-Andersen, Morten; Krogh, Berit Olsen; Viuff, Birgitte; Brünner, Nils; Stenvang, Jan; Moreira, José M A

    2014-09-01

    Tissue inhibitor of metalloproteinases-1 (TIMP-1) is an endogenous inhibitor of matrix metalloproteinases (MMPs) with reported tumor promoting, as well as inhibitory, effects. These paradoxical properties are presumably mediated by different biological functions, MMP-dependent as well as -independent, and probably related to TIMP-1 levels of protein expression, post-translational modifications, and cellular localization. TIMP-1 is an N-glycosylated protein that folds into two functional domains, a C- and an N-terminal domain, with six disulfide bonds. Furthermore, TIMP-1 is processed in the N-terminal sequence. These three biochemical properties make TIMP-1 difficult to produce in conventional bacterial, insect, or yeast expression systems. We describe here a HEK293 cell-based strategy for production and purification of secreted and N-glycosylated recombinant his6-tagged human TIMP-1 (his6-rTIMP-1), which resulted in large amounts of highly purified and bioactive protein. Matrix-assisted laser desorption ionization mass spectrometry confirmed the N- and C-termini of his6-rTIMP-1, and N-glycosylation profiling showed a match to the N-glycosylation of human plasma TIMP-1. The his6-rTIMP-1 was bioactive as shown by its proper inhibitory effect on MMP-2 activity, and its stimulatory effect on cell growth when added to the growth medium of four different breast cancer cell lines. This study provides an easy set-up for large scale production and purification of bioactive, tagged recombinant human TIMP-1, which structurally and functionally is similar to endogenous human TIMP-1, while using an expression system that is adaptable to most biochemical and biomedical laboratories including those that do not perform protein purifications routinely. PMID:24998777

  15. Transcriptional Profiling of Newly Generated Dentate Granule Cells Using TU Tagging Reveals Pattern Shifts in Gene Expression during Circuit Integration1,2

    PubMed Central

    Chatzi, Christina; Shen, Rongkun; Goodman, Richard H.

    2016-01-01

    Abstract Despite representing only a small fraction of hippocampal granule cells, adult-generated newborn granule cells have been implicated in learning and memory (Aimone et al., 2011). Newborn granule cells undergo functional maturation and circuit integration over a period of weeks. However, it is difficult to assess the accompanying gene expression profiles in vivo with high spatial and temporal resolution using traditional methods. Here we used a novel method [“thiouracil (TU) tagging”] to map the profiles of nascent mRNAs in mouse immature newborn granule cells compared with mature granule cells. We targeted a nonmammalian uracil salvage enzyme, uracil phosphoribosyltransferase, to newborn neurons and mature granule cells using retroviral and lentiviral constructs, respectively. Subsequent injection of 4-TU tagged nascent RNAs for analysis by RNA sequencing. Several hundred genes were significantly enhanced in the retroviral dataset compared with the lentiviral dataset. We compared a selection of the enriched genes with steady-state levels of mRNAs using quantitative PCR. Ontology analysis revealed distinct patterns of nascent mRNA expression, with newly generated immature neurons showing enhanced expression for genes involved in synaptic function, and neural differentiation and development, as well as genes not previously associated with granule cell maturation. Surprisingly, the nascent mRNAs enriched in mature cells were related to energy homeostasis and metabolism, presumably indicative of the increased energy demands of synaptic transmission and their complex dendritic architecture. The high spatial and temporal resolution of our modified TU-tagging method provides a foundation for comparison with steady-state RNA analyses by traditional transcriptomic approaches in defining the functional roles of newborn neurons. PMID:27011954

  16. High-yield secretion of recombinant proteins expressed in tobacco cell culture with a designer glycopeptide tag: Process development.

    PubMed

    Zhang, Ningning; Gonzalez, Maria; Savary, Brett; Xu, Jianfeng

    2016-03-01

    Low-yield protein production remains the most significant economic hurdle with plant cell culture technology. Fusions of recombinant proteins with hydroxyproline-O-glycosylated designer glycopeptide tags have consistently boosted secreted protein yields. This prompted us to study the process development of this technology aiming to achieve productivity levels necessary for commercial viability. We used a tobacco BY-2 cell culture expressing EGFP as fusion with a glycopeptide tag comprised of 32 repeat of "Ser-Pro" dipeptide, or (SP)32 , to study cell growth and protein secretion, culture scale-up, and establishment of perfusion cultures for continuous production. The BY-2 cells accumulated low levels of cell biomass (∼7.5 g DW/L) in Schenk & Hildebrandt medium, but secreted high yields of (SP)32 -tagged EGFP (125 mg/L). Protein productivity of the cell culture has been stable for 6.0 years. The BY-2 cells cultured in a 5-L bioreactor similarly produced high secreted protein yield at 131 mg/L. Successful operation of a cell perfusion culture for 30 days was achieved under the perfusion rate of 0.25 and 0.5 day(-1) , generating a protein volumetric productivity of 17.6 and 28.9 mg/day/L, respectively. This research demonstrates the great potential of the designer glycopeptide technology for use in commercial production of valuable proteins with plant cell cultures. PMID:26627201

  17. High-throughput Gene Tagging in Trypanosoma brucei.

    PubMed

    Dyer, Philip; Dean, Samuel; Sunter, Jack

    2016-01-01

    Improvements in mass spectrometry, sequencing and bioinformatics have generated large datasets of potentially interesting genes. Tagging these proteins can give insights into their function by determining their localization within the cell and enabling interaction partner identification. We recently published a fast and scalable method to generate Trypanosoma brucei cell lines that express a tagged protein from the endogenous locus. The method was based on a plasmid we generated that, when coupled with long primer PCR, can be used to modify a gene to encode a protein tagged at either terminus. This allows the tagging of dozens of trypanosome proteins in parallel, facilitating the large-scale validation of candidate genes of interest. This system can be used to tag proteins for localization (using a fluorescent protein, epitope tag or electron microscopy tag) or biochemistry (using tags for purification, such as the TAP (tandem affinity purification) tag). Here, we describe a protocol to perform the long primer PCR and the electroporation in 96-well plates, with the recovery and selection of transgenic trypanosomes occurring in 24-well plates. With this workflow, hundreds of proteins can be tagged in parallel; this is an order of magnitude improvement to our previous protocol and genome scale tagging is now possible. PMID:27584862

  18. Astaxanthin lowers plasma TAG concentrations and increases hepatic antioxidant gene expression in diet-induced obesity mice.

    PubMed

    Yang, Yue; Pham, Tho X; Wegner, Casey J; Kim, Bohkyung; Ku, Chai Siah; Park, Young-Ki; Lee, Ji-Young

    2014-12-14

    Non-alcoholic fatty liver disease (NAFLD) is significantly associated with hyperlipidaemia and oxidative stress. We have previously reported that astaxanthin (ASTX), a xanthophyll carotenoid, lowers plasma total cholesterol and TAG concentrations in apoE knockout mice. To investigate whether ASTX supplementation can prevent the development of NAFLD in obesity, male C57BL/6J mice (n 8 per group) were fed a high-fat diet (35%, w/w) supplemented with 0, 0.003, 0.01 or 0.03% of ASTX (w/w) for 12 weeks. The 0.03% ASTX-supplemented group, but not the other groups, exhibited a significant decrease in plasma TAG concentrations, suggesting that ASTX at a 0.03% supplementation dosage exerts a hypotriacylglycerolaemic effect. Although there was an increase in the mRNA expression of fatty acid synthase and diglyceride acyltransferase 2, the mRNA levels of acyl-CoA oxidase 1, a critical enzyme in peroxisomal fatty acid β-oxidation, exhibited an increase in the 0.03% ASTX-supplemented group. There was a decrease in plasma alanine transaminase (ALT) and aspartate transaminase (AST) concentrations in the 0.03% ASTX-supplemented group. There was a significant increase in the hepatic mRNA expression of nuclear factor erythroid 2-related factor 2 and its downstream genes, which are critical for endogenous antioxidant mechanism, in the 0.03% ASTX-supplemented group. Furthermore, there was a significant decrease in the mRNA abundance of IL-6 in the primary splenocytes isolated from the 0.03% ASTX-supplemented group upon lipopolysaccharide (LPS) stimulation when compared with that in the splenocytes isolated from the control group. In conclusion, ASTX supplementation lowered the plasma concentrations of TAG, ALT and AST, increased the hepatic expression of endogenous antioxidant genes, and rendered splenocytes less sensitive to LPS stimulation. Therefore, ASTX may prevent obesity-associated metabolic disturbances and inflammation. PMID:25328157

  19. N-terminal sequence tagging using reliably determined b2 ions: a useful approach to deconvolute tandem mass spectra of co-fragmented peptides in proteomics.

    PubMed

    Kryuchkov, Fedor; Verano-Braga, Thiago; Kjeldsen, Frank

    2014-05-30

    With the recent introduction of higher-energy collisional dissociation (HCD) in Orbitrap mass spectrometry, the popularity of that technique has grown tremendously in the proteomics society. HCD spectra, however, are characterized by a limited distribution of bn-type ions, which permit the generation of reliable sequence tags based on complementary b,y pairs both for de novo sequencing and sequence tagging strategies. Instead, most peptide HCD spectra (~95%) are dominated with b2 ions. In this work, we analyzed positive predictive values of b2 ions in HCD, and found that b2 ions can be determined with >97% certainty in the presence of a2 and its complementary yn-2 ions. Analytically, b2 ions provide information on the composition of the first two N-terminal amino acids in peptides. Their utilization in N-terminal sequence tagging leads to a significant decrease in false discovery rate by filtering out false positives while retaining true positive identifications. As a consequence, the number of peptide spectrum matches (PSMs) increased by 4.8% at fixed FDR (1%). This approach allows for deconvolution of mixture spectra and increased the number of PSM to 9.2% in a complex human sample and to 24% in a complex sample of synthetic peptides at 1% FDR. PMID:24726481

  20. Transcriptome sequencing and development of an expression microarray platform for the domestic ferret

    PubMed Central

    2010-01-01

    Background The ferret (Mustela putorius furo) represents an attractive animal model for the study of respiratory diseases, including influenza. Despite its importance for biomedical research, the number of reagents for molecular and immunological analysis is restricted. We present here a parallel sequencing effort to produce an extensive EST (expressed sequence tags) dataset derived from a normalized ferret cDNA library made from mRNA from ferret blood, liver, lung, spleen and brain. Results We produced more than 500000 sequence reads that were assembled into 16000 partial ferret genes. These genes were combined with the available ferret sequences in the GenBank to develop a ferret specific microarray platform. Using this array, we detected tissue specific expression patterns which were confirmed by quantitative real time PCR assays. We also present a set of 41 ferret genes with even transcription profiles across the tested tissues, indicating their usefulness as housekeeping genes. Conclusion The tools developed in this study allow for functional genomic analysis and make further development of reagents for the ferret model possible. PMID:20403183

  1. The TAG family of cancer/testis antigens is widely expressed in a variety of malignancies and gives rise to HLA-A2-restricted epitopes.

    PubMed

    Adair, Sara J; Carr, Tiffany M; Fink, Mitsú J; Slingluff, Craig L; Hogan, Kevin T

    2008-01-01

    The TAG-1, TAG-2a, TAG-2b, and TAG-2c cancer/testis genes, known to be expressed in an unusually high percentage of melanoma cell lines, are shown here to be expressed in a variety of tumor lines of diverse histologic type, including cancers of the brain, breast, colon, lung, ovary, pharynx, and tongue. The genes are also expressed in fresh, uncultured melanoma, and ovarian cancer cells. Epitope prediction algorithms were used to identify potential HLA-A1, HLA-A2, HLA-A3, HLA-B7, and HLA-B8 epitopes, and these potential epitopes were tested for their ability to stimulate a peptide-specific cytotoxic T lymphocyte response using lymphocytes from healthy donors. Two HLA-A2-restricted epitopes (SLGWLFLLL and LLLRLECNV) were identified using this approach. Cytotoxic T lymphocytes specific for each of these peptides were capable of recognizing tumor cells expressing both the corresponding class I major histocompatibility complex encoded molecule and the TAG genes. These results indicate that TAG-derived peptides may be good components of a therapeutic vaccine designed to target melanoma and a variety of epithelial cell-derived malignancies. PMID:18157007

  2. A safety-modified SV40 Tag developed for human cancer immunotherapy

    PubMed Central

    Tseng-Rogenski, Stephanie S; Arredouani, Mohamed S; Escara-Wilke, June F; Neeley, Yilin C; Imperiale, Michael J; Sanda, Martin G

    2008-01-01

    Simian virus 40 (SV40)-like DNA sequences have been found in a variety of human tumors, raising the possibility that strategies targeting SV40 may provide a potential avenue for immunotherapy directed against SV40 large T Antigen (Tag)-expressing tumors. We generated a recombinant vaccinia (vac-mTag) expressing mTag and herein assessed the ability of mTag to transform cells and to interact with anti-oncoproteins, as well as screened for the presence of potential HLA-A2.1-restricted epitopes within mTag. We found that transfection of cells with mTag did not lead to their transformation. Also, we demonstrated that mTag protein is degraded rapidly in cells. In addition, our work revealed that mTag did not physically interact with certain anti-oncoproteins. Finally, two potential HLA-A2.1-restricted functional epitopes within mTag sequence were identified. Our results show that mTag lacks the oncogenecity of full-length Tag and harbors potential HLA-A2.1-restricted immunogenic epitopes, hence suggesting the safety of vac-mTag for use in cancer immunotherapy. PMID:19920890

  3. Enhanced expression of human prostaglandin H synthase-2 in the yeast Pichia pastoris and removal of the C-terminal tag with bovine carboxypeptidase A.

    PubMed

    Kukk, Kaia; Samel, Nigulas

    2016-08-10

    Vertebrate prostaglandin H synthases (PGHSs) are membrane-bound disulphide-containing hemoglycoproteins. Therefore, eukaryotic expression systems are required for the production of recombinant PGHSs. Recently we announced the expression of human PGHS-2 (hPGHS-2) in the yeast Pichia pastoris. Here we report improved production of hPGHS-2 in P. pastoris and a convenient method for the purification and de-tagging of the protein. An affinity tag comprised of a proline, a glycine and eight histidines was introduced into the C-terminal end of hPGHS-2. The tagged hPGHS-2 was expressed intracellularly in P. pastoris under the control of a constitutive or methanol-inducible promoter. Compared to constitutive expression, methanol-induced expression yielded approximately four times more protein. The analysis of high and low gene copy number recombinants revealed a positive correlation between the gene copy number and the expression level of hPGHS-2. The recombinant hPGHS-2 was purified using immobilised metal ion affinity chromatography. A novel elution method, treatment of the affinity resin with bovine carboxypeptidase A, was employed. The yield of pure de-tagged hPGHS-2 from 1l of yeast culture was approximately 3mg. The protein purification process with simultaneous removal of the C-terminal polyhistidine tag could be easily applied for the affinity purification of other proteins. PMID:27316830

  4. Novel Cysteine Tags for the Sequencing of Non-Tryptic Disulfide Peptides of Anurans: ESI-MS Study of Fragmentation Efficiency

    NASA Astrophysics Data System (ADS)

    Samgina, Tatyana Y.; Vorontsov, Egor A.; Gorshkov, Vladimir A.; Artemenko, Konstantin A.; Nifant'ev, Ilya E.; Kanawati, Basem; Schmitt-Kopplin, Philippe; Zubarev, Roman A.; Lebedev, Albert T.

    2011-12-01

    Mass spectrometry faces considerable difficulties in de novo sequencing of long non-tryptic peptides with S-S bonds. Long disulfide-containing peptides brevinins 1E and 2Ec from frog Rana ridibunda were reduced and alkylated with nine novel and three known derivatizing agents. Eight of the novel reagents are maleimide derivatives. Modified samples were subjected to MS/MS studies on FT-ICR and Orbitrap mass spectrometers using CAD/HCD or ECD/ETD techniques. Procedures, fragmentation patterns, and sequence coverage for two peptides modified with 12 tags are described. ECD/ETD and CAD fragmentation revealed complementary sequence information. Higher-energy collisionally activated dissociation (HCD) sufficiently enhanced y-ions formation for brevinin 1E, but not for brevinin 2Ec. Some novel tags [ N-benzylmaleimide, N-(2,6-dimethylphenyl)maleimide] along with known N-phenylmaleimide and iodoacetic acid showed high total sequence coverage taking into account combined ETD and HCD fragmentation. Moreover, modification of long (34 residues) brevinin 2Ec with N-benzylmaleimide or N-(2,6-dimethylphenyl)maleimide yielded high sequence coverage and full C-terminal sequence determination with ECD alone.

  5. Long-range effects of tag sequence on marginally stabilized structure in HIV-1 p24 capsid protein monitored using NMR.

    PubMed

    Okazaki, Honoka; Kaneko, Chie; Hirahara, Miyuki; Watanabe, Satoru; Tochio, Naoya; Kigawa, Takanori; Nishimura, Chiaki

    2014-09-01

    N-terminal domain of HIV-1 p24 capsid protein is a globular fold composed of seven helices and two β-strands with a flexible structure including the α4-5 loop and both N- and C-terminal ends. However, the protein shows a high tendency (48%) for an intrinsically disordered structure based on the PONDR VL-XT prediction from the primary sequence. To assess the possibility of marginally stabilized structure under physiological conditions, the N-terminal domain of p24 was destabilized by the addition of an artificial flexible tag to either N- or C-terminal ends, and it was analyzed using T1, T2, hetero-nuclear NOE, and amide-proton exchange experiments. When the C-terminal tag (12 residues) was attached, the regions of the α3-4 loop and helix 6 as well as the α4-5 loop attained the flexible structures. Furthermore, in the protein containing the N-terminal tag (27 residues), helix 4 in addition to the above-mentioned area including α3-4 and α4-5 loops as well as helix 6 exhibited highly disordered structures. Thus, the long-range effects of the existence of tag sequence was observed in the stepwise manner of the appearance of disordered structures (step 1: α4-5 loop, step 2: α3-4 loop and helix 6, and step 3: helix 4). Furthermore, the disordered regions in tagged proteins were consistent with the PONDR VL-XT disordered prediction. The dynamic structure located in the middle part (α3-4 loop to helix 6) of the protein shown in this study may be related to the assembly of the viral particle. PMID:24960591

  6. Analytic signal phase-based myocardial motion estimation in tagged MRI sequences by a bilinear model and motion compensation.

    PubMed

    Wang, Liang; Basarab, Adrian; Girard, Patrick R; Croisille, Pierre; Clarysse, Patrick; Delachartre, Philippe

    2015-08-01

    Different mathematical tools, such as multidimensional analytic signals, allow for the calculation of 2D spatial phases of real-value images. The motion estimation method proposed in this paper is based on two spatial phases of the 2D analytic signal applied to cardiac sequences. By combining the information of these phases issued from analytic signals of two successive frames, we propose an analytical estimator for 2D local displacements. To improve the accuracy of the motion estimation, a local bilinear deformation model is used within an iterative estimation scheme. The main advantages of our method are: (1) The phase-based method allows the displacement to be estimated with subpixel accuracy and is robust to image intensity variation in time; (2) Preliminary filtering is not required due to the bilinear model. The proposed algorithm, integrating phase-based optical flow motion estimation and the combination of global motion compensation with local bilinear transform, allows spatio-temporal cardiac motion analysis, e.g. strain and dense trajectory estimation over the cardiac cycle. Results from 7 realistic simulated tagged magnetic resonance imaging (MRI) sequences show that our method is more accurate compared with state-of-the-art method for cardiac motion analysis and with another differential approach from the literature. The motion estimation errors (end point error) of the proposed method are reduced by about 33% compared with that of the two methods. In our work, the frame-to-frame displacements are further accumulated in time, to allow for the calculation of myocardial Lagrangian cardiac strains and point trajectories. Indeed, from the estimated trajectories in time on 11 in vivo data sets (9 patients and 2 healthy volunteers), the shape of myocardial point trajectories belonging to pathological regions are clearly reduced in magnitude compared with the ones from normal regions. Myocardial point trajectories, estimated from our phase-based analytic

  7. Affinity Purification of a Recombinant Protein Expressed as a Fusion with the Maltose-Binding Protein (MBP) Tag

    PubMed Central

    Duong-Ly, Krisna C.; Gabelli, Sandra B.

    2015-01-01

    Expression of fusion proteins such as MBP fusions can be used as a way to improve the solubility of the expressed protein in E. coli (Fox and Waugh, 2003; Nallamsetty et al., 2005; Nallamsetty and Waugh, 2006) and as a way to introduce an affinity purification tag. The protocol that follows was designed by the authors as a first step in the purification of a recombinant protein fused with MBP, using fast protein liquid chromatography (FPLC). Cells should have been thawed, resuspended in binding buffer, and lysed by sonication or microfluidization before mixing with the amylose resin or loading on the column. Slight modifications to this protocol may be made to accommodate both the protein of interest and the availability of equipment. PMID:26096500

  8. Development and optimization of sequence-tagged microsatellite site markers to detect genetic diversity within Colletotrichum capsici, a causal agent of chilli pepper anthracnose disease.

    PubMed

    Ranathunge, N P; Ford, R; Taylor, P W J

    2009-07-01

    Genomic libraries enriched for microsatellites from Colletotrichum capsici, one of the major causal agents of anthracnose disease in chilli pepper (Capsicum spp.), were developed using a modified hybridization procedure. Twenty-seven robust primer pairs were designed from microsatellite flanking sequences and were characterized using 52 isolates from three countries India, Sri Lanka and Thailand. Highest gene diversity of 0.857 was observed at the CCSSR1 with up to 18 alleles among all the isolates whereas the differentiation ranged from 0.05 to 0.45. The sequence-tagged microsatellite site markers developed in this study will be useful for genetic analyses of C. capsici populations. PMID:21564867

  9. Spiral MR myocardial tagging.

    PubMed

    Ryf, Salome; Kissinger, Kraig V; Spiegel, Marcus A; Börnert, Peter; Manning, Warren J; Boesiger, Peter; Stuber, Matthias

    2004-02-01

    In the present study, complementary spatial modulation of magnetization (CSPAMM) myocardial tagging was extended with an interleaved spiral imaging sequence. The use of a spiral sequence enables the acquisition of grid-tagged images with a tagline distance as low as 4 mm in a single breath-hold. Alternatively, a high temporal resolution of 77 frames per second was obtained with 8-mm grid spacing. Ten healthy adult subjects were studied. With this new approach, high-quality images can be obtained and the tags persist throughout the entire cardiac cycle. PMID:14755646

  10. Birbeck granule-like "organized smooth endoplasmic reticulum" resulting from the expression of a cytoplasmic YFP-tagged langerin.

    PubMed

    Lenormand, Cédric; Spiegelhalter, Coralie; Cinquin, Bertrand; Bardin, Sabine; Bausinger, Huguette; Angénieux, Catherine; Eckly, Anita; Proamer, Fabienne; Wall, David; Lich, Ben; Tourne, Sylvie; Hanau, Daniel; Schwab, Yannick; Salamero, Jean; de la Salle, Henri

    2013-01-01

    Langerin is required for the biogenesis of Birbeck granules (BGs), the characteristic organelles of Langerhans cells. We previously used a Langerin-YFP fusion protein having a C-terminal luminal YFP tag to dynamically decipher the molecular and cellular processes which accompany the traffic of Langerin. In order to elucidate the interactions of Langerin with its trafficking effectors and their structural impact on the biogenesis of BGs, we generated a YFP-Langerin chimera with an N-terminal, cytosolic YFP tag. This latter fusion protein induced the formation of YFP-positive large puncta. Live cell imaging coupled to a fluorescence recovery after photobleaching approach showed that this coalescence of proteins in newly formed compartments was static. In contrast, the YFP-positive structures present in the pericentriolar region of cells expressing Langerin-YFP chimera, displayed fluorescent recovery characteristics compatible with active membrane exchanges. Using correlative light-electron microscopy we showed that the coalescent structures represented highly organized stacks of membranes with a pentalaminar architecture typical of BGs. Continuities between these organelles and the rough endoplasmic reticulum allowed us to identify the stacks of membranes as a form of "Organized Smooth Endoplasmic Reticulum" (OSER), with distinct molecular and physiological properties. The involvement of homotypic interactions between cytoplasmic YFP molecules was demonstrated using an A206K variant of YFP, which restored most of the Langerin traffic and BG characteristics observed in Langerhans cells. Mutation of the carbohydrate recognition domain also blocked the formation of OSER. Hence, a "double-lock" mechanism governs the behavior of YFP-Langerin, where asymmetric homodimerization of the YFP tag and homotypic interactions between the lectin domains of Langerin molecules participate in its retention and the subsequent formation of BG-like OSER. These observations confirm that

  11. Expression profiling and comparative sequence derived insights into lipid metabolism

    SciTech Connect

    Callow, Matthew J.; Rubin, Edward M.

    2001-12-19

    Expression profiling and genomic DNA sequence comparisons are increasingly being applied to the identification and analysis of the genes involved in lipid metabolism. Not only has genome-wide expression profiling aided in the identification of novel genes involved in important processes in lipid metabolism such as sterol efflux, but the utilization of information from these studies has added to our understanding of the regulation of pathways participating in the process. Coupled with these gene expression studies, cross species comparison, searching for sequences conserved through evolution, has proven to be a powerful tool to identify important non-coding regulatory sequences as well as the discovery of novel genes relevant to lipid biology. An example of the value of this approach was the recent chance discovery of a new apolipoprotein gene (apo AV) that has dramatic effects upon triglyceride metabolism in mice and humans.

  12. Method to produce acetyldiacylglycerols (ac-TAGs) by expression of an acetyltransferase gene isolated from Euonymus alatus (burning bush)

    DOEpatents

    Durrett, Timothy; Ohlrogge, John; Pollard, Michael

    2016-05-03

    The present invention relates to novel diacylglycerol acyltransferase genes and proteins, and methods of their use. In particular, the invention describes genes encoding proteins having diacylglycerol acetyltransferase activity, specifically for transferring an acetyl group to a diacylglycerol substrate to form acetyl-Triacylglycerols (ac-TAGS), for example, a 3-acetyl-1,2-diacyl-sn-glycerol. The present invention encompasses both native and recombinant wild-type forms of the transferase, as well as mutants and variant forms. The present invention also relates to methods of using novel diacylglycerol acyltransferase genes and proteins, including their expression in transgenic organisms at commercially viable levels, for increasing production of 3-acetyl-1,2-diacyl-sn-glycerols in plant oils and altering the composition of oils produced by microorganisms, such as yeast, by increasing ac-TAG production. Additionally, oils produced by methods of the present inventions comprising genes and proteins are contemplated for use as biodiesel fuel, in polymer production and as naturally produced food oils with reduced calories.

  13. Epitope-Tagged Autotransporters as Single-Cell Reporters for Gene Expression by a Salmonella Typhimurium wbaP Mutant.

    PubMed

    Curkić, Ismeta; Schütz, Monika; Oberhettinger, Philipp; Diard, Médéric; Claassen, Manfred; Linke, Dirk; Hardt, Wolf-Dietrich

    2016-01-01

    Phenotypic diversity is an important trait of bacterial populations and can enhance fitness of the existing genotype in a given environment. To characterize different subpopulations, several studies have analyzed differential gene expression using fluorescent reporters. These studies visualized either single or multiple genes within single cells using different fluorescent proteins. However, variable maturation and folding kinetics of different fluorophores complicate the study of dynamics of gene expression. Here, we present a proof-of-principle study for an alternative gene expression system in a wbaP mutant of Salmonella Typhimurium (S. Tm) lacking the O-sidechain of the lipopolysaccharide. We employed the hemagglutinin (HA)-tagged inverse autotransporter invasin (invAHA) as a transcriptional reporter for the expression of the type three secretion system 1 (T1) in S. Tm. Using a two-reporter approach with GFP and the InvAHA in single cells, we verify that this reporter system can be used for T1 gene expression analysis, at least in strains lacking the O-antigen (wbaP), which are permissive for detection of the surface-exposed HA-epitope. When we placed the two reporters gfp and invAHA under the control of either one or two different promoters of the T1 regulon, we were able to show correlative expression of both reporters. We conclude that the invAHA reporter system is a suitable tool to analyze T1gene expression in S. Tm and propose its applicability as molecular tool for gene expression studies within single cells. PMID:27149272

  14. Epitope-Tagged Autotransporters as Single-Cell Reporters for Gene Expression by a Salmonella Typhimurium wbaP Mutant

    PubMed Central

    Curkić, Ismeta; Schütz, Monika; Oberhettinger, Philipp; Diard, Médéric; Claassen, Manfred; Linke, Dirk; Hardt, Wolf-Dietrich

    2016-01-01

    Phenotypic diversity is an important trait of bacterial populations and can enhance fitness of the existing genotype in a given environment. To characterize different subpopulations, several studies have analyzed differential gene expression using fluorescent reporters. These studies visualized either single or multiple genes within single cells using different fluorescent proteins. However, variable maturation and folding kinetics of different fluorophores complicate the study of dynamics of gene expression. Here, we present a proof-of-principle study for an alternative gene expression system in a wbaP mutant of Salmonella Typhimurium (S. Tm) lacking the O-sidechain of the lipopolysaccharide. We employed the hemagglutinin (HA)-tagged inverse autotransporter invasin (invAHA) as a transcriptional reporter for the expression of the type three secretion system 1 (T1) in S. Tm. Using a two-reporter approach with GFP and the InvAHA in single cells, we verify that this reporter system can be used for T1 gene expression analysis, at least in strains lacking the O-antigen (wbaP), which are permissive for detection of the surface-exposed HA-epitope. When we placed the two reporters gfp and invAHA under the control of either one or two different promoters of the T1 regulon, we were able to show correlative expression of both reporters. We conclude that the invAHA reporter system is a suitable tool to analyze T1gene expression in S. Tm and propose its applicability as molecular tool for gene expression studies within single cells. PMID:27149272

  15. Chromatin Interaction Analysis with Paired-End Tag Sequencing (ChIA-PET) for Mapping Chromatin Interactions and Understanding Transcription Regulation

    PubMed Central

    Poh, Huay Mei; Peh, Su Qin; Ong, Chin Thing; Zhang, Jingyao; Ruan, Xiaoan; Ruan, Yijun

    2012-01-01

    Genomes are organized into three-dimensional structures, adopting higher-order conformations inside the micron-sized nuclear spaces 7, 2, 12. Such architectures are not random and involve interactions between gene promoters and regulatory elements 13. The binding of transcription factors to specific regulatory sequences brings about a network of transcription regulation and coordination 1, 14. Chromatin Interaction Analysis by Paired-End Tag Sequencing (ChIA-PET) was developed to identify these higher-order chromatin structures 5,6. Cells are fixed and interacting loci are captured by covalent DNA-protein cross-links. To minimize non-specific noise and reduce complexity, as well as to increase the specificity of the chromatin interaction analysis, chromatin immunoprecipitation (ChIP) is used against specific protein factors to enrich chromatin fragments of interest before proximity ligation. Ligation involving half-linkers subsequently forms covalent links between pairs of DNA fragments tethered together within individual chromatin complexes. The flanking MmeI restriction enzyme sites in the half-linkers allow extraction of paired end tag-linker-tag constructs (PETs) upon MmeI digestion. As the half-linkers are biotinylated, these PET constructs are purified using streptavidin-magnetic beads. The purified PETs are ligated with next-generation sequencing adaptors and a catalog of interacting fragments is generated via next-generation sequencers such as the Illumina Genome Analyzer. Mapping and bioinformatics analysis is then performed to identify ChIP-enriched binding sites and ChIP-enriched chromatin interactions 8. We have produced a video to demonstrate critical aspects of the ChIA-PET protocol, especially the preparation of ChIP as the quality of ChIP plays a major role in the outcome of a ChIA-PET library. As the protocols are very long, only the critical steps are shown in the video. PMID:22564980

  16. Expression, purification, and immobilization of His-tagged D-amino acid oxidase of Trigonopsis variabilis in Pichia pastoris.

    PubMed

    Zheng, Huabao; Wang, Xiaolan; Chen, Jun; Zhu, Ke; Zhao, Yuhua; Yang, Yunliu; Yang, Sheng; Jiang, Weihong

    2006-05-01

    High-level expression of D: -amino acid oxidase (DAO) has been reported in Pichia pastoris by integrating the DAO gene under the control of the alcohol oxidase promoter (PAOX1). However, the time taken to reach peak product concentration is usually long (approximately 43 h), and cultivation requires tight regulation of methanol feeding. In this paper, we describe the expression of His-tagged DAO (HDAO) in P. pastoris using the glyceraldehydes-3-phosphate dehydrogenase promoter (PGAP). The maximal level of HDAO expression using the PGAP integrant is attained in 13 h and is equal to that obtained using the PAOX1 integrant in 43 h. We also explored the possibility of secreting HDAO in P. pastoris. In-frame fusion of Saccharomyces cerevisiae alpha-factor secretion signal under a PGAP or PAOX1 resulted in low-level secretion of active HDAO, which was not of practical use. The intracellularly expressed HDAO under PGAP was purified by agar-based affinity support and then immobilized on Amberzyme oxirane resin. The immobilized HDAO, with specific activity of 75 U g-1 (wet weight), could be recycled more than 14 times without significant loss of activity. The data suggest that intracellular production of HDAO under PGAP, followed by affinity purification and immobilization on oxirane resin, may serve as an effective process for the manufacture of immobilized DAO for industrial application. PMID:16217653

  17. Sequencing and functional expression of the malonyl-CoA-sensitive carnitine palmitoyltransferase from Drosophila melanogaster.

    PubMed Central

    Jackson, V N; Cameron, J M; Zammit, V A; Price, N T

    1999-01-01

    Using expressed sequence tag data, we obtained a cDNA for a carnitine palmitoyltransferase I (CPT I)-like molecule from Drosophila melanogaster. The cDNA encodes a 782-residue protein that shows 49% and 48% sequence identity with the rat liver and skeletal-muscle isoforms of CPT I respectively. The sequence has two predicted membrane-spanning regions, suggesting that it adopts the same topology as its mammalian counterparts. The sequence contains all the residues that have been shown to be characteristic of carnitine acetyltransferases. Expression in the yeast Pichia pastoris confirmed that the cDNA does encode a CPT enzyme. The activity was found to be associated with a mitochondria-enriched fraction. Kinetic analysis revealed a K(m) for carnitine of 406 microM and a K(m) for palmitoyl-CoA of 105 microM. The CPT activity was very sensitive to inhibition by malonyl-CoA, with an IC(50) of 0.74 microM when the activity was assayed with 35 microM palmitoyl-CoA and 1% (w/v) albumin at pH 7.0. A histidine residue at position 140 in rat liver CPT I has been indicated to be important for inhibition by malonyl-CoA. The equivalent residue (position 136) in Drosophila CPT I is arginine, implying that any basic residue might be compatible with such sensitivity. Evidence is presented that, unlike in mammals, Drosophila has only a single CPT I gene. Sequences suggesting the existence of a splice variant in the 5' untranslated region were found; this was consistent with the existence of two promoters for the CPT I gene. PMID:10417309

  18. Sequence Determinants of Circadian Gene Expression Phase in Cyanobacteria

    PubMed Central

    Vijayan, Vikram

    2013-01-01

    The cyanobacterium Synechococcus elongatus PCC 7942 exhibits global biphasic circadian oscillations in gene expression under constant-light conditions. Class I genes are maximally expressed in the subjective dusk, whereas class II genes are maximally expressed in the subjective dawn. Here, we identify sequence features that encode the phase of circadian gene expression. We find that, for multiple genes, an ∼70-nucleotide promoter fragment is sufficient to specify class I or II phase. We demonstrate that the gene expression phase can be changed by random mutagenesis and that a single-nucleotide substitution is sufficient to change the phase. Our study provides insight into how the gene expression phase is encoded in the cyanobacterial genome. PMID:23204469

  19. Expression and secretion of functional recombinant μO-conotoxin MrVIB-His-tag in Escherichia coli.

    PubMed

    Gao, Bingmiao; Zhangsun, Dongting; Hu, Yuanyan; Wu, Yong; Sheng, Lizi; Fang, Licong; Wu, Xiaosa; Yu, Jinpeng; Luo, Sulan

    2013-09-01

    μO-conotoxin MrVIB is a 31-amino acid peptide containing three disulfide bonds isolated from the venom of Conus marmoreus, which is a selective antagonist of voltage-gated sodium channel (VGSC) Nav1.8 and has a long-lasting analgesic activity. Drug development of MrVIB has long been hindered over 15 years by difficult chemical synthesis and oxidative folding. Herein we describe a different approach based on the recombinant expression of gene MrVIB in Escherichia coli. A secretion vector pET22b(+)-MrVIB fused with pelB leader signal peptide and His-tag was constructed, which was transformed into BL21 (DE3) strain of E. coli. The recombinant conotoxin MrVIB-His-tag (rMrVIB-His) was successfully expressed and secreted into the periplasmic space of BL21 (DE3) cells. The pelB leader signal peptide was properly cleaved and three disulfide bonds were also formed properly to yield biological active rMrVIB-His. Folded rMrVIB-His in the periplasmic fraction was isolated with a Ni-NTA affinity column, which was further purified using reverse-phase high-performance liquid chromatography (RP-HPLC) and identified by liquid chromatography/mass spectrometry-ion trap-time of flight mass spectrometry (LC/MS-IT-TOF). Biological activity assay of rMrVIB-His showed it had good analgesic effects in three pain models. PMID:23811388

  20. Expression and purification of the recombinant His-tagged GST-CD38 fusion protein using the baculovirus/insect cell expression system.

    PubMed

    Khoo, Keng Meng; Chang, Chan Fong; Schubert, Jeffrey; Wondrak, Ewald; Chng, Hiok Hee

    2005-04-01

    CD38 is a type II transmembrane glycoprotein found in myriad mammalian tissues and cell types. It is known for its involvement in the metabolism of cyclic ADP-ribose and nicotinic acid adenine dinucleotide phosphate, two nucleotides with calcium mobilizing activity independent of inositol trisphosphate. CD38 itself has been shown to have clinical significance in certain diseases with possible utilization in diagnostic and prognostic applications. Previous studies on several autoimmune diseases have shown the usefulness of recombinant CD38 protein expressed from Escherichia coli and Pichia pastoris in the detection of autoantibodies to CD38 via Western blot and ELISA. In this study, we produced a 6 x His-tagged GST-CD38 fusion protein using a recombinant baculovirus/insect cell expression technique that was purified as a soluble protein. The fusion protein was purified to homogeneity by affinity and gel filtration chromatography steps. It has an apparent molecular mass of 56 kDa on SDS-PAGE gel stained with Coomassie blue and was recognized on Western blots by antibodies against human CD38 as well as the polyhistidine tag. Peptide mass fingerprinting analysis confirmed the identity of human CD38 in the fusion protein. PMID:15766882

  1. Amplified expression of the tag+ and alkA+ genes in Escherichia coli: identification of gene products and effects on alkylation resistance.

    PubMed Central

    Kaasen, I; Evensen, G; Seeberg, E

    1986-01-01

    We have constructed plasmids which overproduce the tag and alkA gene products of Escherichia coli, i.e., 3-methyladenine DNA glycosylases I and II. The tag and alkA gene products were identified radiochemically in maxi- or minicells as polypeptides of 21 and 30 kilodaltons, respectively, which are consistent with the gel filtration molecular weights of the enzyme activities, thus confirming the identity of the cloned genes. High expression of the tag+-coded glycosylase almost completely suppressed the alkylation sensitivity of alkA mutants, indicating that high levels of 3-methyladenine DNA glycosylase I will eliminate the need for 3-methyladenine DNA glycosylase II in repair of alkylated DNA. Furthermore, overproduction of the alkA+-coded glycosylase greatly sensitizes wild-type cells to alkylation, suggesting that only a limited expression of this enzyme will allow efficient DNA repair. Images PMID:3536857

  2. Haplotypes of the TaGS5-A1 Gene Are Associated with Thousand-Kernel Weight in Chinese Bread Wheat

    PubMed Central

    Wang, Shasha; Yan, Xuefang; Wang, Yongyan; Liu, Hongmei; Cui, Dangqun; Chen, Feng

    2016-01-01

    In previous work, we cloned TaGS5 gene and found the association of TaGS5-A1 alleles with agronomic traits. In this study, the promoter sequence of the TaGS5-A1 gene was isolated from bread wheat. Sequencing results revealed that a G insertion was found in position -1925 bp of the TaGS5-A1 gene (Reference to ATG), which occurred in the Sp1 domain of the promoter sequence. Combined with previous single nucleotide polymorphism (SNP) in the TaGS5-A1 exon sequence, four genotypes were formed at the TaGS5-A1 locus and were designated as TaGS5-A1a-a, TaGS5-A1a-b, TaGS5-A1b-a, and TaGS5-A1b-b, respectively. Analysis of the association of TaGS5-A1 alleles with agronomic traits indicated that cultivars with the TaGS5-A1a-b allele possessed significantly higher thousand-kernel weight (TKW) and lower plant height than cultivars with the TaGS5-A1a-a allele, and cultivars with the TaGS5-A1b-b allele showed higher TKW than cultivars with the TaGS5-A1b-a allele. The differences of these traits between the TaGS5-A1a-a and TaGS5-A1a-b alleles were larger than those of the TaGS5-A1b-a and TaGS5-A1b-b alleles, suggesting that the -1925G insertion plays the more important role in TaGS5-A1a genotypes than in TaGS5-A1b genotypes. qRT-PCR indicated that TaGS5-A1b-b possessed the significantly highest expression level among four TaGS5-A1 haplotypes in mature seeds and further showed a significantly higher expression level than TaGS5-A1b-a at five different developmental stages of the seeds, suggesting that high expression of TaGS5-A1 was positively associated with high TKW in bread wheat. This study could provide a relatively superior genotype in view of TKW in wheat breeding programs and could also provide important information for dissection of the regulatory mechanism of the yield-related traits. PMID:27375643

  3. Versatile Microfluidic Platform for the Assessment of Sialic Acid Expression on Cancer Cells Using Quantum Dots with Phenylboronic Acid Tags.

    PubMed

    Cao, Jun-Tao; Zhang, Peng-Hui; Liu, Yan-Ming; Abdel-Halim, E S; Zhu, Jun-Jie

    2015-07-15

    This work describes a versatile microfluidic platform for evaluation of cell-surface glycan expression at the single-cell level using quantum dots (QDs) tagged with phenylboronic acid. The platform was integrated with dual microwell arrays, allowing the introduction of cells in two states using the same cell culture chamber. The simultaneous analysis of cells in the same environment minimized errors resulting from different culture conditions. As proof-of-concept, the expressions of sialic acid (SA) groups on K562 cells, with or without 3'-azido-3'-deoxythymidine (AZT) treatment, were evaluated in the same chamber. 3-Aminophenylboronic acid functionalized CdSeTe@ZnS-SiO2 QDs (APBA-QDs) were prepared as probes to recognize SA groups on K562 cells with only one-step labeling. The results showed that the expression of SA moieties on K562 cells was increased by 18% and 31% after treatment with 20 and 40 μM AZT, respectively. Performing the drug treatment and control experiments simultaneously in the same chamber significantly improved the robustness and effectiveness of the assay. The strategy presented here provides an alternative tool for glycan analysis in a sensitive, high-throughput, and effective manner. PMID:26086216

  4. Expression and purification of 6xHis-tagged DNA binding domains of functional ecdysteroid receptor from drosophila melanogaster.

    PubMed

    Rusin, A; Niedziela-Majka, A; Rymarczyk, G; Ozyhar, A

    1996-01-01

    Two members of the nuclear receptor superfamily, EcR and Ultraspiracle (Usp) heterodimerize to form a functional receptor for 20-hydroxyecdysone-the key ecdysteroid controlling induction and modulation of morphogenetic events through Drosophila development. In order to study aspects of receptor function and ultimately the structural basis of the ecdysteroid receptor-DNA interaction, it is necessary to produce large quantities of purified EcR and Usp DNA-binding domains. Toward this end, we have expressed the EcR DNA-binding domain and the Usp DNA-binding domain as proteins with an affinity tag consisting of six histidine residues (6xHis-EcRDBD and 6xHis-UspDBD, respectively) using the expression vector pQE-30. Under optimal conditions, elaborated in this study, bacteria can express the recombinant 6xHis-EcRDBD to the levels of 11% of total soluble proteins and the 6xHis-UspDBD to the levels of 16%. Both proteins were purified to homogeneity from the soluble protein fraction using combination of ammonium sulphate fractionation and affinity chromatography on Ni-NTA agarose. The gel mobility shift experiments demonstrated that the purified 6xHis-EcRDBD and the 6xHis-UspDBD interact specifically with an 20-hydroxyecdysone response element from the promoter region of the hsp 27 Drosophila gene. PMID:9104497

  5. In silico identification of coffee genome expressed sequences potentially associated with resistance to diseases

    PubMed Central

    2010-01-01

    Sequences potentially associated with coffee resistance to diseases were identified by in silico analyses using the database of the Brazilian Coffee Genome Project (BCGP). Keywords corresponding to plant resistance mechanisms to pathogens identified in the literature were used as baits for data mining. Expressed sequence tags (ESTs) related to each of these keywords were identified with tools available in the BCGP bioinformatics platform. A total of 11,300 ESTs were mined. These ESTs were clustered and formed 979 EST-contigs with similarities to chitinases, kinases, cytochrome P450 and nucleotide binding site-leucine rich repeat (NBS-LRR) proteins, as well as with proteins related to disease resistance, pathogenesis, hypersensitivity response (HR) and plant defense responses to diseases. The 140 EST-contigs identified through the keyword NBS-LRR were classified according to function. This classification allowed association of the predicted products of EST-contigs with biological processes, including host defense and apoptosis, and with molecular functions such as nucleotide binding and signal transducer activity. Fisher's exact test was used to examine the significance of differences in contig expression between libraries representing the responses to biotic stress challenges and other libraries from the BCGP. This analysis revealed seven contigs highly similar to catalase, chitinase, protein with a BURP domain and unknown proteins. The involvement of these coffee proteins in plant responses to disease is discussed. PMID:21637594

  6. In silico identification of coffee genome expressed sequences potentially associated with resistance to diseases.

    PubMed

    Alvarenga, Samuel Mazzinghy; Caixeta, Eveline Teixeira; Hufnagel, Bárbara; Thiebaut, Flávia; Maciel-Zambolim, Eunize; Zambolim, Laércio; Sakiyama, Ney Sussumu

    2010-10-01

    Sequences potentially associated with coffee resistance to diseases were identified by in silico analyses using the database of the Brazilian Coffee Genome Project (BCGP). Keywords corresponding to plant resistance mechanisms to pathogens identified in the literature were used as baits for data mining. Expressed sequence tags (ESTs) related to each of these keywords were identified with tools available in the BCGP bioinformatics platform. A total of 11,300 ESTs were mined. These ESTs were clustered and formed 979 EST-contigs with similarities to chitinases, kinases, cytochrome P450 and nucleotide binding site-leucine rich repeat (NBS-LRR) proteins, as well as with proteins related to disease resistance, pathogenesis, hypersensitivity response (HR) and plant defense responses to diseases. The 140 EST-contigs identified through the keyword NBS-LRR were classified according to function. This classification allowed association of the predicted products of EST-contigs with biological processes, including host defense and apoptosis, and with molecular functions such as nucleotide binding and signal transducer activity. Fisher's exact test was used to examine the significance of differences in contig expression between libraries representing the responses to biotic stress challenges and other libraries from the BCGP. This analysis revealed seven contigs highly similar to catalase, chitinase, protein with a BURP domain and unknown proteins. The involvement of these coffee proteins in plant responses to disease is discussed. PMID:21637594

  7. Analysis of expressed sequence tags derived from a compatible Mycosphaerella fijiensis-banana interaction.

    PubMed

    Portal, Orelvis; Izquierdo, Yovanny; De Vleesschauwer, David; Sánchez-Rodríguez, Aminael; Mendoza-Rodríguez, Milady; Acosta-Suárez, Mayra; Ocaña, Bárbara; Jiménez, Elio; Höfte, Monica

    2011-05-01

    Mycosphaerella fijiensis, a hemibiotrophic fungus, is the causal agent of black leaf streak disease, the most serious foliar disease of bananas and plantains. To analyze the compatible interaction of M. fijiensis with Musa spp., a suppression subtractive hybridization (SSH) cDNA library was constructed to identify transcripts induced at late stages of infection in the host and the pathogen. In addition, a full-length cDNA library was created from the same mRNA starting material as the SSH library. The SSH procedure was effective in identifying specific genes predicted to be involved in plant-fungal interactions and new information was obtained mainly about genes and pathways activated in the plant. Several plant genes predicted to be involved in the synthesis of phenylpropanoids and detoxification compounds were identified, as well as pathogenesis-related proteins that could be involved in the plant response against M. fijiensis infection. At late stages of infection, jasmonic acid and ethylene signaling transduction pathways appear to be active, which corresponds with the necrotrophic life style of M. fijiensis. Quantitative PCR experiments revealed that antifungal genes encoding PR proteins and GDSL-like lipase are only transiently induced 30 days post inoculation (dpi), indicating that the fungus is probably actively repressing plant defense. The only fungal gene found was induced 37 dpi and encodes UDP-glucose pyrophosphorylase, an enzyme involved in the biosynthesis of trehalose. Trehalose biosynthesis was probably induced in response to prior activation of plant antifungal genes and may act as an osmoprotectant against membrane damage. PMID:21279642

  8. Abiotic stress-related expressed sequence tags from the diploid strawberry Fragaria vesca f. semperflorens

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Background Plants of the Rosaceae family such as the large-statured tree crops apple, peach and cherry, the ornamental rose, the cultivated octoploid strawberry, F. ×ananassa, represent a considerable share of horticultural crops worldwide, many of which fail to produce maximally due to environment...

  9. Analysis and functional annotation of expressed sequence tags from the Asian longhorned beetle, Anoplophora glabripennis

    Technology Transfer Automated Retrieval System (TEKTRAN)

    The Asian longhorned beetle (ALB), ‘Anoplophora glabripennis’, is one of the most economically and ecological important non-native, invasive forest pests recently discovered in North America. Despite the substantial impact of this pest, limited effort has been expended in regards to defining the ge...

  10. Expressed sequence tags from the black-winged sharpshooter: Application to biology and vector control

    Technology Transfer Automated Retrieval System (TEKTRAN)

    We identified 14 putative full-length transcripts of proteins important for the survival of the black-winged sharpshooter, BWSS, Oncometopia nigricans. The BWSS is considered a highly competent vector of several strains of the xylem-inhabiting bacterium Xylella fastidiosa, the causal agent of a numb...

  11. Generation of expressed sequence tags (ESTs) from cultivated peanuts for gene discovery and marker development

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Aflatoxin contamination caused by Aspergillus fungi is a great concern in peanut production worldwide. Pre-harvest Aspergilli infection and aflatoxin contamination are usually severe in peanuts that are grown under drought stressed conditions. Genomic research can provide new tools to study plant-mi...

  12. Insert sequence length determines transfection efficiency and gene expression levels in bicistronic mammalian expression vectors

    PubMed Central

    Payne, Andrew J; Gerdes, Bryan C; Kaja, Simon; Koulen, Peter

    2013-01-01

    Bicistronic expression vectors have been widely used for co-expression studies since the initial discovery of the internal ribosome entry site (IRES) about 25 years ago. IRES sequences allow the 5’ cap-independent initiation of translation of multiple genes on a single messenger RNA strand. Using a commercially available mammalian expression vector containing an IRES sequence with a 3’ green fluorescent protein fluorescent marker, we found that sequence length of the gene of interest expressed 5’ of the IRES site influences both expression of the 3’ fluorescent marker and overall transfection efficiency of the vector construct. Furthermore, we generated a novel construct expressing two distinct fluorescent markers and found that high expression of one gene can lower expression of the other. Observations from this study indicate that caution is warranted in the design of experiments utilizing an IRES system with a short 5’ gene of interest sequence (<300 bp), selection of single cells based on the expression profile of the 3’ optogenetic fluorescent marker, and assumptions made during data analysis. PMID:24380024

  13. A ubiquitin-10 promoter-based vector set for fluorescent protein tagging facilitates temporal stability and native protein distribution in transient and stable expression studies.

    PubMed

    Grefen, Christopher; Donald, Naomi; Hashimoto, Kenji; Kudla, Jörg; Schumacher, Karin; Blatt, Michael R

    2010-10-01

    Fluorescent tagging of proteins and confocal imaging techniques have become methods of choice in analysing the distributions and dynamic characteristics of proteins at the subcellular level. In common use are a number of strategies for transient expression that greatly reduce the preparation time in advance of imaging, but their applications are limited in success outside a few tractable species and tissues. We previously developed a simple method to transiently express fluorescently-tagged proteins in Arabidopsis root epidermis and root hairs. We describe here a set of Gateway-compatable vectors with fluorescent tags incorporating the ubiqutin-10 gene promoter (P(UBQ10) ) of Arabidopsis that gives prolonged expression of the fluorescently-tagged proteins, both in tobacco and Arabidopsis tissues, after transient transformation, and is equally useful in generating stably transformed lines. As a proof of principle, we carried out transformations with fluorescent markers for the integral plasma membrane protein SYP121, a member of the SNARE family of vesicle-trafficking proteins, and for DHAR1, a cytosolic protein that facilitates the scavenging of reactive oxygen species. We also carried out transformations with SYP121 and its interacting partner, the KC1 K(+) channel, to demonstrate the utility of the methods in bimolecular fluorescence complementation (BiFC). Transient transformations of Arabidopsis using Agrobacterium co-cultivation methods yielded expression in all epidermal cells, including root hairs and guard cells. Comparative studies showed that the P(UBQ10) promoter gives similar levels of expression to that driven by the native SYP121 promoter, faithfully reproducing the characteristics of protein distributions at the subcellular level. Unlike the 35S-driven construct, expression under the P(UBQ10) promoter remained elevated for periods in excess of 2 weeks after transient transformation. This toolbox of vectors and fluorescent tags promises significant

  14. pT7MT, a metallothionein 2A-tagged novel prokaryotic fusion expression vector.

    PubMed

    Marikar, Faiz M M T; Fang, Lei; Jiang, Shu-Han; Hua, Zi-Chun

    2007-05-01

    In the present article, a novel fusion expression vector for Escherichia coli was developed based on the pTORG plasmid, a derivative of pET32a. This vector, named pT7MT (GenBank Accession No DQ504436), carries a T7 promoter and it drives the downstream gene encoding Metallothionein 2A (MT2A). There are in-framed multiple cloning sites (MCS) downstream of the MT2A gene. A target gene can be cloned into the MCS and fused to the C-terminal of the MT2A gene in a compatible open reading frame (ORF) to achieve fusion expression. The metal-binding capability of MT2A allows the purification of fusion proteins by metal chelating affinity chromatogralhy, known as Ni2+-affinity chromatography. Using this expression vector, we successfully got the stable and high-yield expression of MT2A-GST and MT2A-Troponin I fusion proteins. These two proteins were easily purified from the supernatant of cell lysates by one-step Ni2+ -affinity chromatography. The final yields of MT2A-GST and MT2A-Troponin I were 30 mg/l and 28 mg/l in LB culture, respectively. Taken together, our data suggest that pT7MT can be applied as a useful expression vector for stable and high-yield production of fusion proteins. PMID:18051292

  15. Characterization of transcriptome dynamics during watermelon fruit development: sequencing, assembly, annotation and gene expression profiles

    PubMed Central

    2011-01-01

    Background Cultivated watermelon [Citrullus lanatus (Thunb.) Matsum. & Nakai var. lanatus] is an important agriculture crop world-wide. The fruit of watermelon undergoes distinct stages of development with dramatic changes in its size, color, sweetness, texture and aroma. In order to better understand the genetic and molecular basis of these changes and significantly expand the watermelon transcript catalog, we have selected four critical stages of watermelon fruit development and used Roche/454 next-generation sequencing technology to generate a large expressed sequence tag (EST) dataset and a comprehensive transcriptome profile for watermelon fruit flesh tissues. Results We performed half Roche/454 GS-FLX run for each of the four watermelon fruit developmental stages (immature white, white-pink flesh, red flesh and over-ripe) and obtained 577,023 high quality ESTs with an average length of 302.8 bp. De novo assembly of these ESTs together with 11,786 watermelon ESTs collected from GenBank produced 75,068 unigenes with a total length of approximately 31.8 Mb. Overall 54.9% of the unigenes showed significant similarities to known sequences in GenBank non-redundant (nr) protein database and around two-thirds of them matched proteins of cucumber, the most closely-related species with a sequenced genome. The unigenes were further assigned with gene ontology (GO) terms and mapped to biochemical pathways. More than 5,000 SSRs were identified from the EST collection. Furthermore we carried out digital gene expression analysis of these ESTs and identified 3,023 genes that were differentially expressed during watermelon fruit development and ripening, which provided novel insights into watermelon fruit biology and a comprehensive resource of candidate genes for future functional analysis. We then generated profiles of several interesting metabolites that are important to fruit quality including pigmentation and sweetness. Integrative analysis of metabolite and digital gene

  16. Identification of thyroid hormone response elements in vivo using mice expressing a tagged thyroid hormone receptor α1

    PubMed Central

    Dudazy-Gralla, Susi; Nordström, Kristina; Hofmann, Peter Josef; Meseh, Dina Abdul; Schomburg, Lutz; Vennström, Björn; Mittag, Jens

    2013-01-01

    TRα1 (thyroid hormone receptor α1) is well recognized for its importance in brain development. However, due to the difficulties in predicting TREs (thyroid hormone response elements) in silico and the lack of suitable antibodies against TRα1 for ChIP (chromatin immunoprecipitation), only a few direct TRα1 target genes have been identified in the brain. Here we demonstrate that mice expressing a TRα1–GFP (green fluorescent protein) fusion protein from the endogenous TRα locus provide a valuable animal model to identify TRα1 target genes. To this end, we analysed DNA–TRα1 interactions in vivo using ChIP with an anti-GFP antibody. We validated our system using established TREs from neurogranin and hairless, and by verifying additional TREs from known TRα1 target genes in brain and heart. Moreover, our model system enabled the identification of novel TRα1 target genes such as RNF166 (ring finger protein 166). Our results demonstrate that transgenic mice expressing a tagged nuclear receptor constitute a feasible approach to study receptor–DNA interactions in vivo, circumventing the need for specific antibodies. Models like the TRα1–GFP mice may thus pave the way for genome-wide mapping of nuclear receptor-binding sites, and advance the identification of novel target genes in vivo. PMID:23398480

  17. Photo-convertible tagging for localization and dynamic analyses of low-expression proteins in filamentous fungi.

    PubMed

    Perez-de-Nanclares-Arregi, Elixabet; Etxebeste, Oier

    2014-09-01

    Photo-convertible fluorescent proteins (PCFPs) undergo a dramatic change in their excitation and emission spectra upon irradiation at specific wavelengths, thus rendering a different color. Dendra2 is a commercially available PCFP used to track the redistribution of proteins within cellular compartments, their life-time or interactions. Before photo-conversion Dendra2 exhibits green fluorescence, which becomes red after irradiation with either UV or blue lights. Multiple studies including Dendra2 as a molecular tool have been described in eukaryotes but not in filamentous fungi. Here we present a method to tag low-expression proteins from the filamentous fungus Aspergillus nidulans with Dendra2 and track their cellular dynamics. The regulator of asexual development FlbB was selected as control, a transcription factor that is expressed at low levels and can be used as a marker for the tip and nuclei of vegetative hyphae. This control provided us with a visual way to confirm the functionality of our genomic and plasmid constructs, since a non-functional FlbB protein renders a block in development and a characteristic aconidial phenotype. Our protocol combines standardized cloning and transformation procedures with the use of a mercury lamp microscope to convert and follow Dendra2 within cells. Hence, we present a rapid, simple and inexpensive method that makes tracking analysis of proteins that present technical difficulties to be followed feasible in filamentous fungi. PMID:25014896

  18. Molecular cloning, sequence characterization, and gene expression profiling of a novel water buffalo (Bubalus bubalis) gene, AGPAT6.

    PubMed

    Song, S; Huo, J L; Li, D L; Yuan, Y Y; Yuan, F; Miao, Y W

    2013-01-01

    Several 1-acylglycerol-3-phosphate-O-acyltransferases (AGPATs) can acylate lysophosphatidic acid to produce phosphatidic acid. Of the eight AGPAT isoforms, AGPAT6 is a crucial enzyme for glycerolipids and triacylglycerol biosynthesis in some mammalian tissues. We amplified and identified the complete coding sequence (CDS) of the water buffalo AGPAT6 gene by using the reverse transcription-polymerase chain reaction, based on the conversed sequence information of the cattle or expressed sequence tags of other Bovidae species. This novel gene was deposited in the NCBI database (accession No. JX518941). Sequence analysis revealed that the CDS of this AGPAT6 encodes a 456-amino acid enzyme (molecular mass = 52 kDa; pI = 9.34). Water buffalo AGPAT6 contains three hydrophobic transmembrane regions and a signal 37-amino acid peptide, localized in the cytoplasm. The deduced amino acid sequences share 99, 98, 98, 97, 98, 98, 97 and 95% identity with their homologous sequences from cattle, horse, human, mouse, orangutan, pig, rat, and chicken, respectively. The phylogenetic tree analysis based on the AGPAT6 CDS showed that water buffalo has a closer genetic relationship with cattle than with other species. Tissue expression profile analysis shows that this gene is highly expressed in the mammary gland, moderately expressed in the heart, muscle, liver, and brain; weakly expressed in the pituitary gland, spleen, and lung; and almost silently expressed in the small intestine, skin, kidney, and adipose tissues. Four predicted microRNA target sites are found in the water buffalo AGPAT6 CDS. These results will establish a foundation for further insights into this novel water buffalo gene. PMID:24114207

  19. Overview of affinity tags for protein purification.

    PubMed

    Kimple, Michelle E; Sondek, John

    2004-09-01

    Addition of an affinity tag is a useful method for differentiating recombinant proteins expressed in bacterial and eukaryotic expression systems from the background of total cellular proteins, and for detecting protein-protein interactions. This overview describes the historical basis for the development of affinity tags, affinity tags that are commonly used today, how to choose an appropriate affinity tag for a particular purpose, and several recently developed affinity tag technologies that may prove useful in the near future. PMID:18429272

  20. Overview of affinity tags for protein purification.

    PubMed

    Kimple, Michelle E; Brill, Allison L; Pasker, Renee L

    2013-01-01

    Addition of an affinity tag is a useful method for differentiating recombinant proteins expressed in bacterial and eukaryotic expression systems from the background of total cellular proteins, as well as for detecting protein-protein interactions. This overview describes the historical basis for the development of affinity tags, affinity tags that are commonly used today, how to choose an appropriate affinity tag for a particular purpose, and several recently developed affinity tag technologies that may prove useful in the near future. PMID:24510596

  1. P2A-Fluorophore Tagging of BRAF Tightly Links Expression to Fluorescence In Vivo

    PubMed Central

    McMahon, Martin

    2016-01-01

    The Braf proto-oncogene is a key component of the mitogen-activated protein kinase signaling cascade and is a critical regulator of both normal development and tumorigenesis in a variety of tissues. In order to elucidate BRAF’s differing roles in varying cell types, it is important to understand both the pattern and timing of BRAF expression. Here we report the production of a mouse model that links the expression of Braf with the bright red fluorescent protein, tdTomato. We have utilized a P2A knock-in strategy, ensuring that BRAF and the fluorophore are expressed from the same endogenous promoter and from the same bicistronic mRNA transcript. This mouse model (BrafTOM) shows bright red fluorescence in organs and cell types known to be sensitive to BRAF perturbation. We further show that on a cell-by-cell basis, fluorescence correlates with BRAF protein levels. Finally, we extend the utility of this mouse by demonstrating that the remnant P2A fragment attached to BRAF acts as a suitable epitope for immunoprecipitation and biochemical characterization of BRAF in vivo. PMID:27348307

  2. High-throughput sequencing-based genome-wide identification of microRNAs expressed in developing cotton seeds.

    PubMed

    Wang, YanMei; Ding, Yan; Yu, DingWei; Xue, Wei; Liu, JinYuan

    2015-08-01

    MicroRNAs (miRNAs) have been shown to play critical regulatory roles in gene expression in cotton. Although a large number of miRNAs have been identified in cotton fibers, the functions of miRNAs in seed development remain unexplored. In this study, a small RNA library was constructed from cotton seeds sampled at 15 days post-anthesis (DPA) and was subjected to high-throughput sequencing. A total of 95 known miRNAs were detected to be expressed in cotton seeds. The expression pattern of these identified miRNAs was profiled and 48 known miRNAs were differentially expressed between cotton seeds and fibers at 15 DPA. In addition, 23 novel miRNA candidates were identified in 15-DPA seeds. Putative targets for 21 novel and 87 known miRNAs were successfully predicted and 900 expressed sequence tag (EST) sequences were proposed to be candidate target genes, which are involved in various metabolic and biological processes, suggesting a complex regulatory network in developing cotton seeds. Furthermore, miRNA-mediated cleavage of three important transcripts in vivo was validated by RLM-5' RACE. This study is the first to show the regulatory network of miRNAs that are involved in developing cotton seeds and provides a foundation for future studies on the specific functions of these miRNAs in seed development. PMID:26117827

  3. Increased functional protein expression using nucleotide sequence features enriched in highly expressed genes in zebrafish

    PubMed Central

    Horstick, Eric J.; Jordan, Diana C.; Bergeron, Sadie A.; Tabor, Kathryn M.; Serpe, Mihaela; Feldman, Benjamin; Burgess, Harold A.

    2015-01-01

    Many genetic manipulations are limited by difficulty in obtaining adequate levels of protein expression. Bioinformatic and experimental studies have identified nucleotide sequence features that may increase expression, however it is difficult to assess the relative influence of these features. Zebrafish embryos are rapidly injected with calibrated doses of mRNA, enabling the effects of multiple sequence changes to be compared in vivo. Using RNAseq and microarray data, we identified a set of genes that are highly expressed in zebrafish embryos and systematically analyzed for enrichment of sequence features correlated with levels of protein expression. We then tested enriched features by embryo microinjection and functional tests of multiple protein reporters. Codon selection, releasing factor recognition sequence and specific introns and 3′ untranslated regions each increased protein expression between 1.5- and 3-fold. These results suggested principles for increasing protein yield in zebrafish through biomolecular engineering. We implemented these principles for rational gene design in software for codon selection (CodonZ) and plasmid vectors incorporating the most active non-coding elements. Rational gene design thus significantly boosts expression in zebrafish, and a similar approach will likely elevate expression in other animal models. PMID:25628360

  4. Long Span DNA Paired-End-Tag (DNA-PET) Sequencing Strategy for the Interrogation of Genomic Structural Mutations and Fusion-Point-Guided Reconstruction of Amplicons

    PubMed Central

    Hillmer, Axel M.; Lee, Wah Heng; Li, Guoliang; Teo, Audrey S. M.; Woo, Xing Yi; Zhang, Zhenshui; Chen, Jieqi P.; Poh, Wan Ting; Zawack, Kelson F. B.; Chan, Chee Seng; Leong, See Ting; Neo, Say Chuan; Choi, Poh Sum D.; Gao, Song; Nagarajan, Niranjan; Thoreau, Hervé; Shahab, Atif; Ruan, Xiaoan; Cacheux-Rataboul, Valère; Wei, Chia-Lin; Bourque, Guillaume; Sung, Wing-Kin; Liu, Edison T.; Ruan, Yijun

    2012-01-01

    Structural variations (SVs) contribute significantly to the variability of the human genome and extensive genomic rearrangements are a hallmark of cancer. While genomic DNA paired-end-tag (DNA-PET) sequencing is an attractive approach to identify genomic SVs, the current application of PET sequencing with short insert size DNA can be insufficient for the comprehensive mapping of SVs in low complexity and repeat-rich genomic regions. We employed a recently developed procedure to generate PET sequencing data using large DNA inserts of 10–20 kb and compared their characteristics with short insert (1 kb) libraries for their ability to identify SVs. Our results suggest that although short insert libraries bear an advantage in identifying small deletions, they do not provide significantly better breakpoint resolution. In contrast, large inserts are superior to short inserts in providing higher physical genome coverage for the same sequencing cost and achieve greater sensitivity, in practice, for the identification of several classes of SVs, such as copy number neutral and complex events. Furthermore, our results confirm that large insert libraries allow for the identification of SVs within repetitive sequences, which cannot be spanned by short inserts. This provides a key advantage in studying rearrangements in cancer, and we show how it can be used in a fusion-point-guided-concatenation algorithm to study focally amplified regions in cancer. PMID:23029419

  5. Method for protein tagging in Schizosaccharomyces pombe.

    PubMed

    Petrescu-Dănilă, Elena; Voicu, Pia-Manuela; Poiţelea, M; Stoica, B; Stănescu, Raluca; Rusu, M

    2006-01-01

    Tagging is a useful method for the investigation of proteins. It allows the localization of the proteins in the cell, their purification in order to investigate their function and the determination of their expression. The aim of the present study was to tag the Rad32 protein of fission yeast (which is the homologue of Mre11 protein from humans) at its N-terminus. Rad32p as well as Mre11p are involved in the repair of DNA double strand breaks and in the DNA damage checkpoint. We carried out this tagging using the Cre-loxp recombination system. In a first step, a 2 kb DNA fragment was integrated upstream of the initiating codon of rad32 gene. This fragment encoded the TAP-tag (tandem affinity purification), a loxp site, a selectable marker (sup3-5), an exogenous promoter (nmt1) and a second loxp site, in this sequence. Following transformation of this DNA fragment into S. pombe cells, rad32 was under the control of the artificial promotor, which allows a controlled expression of the gene by thiamine. In a second step, the cells were transformed with a plasmid coding for Cre recombinase, which catalyses the excision of the DNA sequence between the two loxp sites, removing the marker and the artificial promotor. Thus the tag became attached to the rad32 gene upstream of the ATG, placing the gene under the control of its native promotor. The strain thus obtained will be subsequently used for evidencing the tagged protein by Western blotting and then for its purification in order to investigate its function. PMID:17802953

  6. Expressed haplotypes and polymorphisms in prunus species

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Expressed sequence tags (ESTs) represent collective and redundant gene transcripts that contain alleles and paralogs of genes expressed in various genotypes, tissues, developmental stages, and environmental conditions. The former are primarily responsible for distinct individual phenotypes in a segr...

  7. Applying thiouracil (TU)-tagging for mouse transcriptome analysis

    PubMed Central

    Gay, Leslie; Karfilis, Kate V.; Miller, Michael R.; Doe, Chris Q.; Stankunas, Kryn

    2014-01-01

    Transcriptional profiling is a powerful approach to study mouse development, physiology, and disease models. Here, we describe a protocol for mouse thiouracil-tagging (TU-tagging), a transcriptome analysis technology that includes in vivo covalent labeling, purification, and analysis of cell type-specific RNA. TU-tagging enables 1) the isolation of RNA from a given cell population of a complex tissue, avoiding transcriptional changes induced by cell isolation trauma, and 2) the identification of actively transcribed RNAs and not pre-existing transcripts. Therefore, in contrast to other cell-specific transcriptional profiling methods based on purification of tagged ribosomes or nuclei, TU-tagging provides a direct examination of transcriptional regulation. We describe how to: 1) deliver 4-thiouracil to transgenic mice to thio-label cell lineage-specific transcripts, 2) purify TU-tagged RNA and prepare libraries for Illumina sequencing, and 3) follow a straight-forward bioinformatics workflow to identify cell type-enriched or differentially expressed genes. Tissue containing TU-tagged RNA can be obtained in one day, RNA-Seq libraries generated within two days, and, following sequencing, an initial bioinformatics analysis completed in one additional day. PMID:24457332

  8. Serial analysis of gene expression.

    PubMed

    Velculescu, V E; Zhang, L; Vogelstein, B; Kinzler, K W

    1995-10-20

    The characteristics of an organism are determined by the genes expressed within it. A method was developed, called serial analysis of gene expression (SAGE), that allows the quantitative and simultaneous analysis of a large number of transcripts. To demonstrate this strategy, short diagnostic sequence tags were isolated from pancreas, concatenated, and cloned. Manual sequencing of 1000 tags revealed a gene expression pattern characteristic of pancreatic function. New pancreatic transcripts corresponding to novel tags were identified. SAGE should provide a broadly applicable means for the quantitative cataloging and comparison of expressed genes in a variety of normal, developmental, and disease states. PMID:7570003

  9. Batch submission and release of 12,876 EST (expressed sequence tag) nucleotide sequences from peanut leaf tissues

    Technology Transfer Automated Retrieval System (TEKTRAN)

    In the southeastern US, tomato spotted wilt virus disease has become a major limiting factor for many peanut producers, while the control methods are limited. Both early (Cercospora arachidicola) and late (Cercosporidium personatum) leaf spot diseases are among the worst foliar diseases of the culti...

  10. High-yield expression, purification, characterization, and structure determination of tag-free Candida utilis uricase.

    PubMed

    Liu, Xiaojuan; Wen, Mingjie; Li, Jing; Zhai, Fangli; Ruan, Jing; Zhang, Liqing; Li, Shentao

    2011-11-01

    We report the successful high-yield expression of Candida utilis uricase in Escherichia coli and the establishment of an efficient three-step protein purification protocol. The purity of the recombinant protein, which was confirmed to be C. utilis uricase by sodium dodecyl sulfate-polyacrylamide gel electrophoresis and matrix-assisted laser desorption/ionization time-of-flight mass spectrometer analysis, was >98% and the specific activity was 38.4 IU/mg. Crystals of C. utilis uricase were grown at 18°C using 25% polyethylene glycol 3350 as precipitant. Diffraction by the crystals extends to 1.93 Å resolution, and the crystals belong to the space group P2(1)2(1)2(1) with unit cell parameters a = 69.16 Å, b = 139.31 Å, c = 256.33 Å, and α = β = γ = 90°. The crystal structure of C. utilis uricase shares a high similarity with other reported structures of the homologous uricases from other species in protein database, demonstrating that the three-dimensional structure of the protein defines critically to the catalytic activities. PMID:21573940

  11. Expression and Subcellular Distribution of GFP-Tagged Human Tetraspanin Proteins in Saccharomyces cerevisiae

    PubMed Central

    Skaar, Karin; Korza, Henryk J.; Tarry, Michael; Sekyrova, Petra; Högbom, Martin

    2015-01-01

    Tetraspanins are integral membrane proteins that function as organizers of multimolecular complexes and modulate function of associated proteins. Mammalian genomes encode approximately 30 different members of this family and remotely related eukaryotic species also contain conserved tetraspanin homologs. Tetraspanins are involved in a number of fundamental processes such as regulation of cell migration, fusion, immunity and signaling. Moreover, they are implied in numerous pathological states including mental disorders, infectious diseases or cancer. Despite the great interest in tetraspanins, the structural and biochemical basis of their activity is still largely unknown. A major bottleneck lies in the difficulty of obtaining stable and homogeneous protein samples in large quantities. Here we report expression screening of 15 members of the human tetraspanin superfamily and successful protocols for the production in S. cerevisiae of a subset of tetraspanins involved in human cancer development. We have demonstrated the subcellular localization of overexpressed tetraspanin-green fluorescent protein fusion proteins in S. cerevisiae and found that despite being mislocalized, the fusion proteins are not degraded. The recombinantly produced tetraspanins are dispersed within the endoplasmic reticulum membranes or localized in granule-like structures in yeast cells. The recombinantly produced tetraspanins can be extracted from the membrane fraction and purified with detergents or the poly (styrene-co-maleic acid) polymer technique for use in further biochemical or biophysical studies. PMID:26218426

  12. Extending RAD tag analysis to microbial ecology: a comparison between MultiLocus Sequence Typing and 2b-RAD to investigate Listeria monocytogenes genetic structure.

    PubMed

    Pauletto, Marianna; Carraro, Lisa; Babbucci, Massimiliano; Lucchini, Rosaria; Bargelloni, Luca; Cardazzo, Barbara

    2016-05-01

    The advent of next-generation sequencing (NGS) has dramatically changed bacterial typing technologies, increasing our ability to differentiate bacterial isolates. Despite it is now possible to sequence a bacterial genome in a few days and at reasonable costs, most genetic analyses do not require whole-genome sequencing, which also remains impractical for large population samples due to the cost of individual library preparation and bioinformatics. More traditional sequencing approaches, however, such as MultiLocus Sequence Typing (mlst) are quite laborious and time-consuming, especially for large-scale analyses. In this study, a genotyping approach based on restriction site-associated (RAD) tag sequencing, 2b-RAD, was applied to characterize Listeria monocytogenes strains. To verify the feasibility of the method, an in silico analysis was performed on 30 available complete genomes. For the same set of strains, in silico mlst analysis was conducted as well. Subsequently, 2b-RAD and mlst analyses were experimentally carried out on 58 isolates collected from food samples or food-processing sites. The obtained results demonstrate that 2b-RAD predicts mlst types and often provides more detailed information on population structure than mlst. Moreover, the majority of variants differentiating identical sequence type isolates mapped against accessory fragments, thus providing additional information to characterize strains. Although mlst still represents a reliable typing method, large-scale studies on molecular epidemiology and public health, as well as bacterial phylogenetics, population genetics and biosafety could benefit of a low cost and fast turnaround time approach such as the 2b-RAD analysis proposed here. PMID:26613186

  13. Ear tag

    MedlinePlus

    ... are: An inherited tendency to have this facial feature A genetic syndrome that includes having these pits or tags A sinus tract problem (an abnormal connection between the skin and tissue underneath) When to Contact a Medical Professional Your provider will usually find the skin tag ...

  14. Sequence and expression of a halobacterial beta-galactosidase gene.

    PubMed

    Holmes, M L; Dyall-Smith, M L

    2000-04-01

    Studies of gene expression in haloarchaea have been greatly hindered by the lack of a convenient reporter gene. In a previous study, a beta-galactosidase from Haloferax alicantei was purified and several peptide sequences determined. The peptide sequences have now been used to clone the entire beta-galactosidase gene (designated bgaH) along with some flanking chromosomal DNA. The deduced amino acid sequence of BgaH was 665 amino acids (74 kDa) and showed greatest amino acid similarity to members of glycosyl hydrolase family 42 [classification of Henrissat, B., and Bairoch, A. (1993) New families in the classification of glycosyl hydrolases based on amino acid sequence similarities. Biochem J 293: 781-788]. Within this family, BgaH was most similar (42-43% aa identity) to enzymes from extremely thermophilic bacteria such as Thermotoga and Thermus. Family 42 enzymes are only distantly related to the Sulfolobus LacS and Escherichia coli LacZ enzymes (families one and two respectively). Three open reading frames (ORFs) upstream of bgaH were readily identified by database searches as glucose-fructose oxidoreductase, 2-dehydro-3-deoxyphosphogluconate aldolase and 2-keto-3-deoxygluconate kinase, enzymes that are also involved in carbohydrate metabolism. Downstream of bgaH there was an ORF which contained a putative fibronectin III motif. The bgaH gene was engineered into a halobacterial plasmid vector and introduced into Haloferax volcanii, a widely used strain that lacks detectable beta-galactosidase activity. Transformants were shown to express the enzyme; colonies turned blue when sprayed with Xgal and enzyme activity could be easily quantitated using a standard ONPG assay. In an accompanying publication, Patenge et al. (2000) have demonstrated the utility of bgaH as a promoter reporter in Halobacterium salinarum. PMID:10760168

  15. Natural LacI from E. coli yields faster response and higher level of expression than the LVA-tagged LacI.

    PubMed

    Andreassen, Patrick Rosendahl; Fredberg, Sofie; Horan, Mattias; Knudsen, Maria Højberg; Jacobsen, Kirstine; Kjær, Andreas; Krogh, Thøger Jensen; Kronborg, Tina; Mattsson, Nicky Cordua; Schmidt, Sissel Ida; Wille, Heidi; Andersen, Ann

    2014-12-19

    The lac promoter is one of the most commonly used promoters for expression control of recombinant genes in E. coli. In the absence of galactosides, the lac promoter is repressed by its repressor protein LacI. Since the lac promoter is regulated by a repressor, overexpression of LacI is necessary for regulation when the promoter is introduced on a high-copy plasmid. For that purpose, a modified variant of LacI, a LVA-tagged LacI, was submitted to the Registry of Standard Biological Parts and has been used for more than 500 constructs since then. We have found, however, that natural LacI is superior to the LVA-tagged LacI as controller of expression. PMID:25524095

  16. Parallel tagged amplicon sequencing of relatively long PCR products using the Illumina HiSeq platform and transcriptome assembly.

    PubMed

    Feng, Yan-Jie; Liu, Qing-Feng; Chen, Meng-Yun; Liang, Dan; Zhang, Peng

    2016-01-01

    In phylogenetics and population genetics, a large number of loci are often needed to accurately resolve species relationships. Normally, loci are enriched by PCR and sequenced by Sanger sequencing, which is expensive when the number of amplicons is large. Next-generation sequencing (NGS) techniques are increasingly used for parallel amplicon sequencing, which reduces sequencing costs tremendously, but has not reduced preparation costs very much. Moreover, for most current NGS methods, amplicons need to be purified and quantified before sequencing and their lengths are also restricted (normally <700 bp). Here, we describe an approach to sequence pooled amplicons of any length using the Illumina platform. Using this method, amplicons are pooled at equal volume rather than at equal concentration, thus eliminating the laborious purification and quantification steps. We then shear the pooled amplicons, repair the ends, add sample identifying linkers and pool multiple samples prior to Illumina library preparation. Data are then assembled using the transcriptome assembly program trinity, which is optimized to deal with templates of highly varying quantities. We demonstrated the utility of our approach by recovering 93.5% of the target amplicons (size up to 1650 bp) in full length for a 16 taxa × 101 loci project, using ~2.0 GB of Illumina HiSeq paired-end 90-bp data. Overall, we validate a rapid, cost-effective and scalable approach to sequence a large number of targeted loci from a large number of samples that is particularly suitable for both phylogenetics and population genetics studies that require a modest scale of data. PMID:25959587

  17. Expression of epitope-tagged LMW glutenin subunits in the starchy endosperm of transgenic wheat and their incorporation into glutenin polymers.

    PubMed

    Tosi, P; D'Ovidio, R; Napier, J A; Bekes, F; Shewry, P R

    2004-02-01

    The low-molecular-weight (LMW) glutenin subunits are components of the highly cross-linked glutenin polymers that confer viscoelastic properties to gluten and dough. They have both quantitative and qualitative effects on dough quality that may relate to differences in their ability to form the inter-chain disulphide bonds that stabilise the polymers. In order to determine the relationship between dough quality and the amounts and properties of the LMW subunits, we have transformed the pasta wheat cultivars Svevo and Ofanto with three genes encoding proteins, which differ in their numbers or positions of cysteine residues. The transgenes were delivered under control of the high-molecular-weight (HMW) subunit 1Dx5 gene promoter and terminator regions, and the encoded proteins were C-terminally tagged by the introduction of the c-myc epitope. Stable transformants were obtained with both cultivars, and the use of a specific antibody to the c-myc epitope tag allowed the transgene products to be readily detected in the complex mixture of LMW subunits. A range of transgene expression levels was observed. The addition of the epitope tag did not compromise the correct folding of the trangenic subunits and their incorporation into the glutenin polymers. Our results demonstrate that the ability to specifically epitope-tag LMW glutenin transgenes can greatly assist in the elucidation of their individual contributions to the functionality of the complex gluten system. PMID:14574453

  18. Examination of Endogenous Rotund Expression and Function in Developing Drosophila Olfactory System Using CRISPR-Cas9–Mediated Protein Tagging

    PubMed Central

    Li, Qingyun; Barish, Scott; Okuwa, Sumie; Volkan, Pelin C.

    2015-01-01

    The zinc-finger protein Rotund (Rn) plays a critical role in controlling the development of the fly olfactory system. However, little is known about its molecular function in vivo. Here, we added protein tags to the rn locus using CRISPR-Cas9 technology in Drosophila to investigate its subcellular localization and the genes that it regulates . We previously used a reporter construct to show that rn is expressed in a subset of olfactory receptor neuron (ORN) precursors and it is required for the diversification of ORN fates. Here, we show that tagged endogenous Rn protein is functional based on the analysis of ORN phenotypes. Using this method, we also mapped the expression pattern of the endogenous isoform-specific tags in vivo with increased precision. Comparison of the Rn expression pattern from this study with previously published results using GAL4 reporters showed that Rn is mainly present in early steps in antennal disc patterning, but not in pupal stages when ORNs are born. Finally, using chromatin immunoprecipitation, we showed a direct binding of Rotund to a previously identified regulatory element upstream of the bric-a-brac gene locus in the developing antennal disc. PMID:26497147

  19. Use of the myosin motor domain as large-affinity tag for the expression and purification of proteins in Dictyostelium discoideum.

    PubMed

    Kollmar, Martin

    2006-08-15

    The cellular slime mold Dictyostelium discoideum is increasingly be used for the overexpression of proteins. Dictyostelium is amenable to classical and molecular genetic approaches and can easily be grown in large quantities. It contains a variety of chaperones and folding enzymes, and is able to perform all kinds of post-translational protein modifications. Here, new expression vectors are presented that have been designed for the production of proteins in large quantities for biochemical and structural studies. The expression cassettes of the most successful vectors are based on a tandem affinity purification tag consisting of an octahistidine tag followed by the myosin motor domain tag. The myosin motor domain not only strongly enhances the production of fused proteins but is also used for a fast affinity purification step through its ATP-dependent binding to actin. The applicability of the new system has been demonstrated for the expression and purification of subunits of the dynein-dynactin motor protein complex from different species. PMID:16516959

  20. Facile expression and purification of the antimicrobial peptide histatin 1 with a cleavable self-aggregating tag (cSAT) in Escherichia coli.

    PubMed

    Xing, Lei; Xu, Wanghui; Zhou, Bihong; Chen, Yilu; Lin, Zhanglin

    2013-04-01

    Human histatin 1 (Hst1), a member of the histatin family, possesses antimicrobial properties. In this study, we applied a previously developed cleavable self-aggregating tag (cSAT) for the expression and purification of histatin 1 to demonstrate its utility for peptide expression and purification. The tag consists of a self-cleavable intein and a self-assembling peptide ELK16 (I-ELK16). First, an active insoluble aggregate of the recombinant histatin 1-Mxe GyrA intein-ELK16 (Hst1-I-ELK16) fusion protein was produced with a yield of 28.9 μg/mg wet cell pellet. The thiol reagent dithiothreitol (DTT) was then used to induce the intein-mediated cleavage and peptide release into the soluble fraction with a yield of 2.06 μg/mg wet cell pellet and a purity of 70%. The peptide was further purified by high performance liquid chromatography. These results were comparable to the yield and purity achieved when the more conventional glutathione transferase (GST) tag was used. The antimicrobial activities of this recombinant histatin 1 were confirmed against three Candida strains. This cSAT technique offers considerable advantages in terms of its simplicity and speed, eliminating the need for an exogenous protease, and reducing the number of chromatography purification steps. This technique should also be useful for the expression and purification of other AMPs. PMID:23403143

  1. A semi-automated large-scale process for the production of recombinant tagged proteins in the Baculovirus expression system.

    PubMed

    Schlaeppi, Jean-Marc; Henke, Mario; Mahnke, Marion; Hartmann, Steffen; Schmitz, Rita; Pouliquen, Yann; Kerins, Brendan; Weber, Eric; Kolbinger, Frank; Kocher, Hans P

    2006-12-01

    The efficient preparation of recombinant proteins at the lab-scale level is essential for drug discovery, in particular for structural biology, protein interaction studies and drug screening. The Baculovirus insect-cell expression system is one of the most widely applied and highly successful systems for production of recombinant functional proteins. However, the use of eukaryotic cells as host organisms and the multi-step protocol required for the generation of sufficient virus and protein has limited its adaptation to industrialized high-throughput operation. We have developed an integrated large-scale process for continuous and partially automated protein production in the Baculovirus system. The instrumental platform includes parallel insect-cell fermentation in 10L BioWave reactors, cell harvesting and lysis by tangential flow filtration (TFF) using two custom-made filtration units and automated purification by multi-dimensional chromatography. The use of disposable materials (bags, filters and tubing), automated cleaning cycles and column regeneration, prevent any cross-contamination between runs. The preparation of the clear cell lysate by sequential TFF takes less than 2 h and represents considerable time saving compared to standard cell harvesting and lysis by sonication and ultra-centrifugation. The process has been validated with 41 His-tagged proteins with molecular weights ranging from 20 to 160 kDa. These proteins represented several families, and included 23 members of the deubiquitinating enzyme (DUB) family. Each down-stream unit can process four proteins in less than 24 h with final yields between 1 and 100 mg, and purities between 50 and 95%. PMID:16904904

  2. Identification and characterization of expressed resistance gene analogs (RGSs) from peanut (Arachis hypogaea L.) expressed sequence tags (ESTs)

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Cultivated peanut (Arachis hypogaea L.) is an important food legume grown worldwide for providing edible oil and protein. However, due to scarcity of genetic diversity, peanut is very vulnerable to a variety of pathogens, such as rust (Puccinia arachidis Speg.), early leaf spot (Cercospora arachidic...

  3. Expression of the cell adhesion proteins BEN/SC1/DM-GRASP and TAG-1 defines early steps of axonogenesis in the human spinal cord.

    PubMed

    Karagogeos, D; Pourquié, C; Kyriakopoulou, K; Tavian, M; Stallcup, W; Péault, B; Pourquié, O

    1997-03-17

    We have studied the expression pattern of two cell adhesion proteins of the immunoglobin (Ig) superfamily, BEN/SC1/DM-GRASP (BEN) and the transient axonal glycoprotein TAG-1, during the development of the human nervous system. This study was performed by immunocytochemistry on sections of human embryos ranging from 4 to 13 weeks postconception. The overall distribution of the two proteins during development is very similar to that reported in other vertebrate species, but several important differences have been observed. Both proteins exhibit a transient expression on selected neuronal populations, which include the motor and the sensory neurons. In addition, BEN was also detected on virtually all neurons derived from the neural crest as well as in nonneuronal tissues. A major difference of expression with the chick embryo is that, in the motor neurons, BEN expression was not observed at early stages of development, thus arguing against a role of this molecule in pathfinding and fasciculation. BEN was observed to be restricted to subsets of motor neurons, such as the medial column at the upper limb level. Expression was also detected in a laterodorsal population of the ventral horn cells, which are likely to correspond to migrating preganglionic neurons that originate from the motor pool at the thoracic level. TAG-1 was found on commissural neurons and weakly on the sympathetic neurons; it was also detected on restricted nonneuronal populations. In addition, we observed TAG-1 expression in fibers that could correspond either to subsets of dorsal root ganglia (DRGs) central afferences (including the Ia fibers) or to the axons of association interneurons and in scattered motoneurons likely to correspond either to preganglionic neurons, to gamma-motoneurons, or to late-born motoneurons. Therefore, our results indicate that the molecular strategies used to establish the axonal scaffolding of the nervous system in humans are extremely conserved among the different

  4. Bacterial diversity assessment of pristine mangrove microbial community from Dhulibhashani, Sundarbans using 16S rRNA gene tag sequencing.

    PubMed

    Basak, Pijush; Pramanik, Arnab; Sengupta, Sohan; Nag, Sudip; Bhattacharyya, Anish; Roy, Debojyoti; Pattanayak, Rudradip; Ghosh, Abhrajyoti; Chattopadhyay, Dhrubajyoti; Bhattacharyya, Maitree

    2016-03-01

    The global knowledge of microbial diversity and function in Sundarbans ecosystem is still scarce, despite global advancement in understanding the microbial diversity. In the present study, we have analyzed the diversity and distribution of bacteria in the tropical mangrove sediments of Sundarbans using 16S rRNA gene amplicon sequencing. Metagenome is comprised of 1,53,926 sequences with 108.8 Mbp data and with 55 ± 2% G + C content. Metagenome sequence data are available at NCBI under the Bioproject database with accession no. PRJNA245459. Bacterial community metagenome sequences were analyzed by MG-RAST software representing the presence of 56,547 species belonging to 44 different phyla. The taxonomic analysis revealed the dominance of phyla Proteobacteria within our dataset. Further taxonomic analysis revealed abundance of Bacteroidetes, Acidobactreia, Firmicutes, Actinobacteria, Nitrospirae, Cyanobacteria, Planctomycetes and Fusobacteria group as the predominant bacterial assemblages in this largely pristine mangrove habitat. The distribution of different community datasets obtained from four sediment samples originated from one sampling station at two different depths providing better understanding of the sediment bacterial diversity and its relationship to the ecosystem dynamics of this pristine mangrove sediment of Dhulibhashani in, Sundarbans. PMID:26981367

  5. Bacterial diversity assessment of pristine mangrove microbial community from Dhulibhashani, Sundarbans using 16S rRNA gene tag sequencing

    PubMed Central

    Basak, Pijush; Pramanik, Arnab; Sengupta, Sohan; Nag, Sudip; Bhattacharyya, Anish; Roy, Debojyoti; Pattanayak, Rudradip; Ghosh, Abhrajyoti; Chattopadhyay, Dhrubajyoti; Bhattacharyya, Maitree

    2015-01-01

    The global knowledge of microbial diversity and function in Sundarbans ecosystem is still scarce, despite global advancement in understanding the microbial diversity. In the present study, we have analyzed the diversity and distribution of bacteria in the tropical mangrove sediments of Sundarbans using 16S rRNA gene amplicon sequencing. Metagenome is comprised of 1,53,926 sequences with 108.8 Mbp data and with 55 ± 2% G + C content. Metagenome sequence data are available at NCBI under the Bioproject database with accession no. PRJNA245459. Bacterial community metagenome sequences were analyzed by MG-RAST software representing the presence of 56,547 species belonging to 44 different phyla. The taxonomic analysis revealed the dominance of phyla Proteobacteria within our dataset. Further taxonomic analysis revealed abundance of Bacteroidetes, Acidobactreia, Firmicutes, Actinobacteria, Nitrospirae, Cyanobacteria, Planctomycetes and Fusobacteria group as the predominant bacterial assemblages in this largely pristine mangrove habitat. The distribution of different community datasets obtained from four sediment samples originated from one sampling station at two different depths providing better understanding of the sediment bacterial diversity and its relationship to the ecosystem dynamics of this pristine mangrove sediment of Dhulibhashani in, Sundarbans. PMID:26981367

  6. Expression platforms for producing eukaryotic proteins: a comparison of E. coli cell-based and wheat germ cell-free synthesis, affinity and solubility tags, and cloning strategies.

    PubMed

    Aceti, David J; Bingman, Craig A; Wrobel, Russell L; Frederick, Ronnie O; Makino, Shin-Ichi; Nichols, Karl W; Sahu, Sarata C; Bergeman, Lai F; Blommel, Paul G; Cornilescu, Claudia C; Gromek, Katarzyna A; Seder, Kory D; Hwang, Soyoon; Primm, John G; Sabat, Grzegorz; Vojtik, Frank C; Volkman, Brian F; Zolnai, Zsolt; Phillips, George N; Markley, John L; Fox, Brian G

    2015-06-01

    Vectors designed for protein production in Escherichia coli and by wheat germ cell-free translation were tested using 21 well-characterized eukaryotic proteins chosen to serve as controls within the context of a structural genomics pipeline. The controls were carried through cloning, small-scale expression trials, large-scale growth or synthesis, and purification. Successfully purified proteins were also subjected to either crystallization trials or (1)H-(15)N HSQC NMR analyses. Experiments evaluated: (1) the relative efficacy of restriction/ligation and recombinational cloning systems; (2) the value of maltose-binding protein (MBP) as a solubility enhancement tag; (3) the consequences of in vivo proteolysis of the MBP fusion as an alternative to post-purification proteolysis; (4) the effect of the level of LacI repressor on the yields of protein obtained from E. coli using autoinduction; (5) the consequences of removing the His tag from proteins produced by the cell-free system; and (6) the comparative performance of E. coli cells or wheat germ cell-free translation. Optimal promoter/repressor and fusion tag configurations for each expression system are discussed. PMID:25854603

  7. Expression dynamics and ultrastructural localization of epitope-tagged Abutilon mosaic virus nuclear shuttle and movement proteins in Nicotiana benthamiana cells

    SciTech Connect

    Kleinow, Tatjana; Tanwir, Fariha; Kocher, Cornelia; Krenz, Bjoern; Wege, Christina; Jeske, Holger

    2009-09-01

    The geminivirus Abutilon mosaic virus (AbMV) encodes two proteins which are essential for viral spread within plants. The nuclear shuttle protein (NSP) transfers viral DNA between the nucleus and cytoplasm, whereas the movement protein (MP) facilitates transport between cells through plasmodesmata and long-distance via phloem. An inducible overexpression system for epitope-tagged NSP and MP in plants yielded unprecedented amounts of both proteins. Western blots revealed extensive posttranslational modification and truncation for MP, but not for NSP. Ultrastructural examination of Nicotiana benthamiana tissues showed characteristic nucleopathic alterations, including fibrillar rings, when epitope-tagged NSP and MP were simultaneously expressed in leaves locally infected with an AbMV DNA A in which the coat protein gene was replaced by a green fluorescent protein encoding gene. Immunogold labelling localized NSP in the nucleoplasm and in the fibrillar rings. MP appeared at the cell periphery, probably the plasma membrane, and plasmodesmata.

  8. Cloning, sequence, and expression of bovine interleukin 2.

    PubMed Central

    Cerretti, D P; McKereghan, K; Larsen, A; Cantrell, M A; Anderson, D; Gillis, S; Cosman, D; Baker, P E

    1986-01-01

    Interleukin 2 (IL-2) cDNA clones have been isolated from both human and murine sources. We report here the isolation of a cDNA clone encoding bovine IL-2. This was accomplished by screening a cDNA library constructed from lectin-stimulated bovine lymph node cells, using a human IL-2 probe. Bovine IL-2 is composed of 155 amino acids and has a predicted molecular weight of 19,555. Alignment of the amino acid sequence with human IL-2 indicates that mature bovine IL-2 is composed of 135 amino acids and has a predicted molecular weight of 15,452. It has an amino acid homology of 65% with human IL-2 and 50% with murine IL-2. Bovine IL-2 is unique among IL-2 homologs in that it has a single N-linked glycosylation site. Biologically active bovine IL-2 was synthesized in an Escherichia coli expression system. Images PMID:3517854

  9. Sequence and expression of ferredoxin mRNA in barley

    SciTech Connect

    Zielinski, R.; Funder, P.M.; Ling, V. )

    1990-05-01

    We have isolated and structurally characterized a full-length cDNA clone encoding ferredoxin from a {lambda}gt10 cDNA library prepared from barley leaf mRNA. The ferredoxin clone (pBFD-1) was fused head-to-head with a partial-length cDNA clone encoding calmodulin, and was fortuitously isolated by screening the library with a calmodulin-specific oligonucleotide probe. The mRNA sequence from which pBFD-1 was derived is expressed exclusively in the leaf tissues of 7-d old barley seedlings. Barley pre-ferredoxin has a predicted size of 15.3 kDal, of which 4.6 kDal are accounted for by the transit peptide. The polypeptide encoded by pBFD-1 is identical to wheat ferredoxin, and shares slightly more amino acid sequence similarity with spinach ferredoxin I than with ferredoxin II. Ferredoxin mRNA levels are rapidly increased 10-fold by white light in etiolated barley leaves.

  10. The dynamics of the bacterial diversity in the redox transition and anoxic zones of the Cariaco Basin assessed by parallel tag sequencing.

    PubMed

    Rodriguez-Mora, Maria J; Scranton, Mary I; Taylor, Gordon T; Chistoserdov, Andrei Y

    2015-09-01

    Massively parallel tag sequencing was applied to describe the bacterial diversity in the redox transition and anoxic zones of the Cariaco Basin. In total, 14 samples from the Cariaco Basin were collected over a period of eight years from two stations. A total of 244 357 unique bacterial V6 amplicons were sequenced. The total number of operational taxonomic units (OTUs) found in this study was 4692, with a range of 511-1491 OTUs per sample. Approximately 95% of the OTUs found in the redox transition zone and anoxic layers of Cariaco are represented by less than 50 amplicons suggesting that only about 5% of the bacterial OTUs are responsible for the bulk of the microbial processes in the basin redox transition and anoxic zones. The same dominant OTUs were observed across all eight years of sampling although periodic fluctuations in their proportion were apparent. No distinctive differences were observed between the bacterial communities from the redox transition and anoxic layers of the Cariaco Basin water column. The largest proportion of amplicons belongs to Gammaproteobacteria represented mostly by sulfide oxidizers, followed by Marine Group A (originally described as SAR406; Gordon and Giovannoni 1996), a group of uncultured bacteria hypothesized to be involved in metal reduction, and sulfate-reducing Deltaproteobacteria. Gammaproteobacteria, Deltaproteobacteria and Marine Group A make up 67-90% of all V6 amplicons sequenced in this study. This strongly suggests that the basin's microbial communities are actively involved in the sulfur-related metabolism and coupling of the sulfur and carbon cycles. According to detrended canonical correspondence analysis, ecological factors such as chemoautotrophy, nitrate and oxidized and reduced sulfur compounds influence the structuring and distribution of the Cariaco microbial communities. PMID:26209697

  11. Analysis of bacterial and archaeal diversity in coastal microbial mats using massive parallel 16S rRNA gene tag sequencing

    PubMed Central

    Bolhuis, Henk; Stal, Lucas J

    2011-01-01

    Coastal microbial mats are small-scale and largely closed ecosystems in which a plethora of different functional groups of microorganisms are responsible for the biogeochemical cycling of the elements. Coastal microbial mats play an important role in coastal protection and morphodynamics through stabilization of the sediments and by initiating the development of salt-marshes. Little is known about the bacterial and especially archaeal diversity and how it contributes to the ecological functioning of coastal microbial mats. Here, we analyzed three different types of coastal microbial mats that are located along a tidal gradient and can be characterized as marine (ST2), brackish (ST3) and freshwater (ST3) systems. The mats were sampled during three different seasons and subjected to massive parallel tag sequencing of the V6 region of the 16S rRNA genes of Bacteria and Archaea. Sequence analysis revealed that the mats are among the most diverse marine ecosystems studied so far and consist of several novel taxonomic levels ranging from classes to species. The diversity between the different mat types was far more pronounced than the changes between the different seasons at one location. The archaeal community for these mats have not been studied before and revealed a strong reaction on a short period of draught during summer resulting in a massive increase in halobacterial sequences, whereas the bacterial community was barely affected. We concluded that the community composition and the microbial diversity were intrinsic of the mat type and depend on the location along the tidal gradient indicating a relation with salinity. PMID:21544102

  12. Transcriptome profiling and digital gene expression by deep sequencing in early somatic embryogenesis of endangered medicinal Eleutherococcus senticosus Maxim.

    PubMed

    Tao, Lei; Zhao, Yue; Wu, Ying; Wang, Qiuyu; Yuan, Hongmei; Zhao, Lijuan; Guo, Wendong; You, Xiangling

    2016-03-01

    Somatic embryogenesis (SE) has been studied as a model system to understand molecular events in physiology, biochemistry, and cytology during plant embryo development. In particular, it is exceedingly difficult to access the morphological and early regulatory events in zygotic embryos. To understand the molecular mechanisms regulating early SE in Eleutherococcus senticosus Maxim., we used high-throughput RNA-Seq technology to investigate its transcriptome. We obtained 58,327,688 reads, which were assembled into 75,803 unique unigenes. To better understand their functions, the unigenes were annotated using the Clusters of Orthologous Groups, Gene Ontology, and Kyoto Encyclopedia of Genes and Genomes databases. Digital gene expression libraries revealed differences in gene expression profiles at different developmental stages (embryogenic callus, yellow embryogenic callus, global embryo). We obtained a sequencing depth of >5.6 million tags per sample and identified many differentially expressed genes at various stages of SE. The initiation of SE affected gene expression in many KEGG pathways, but predominantly that in metabolic pathways, biosynthesis of secondary metabolites, and plant hormone signal transduction. This information on the changes in the multiple pathways related to SE induction in E. senticosus Maxim. embryogenic tissue will contribute to a more comprehensive understanding of the mechanisms involved in early SE. Additionally, the differentially expressed genes may act as molecular markers and could play very important roles in the early stage of SE. The results are a comprehensive molecular biology resource for investigating SE of E. senticosus Maxim. PMID:26657036

  13. Insilico analysis of three different tag polypeptides with dual roles in scFv antibodies.

    PubMed

    Mohammadi, Mozafar; Nejatollahi, Foroogh; Sakhteman, Amirhossein; Zarei, Neda

    2016-08-01

    Single chain fragment variable (scFv) antibodies are composed of variable heavy (VH) and variable light (VL) domains that are joined by a polypeptide linker. Typically, [(Gly4Ser) n] sequence is used as a linker to retain the integrity of the antigen-binding domain. Due to its low immunogenicity, this sequence cannot be used as a tag for scFv detection and purification. Several evidences have shown that the addition of an N or C-terminal tag for scFv detection and purification will result in the decreased expression and binding capacity of this antibody fragment. In this study, we substituted the traditional linker (GGGGS) with His-tag, C-myc or E-tag sequences through molecular modeling. Stability and integrity of all models were assessed by molecular dynamic (MD) simulation. Based on MD simulation analysis, the model containing E-tag sequence as a linker indicated more stability compared to other molecules. The results suggest that E-tag not only can be substituted for the traditional linker, also eliminates the necessity of using additional tag for scFv detection and purification. PMID:27113782

  14. Active populations of rare microbes in oceanic environments as revealed by bromodeoxyuridine incorporation and 454 tag sequencing.

    PubMed

    Hamasaki, Koji; Taniguchi, Akito; Tada, Yuya; Kaneko, Ryo; Miki, Takeshi

    2016-02-01

    The "rare biosphere" consisting of thousands of low-abundance microbial taxa is important as a seed bank or a gene pool to maintain microbial functional redundancy and robustness of the ecosystem. Here we investigated contemporaneous growth of diverse microbial taxa including rare taxa and determined their variability in environmentally distinctive locations along a north-south transect in the Pacific Ocean in order to assess which taxa were actively growing and how environmental factors influenced bacterial community structures. A bromodeoxyuridine-labeling technique in combination with PCR amplicon pyrosequencing of 16S rRNA genes gave 215-793 OTUs from 1200 to 3500 unique sequences in the total communities and 175-299 OTUs nearly 860 to 1800 sequences in the active communities. Unexpectedly, many of the active OTUs were not detected in the total fractions. Among these active but rare OTUs, some taxa (2-4% of rare OTUs) showed much higher abundance (>0.10% of total reads) in the active fraction than in the total fraction, suggesting that their contribution to bacterial community productivity or growth was much larger than that expected from their standing stocks at each location. An ordination plot by the principal component analysis presented that bacterial community compositions among 4 sampling locations and between total and active fractions were distinctive with each other. A redundancy analysis revealed that the variability of community compositions significantly correlated to seawater temperature and dissolved oxygen concentration. Also, a variation partitioning analysis showed that the environmental factors explained 49% of the variability of community compositions and the distance only explained 4.0% of its variability. These results implied very dynamic change of community structures due to environmental filtering. The active bacterial populations are more diverse and spread further in rare biosphere than we have ever seen. This study implied that rare

  15. Cloning, expression and sequencing of Helicobacter felis urease genes.

    PubMed

    Ferrero, R L; Labigne, A

    1993-07-01

    Urease genes from Helicobacter felis were cloned and expressed in Escherichia coli cells. A genomic bank of Sau3A-digested H. felis chromosomal DNA was created using a cosmid vector. Cosmid clones were screened for urease activity following subculture on a nitrogen-limiting medium. Subcloning of DNA from an urease-positive cosmid clone led to the construction of pILL205 (9.5 kb) which conferred a urease activity of 1.2 +/- 0.5 mumole urea min-1 mg-1 bacterial protein to E. coli HB101 bacteria grown on a nitrogen-limiting medium. Random mutagenesis using a MiniTn3-Km transposable element permitted the identification of three DNA regions on pILL205 which were necessary for the expression of an urease-positive phenotype in E. coli clones. To localize the putative structural genes of H. felis on pILL205, extracts of clones harbouring the mutated copies of the plasmid were analysed by Western blotting with anti-H. felis rabbit serum. One mutant clone did not synthesize the putative UreB subunit of H. felis urease and it was postulated that the transposable element had disrupted the corresponding structural gene. By sequencing the DNA region adjacent to the transposon insertion site two open reading frames, designated ureA and ureB, were identified. The polypeptides encoded by these genes had calculated molecular masses of 26,074 and 61,663 Da, respectively, and shared 73.5% and 88.2% identity with the corresponding gene products of Helicobacter pylori urease. PMID:8412683

  16. Identification of Tuber borchii Vittad. mycelium proteins separated by two-dimensional polyacrylamide gel electrophoresis using amino acid analysis and sequence tagging.

    PubMed

    Vallorani, L; Bernardini, F; Sacconi, C; Pierleoni, R; Pieretti, B; Piccoli, G; Buffalini, M; Stocchi, V

    2000-11-01

    This paper reports the first results in the proteome analysis of Tuber borchii Vittad. mycelium, an ectomycorrhizal fungus poorly defined genetically, but known for its generation of edible fruit bodies known as white truffles. Employing isoelectric focusing on immobilized pH gradients, followed by sodium dodecyl sulfate-polyacrylamide gel electrophoresis, we obtained an electropherogram presenting over 800 spots within the window of isoelectric points (pI) 3.5-9 and a molecular mass of 10-200 kDa. Different reducing agents were tested in the sample preparation buffers, and the standard lysis buffer plus 2% w/v polyvinylpolypyrrolidone allowed the best solubilization and resolution of the proteins. The T. borchii proteins separated in micropreparative gels were electroblotted onto polyvinylidene difluoride membranes and visualized by Coomassie staining. Twenty-three proteins were excised and analyzed by the combination of amino acid and N-terminal analysis. One protein was identified by matching its amino acid composition, estimated isoelectric point and molecular mass against the SWISS-PROT and EMBL databases. Four spots were successfully tagged by Edman microsequencing but no homologous sequences were found in databases. PMID:11271490

  17. A high-density genetic recombination map of sequence-tagged sites for sorghum, as a framework for comparative structural and evolutionary genomics of tropical grains and grasses.

    PubMed Central

    Bowers, John E; Abbey, Colette; Anderson, Sharon; Chang, Charlene; Draye, Xavier; Hoppe, Alison H; Jessup, Russell; Lemke, Cornelia; Lennington, Jennifer; Li, Zhikang; Lin, Yann-Rong; Liu, Sin-Chieh; Luo, Lijun; Marler, Barry S; Ming, Reiguang; Mitchell, Sharon E; Qiang, Dou; Reischmann, Kim; Schulze, Stefan R; Skinner, D Neil; Wang, Yue-Wen; Kresovich, Stephen; Schertz, Keith F; Paterson, Andrew H

    2003-01-01

    We report a genetic recombination map for Sorghum of 2512 loci spaced at average 0.4 cM ( approximately 300 kb) intervals based on 2050 RFLP probes, including 865 heterologous probes that foster comparative genomics of Saccharum (sugarcane), Zea (maize), Oryza (rice), Pennisetum (millet, buffelgrass), the Triticeae (wheat, barley, oat, rye), and Arabidopsis. Mapped loci identify 61.5% of the recombination events in this progeny set and reveal strong positive crossover interference acting across intervals of sequence-tagged sites will foster many structural, functional and evolutionary genomic studies in major food, feed, and biomass crops. PMID:14504243

  18. The Xenopus laevis Atg4B Protease: Insights into Substrate Recognition and Application for Tag Removal from Proteins Expressed in Pro- and Eukaryotic Hosts

    PubMed Central

    Frey, Steffen; Görlich, Dirk

    2015-01-01

    During autophagy, members of the ubiquitin-like Atg8 protein family get conjugated to phosphatidylethanolamine and act as protein-recruiting scaffolds on the autophagosomal membrane. The Atg4 protease produces mature Atg8 from C-terminally extended precursors and deconjugates lipid-bound Atg8. We now found that Xenopus laevis Atg4B (xAtg4B) is ideally suited for proteolytic removal of N-terminal tags from recombinant proteins. To implement this strategy, an Atg8 cleavage module is inserted in between tag and target protein. An optimized xAtg4B protease fragment includes the so far uncharacterized C-terminus, which crucially contributes to recognition of the Xenopus Atg8 homologs xLC3B and xGATE16. xAtg4B-mediated tag cleavage is very robust in solution or on-column, efficient at 4°C and orthogonal to TEV protease and the recently introduced proteases bdSENP1, bdNEDP1 and xUsp2. Importantly, xLC3B fusions are stable in wheat germ extract or when expressed in Saccharomyces cerevisiae, but cleavable by xAtg4B during or following purification. We also found that fusions to the bdNEDP1 substrate bdNEDD8 are stable in S. cerevisiae. In combination, or findings now provide a system, where proteins and complexes fused to xLC3B or bdNEDD8 can be expressed in a eukaryotic host and purified by successive affinity capture and proteolytic release steps. PMID:25923686

  19. The Xenopus laevis Atg4B Protease: Insights into Substrate Recognition and Application for Tag Removal from Proteins Expressed in Pro- and Eukaryotic Hosts.

    PubMed

    Frey, Steffen; Görlich, Dirk

    2015-01-01

    During autophagy, members of the ubiquitin-like Atg8 protein family get conjugated to phosphatidylethanolamine and act as protein-recruiting scaffolds on the autophagosomal membrane. The Atg4 protease produces mature Atg8 from C-terminally extended precursors and deconjugates lipid-bound Atg8. We now found that Xenopus laevis Atg4B (xAtg4B) is ideally suited for proteolytic removal of N-terminal tags from recombinant proteins. To implement this strategy, an Atg8 cleavage module is inserted in between tag and target protein. An optimized xAtg4B protease fragment includes the so far uncharacterized C-terminus, which crucially contributes to recognition of the Xenopus Atg8 homologs xLC3B and xGATE16. xAtg4B-mediated tag cleavage is very robust in solution or on-column, efficient at 4°C and orthogonal to TEV protease and the recently introduced proteases bdSENP1, bdNEDP1 and xUsp2. Importantly, xLC3B fusions are stable in wheat germ extract or when expressed in Saccharomyces cerevisiae, but cleavable by xAtg4B during or following purification. We also found that fusions to the bdNEDP1 substrate bdNEDD8 are stable in S. cerevisiae. In combination, or findings now provide a system, where proteins and complexes fused to xLC3B or bdNEDD8 can be expressed in a eukaryotic host and purified by successive affinity capture and proteolytic release steps. PMID:25923686

  20. A Review of Recommendations for Sequencing Receptive and Expressive Language Instruction

    ERIC Educational Resources Information Center

    Petursdottir, Anna Ingeborg; Carr, James E.

    2011-01-01

    We review recommendations for sequencing instruction in receptive and expressive language objectives in early and intensive behavioral intervention (EIBI) programs. Several books recommend completing receptive protocols before introducing corresponding expressive protocols. However, this recommendation has little empirical support, and some…

  1. Expressed Sequence Reference Standards for Evaluating Stage-specific Gene Expression in Southern Green Lacewings, Chrysoperla rufilabris

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Five developmental stages of Chrysoperla rufilabris were tested using nine primer pairs. Three sequences were highly expressed at all life stages and six were differentially expressed. These primer pairs may be used as standards to quantitate functional gene expression associated with physiological ...

  2. Substantial prevalence of microdeletions of the Y-chromosome in infertile men with idiopathic azoospermia and oligozoospermia detected using a sequence-tagged site-based mapping strategy

    SciTech Connect

    Najmabadi, H.; Huang, V.; Bhasin, D.

    1996-04-01

    Genes on the long arm of Y (Yq), particularly within interval 6, are believed to play a critical role in human spermatogenesis. Cytogenetically detectable deletions of this region are associated with azoospermia in men, but are relatively uncommon. The objective of this study was to validate a sequence-tagged site (STS)-mapping strategy for the detection of Yq microdeletions and to use this method to determine the proportion of men with idiopathic azoospermia or severe oligozoospermia who carry microdeletions in Yq. STS mapping of a sufficiently large sample of infertile men should also help further localize the putative gene(s) involved in the pathogenesis of male infertility. Genomic DNA was extracted from peripheral leukocytes of 16 normal fertile men, 7 normal fertile women, 60 infertile men, and 15 patients with the X-linked disorder, ichthyosis. PCR primers were synthesized for 26 STSs that span Yq interval 6. None of the 16 normal men of known fertility had microdeletions. Seven normal fertile women failed to amplify any of the 26 STSs, providing evidence of their Y specificity. No microdeletions were detected in any of the 15 patients with ichthyosis. Of the 60 infertile men typed with 26 STSs, 11 (18%; 10 azoospermic and 1 oligozoospermic) failed to amplify 1 or more STS. Interestingly, 4 of the 11 patients had microdeletions in a region that is outside the Yq region from which the DAZ (deleted in azoospermia gene region) gene was cloned. In an additional 3 patients, microdeletions were present both inside and outside the DAZ region. The physical locations of these microdeletions provide further support for the concept that a gene(s) on Yq deletion interval 6 plays an important role in spermatogenesis. The presence of deletions that do not overlap with the DAZ region suggests that genes other than the DAZ gene may also be implicated in the pathogenesis of some subsets of male infertility. 48 refs., 2 figs., 2 tabs.

  3. Targeted RNA Sequencing Assay to Characterize Gene Expression and Genomic Alterations.

    PubMed

    Martin, Dorrelyn P; Miya, Jharna; Reeser, Julie W; Roychowdhury, Sameek

    2016-01-01

    RNA sequencing (RNAseq) is a versatile method that can be utilized to detect and characterize gene expression, mutations, gene fusions, and noncoding RNAs. Standard RNAseq requires 30 - 100 million sequencing reads and can include multiple RNA products such as mRNA and noncoding RNAs. We demonstrate how targeted RNAseq (capture) permits a focused study on selected RNA products using a desktop sequencer. RNAseq capture can characterize unannotated, low, or transiently expressed transcripts that may otherwise be missed using traditional RNAseq methods. Here we describe the extraction of RNA from cell lines, ribosomal RNA depletion, cDNA synthesis, preparation of barcoded libraries, hybridization and capture of targeted transcripts and multiplex sequencing on a desktop sequencer. We also outline the computational analysis pipeline, which includes quality control assessment, alignment, fusion detection, gene expression quantification and identification of single nucleotide variants. This assay allows for targeted transcript sequencing to characterize gene expression, gene fusions, and mutations. PMID:27585245

  4. Construction of a Genetic Linkage Map Based on Amplified Fragment Length Polymorphism Markers and Development of Sequence-Tagged Site Markers for Marker-Assisted Selection of the Sporeless Trait in the Oyster Mushroom (Pleurotus eryngii)

    PubMed Central

    Ueda, Jun; Obatake, Yasushi; Murakami, Shigeyuki; Fukumasa, Yukitaka; Matsumoto, Teruyuki

    2012-01-01

    A large number of spores from fruiting bodies can lead to allergic reactions and other problems during the cultivation of edible mushrooms, including Pleurotus eryngii (DC.) Quél. A cultivar harboring a sporulation-deficient (sporeless) mutation would be useful for preventing these problems, but traditional breeding requires extensive time and labor. In this study, using a sporeless P. eryngii strain, we constructed a genetic linkage map to introduce a molecular breeding program like marker-assisted selection. Based on the segregation of 294 amplified fragment length polymorphism markers, two mating type factors, and the sporeless trait, the linkage map consisted of 11 linkage groups with a total length of 837.2 centimorgans (cM). The gene region responsible for the sporeless trait was located in linkage group IX with 32 amplified fragment length polymorphism markers and the B mating type factor. We also identified eight markers closely linked (within 1.2 cM) to the sporeless locus using bulked-segregant analysis-based amplified fragment length polymorphism. One such amplified fragment length polymorphism marker was converted into two sequence-tagged site markers, SD488-I and SD488-II. Using 14 wild isolates, sequence-tagged site analysis indicated the potential usefulness of the combination of two sequence-tagged site markers in cross-breeding of the sporeless strain. It also suggested that a map constructed for P. eryngii has adequate accuracy for marker-assisted selection. PMID:22210222

  5. Live-cell imaging of migrating cells expressing fluorescently-tagged proteins in a three-dimensional matrix.

    PubMed

    Shih, Wenting; Yamada, Soichiro

    2011-01-01

    Traditionally, cell migration has been studied on two-dimensional, stiff plastic surfaces. However, during important biological processes such as wound healing, tissue regeneration, and cancer metastasis, cells must navigate through complex, three-dimensional extracellular tissue. To better understand the mechanisms behind these biological processes, it is important to examine the roles of the proteins responsible for driving cell migration. Here, we outline a protocol to study the mechanisms of cell migration using the epithelial cell line (MDCK), and a three-dimensional, fibrous, self-polymerizing matrix as a model system. This optically clear extracellular matrix is easily amenable to live-cell imaging studies and better mimics the physiological, soft tissue environment. This report demonstrates a technique for directly visualizing protein localization and dynamics, and deformation of the surrounding three-dimensional matrix. Examination of protein localization and dynamics during cellular processes provides key insight into protein functions. Genetically encoded fluorescent tags provide a unique method for observing protein localization and dynamics. Using this technique, we can analyze the subcellular accumulation of key, force-generating cytoskeletal components in real-time as the cell maneuvers through the matrix. In addition, using multiple fluorescent tags with different wavelengths, we can examine the localization of multiple proteins simultaneously, thus allowing us to test, for example, whether different proteins have similar or divergent roles. Furthermore, the dynamics of fluorescently tagged proteins can be quantified using Fluorescent Recovery After Photobleaching (FRAP) analysis. This measurement assays the protein mobility and how stably bound the proteins are to the cytoskeletal network. By combining live-cell imaging with the treatment of protein function inhibitors, we can examine in real-time the changes in the distribution of proteins and

  6. A Single-Nucleotide Polymorphism of TaGS5 Gene Revealed its Association with Kernel Weight in Chinese Bread Wheat

    PubMed Central

    Wang, Shasha; Zhang, Xiangfen; Chen, Feng; Cui, Dangqun

    2015-01-01

    TaGS5 genes were cloned from bread wheat and were physically mapped on 3AS and 3DS. Sequencing results revealed that a SNP was found in the sixth exon of TaGS5-A1 gene. The SNP resulted in amino acid change from alanine to serine at the 303 bp position of TaGS5-A1. These two alleles were designated as TaGS5-A1a (alanine at the 303 bp position) and TaGS5-A1b genes (serine at the 303-bp position). Analysis of association of TaGS5-A1 alleles with agronomic traits indicated that cultivars with TaGS5-A1b possessed wider kernel width and higher thousand-kernel weight, as well as significantly lower plant height, spike length, and internode length below spike than those of cultivars with TaGS5-A1a over 3 years. These trait differences between TaGS5-A1a and TaGS5-A1b genotypes were larger in landraces than in modern cultivars. This finding suggested that TaGS5 gene played an important role in modulating yield-related traits in the landraces, which possibly resulted from numerous superior genes gathering in modern cultivars after strong artificial selection. The preferred TaGS5-A1b haplotype underwent very strong positive selection in Chinese modern wheat breeding, but not in Chinese landraces. Expression analysis of the TaGS5-A1 gene indicated that TaGS5-A1b allele possessed significantly higher expression level than TaGS5-A1b allele in differently developmental seeds. This study could provide relatively superior genotype in view of agronomic traits in wheat breeding programs. Likewise, this study could offer important information for the dissection of molecular and genetic basis of yield-related traits. PMID:26779195

  7. Proximal and distal sequences control UV cone pigment gene expression in transgenic zebrafish.

    PubMed

    Luo, Wenqin; Williams, John; Smallwood, Philip M; Touchman, Jeffrey W; Roman, Laura M; Nathans, Jeremy

    2004-04-30

    The molecular basis of cone photoreceptor-specific gene expression is largely unknown. In this study, we define cis-acting DNA sequences that control the cell type-specific expression of the zebrafish UV cone pigment gene by transient expression of green fluorescent protein transgenes following their injection into zebrafish embryos. These experiments show that 4.8 kb of 5'-flanking sequences from the zebrafish UV pigment gene direct expression specifically to UV cones and that this activity requires both distal and proximal sequences. In addition, we demonstrate that a proximal region located between -215 and -110 bp (with respect to the initiator methionine codon) can function in the context of a zebrafish rhodopsin promotor to convert its specificity from rod-only expression to rod and UV cone expression. These experiments demonstrate the power of transient transgenesis in zebrafish to efficiently define cis-acting regulatory sequences in an intact vertebrate. PMID:14966125

  8. The nucleotide composition of the spacer sequence influences the expression yield of heterologously expressed genes in Bacillus subtilis.

    PubMed

    Liebeton, Klaus; Lengefeld, Jette; Eck, Jürgen

    2014-12-10

    Bacillus subtilis is a commonly used host for the heterologous expression of genes in academia and industry. Many factors are known to influence the expression yield in this organism e.g. the complementarity between the Shine-Dalgarno sequence (SD) and the 16S-rRNA or secondary structures in the translation initiation region of the transcript. In this study, we analysed the impact of the nucleotide composition between the SD sequence and the start codon (the spacer sequence) on the expression yield. We demonstrated that a polyadenylate-moiety spacer sequence moderately increases the expression level of laccase CotA from B. subtilis. By screening a library of artificially generated spacer variants, we identified clones with greatly increased expression levels of two model enzymes, the laccase CotA from B. subtilis (11 fold) and the metagenome derived protease H149 (30 fold). Furthermore, we demonstrated that the effect of the spacer sequence is specific to the gene of interest. These results prove the high impact of the spacer sequence on the expression yield in B. subtilis. PMID:24997355

  9. Small nucleolar RNA clusters in trypanosomatid Leptomonas collosoma. Genome organization, expression studies, and the potential role of sequences present upstream from the first repeated cluster.

    PubMed

    Liang, Xue-hai; Ochaion, Avivit; Xu, Yu-xin; Liu, Qing; Michaeli, Shulamit

    2004-02-13

    Trypanosomatid small nucleolar RNA (snoRNA) genes are clustered in the genome. snoRNAs are transcribed polycistronically and processed into mature RNAs. In this study, we characterized four snoRNA clusters in Leptomonas collosoma. All of the clusters analyzed carry both C/D and H/ACA RNAs. The H/ACA RNAs are composed of a single hairpin, a structure typical to trypanosome and archaea guide RNAs. Using deletion and mutational analysis of a tagged C/D snoRNA situated within the snoRNA cluster, we identified 10-nucleotide flanking sequences that are essential for processing snoRNA from its precursor. Chromosome walk was performed on a snoRNA cluster, and a sequence of 700 bp was identified between the first repeat and the upstream open reading frame. Cloning of this sequence in an episome vector enhanced the expression of a tagged snoRNA gene in an orientation-dependent manner. However, continuous transcript spanning of this region was detected in steady-state RNA, suggesting that snoRNA transcription also originates from an upstream-long polycistronic transcriptional unit. The 700-bp fragment may therefore represent an example of many more elements to be discovered that enhance transcription along the chromosome, especially when transcription from the upstream gene is reduced or when enhanced transcription is needed. PMID:14645367

  10. FINDING REGULATORY ELEMENTS USING JOINT LIKELIHOODS FOR SEQUENCE AND EXPRESSION PROFILE DATA.

    SciTech Connect

    IAN HOLMES, UC BERKELEY, CA, WILLIAM J. BRUNO, LANL

    2000-08-20

    A recent, popular method of finding promoter sequences is to look for conserved motifs up-stream of genes clustered on the basis of expression data. This method presupposes that the clustering is correct. Theoretically, one should be better able to find promoter sequences and create more relevant gene clusters by taking a unified approach to these two problems. We present a likelihood function for a sequence-expression model giving a joint likelihood for a promoter sequence and its corresponding expression levels. An algorithm to estimate sequence-expression model parameters using Gibbs sampling and Expectation/Maximization is described. A program, called kimono, that implements this algorithm has been developed and the source code is freely available over the internet.

  11. Heparin-binding peptide as a novel affinity tag for purification of recombinant proteins.

    PubMed

    Morris, Jacqueline; Jayanthi, Srinivas; Langston, Rebekah; Daily, Anna; Kight, Alicia; McNabb, David S; Henry, Ralph; Kumar, Thallapuranam Krishnaswamy Suresh

    2016-10-01

    Purification of recombinant proteins constitutes a significant part of the downstream processing in biopharmaceutical industries. Major costs involved in the production of bio-therapeutics mainly depend on the number of purification steps used during the downstream process. Affinity chromatography is a widely used method for the purification of recombinant proteins expressed in different expression host platforms. Recombinant protein purification is achieved by fusing appropriate affinity tags to either N- or C- terminus of the target recombinant proteins. Currently available protein/peptide affinity tags have proved quite useful in the purification of recombinant proteins. However, these affinity tags suffer from specific limitations in their use under different conditions of purification. In this study, we have designed a novel 34-amino acid heparin-binding affinity tag (HB-tag) for the purification of recombinant proteins expressed in Escherichia coli (E. coli) cells. HB-tag fused recombinant proteins were overexpressed in E. coli in high yields. A one-step heparin-Sepharose-based affinity chromatography protocol was developed to purify HB-fused recombinant proteins to homogeneity using a simple sodium chloride step gradient elution. The HB-tag has also been shown to facilitate the purification of target recombinant proteins from their 8 M urea denatured state(s). The HB-tag has been demonstrated to be successfully released from the fusion protein by an appropriate protease treatment to obtain the recombinant target protein(s) in high yields. Results of the two-dimensional NMR spectroscopy experiments indicate that the purified recombinant target protein(s) exist in the native conformation. Polyclonal antibodies raised against the HB-peptide sequence, exhibited high binding specificity and sensitivity to the HB-fused recombinant proteins (∼10 ng) in different crude cell extracts obtained from diverse expression hosts. In our opinion, the HB-tag provides a

  12. Sequence and Expression Analyses of Ethylene Response Factors Highly Expressed in Latex Cells from Hevea brasiliensis

    PubMed Central

    Piyatrakul, Piyanuch; Yang, Meng; Putranto, Riza-Arief; Pirrello, Julien; Dessailly, Florence; Hu, Songnian; Summo, Marilyne; Theeravatanasuk, Kannikar; Leclercq, Julie; Kuswanhadi; Montoro, Pascal

    2014-01-01

    The AP2/ERF superfamily encodes transcription factors that play a key role in plant development and responses to abiotic and biotic stress. In Hevea brasiliensis, ERF genes have been identified by RNA sequencing. This study set out to validate the number of HbERF genes, and identify ERF genes involved in the regulation of latex cell metabolism. A comprehensive Hevea transcriptome was improved using additional RNA reads from reproductive tissues. Newly assembled contigs were annotated in the Gene Ontology database and were assigned to 3 main categories. The AP2/ERF superfamily is the third most represented compared with other transcription factor families. A comparison with genomic scaffolds led to an estimation of 114 AP2/ERF genes and 1 soloist in Hevea brasiliensis. Based on a phylogenetic analysis, functions were predicted for 26 HbERF genes. A relative transcript abundance analysis was performed by real-time RT-PCR in various tissues. Transcripts of ERFs from group I and VIII were very abundant in all tissues while those of group VII were highly accumulated in latex cells. Seven of the thirty-five ERF expression marker genes were highly expressed in latex. Subcellular localization and transactivation analyses suggested that HbERF-VII candidate genes encoded functional transcription factors. PMID:24971876

  13. Widespread Differential Expression of Coding Region and 3' UTR Sequences in Neurons and Other Tissues.

    PubMed

    Kocabas, Arif; Duarte, Terence; Kumar, Saranya; Hynes, Mary A

    2015-12-16

    Mature messenger RNAs (mRNAs) consist of coding sequence (CDS) and 5' and 3' UTRs, typically expected to show similar abundance within a given neuron. Examining mRNA from defined neurons, we unexpectedly show extremely common unbalanced expression of cognate 3' UTR and CDS sequences; many genes show high 3' UTR relative to CDS, others show high CDS to 3' UTR. In situ hybridization (19 of 19 genes) shows a broad range of 3' UTR-to-CDS expression ratios across neurons and tissues. Ratios may be spatially graded or change with developmental age but are consistent across animals. Further, for two genes examined, a 3' UTR-to-CDS ratio above a particular threshold in any given neuron correlated with reduced or undetectable protein expression. Our findings raise questions about the role of isolated 3' UTR sequences in regulation of protein expression and highlight the importance of separately examining 3' UTR and CDS sequences in gene expression analyses. PMID:26687222

  14. Shark Tagging Activities.

    ERIC Educational Resources Information Center

    Current: The Journal of Marine Education, 1998

    1998-01-01

    In this group activity, children learn about the purpose of tagging and how scientists tag a shark. Using a cut-out of a shark, students identify, measure, record data, read coordinates, and tag a shark. Includes introductory information about the purpose of tagging and the procedure, a data sheet showing original tagging data from Tampa Bay, and…

  15. Molecular assembly and dynamics of fluorescent protein-tagged single KCa1.1 channel in expression system and vascular smooth muscle cells.

    PubMed

    Yamamura, Hisao; Ikeda, Chikako; Suzuki, Yoshiaki; Ohya, Susumu; Imaizumi, Yuji

    2012-04-15

    The large-conductance Ca(2+)-activated K(+) (K(Ca)1.1, BK) channel has pivotal roles in the regulation of vascular tone. To clarify the molecular dynamics of BK channels and their functionally coupled protein on the membrane surface, we examined single-molecule imaging of fluorescent-labeled BK subunits in the plasma membrane using total internal reflection fluorescence (TIRF) microscopy. The dynamic mobility of yellow fluorescent protein (YFP)-tagged BKα subunit (BKα-YFP) expressed in human embryo kidney 293 (HEK) cells was detected in TIRF regions at the level of individual channels and their clusters on the plasma membrane with a diffusion coefficient of 6.7 × 10(3) nm(2)/s. When BKα-YFP was coexpressed with cyan fluorescent protein (CFP)-tagged BKβ1 subunit (BKβ1-CFP) in HEK cells, the mobility was reduced by ∼50%. Fluorescent image analyses suggest that green fluorescent protein (GFP)-tagged BKα subunit (BKα-GFP) expressed in vascular smooth muscle cells (VSMCs), at low density, preferentially formed a heterotetrameric molecular assembly with native BKα subunits, rather than homotetrameric BKα-GFP. Movement of BKα-YFP in VSMCs (0.29 × 10(3) nm(2)/s) was far more restricted than BKα-YFP/BKβ1-CFP in HEK cells (2.5 × 10(3) nm(2)/s). Actin disruption by pretreatment with cytochalasin D in VSMCs appeared to increase the mobile behavior of BKα-YFP, which was then significantly reduced by addition of jasplakinolide. Most BKα-YFP colocalized with caveolin 1 (Cav1)-CFP in VSMCs, but unexpectedly not frequently in HEK cells. Fluorescence resonance energy transfer analyses showed the direct interaction between BKα-YFP and Cav1-CFP, particularly in VSMCs. These results, obtained by single molecule imaging in living cells, indicate that the dynamics of BKα molecules on the membrane surface are strongly restricted or regulated by its auxiliary β-subunit, cytoskeleton, and direct interaction with Cav1 in VSMCs. PMID:22301058

  16. Transferable green fluorescence-tagged pEI2 in Edwardsiella ictaluri

    Technology Transfer Automated Retrieval System (TEKTRAN)

    The pEI2 plasmid of Edwardsiella ictaluri isolate, I49, was tagged using a Tn10-GFP-kan cassette to create the green fluorescence-expressing derivative I49-gfp. The Tn10-GFP-kan insertion site was mapped by plasmid sequencing to 663 bp upstream of orf2 and appeared to be at a neutral site in the pla...

  17. Regulated expression of repetitive sequences including the identifier sequence during myotube formation in culture.

    PubMed Central

    Herget, T; Reich, M; Stüber, K; Starzinski-Powitz, A

    1986-01-01

    We have isolated and characterized a cDNA of 1183 bp, pL6-411, from rat L6 muscle cells. This cDNA contains repetitive sequences - including two inverted copies of the previously described identifier sequence - as shown by sequence analysis. Repetitive sequences from pL6-411 characterize a family of RNAs which is specifically induced during L6 myotube formation. Another part of the pL6-411 sequence, existing at low-copy number per haploid rat genome, hybridized to two RNAs of 5 kb and 2 kb from L6 myoblasts as well as from L6 myotubes. A third pL6-411-related RNA of 150 bases was detected which hybridized with the repetitive sequence but did not hybridize with the low-copy number part of pL6-411. It appears that the 'identifier' sequence in this population of small RNAs is complementary to one of the 'identifier' copies in the pL6-411-related RNA. Finally, we identified on cDNA pL6-411 the recognition site for the TGGCA-binding protein and in both orientations a total of four putative promoters for RNA polymerase III. Images Fig.1. Fig.2. Fig.3. PMID:2423328

  18. Site-specific chemical protein conjugation using genetically encoded aldehyde tags.

    PubMed

    Rabuka, David; Rush, Jason S; deHart, Gregory W; Wu, Peng; Bertozzi, Carolyn R

    2012-06-01

    We describe a method for modifying proteins site-specifically using a chemoenzymatic bioconjugation approach. Formylglycine generating enzyme (FGE) recognizes a pentapeptide consensus sequence, CxPxR, and it specifically oxidizes the cysteine in this sequence to an unusual aldehyde-bearing formylglyine. The FGE recognition sequence, or aldehyde tag, can be inserted into heterologous recombinant proteins produced in either prokaryotic or eukaryotic expression systems. The conversion of cysteine to formylglycine is accomplished by co-overexpression of FGE, either transiently or as a stable cell line, and the resulting aldehyde can be selectively reacted with α-nucleophiles to generate a site-selectively modified bioconjugate. This protocol outlines both the generation and the analysis of proteins aldehyde-tagged at their termini and the methods for chemical conjugation to the formylglycine. The process of generating aldehyde-tagged protein followed by chemical conjugation and purification takes 20 d. PMID:22576105

  19. Using The Affymetrix Wheat Microarray As An Oat Expression Platform

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Recent advances in sequencing have resulted in the sequence of a large number of plant expressed sequence tags (ESTs) to entire plant genomes. Using these EST sequences, oligonucleotide microarray chips have been developed for several species including barley (Hordeum vulgare), maize (Zea mays), ric...

  20. Sequence and gene expression evolution of paralogous genes in willows.

    PubMed

    Harikrishnan, Srilakshmy L; Pucholt, Pascal; Berlin, Sofia

    2015-01-01

    Whole genome duplications (WGD) have had strong impacts on species diversification by triggering evolutionary novelties, however, relatively little is known about the balance between gene loss and forces involved in the retention of duplicated genes originating from a WGD. We analyzed putative Salicoid duplicates in willows, originating from the Salicoid WGD, which took place more than 45 Mya. Contigs were constructed by de novo assembly of RNA-seq data derived from leaves and roots from two genotypes. Among the 48,508 contigs, 3,778 pairs were, based on fourfold synonymous third-codon transversion rates and syntenic positions, predicted to be Salicoid duplicates. Both copies were in most cases expressed in both tissues and 74% were significantly differentially expressed. Mean Ka/Ks was 0.23, suggesting that the Salicoid duplicates are evolving by purifying selection. Gene Ontology enrichment analyses showed that functions related to DNA- and nucleic acid binding were over-represented among the non-differentially expressed Salicoid duplicates, while functions related to biosynthesis and metabolism were over-represented among the differentially expressed Salicoid duplicates. We propose that the differentially expressed Salicoid duplicates are regulatory neo- and/or subfunctionalized, while the non-differentially expressed are dose sensitive, hence, functionally conserved. Multiple evolutionary processes, thus drive the retention of Salicoid duplicates in willows. PMID:26689951

  1. Sequence and gene expression evolution of paralogous genes in willows

    PubMed Central

    Harikrishnan, Srilakshmy L.; Pucholt, Pascal; Berlin, Sofia

    2015-01-01

    Whole genome duplications (WGD) have had strong impacts on species diversification by triggering evolutionary novelties, however, relatively little is known about the balance between gene loss and forces involved in the retention of duplicated genes originating from a WGD. We analyzed putative Salicoid duplicates in willows, originating from the Salicoid WGD, which took place more than 45 Mya. Contigs were constructed by de novo assembly of RNA-seq data derived from leaves and roots from two genotypes. Among the 48,508 contigs, 3,778 pairs were, based on fourfold synonymous third-codon transversion rates and syntenic positions, predicted to be Salicoid duplicates. Both copies were in most cases expressed in both tissues and 74% were significantly differentially expressed. Mean Ka/Ks was 0.23, suggesting that the Salicoid duplicates are evolving by purifying selection. Gene Ontology enrichment analyses showed that functions related to DNA- and nucleic acid binding were over-represented among the non-differentially expressed Salicoid duplicates, while functions related to biosynthesis and metabolism were over-represented among the differentially expressed Salicoid duplicates. We propose that the differentially expressed Salicoid duplicates are regulatory neo- and/or subfunctionalized, while the non-differentially expressed are dose sensitive, hence, functionally conserved. Multiple evolutionary processes, thus drive the retention of Salicoid duplicates in willows. PMID:26689951

  2. Analysis of the Changes in Expression Levels of Sialic Acid on Influenza-Virus-Infected Cells Using Lectin-Tagged Polymeric Nanoparticles

    PubMed Central

    Cho, Jaebum; Miyake, Yukari; Honda, Ayae; Kushiro, Keiichiro; Takai, Madoka

    2016-01-01

    Viral infections affect millions around the world, sometimes leading to severe consequences or even epidemics. Understanding the molecular dynamics during viral infections would provide crucial information for preventing or stopping the progress of infections. However, the current methods often involve the disruption of the infected cells or expensive and time-consuming procedures. In this study, fluorescent polymeric nanoparticles were fabricated and used as bioimaging nanoprobes that can monitor the progression of influenza viral infection through the changes in the expression levels of sialic acids expressed on the cell membrane. The nanoparticles were composed of a biocompatible monomer to prevent non-specific interactions, a hydrophobic monomer to form the core, a fluorescent monomer, and a protein-binding monomer to conjugate lectin, which binds sialic acids. It was shown that these lectin-tagged nanoparticles that specifically target sialic acids could track the changes in the expression levels of sialic acids caused by influenza viral infections in human lung epithelial cells. There was a sudden drop in the levels of sialic acid at the initial onset of virus infection (t = 0~1 h) and at approximately 4~5 h post-infection. The latter drop correlated with the production of viral proteins that was confirmed using traditional techniques. Thus, the accuracy, the rapidity and the efficacy of the nanoprobes were demonstrated. Such molecular bioimaging tools, which allow easy-handling and in situ monitoring, would be useful to directly observe and decipher the viral infection mechanisms. PMID:27493646

  3. Breast Cancer Proteomics – Differences in Protein Expression between Estrogen Receptor-Positive and -Negative Tumors Identified by Tandem Mass Tag Technology

    PubMed Central

    Ruckhäberle, Eugen; Karn, Thomas; Hanker, Lars; Schwarz, Josef; Schulz-Knappe, Peter; Kuhn, Karsten; Böhm, Gitte; Selzer, Stefan; Erhard, Neukum; Engels, Knut; Holtrich, Uwe; Kaufmann, Manfred; Rody, Achim

    2010-01-01

    Background Proteomic analysis has become an effective tool in breast cancer research. In this study, we applied the new gel-free tandem mass tag (TMT) reference method for the first time in breast cancer. Materials and Methods Proteomic analysis was used to compare 10 estrogen receptor (ER)-positive and 10 ER-negative samples. The results of the proteomic approach were validated by Western blot, immunohistochemistry and gene expression analysis. Results 17 proteins with significant differences in expression were identified. 13 proteins were overexpressed in ER-negative tumors and 4 were overexpressed in ER-positive samples. All these proteins were characterized by relatively high cellular abundance. Conclusions Our results demonstrate that the gel-free TMT approach allows the quantification of differences in protein expression levels. Further improvement of the sensitivity by subfractionation of the tissue should allow also the identification of low-abundance proteins and might lead to the use of this method in breast cancer research. PMID:22619634

  4. Experimental design, preprocessing, normalization and differential expression analysis of small RNA sequencing experiments

    PubMed Central

    2011-01-01

    Prior to the advent of new, deep sequencing methods, small RNA (sRNA) discovery was dependent on Sanger sequencing, which was time-consuming and limited knowledge to only the most abundant sRNA. The innovation of large-scale, next-generation sequencing has exponentially increased knowledge of the biology, diversity and abundance of sRNA populations. In this review, we discuss issues involved in the design of sRNA sequencing experiments, including choosing a sequencing platform, inherent biases that affect sRNA measurements and replication. We outline the steps involved in preprocessing sRNA sequencing data and review both the principles behind and the current options for normalization. Finally, we discuss differential expression analysis in the absence and presence of biological replicates. While our focus is on sRNA sequencing experiments, many of the principles discussed are applicable to the sequencing of other RNA populations. PMID:21356093

  5. Behavioral tagging of extinction learning.

    PubMed

    de Carvalho Myskiw, Jociane; Benetti, Fernando; Izquierdo, Iván

    2013-01-15

    Extinction of contextual fear in rats is enhanced by exposure to a novel environment at 1-2 h before or 1 h after extinction training. This effect is antagonized by administration of protein synthesis inhibitors anisomycin and rapamycin into the hippocampus, but not into the amygdala, immediately after either novelty or extinction training, as well as by the gene expression blocker 5,6-dichloro-1-beta-D-ribofuranosylbenzimidazole administered after novelty training, but not after extinction training. Thus, this effect can be attributed to a mechanism similar to synaptic tagging, through which long-term potentiation can be enhanced by other long-term potentiations or by exposure to a novel environment in a protein synthesis-dependent fashion. Extinction learning produces a tag at the appropriate synapses, whereas novelty learning causes the synthesis of plasticity-related proteins that are captured by the tag, strengthening the synapses that generated this tag. PMID:23277583

  6. Use of the amicyanin signal sequence for efficient periplasmic expression in E. coli of a human antibody light chain variable domain.

    PubMed

    Dow, Brian A; Tatulian, Suren A; Davidson, Victor L

    2015-04-01

    Periplasmic localization of recombinant proteins offers advantages over cytoplasmic protein expression. In this study signal sequence of amicyanin, which is encoded by the mauC gene of Paracoccus denitrificans, was used to express the light chain variable domain of the human κIO8/O18 germline antibody in the periplasm of Escherichiacoli. The expressed protein was purified in good yield (70mg/L of culture) in one step from the periplasmic fraction by affinity chromatography using an engineered hexahistidine tag. Circular dichroism spectroscopy was used to determine if the secondary and tertiary structures of the protein and its thermal stability corresponded to those of the native folded protein. The expressed and purified protein was indeed properly folded and exhibited a reasonable thermal transition temperature of 53°C. These results indicate that the amicyanin signal sequence may be particularly useful for prokaryotic expression of proteins which are prone to mis-folding, aggregation or formation of inclusion bodies, all of which were circumvented in this study. PMID:25573388

  7. Nucleotide sequence and expression of a Drosophila metallothionein.

    PubMed

    Lastowski-Perry, D; Otto, E; Maroni, G

    1985-02-10

    A Drosophila melanogaster cDNA clone was isolated based on its more intense hybridization to RNA sequences from copper-fed larvae than from control larval RNA. This clone showed strong hybridization to mouse metallothionein I cDNA at reduced stringency. Its nucleotide sequence includes an open reading segment which codes for a 40-amino acid protein; this protein is identified as metallothionein based on its similarity to the amino-terminal portion of mammalian and crab metalloproteins. The 10 cysteine residues present occur in five pairs of near vicinal cysteines (Cys-X-Cys). This cDNA sequence hybridized to a 400-nucleotide polyadenylated RNA whose presence in the cells of the alimentary canal of larvae was stimulated by ingestion of cadmium or copper; in other tissues this RNA was present at much lower levels. Mercury, silver, and zinc induced metallothionein to a lesser extent. The level of metallothionein RNA increased very soon after the initiation of metal treatment and reached a maximum after approximately 36 h. PMID:2578462

  8. Heterotopic endochondrial ossification with mixed tumor formation in C3(1)/Tag transgenic mice is associated with elevated TGF-beta1 and BMP-2 expression.

    PubMed

    Maroulakou, I G; Shibata, M A; Anver, M; Jorcyk, C L; Liu, M l; Roche, N; Roberts, A B; Tsarfaty, I; Reseau, J; Ward, J; Green, J E

    1999-09-23

    Transgenic mice which express the simian virus 40 large T-antigen (Tag) under the regulatory control of the hormone responsive rat C3(1) gene develop unusual lesions of heterotopic bone growth associated with mixed tumor formation arising from eccrine sweat glands found only in the foot pads of mice, ischiocavernosus muscle adjacent to bulbourethral glands and occasionally the salivary and mammary glands. These lesions are very similar to mixed tumors arising in several types of human cancers. Based upon electron microscopic examination and immunocytochemical analyses of cellular differentiation markers, the mixed proliferative lesions in this transgenic mouse model begin with the Tag-induced proliferation of epithelial and myoepithelial cells. The proliferation of these two types of cells results in hyperplasia and adenomatous transformation of the epithelial component, whereas the proliferating myoepithelial cells undergo metaplasia to form chondrocytes which deposit extracellular matrix, including collagen fibers. Cartilage develops focally between areas of epithelial proliferation and subsequently ossifies through a process of endochondrial bone formation. The metaplasia of myoepithelial cells to chondrocytes appears to require the inductive interaction of factors produced by the closely associated proliferating epithelial cells, including members of the TGF-beta superfamily. We demonstrate that TGF-beta1 protein accumulates in the extracellular matrix of the lesions, whereas RNA in situ hybridization reveals that BMP-2, another strong inducer of heterotopic bone formation, is overexpressed by the proliferating epithelial cells during the development of ectopic bone. The formation of sarcomatous tumors within the mixed tumors appears to be androgen-dependent and more frequent in mice lacking a normal allele of p53. This process of cartilage and bone induction may mimic epithelial-mesenchymal interactions which occur during embryonic bone formation. These

  9. Biostratigraphic expression of pleistocene sequence boundaries, Gulf of Mexico

    SciTech Connect

    Martin, R.E. ); Neff, E.D. ); Johnson, G.W. ); Krantz, D.E. )

    1993-04-01

    The quaternary section west of the Mississippi River delta consists of thousands of meters of terrigenous sediments, but the stratigraphic and paleoclimatic history recorded in these sequences is often distorted as a result of salt and shale diapirism. Quaternary sequences of the western Gulf of Mexico often reflect highly variable sediment accumulation rates within and between isolated salt-withdrawal basins and missing section resulting from unconformities and extensive faulting. The sedimentary record of Ocean Drilling Program's Core 625B (northeast Gulf of Mexico) contains significant unconformaties that represent a record of sea-level change during the Pleistocene. The core may thus serve as a standard for timing of sea-level changes of the Western Gulf. Utilizing primarily relative abundances of the warm-water Globorotalia menardii complex and cool water G. inflata, we have subdivided the pre-zone W Pleistocene of Core 625B into 17 subzones, resulting in an average duration of approximately 100,000 years per unit. Based on graphic correlation, subzonal boundaries are largely coeval between sites and can provide high-resolution biostratigraphic subdivision of the Pleistocene of industrial wells on an operational basis. Also, the subzonation delineates anomalous paleotops that are reworked, erosionally truncated at sequence boundaries or delta-depressed as a result of localized sediment influx. Graphic correlation of subzonal boundaries coupled with available biostratigraphic and magnetostrategraphic datums has demonstrated the near synchronomy of subzonal boundaries and their utility in the subdivision of the Pleistocene. Using graphic correlation, the paleontologist can build viable exploration models that can be used to predict the occurrence of hydrocarbon reservoir sands. 87 refs., 13 figs.

  10. Delineation of Cis-Acting Sequences Required for Expression of Drosophila Mojavensis Adh-1

    PubMed Central

    Bayer, C. A.; Curtiss, S. W.; Weaver, J. A.; Sullivan, D. T.

    1992-01-01

    The control of expression of the Adh-1 gene of Drosophila mojavensis has been analyzed by transforming ADH null Drosophila melanogaster hosts with P element constructs which contain D. mojavensis Adh-1 having deletions of different extent in the 5' and 3' ends. Adh-1 expression in the D. melanogaster hosts is qualitatively similar to expression in D. mojavensis, although expression is quantitatively lower in transformants. Deletions of the 5' end indicate that information required for normal temporal and tissue expression in larvae is contained within 70 bp of the transcription start site. However, deletion constructs to -70 are deficient in ovarian nurse cell expression, whereas the additional upstream sequences present in constructs containing deletions to -257 do support expression in the ovary. Comparison of the nucleotide sequence in the -257 to -70 region of Adh-1 of four species: D. mojavensis and Drosophila arizona, which express Adh-1 in the ovary, and Drosophila mulleri and Drosophila navojoa, which do not, has led to the identification of regions of sequence similarity that correlate with ovary expression. One of these bears a striking similarity to a conserved sequence located upstream of the three heat shock genes that have constitutive ovarian expression and may be an ovarian control element. We have identified an aberrant aspect of Adh-1 expression. In transformants which carry an Adh-1 gene without a functional upstream Adh-2 gene Adh-1 expression continues into the adult stage instead of ceasing at the onset of metamorphosis. In transformants with a functional Adh-2 gene, Adh-1 expression ceases in the third larval instar stage and aberrant expression in the adult stage does not occur. PMID:1317314

  11. Evaluation of codon biology in citrus and Poncirus trifoliata based on genomic features and frame corrected expressed sequence tags.

    PubMed

    Ahmad, Touqeer; Sablok, Gaurav; Tatarinova, Tatiana V; Xu, Qiang; Deng, Xiu-Xin; Guo, Wen-Wu

    2013-04-01

    Citrus, as one of the globally important fruit trees, has been an object of interest for understanding genetics and evolutionary process in fruit crops. Meta-analyses of 19 Citrus species, including 4 globally and economically important Citrus sinensis, Citrus clementina, Citrus reticulata, and 1 Citrus relative Poncirus trifoliata, were performed. We observed that codons ending with A- or T- at the wobble position were preferred in contrast to C- or G- ending codons, indicating a close association with AT richness of Citrus species and P. trifoliata. The present study postulates a large repertoire of a set of optimal codons for the Citrus genus and P. trifoliata and demonstrates that GCT and GGT are evolutionary conserved optimal codons. Our observation suggested that mutational bias is the dominating force in shaping the codon usage bias (CUB) in Citrus and P. trifoliata. Correspondence analysis (COA) revealed that the principal axis [axis 1; COA/relative synonymous codon usage (RSCU)] contributes only a minor portion (∼10.96%) of the recorded variance. In all analysed species, except P. trifoliata, Gravy and aromaticity played minor roles in resolving CUB. Compositional constraints were found to be strongly associated with the amino acid signatures in Citrus species and P. trifoliata. Our present analysis postulates compositional constraints in Citrus species and P. trifoliata and plausible role of the stress with GC3 and coevolution pattern of amino acid. PMID:23315666

  12. Expressed-sequence tag (EST) analysis of the response of apple (Malus x domestica) to low temperature and water deficit

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Apple leaf, bark, xylem, and root tissues were used to make cDNA libraries from non-stressed (control) trees and trees exposed to either low-temperature (5 C for 24 h) or water deficit (45% of saturated pot mass for 2 weeks). Over 22,600 clones from the nine libraries were subjected to 5' single pa...

  13. Evaluation of Codon Biology in Citrus and Poncirus trifoliata Based on Genomic Features and Frame Corrected Expressed Sequence Tags

    PubMed Central

    Ahmad, Touqeer; Sablok, Gaurav; Tatarinova, Tatiana V.; Xu, Qiang; Deng, Xiu-Xin; Guo, Wen-Wu

    2013-01-01

    Citrus, as one of the globally important fruit trees, has been an object of interest for understanding genetics and evolutionary process in fruit crops. Meta-analyses of 19 Citrus species, including 4 globally and economically important Citrus sinensis, Citrus clementina, Citrus reticulata, and 1 Citrus relative Poncirus trifoliata, were performed. We observed that codons ending with A- or T- at the wobble position were preferred in contrast to C- or G- ending codons, indicating a close association with AT richness of Citrus species and P. trifoliata. The present study postulates a large repertoire of a set of optimal codons for the Citrus genus and P. trifoliata and demonstrates that GCT and GGT are evolutionary conserved optimal codons. Our observation suggested that mutational bias is the dominating force in shaping the codon usage bias (CUB) in Citrus and P. trifoliata. Correspondence analysis (COA) revealed that the principal axis [axis 1; COA/relative synonymous codon usage (RSCU)] contributes only a minor portion (∼10.96%) of the recorded variance. In all analysed species, except P. trifoliata, Gravy and aromaticity played minor roles in resolving CUB. Compositional constraints were found to be strongly associated with the amino acid signatures in Citrus species and P. trifoliata. Our present analysis postulates compositional constraints in Citrus species and P. trifoliata and plausible role of the stress with GC3 and coevolution pattern of amino acid. PMID:23315666

  14. Response of apple (Malus x domestica) to low temperature and water deficit: an expressed-sequence tag (EST) analysis

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Apple leaf, bark, xylem, and root tissues were used to make cDNA libraries from non-stressed (control) trees, and trees exposed to either low-temperature (5 C for 24 h) or water deficit (45% of saturated pot mass for 2 weeks). Over 22,600 clones from the nine libraries were subjected to 5’ single p...

  15. Analysis of Expressed Sequence Tags from the Liver and Brain of Estrogen Treated Yellow Perch (Perca flavescens)

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Over the last decade, there has been strong interest in the aquaculture of yellow perch (Perca flavescens) in North America. We have been developing genetically defined yellow perch broodstocks for aquaculture using genetic analyses and selection methods. However, aside from microsatellites, there...

  16. Generation and analysis of expression sequence tags from haustoria of the wheat stripe rust fungus Puccinia striiformis f. sp. tritici

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Background: Stripe rust, caused by Puccinia striiformis f. sp. tritici (Pst), is one of the most destructive diseases of wheat (Triticum aestivum L.) worldwide. In spite of its agricultural importance, the genomics and genetics of the pathogen are poorly characterized. Pst transcripts from uredinios...

  17. Cloning and constitutive expression of His-tagged xylanase GH 11 from Penicillium occitanis Pol6 in Pichia pastoris X33: purification and characterization.

    PubMed

    Driss, Dorra; Bhiri, Fatma; Ghorbel, Raoudha; Chaabouni, Semia Ellouz

    2012-05-01

    High-level constitutive expression of xylanase GH11 from Penicillium occitanis Pol6 termed PoXyn2 was achieved using the methylotrophic yeast Pichia pastoris. The PoXyn2 cDNA encoding for a mature xylanase of 320 amino acids was subcloned into the pGAPZαA vector, to construct recombinant xylanse with six histidine residues at the N-terminal and further integrated into the genome of P. pastoris X-33 under the control of the glyceraldehyde 3-phosphate dehydrogenase (GAP) constitutive promoter. Activity assay and SDS-PAGE demonstrate that the His-tagged xylanase was extracellularly expressed in P. pastoris and purified to homogeneity by a simple, one-step purification protocol using immobilized metal affinity chromatography (Ni-NTA resin). The purified PoXyn2 showed a single band on SDS-PAGE with an apparent molecular weight of 30 kDa. The xylanase activity was optimal at pH 3.0 and 50°C. The specific activity measured for Oat Spelt Xylan was 8549.85 U mg(-1). The apparent The K(M) and V(max) values were 8.33±0.7 mg ml(-1)and 58.82±0.9 μmol min(-1) ml(-1), respectively, as measured on Oat Spelt Xylan. This is the first report demonstrating the possibility of mass production of P. occitanis xylanase using P. pastoris. PMID:22402470

  18. Fluorescently Tagged pUL47 of Marek's Disease Virus Reveals Differential Tissue Expression of the Tegument Protein In Vivo

    PubMed Central

    Arndt, Sina; Kaufer, Benedikt B.; Osterrieder, Nikolaus

    2012-01-01

    Marek's disease virus (MDV), a lymphotropic alphaherpesvirus, causes Marek's disease (MD) in chickens. MD is characterized by neurological signs, chronic wasting, and T cell lymphomas that predominate in the visceral organs. MDV replicates in a highly cell-associated manner in vitro and in vivo, with infectious virus particles being released only from feather follicle epithelial (FFE) cells in the skin. Virus produced and shed from FFE cells allows transmission of MDV from infected to naïve chickens, but the mechanisms or roles of differential virus gene expression have remained elusive. Here, we generated recombinant MDV in which we fused enhanced green fluorescent protein (EGFP) to the C terminus of the tegument protein pUL47 (vUL47-EGFP) or pUL49 (vUL49-EGFP). While vUL49-EGFP was highly attenuated in vitro and in vivo, vUL47-EGFP showed unaltered pathogenic potential and stable production of pUL47-EGFP, which facilitated direct analysis of pUL47 expression in cells and tissues. Our studies revealed that pUL47-EGFP is expressed at low levels and localizes to the nucleus during lytic replication in vitro and in lymphocytes in the spleen in vivo, while it is undetectable in tumors. In contrast, pUL47-EGFP is highly abundant and localizes predominantly in the cytoplasm in FFE cells in the skin, where MDV is shed into the environment. We concluded that differential expression and localization of MDV pUL47-EGFP tegument protein is potentially important for the unique cell-associated nature of MDV in vitro and in lymphocytes in vivo, as well as production of free virus in FFE cells. PMID:22190714

  19. Fluorescently tagged pUL47 of Marek's disease virus reveals differential tissue expression of the tegument protein in vivo.

    PubMed

    Jarosinski, Keith W; Arndt, Sina; Kaufer, Benedikt B; Osterrieder, Nikolaus

    2012-03-01