expression sequence tags: Topics by Science.gov

Sample records for expression sequence tags

Analyses of Expressed Sequence Tags from Apple1

PubMed Central

Newcomb, Richard D.; Crowhurst, Ross N.; Gleave, Andrew P.; Rikkerink, Erik H.A.; Allan, Andrew C.; Beuning, Lesley L.; Bowen, Judith H.; Gera, Emma; Jamieson, Kim R.; Janssen, Bart J.; Laing, William A.; McArtney, Steve; Nain, Bhawana; Ross, Gavin S.; Snowden, Kimberley C.; Souleyre, Edwige J.F.; Walton, Eric F.; Yauk, Yar-Khing

2006-01-01

The domestic apple (Malus domestica; also known as Malus pumila Mill.) has become a model fruit crop in which to study commercial traits such as disease and pest resistance, grafting, and flavor and health compound biosynthesis. To speed the discovery of genes involved in these traits, develop markers to map genes, and breed new cultivars, we have produced a substantial expressed sequence tag collection from various tissues of apple, focusing on fruit tissues of the cultivar Royal Gala. Over 150,000 expressed sequence tags have been collected from 43 different cDNA libraries representing 34 different tissues and treatments. Clustering of these sequences results in a set of 42,938 nonredundant sequences comprising 17,460 tentative contigs and 25,478 singletons, together representing what we predict are approximately one-half the expressed genes from apple. Many potential molecular markers are abundant in the apple transcripts. Dinucleotide repeats are found in 4,018 nonredundant sequences, mainly in the 5′-untranslated region of the gene, with a bias toward one repeat type (containing AG, 88%) and against another (repeats containing CG, 0.1%). Trinucleotide repeats are most common in the predicted coding regions and do not show a similar degree of sequence bias in their representation. Bi-allelic single-nucleotide polymorphisms are highly abundant with one found, on average, every 706 bp of transcribed DNA. Predictions of the numbers of representatives from protein families indicate the presence of many genes involved in disease resistance and the biosynthesis of flavor and health-associated compounds. Comparisons of some of these gene families with Arabidopsis (Arabidopsis thaliana) suggest instances where there have been duplications in the lineages leading to apple of biosynthetic and regulatory genes that are expressed in fruit. This resource paves the way for a concerted functional genomics effort in this important temperate fruit crop. PMID:16531485
Identification of human chromosome 22 transcribed sequences with ORF expressed sequence tags

PubMed Central

de Souza, Sandro J.; Camargo, Anamaria A.; Briones, Marcelo R. S.; Costa, Fernando F.; Nagai, Maria Aparecida; Verjovski-Almeida, Sergio; Zago, Marco A.; Andrade, Luis Eduardo C.; Carrer, Helaine; El-Dorry, Hamza F. A.; Espreafico, Enilza M.; Habr-Gama, Angelita; Giannella-Neto, Daniel; Goldman, Gustavo H.; Gruber, Arthur; Hackel, Christine; Kimura, Edna T.; Maciel, Rui M. B.; Marie, Suely K. N.; Martins, Elizabeth A. L.; Nóbrega, Marina P.; Paçó-Larson, Maria Luisa; Pardini, Maria Inês M. C.; Pereira, Gonçalo G.; Pesquero, João Bosco; Rodrigues, Vanderlei; Rogatto, Silvia R.; da Silva, Ismael D. C. G.; Sogayar, Mari C.; de Fátima Sonati, Maria; Tajara, Eloiza H.; Valentini, Sandro R.; Acencio, Marcio; Alberto, Fernando L.; Amaral, Maria Elisabete J.; Aneas, Ivy; Bengtson, Mário Henrique; Carraro, Dirce M.; Carvalho, Alex F.; Carvalho, Lúcia Helena; Cerutti, Janete M.; Corrêa, Maria Lucia C.; Costa, Maria Cristina R.; Curcio, Cyntia; Gushiken, Tsieko; Ho, Paulo L.; Kimura, Elza; Leite, Luciana C. C.; Maia, Gustavo; Majumder, Paromita; Marins, Mozart; Matsukuma, Adriana; Melo, Analy S. A.; Mestriner, Carlos Alberto; Miracca, Elisabete C.; Miranda, Daniela C.; Nascimento, Ana Lucia T. O.; Nóbrega, Francisco G.; Ojopi, Élida P. B.; Pandolfi, José Rodrigo C.; Pessoa, Luciana Gilbert; Rahal, Paula; Rainho, Claudia A.; da Ro's, Nancy; de Sá, Renata G.; Sales, Magaly M.; da Silva, Neusa P.; Silva, Tereza C.; da Silva, Wilson; Simão, Daniel F.; Sousa, Josane F.; Stecconi, Daniella; Tsukumo, Fernando; Valente, Valéria; Zalcberg, Heloisa; Brentani, Ricardo R.; Reis, Luis F. L.; Dias-Neto, Emmanuel; Simpson, Andrew J. G.

2000-01-01

Transcribed sequences in the human genome can be identified with confidence only by alignment with sequences derived from cDNAs synthesized from naturally occurring mRNAs. We constructed a set of 250,000 cDNAs that represent partial expressed gene sequences and that are biased toward the central coding regions of the resulting transcripts. They are termed ORF expressed sequence tags (ORESTES). The 250,000 ORESTES were assembled into 81,429 contigs. Of these, 1,181 (1.45%) were found to match sequences in chromosome 22 with at least one ORESTES contig for 162 (65.6%) of the 247 known genes, for 67 (44.6%) of the 150 related genes, and for 45 of the 148 (30.4%) EST-predicted genes on this chromosome. Using a set of stringent criteria to validate our sequences, we identified a further 219 previously unannotated transcribed sequences on chromosome 22. Of these, 171 were in fact also defined by EST or full length cDNA sequences available in GenBank but not utilized in the initial annotation of the first human chromosome sequence. Thus despite representing less than 15% of all expressed human sequences in the public databases at the time of the present analysis, ORESTES sequences defined 48 transcribed sequences on chromosome 22 not defined by other sequences. All of the transcribed sequences defined by ORESTES coincided with DNA regions predicted as encoding exons by genscan. (http://genes.mit.edu/GENSCAN.html). PMID:11070084
Analysis and Functional Annotation of an Expressed Sequence Tag Collection for Tropical Crop Sugarcane

PubMed Central

Vettore, André L.; da Silva, Felipe R.; Kemper, Edson L.; Souza, Glaucia M.; da Silva, Aline M.; Ferro, Maria Inês T.; Henrique-Silva, Flavio; Giglioti, Éder A.; Lemos, Manoel V.F.; Coutinho, Luiz L.; Nobrega, Marina P.; Carrer, Helaine; França, Suzelei C.; Bacci, Maurício; Goldman, Maria Helena S.; Gomes, Suely L.; Nunes, Luiz R.; Camargo, Luis E.A.; Siqueira, Walter J.; Van Sluys, Marie-Anne; Thiemann, Otavio H.; Kuramae, Eiko E.; Santelli, Roberto V.; Marino, Celso L.; Targon, Maria L.P.N.; Ferro, Jesus A.; Silveira, Henrique C.S.; Marini, Danyelle C.; Lemos, Eliana G.M.; Monteiro-Vitorello, Claudia B.; Tambor, José H.M.; Carraro, Dirce M.; Roberto, Patrícia G.; Martins, Vanderlei G.; Goldman, Gustavo H.; de Oliveira, Regina C.; Truffi, Daniela; Colombo, Carlos A.; Rossi, Magdalena; de Araujo, Paula G.; Sculaccio, Susana A.; Angella, Aline; Lima, Marleide M.A.; de Rosa, Vicente E.; Siviero, Fábio; Coscrato, Virginia E.; Machado, Marcos A.; Grivet, Laurent; Di Mauro, Sonia M.Z.; Nobrega, Francisco G.; Menck, Carlos F.M.; Braga, Marilia D.V.; Telles, Guilherme P.; Cara, Frank A.A.; Pedrosa, Guilherme; Meidanis, João; Arruda, Paulo

2003-01-01

To contribute to our understanding of the genome complexity of sugarcane, we undertook a large-scale expressed sequence tag (EST) program. More than 260,000 cDNA clones were partially sequenced from 26 standard cDNA libraries generated from different sugarcane tissues. After the processing of the sequences, 237,954 high-quality ESTs were identified. These ESTs were assembled into 43,141 putative transcripts. Of the assembled sequences, 35.6% presented no matches with existing sequences in public databases. A global analysis of the whole SUCEST data set indicated that 14,409 assembled sequences (33% of the total) contained at least one cDNA clone with a full-length insert. Annotation of the 43,141 assembled sequences associated almost 50% of the putative identified sugarcane genes with protein metabolism, cellular communication/signal transduction, bioenergetics, and stress responses. Inspection of the translated assembled sequences for conserved protein domains revealed 40,821 amino acid sequences with 1415 Pfam domains. Reassembling the consensus sequences of the 43,141 transcripts revealed a 22% redundancy in the first assembling. This indicated that possibly 33,620 unique genes had been identified and indicated that >90% of the sugarcane expressed genes were tagged. PMID:14613979
Single nucleotide polymorphisms from Theobroma cacao expressed sequence tags associated with witches' broom disease in cacao.

PubMed

Lima, L S; Gramacho, K P; Carels, N; Novais, R; Gaiotto, F A; Lopes, U V; Gesteira, A S; Zaidan, H A; Cascardo, J C M; Pires, J L; Micheli, F

2009-07-14

In order to increase the efficiency of cacao tree resistance to witches' broom disease, which is caused by Moniliophthora perniciosa (Tricholomataceae), we looked for molecular markers that could help in the selection of resistant cacao genotypes. Among the different markers useful for developing marker-assisted selection, single nucleotide polymorphisms (SNPs) constitute the most common type of sequence difference between alleles and can be easily detected by in silico analysis from expressed sequence tag libraries. We report the first detection and analysis of SNPs from cacao-M. perniciosa interaction expressed sequence tags, using bioinformatics. Selection based on analysis of these SNPs should be useful for developing cacao varieties resistant to this devastating disease.
An expressed sequence tag (EST) data mining strategy succeeding in the discovery of new G-protein coupled receptors.

PubMed

Wittenberger, T; Schaller, H C; Hellebrand, S

2001-03-30

We have developed a comprehensive expressed sequence tag database search method and used it for the identification of new members of the G-protein coupled receptor superfamily. Our approach proved to be especially useful for the detection of expressed sequence tag sequences that do not encode conserved parts of a protein, making it an ideal tool for the identification of members of divergent protein families or of protein parts without conserved domain structures in the expressed sequence tag database. At least 14 of the expressed sequence tags found with this strategy are promising candidates for new putative G-protein coupled receptors. Here, we describe the sequence and expression analysis of five new members of this receptor superfamily, namely GPR84, GPR86, GPR87, GPR90 and GPR91. We also studied the genomic structure and chromosomal localization of the respective genes applying in silico methods. A cluster of six closely related G-protein coupled receptors was found on the human chromosome 3q24-3q25. It consists of four orphan receptors (GPR86, GPR87, GPR91, and H963), the purinergic receptor P2Y1, and the uridine 5'-diphosphoglucose receptor KIAA0001. It seems likely that these receptors evolved from a common ancestor and therefore might have related ligands. In conclusion, we describe a data mining procedure that proved to be useful for the identification and first characterization of new genes and is well applicable for other gene families. Copyright 2001 Academic Press.
Studies of a biochemical factory: tomato trichome deep expressed sequence tag sequencing and proteomics.

PubMed

Schilmiller, Anthony L; Miner, Dennis P; Larson, Matthew; McDowell, Eric; Gang, David R; Wilkerson, Curtis; Last, Robert L

2010-07-01

Shotgun proteomics analysis allows hundreds of proteins to be identified and quantified from a single sample at relatively low cost. Extensive DNA sequence information is a prerequisite for shotgun proteomics, and it is ideal to have sequence for the organism being studied rather than from related species or accessions. While this requirement has limited the set of organisms that are candidates for this approach, next generation sequencing technologies make it feasible to obtain deep DNA sequence coverage from any organism. As part of our studies of specialized (secondary) metabolism in tomato (Solanum lycopersicum) trichomes, 454 sequencing of cDNA was combined with shotgun proteomics analyses to obtain in-depth profiles of genes and proteins expressed in leaf and stem glandular trichomes of 3-week-old plants. The expressed sequence tag and proteomics data sets combined with metabolite analysis led to the discovery and characterization of a sesquiterpene synthase that produces beta-caryophyllene and alpha-humulene from E,E-farnesyl diphosphate in trichomes of leaf but not of stem. This analysis demonstrates the utility of combining high-throughput cDNA sequencing with proteomics experiments in a target tissue. These data can be used for dissection of other biochemical processes in these specialized epidermal cells.
Application of Cydia pomonella expressed sequence tags: identification and expression of three general odorant binding proteins in codling moth

USDA-ARS?s Scientific Manuscript database

The codling moth, Cydia pomonella, is one of the most important pests of pome fruits in the world, yet the molecular genetics and physiology of this insect remains poorly understood. A combined assembly of 8340 expressed sequence tags (ESTs) was generated from Roche 454 GS-FLX sequencing of 8 tissu...
Rapid in silico cloning of genes using expressed sequence tags (ESTs).

PubMed

Gill, R W; Sanseau, P

2000-01-01

Expressed sequence tags (ESTs) are short single-pass DNA sequences obtained from either end of cDNA clones. These ESTs are derived from a vast number of cDNA libraries obtained from different species. Human ESTs are the bulk of the data and have been widely used to identify new members of gene families, as markers on the human chromosomes, to discover polymorphism sites and to compare expression patterns in different tissues or pathologies states. Information strategies have been devised to query EST databases. Since most of the analysis is performed with a computer, the term "in silico" strategy has been coined. In this chapter we will review the current status of EST databases, the pros and cons of EST-type data and describe possible strategies to retrieve meaningful information.
Genome-wide characterization and selection of expressed sequence tag simple sequence repeat primers for optimized marker distribution and reliability in peach

USDA-ARS?s Scientific Manuscript database

Expressed sequence tag (EST) simple sequence repeats (SSRs) in Prunus were mined, and flanking primers designed and used for genome-wide characterization and selection of primers to optimize marker distribution and reliability. A total of 12,618 contigs were assembled from 84,727 ESTs, along with 34...
Evaluation of anonymous and expressed sequence tag derived polymorphic microsatellite markers in the tobacco budworm Heliothis virescens (Lepidoptera: noctuidae)

USDA-ARS?s Scientific Manuscript database

Polymorphic genetic markers were identified and characterized using a partial genomic library of Heliothis virescens enriched for simple sequence repeats (SSR) and nucleotide sequences of expressed sequence tags (EST). Nucleotide sequences of 192 clones from the partial genomic library yielded 147 u...
Expressed sequence tags from the plant trypanosomatid Phytomonas serpens.

PubMed

Pappas, Georgios J; Benabdellah, Karim; Zingales, Bianca; González, Antonio

2005-08-01

We have generated 2190 expressed sequence tags (ESTs) from a cDNA library of the plant trypanosomatid Phytomonas serpens. Upon processing and clustering the set of 1893 accepted sequences was reduced to 697 clusters consisting of 452 singletons and 245 contigs. Functional categories were assigned based on BLAST searches against a database of the eukaryotic orthologous groups of proteins (KOG). Thirty six percent of the generated sequences showed no hits against the KOG database and 39.6% presented similarity to the KOG classes corresponding to translation, ribosomal structure and biogenesis. The most populated cluster contained 45 ESTs homologous to members of the glucose transporter family. This fact can be immediately correlated to the reported Phytomonas dependence on anaerobic glycolytic ATP production due to the lack of cytochrome-mediated respiratory chain. In this context, not only a number of enzymes of the glycolytic pathway were identified but also of the Krebs cycle as well as specific components of the respiratory chain. The data here reported, including a few hundred unique sequences and the description of tandemly repeated motifs and putative transcript stability motifs at untranslated mRNA ends, represent an initial approach to overcome the lack of information on the molecular biology of this organism.
Expressed sequence tags from Atta laevigata and identification of candidate genes for the control of pest leaf-cutting ants

PubMed Central

2011-01-01

Background Leafcutters are the highest evolved within Neotropical ants in the tribe Attini and model systems for studying caste formation, labor division and symbiosis with microorganisms. Some species of leafcutters are agricultural pests controlled by chemicals which affect other animals and accumulate in the environment. Aiming to provide genetic basis for the study of leafcutters and for the development of more specific and environmentally friendly methods for the control of pest leafcutters, we generated expressed sequence tag data from Atta laevigata, one of the pest ants with broad geographic distribution in South America. Results The analysis of the expressed sequence tags allowed us to characterize 2,006 unique sequences in Atta laevigata. Sixteen of these genes had a high number of transcripts and are likely positively selected for high level of gene expression, being responsible for three basic biological functions: energy conservation through redox reactions in mitochondria; cytoskeleton and muscle structuring; regulation of gene expression and metabolism. Based on leafcutters lifestyle and reports of genes involved in key processes of other social insects, we identified 146 sequences potential targets for controlling pest leafcutters. The targets are responsible for antixenobiosis, development and longevity, immunity, resistance to pathogens, pheromone function, cell signaling, behavior, polysaccharide metabolism and arginine kynase activity. Conclusion The generation and analysis of expressed sequence tags from Atta laevigata have provided important genetic basis for future studies on the biology of leaf-cutting ants and may contribute to the development of a more specific and environmentally friendly method for the control of agricultural pest leafcutters. PMID:21682882
Expressed sequence tags from Atta laevigata and identification of candidate genes for the control of pest leaf-cutting ants.

PubMed

Rodovalho, Cynara M; Ferro, Milene; Fonseca, Fernando Pp; Antonio, Erik A; Guilherme, Ivan R; Henrique-Silva, Flávio; Bacci, Maurício

2011-06-17

Leafcutters are the highest evolved within Neotropical ants in the tribe Attini and model systems for studying caste formation, labor division and symbiosis with microorganisms. Some species of leafcutters are agricultural pests controlled by chemicals which affect other animals and accumulate in the environment. Aiming to provide genetic basis for the study of leafcutters and for the development of more specific and environmentally friendly methods for the control of pest leafcutters, we generated expressed sequence tag data from Atta laevigata, one of the pest ants with broad geographic distribution in South America. The analysis of the expressed sequence tags allowed us to characterize 2,006 unique sequences in Atta laevigata. Sixteen of these genes had a high number of transcripts and are likely positively selected for high level of gene expression, being responsible for three basic biological functions: energy conservation through redox reactions in mitochondria; cytoskeleton and muscle structuring; regulation of gene expression and metabolism. Based on leafcutters lifestyle and reports of genes involved in key processes of other social insects, we identified 146 sequences potential targets for controlling pest leafcutters. The targets are responsible for antixenobiosis, development and longevity, immunity, resistance to pathogens, pheromone function, cell signaling, behavior, polysaccharide metabolism and arginine kynase activity. The generation and analysis of expressed sequence tags from Atta laevigata have provided important genetic basis for future studies on the biology of leaf-cutting ants and may contribute to the development of a more specific and environmentally friendly method for the control of agricultural pest leafcutters.
Analysis of expressed sequence tags from Prunus mume flower and fruit and development of simple sequence repeat markers

PubMed Central

2010-01-01

Background Expressed Sequence Tag (EST) has been a cost-effective tool in molecular biology and represents an abundant valuable resource for genome annotation, gene expression, and comparative genomics in plants. Results In this study, we constructed a cDNA library of Prunus mume flower and fruit, sequenced 10,123 clones of the library, and obtained 8,656 expressed sequence tag (EST) sequences with high quality. The ESTs were assembled into 4,473 unigenes composed of 1,492 contigs and 2,981 singletons and that have been deposited in NCBI (accession IDs: GW868575 - GW873047), among which 1,294 unique ESTs were with known or putative functions. Furthermore, we found 1,233 putative simple sequence repeats (SSRs) in the P. mume unigene dataset. We randomly tested 42 pairs of PCR primers flanking potential SSRs, and 14 pairs were identified as true-to-type SSR loci and could amplify polymorphic bands from 20 individual plants of P. mume. We further used the 14 EST-SSR primer pairs to test the transferability on peach and plum. The result showed that nearly 89% of the primer pairs produced target PCR bands in the two species. A high level of marker polymorphism was observed in the plum species (65%) and low in the peach (46%), and the clustering analysis of the three species indicated that these SSR markers were useful in the evaluation of genetic relationships and diversity between and within the Prunus species. Conclusions We have constructed the first cDNA library of P. mume flower and fruit, and our data provide sets of molecular biology resources for P. mume and other Prunus species. These resources will be useful for further study such as genome annotation, new gene discovery, gene functional analysis, molecular breeding, evolution and comparative genomics between Prunus species. PMID:20626882
Development of expressed sequence tag-simple sequence repeat markers for genetic characterization and population structure analysis of Praxelis clematidea (Asteraceae).

PubMed

Wang, Q Z; Huang, M; Downie, S R; Chen, Z X

2016-05-23

Invasive plants tend to spread aggressively in new habitats and an understanding of their genetic diversity and population structure is useful for their management. In this study, expressed sequence tag-simple sequence repeat (EST-SSR) markers were developed for the invasive plant species Praxelis clematidea (Asteraceae) from 5548 Stevia rebaudiana (Asteraceae) expressed sequence tags (ESTs). A total of 133 microsatellite-containing ESTs (2.4%) were identified, of which 56 (42.1%) were hexanucleotide repeat motifs and 50 (37.6%) were trinucleotide repeat motifs. Of the 24 primer pairs designed from these 133 ESTs, 7 (29.2%) resulted in significant polymorphisms. The number of alleles per locus ranged from 5 to 9. The relatively high genetic diversity (H = 0.2667, I = 0.4212, and P = 100%) of P. clematidea was related to high gene flow (Nm = 1.4996) among populations. The coefficient of population differentiation (GST = 0.2500) indicated that most genetic variation occurred within populations. A Mantel test suggested that there was significant correlation between genetic distance and geographical distribution (r = 0.3192, P = 0.012). These results further support the transferability of EST-SSR markers between closely related genera of the same family.
Studies of a Biochemical Factory: Tomato Trichome Deep Expressed Sequence Tag Sequencing and Proteomics1[W][OA

PubMed Central

Schilmiller, Anthony L.; Miner, Dennis P.; Larson, Matthew; McDowell, Eric; Gang, David R.; Wilkerson, Curtis; Last, Robert L.

2010-01-01

Shotgun proteomics analysis allows hundreds of proteins to be identified and quantified from a single sample at relatively low cost. Extensive DNA sequence information is a prerequisite for shotgun proteomics, and it is ideal to have sequence for the organism being studied rather than from related species or accessions. While this requirement has limited the set of organisms that are candidates for this approach, next generation sequencing technologies make it feasible to obtain deep DNA sequence coverage from any organism. As part of our studies of specialized (secondary) metabolism in tomato (Solanum lycopersicum) trichomes, 454 sequencing of cDNA was combined with shotgun proteomics analyses to obtain in-depth profiles of genes and proteins expressed in leaf and stem glandular trichomes of 3-week-old plants. The expressed sequence tag and proteomics data sets combined with metabolite analysis led to the discovery and characterization of a sesquiterpene synthase that produces β-caryophyllene and α-humulene from E,E-farnesyl diphosphate in trichomes of leaf but not of stem. This analysis demonstrates the utility of combining high-throughput cDNA sequencing with proteomics experiments in a target tissue. These data can be used for dissection of other biochemical processes in these specialized epidermal cells. PMID:20431087
Analysis and functional annotation of expressed sequence tags from the fall armyworm Spodoptera frugiperda

PubMed Central

Deng, Youping; Dong, Yinghua; Thodima, Venkata; Clem, Rollie J; Passarelli, A Lorena

2006-01-01

Background Little is known about the genome sequences of lepidopteran insects, although this group of insects has been studied extensively in the fields of endocrinology, development, immunity, and pathogen-host interactions. In addition, cell lines derived from Spodoptera frugiperda and other lepidopteran insects are routinely used for baculovirus foreign gene expression. This study reports the results of an expressed sequence tag (EST) sequencing project in cells from the lepidopteran insect S. frugiperda, the fall armyworm. Results We have constructed an EST database using two cDNA libraries from the S. frugiperda-derived cell line, SF-21. The database consists of 2,367 ESTs which were assembled into 244 contigs and 951 singlets for a total of 1,195 unique sequences. Conclusion S. frugiperda is an agriculturally important pest insect and genomic information will be instrumental for establishing initial transcriptional profiling and gene function studies, and for obtaining information about genes manipulated during infections by insect pathogens such as baculoviruses. PMID:17052344
Expressed sequence tags from heat-shocked seagrass Zostera noltii (Hornemann) from its southern distribution range.

PubMed

Massa, Sónia I; Pearson, Gareth A; Aires, Tânia; Kube, Michael; Olsen, Jeanine L; Reinhardt, Richard; Serrão, Ester A; Arnaud-Haond, Sophie

2011-09-01

Predicted global climate change threatens the distributional ranges of species worldwide. We identified genes expressed in the intertidal seagrass Zostera noltii during recovery from a simulated low tide heat-shock exposure. Five Expressed Sequence Tag (EST) libraries were compared, corresponding to four recovery times following sub-lethal temperature stress, and a non-stressed control. We sequenced and analyzed 7009 sequence reads from 30min, 2h, 4h and 24h after the beginning of the heat-shock (AHS), and 1585 from the control library, for a total of 8594 sequence reads. Among 51 Tentative UniGenes (TUGs) exhibiting significantly different expression between libraries, 19 (37.3%) were identified as 'molecular chaperones' and were over-expressed following heat-shock, while 12 (23.5%) were 'photosynthesis TUGs' generally under-expressed in heat-shocked plants. A time course analysis of expression showed a rapid increase in expression of the molecular chaperone class, most of which were heat-shock proteins; which increased from 2 sequence reads in the control library to almost 230 in the 30min AHS library, followed by a slow decrease during further recovery. In contrast, 'photosynthesis TUGs' were under-expressed 30min AHS compared with the control library, and declined progressively with recovery time in the stress libraries, with a total of 29 sequence reads 24h AHS, compared with 125 in the control. A total of 4734 TUGs were screened for EST-Single Sequence Repeats (EST-SSRs) and 86 microsatellites were identified. Copyright © 2011 Elsevier B.V. All rights reserved.
Expressed sequence tags from the flower pathogen Claviceps purpurea.

PubMed

Oeser, Birgitt; Beaussart, François; Haarmann, Thomas; Lorenz, Nicole; Nathues, Eva; Rolke, Yvonne; Scheffer, Jan; Weiner, January; Tudzynski, Paul

2009-09-01

SUMMARY The ascomycete Claviceps purpurea (ergot) is a biotrophic flower pathogen of rye and other grasses. The deleterious toxic effects of infected rye seeds on humans and grazing animals have been known since the Middle Ages. To gain further insight into the molecular basis of this disease, we generated about 10 000 expressed sequence tags (ESTs)-about 25% originating from axenic fungal culture and about 75% from tissues collected 6-20 days after infection of rye spikes. The pattern of axenic vs. in planta gene expression was compared. About 200 putative plant genes were identified within the in planta library. A high percentage of these were predicted to function in plant defence against the ergot fungus and other pathogens, for example pathogenesis-related proteins. Potential fungal pathogenicity and virulence genes were found via comparison with the pathogen-host interaction database (PHI-base; http://www.phi-base.org) and with genes known to be highly expressed in the haustoria of the bean rust fungus. Comparative analysis of Claviceps and two other fungal flower pathogens (necrotrophic Fusarium graminearum and biotrophic Ustilago maydis) highlighted similarities and differences in their lifestyles, for example all three fungi have signalling components and cell wall-degrading enzymes in their arsenal. In summary, the analysis of axenic and in planta ESTs yielded a collection of candidate genes to be evaluated for functional roles in this plant-microbe interaction.
Genomic analysis of expressed sequence tags in American black bear Ursus americanus

PubMed Central

2010-01-01

Background Species of the bear family (Ursidae) are important organisms for research in molecular evolution, comparative physiology and conservation biology, but relatively little genetic sequence information is available for this group. Here we report the development and analyses of the first large scale Expressed Sequence Tag (EST) resource for the American black bear (Ursus americanus). Results Comprehensive analyses of molecular functions, alternative splicing, and tissue-specific expression of 38,757 black bear EST sequences were conducted using the dog genome as a reference. We identified 18 genes, involved in functions such as lipid catabolism, cell cycle, and vesicle-mediated transport, that are showing rapid evolution in the bear lineage Three genes, Phospholamban (PLN), cysteine glycine-rich protein 3 (CSRP3) and Troponin I type 3 (TNNI3), are related to heart contraction, and defects in these genes in humans lead to heart disease. Two genes, biphenyl hydrolase-like (BPHL) and CSRP3, contain positively selected sites in bear. Global analysis of evolution rates of hibernation-related genes in bear showed that they are largely conserved and slowly evolving genes, rather than novel and fast-evolving genes. Conclusion We provide a genomic resource for an important mammalian organism and our study sheds new light on the possible functions and evolution of bear genes. PMID:20338065

Genomic analysis of expressed sequence tags in American black bear Ursus americanus.

PubMed

Zhao, Sen; Shao, Chunxuan; Goropashnaya, Anna V; Stewart, Nathan C; Xu, Yichi; Tøien, Øivind; Barnes, Brian M; Fedorov, Vadim B; Yan, Jun

2010-03-26

Species of the bear family (Ursidae) are important organisms for research in molecular evolution, comparative physiology and conservation biology, but relatively little genetic sequence information is available for this group. Here we report the development and analyses of the first large scale Expressed Sequence Tag (EST) resource for the American black bear (Ursus americanus). Comprehensive analyses of molecular functions, alternative splicing, and tissue-specific expression of 38,757 black bear EST sequences were conducted using the dog genome as a reference. We identified 18 genes, involved in functions such as lipid catabolism, cell cycle, and vesicle-mediated transport, that are showing rapid evolution in the bear lineage Three genes, Phospholamban (PLN), cysteine glycine-rich protein 3 (CSRP3) and Troponin I type 3 (TNNI3), are related to heart contraction, and defects in these genes in humans lead to heart disease. Two genes, biphenyl hydrolase-like (BPHL) and CSRP3, contain positively selected sites in bear. Global analysis of evolution rates of hibernation-related genes in bear showed that they are largely conserved and slowly evolving genes, rather than novel and fast-evolving genes. We provide a genomic resource for an important mammalian organism and our study sheds new light on the possible functions and evolution of bear genes.
Distinct profiles of expressed sequence tags during intestinal regeneration in the sea cucumber Holothuria glaberrima

PubMed Central

Rojas-Cartagena, Carmencita; Ortíz-Pineda, Pablo; Ramírez-Gómez, Francisco; Suárez-Castillo, Edna C.; Matos-Cruz, Vanessa; Rodríguez, Carlos; Ortíz-Zuazaga, Humberto; García-Arrarás, José E.

2010-01-01

Repair and regeneration are key processes for tissue maintenance, and their disruption may lead to disease states. Little is known about the molecular mechanisms that underline the repair and regeneration of the digestive tract. The sea cucumber Holothuria glaberrima represents an excellent model to dissect and characterize the molecular events during intestinal regeneration. To study the gene expression profile, cDNA libraries were constructed from normal, 3-day, and 7-day regenerating intestines of H. glaberrima. Clones were randomly sequenced and queried against the nonredundant protein database at the National Center for Biotechnology Information. RT-PCR analyses were made of several genes to determine their expression profile during intestinal regeneration. A total of 5,173 sequences from three cDNA libraries were obtained. About 46.2, 35.6, and 26.2% of the sequences for the normal, 3-days, and 7-days cDNA libraries, respectively, shared significant similarity with known sequences in the protein database of GenBank but only present 10% of similarity among them. Analysis of the libraries in terms of functional processes, protein domains, and most common sequences suggests that a differential expression profile is taking place during the regeneration process. Further examination of the expressed sequence tag dataset revealed that 12 putative genes are differentially expressed at significant level (R > 6). Experimental validation by RT-PCR analysis reveals that at least three genes (unknown C-4677-1, melanotransferrin, and centaurin) present a differential expression during regeneration. These findings strongly suggest that the gene expression profile varies among regeneration stages and provide evidence for the existence of differential gene expression. PMID:17579180
Expressed sequence tags related to nitrogen metabolism in maize inoculated with Azospirillum brasilense.

PubMed

Pereira-Defilippi, L; Pereira, E M; Silva, F M; Moro, G V

2017-05-31

The relative quantitative real-time expression of two expressed sequence tags (ESTs) codifying for key enzymes in nitrogen metabolism in maize, nitrate reductase (ZmNR), and glutamine synthetase (ZmGln1-3) was performed for genotypes inoculated with Azospirillum brasilense. Two commercial single-cross hybrids (AG7098 and 2B707) and two experimental synthetic varieties (V2 and V4) were raised under controlled greenhouse conditions, in six treatment groups corresponding to different forms of inoculation and different levels of nitrogen application by top-dressing. The genotypes presented distinct responses to inoculation with A. brasilense. Increases in the expression of ZmNR were observed for the hybrids, while V4 only displayed a greater level of expression when the plants received nitrogenous fertilization by top-dressing and there was no inoculation. The expression of the ZmGln1-3EST was induced by A. brasilense in the hybrids and the variety V4. In contrast, the variety V2 did not respond to inoculation.
Development of an Expressed Sequence Tag (EST) Resource for Wheat (Triticum aestivum L.)

PubMed Central

Lazo, G. R.; Chao, S.; Hummel, D. D.; Edwards, H.; Crossman, C. C.; Lui, N.; Matthews, D. E.; Carollo, V. L.; Hane, D. L.; You, F. M.; Butler, G. E.; Miller, R. E.; Close, T. J.; Peng, J. H.; Lapitan, N. L. V.; Gustafson, J. P.; Qi, L. L.; Echalier, B.; Gill, B. S.; Dilbirligi, M.; Randhawa, H. S.; Gill, K. S.; Greene, R. A.; Sorrells, M. E.; Akhunov, E. D.; Dvořák, J.; Linkiewicz, A. M.; Dubcovsky, J.; Hossain, K. G.; Kalavacharla, V.; Kianian, S. F.; Mahmoud, A. A.; Miftahudin; Ma, X.-F.; Conley, E. J.; Anderson, J. A.; Pathan, M. S.; Nguyen, H. T.; McGuire, P. E.; Qualset, C. O.; Anderson, O. D.

2004-01-01

This report describes the rationale, approaches, organization, and resource development leading to a large-scale deletion bin map of the hexaploid (2n = 6x = 42) wheat genome (Triticum aestivum L.). Accompanying reports in this issue detail results from chromosome bin-mapping of expressed sequence tags (ESTs) representing genes onto the seven homoeologous chromosome groups and a global analysis of the entire mapped wheat EST data set. Among the resources developed were the first extensive public wheat EST collection (113,220 ESTs). Described are protocols for sequencing, sequence processing, EST nomenclature, and the assembly of ESTs into contigs. These contigs plus singletons (unassembled ESTs) were used for selection of distinct sequence motif unigenes. Selected ESTs were rearrayed, validated by 5′ and 3′ sequencing, and amplified for probing a series of wheat aneuploid and deletion stocks. Images and data for all Southern hybridizations were deposited in databases and were used by the coordinators for each of the seven homoeologous chromosome groups to validate the mapping results. Results from this project have established the foundation for future developments in wheat genomics. PMID:15514037
A blackberry (Rubus L.) expressed sequence tag library for the development of simple sequence repeat markers

PubMed Central

Lewers, Kim S; Saski, Chris A; Cuthbertson, Brandon J; Henry, David C; Staton, Meg E; Main, Dorrie S; Dhanaraj, Anik L; Rowland, Lisa J; Tomkins, Jeff P

2008-01-01

Background The recent development of novel repeat-fruiting types of blackberry (Rubus L.) cultivars, combined with a long history of morphological marker-assisted selection for thornlessness by blackberry breeders, has given rise to increased interest in using molecular markers to facilitate blackberry breeding. Yet no genetic maps, molecular markers, or even sequences exist specifically for cultivated blackberry. The purpose of this study is to begin development of these tools by generating and annotating the first blackberry expressed sequence tag (EST) library, designing primers from the ESTs to amplify regions containing simple sequence repeats (SSR), and testing the usefulness of a subset of the EST-SSRs with two blackberry cultivars. Results A cDNA library of 18,432 clones was generated from expanding leaf tissue of the cultivar Merton Thornless, a progenitor of many thornless commercial cultivars. Among the most abundantly expressed of the 3,000 genes annotated were those involved with energy, cell structure, and defense. From individual sequences containing SSRs, 673 primer pairs were designed. Of a randomly chosen set of 33 primer pairs tested with two blackberry cultivars, 10 detected an average of 1.9 polymorphic PCR products. Conclusion This rate predicts that this library may yield as many as 940 SSR primer pairs detecting 1,786 polymorphisms. This may be sufficient to generate a genetic map that can be used to associate molecular markers with phenotypic traits, making possible molecular marker-assisted breeding to compliment existing morphological marker-assisted breeding in blackberry. PMID:18570660
Micropreparative capillary gel electrophoresis of DNA: rapid expressed sequence tag library construction.

PubMed

Shi, Liang; Khandurina, Julia; Ronai, Zsolt; Li, Bi-Yu; Kwan, Wai King; Wang, Xun; Guttman, András

2003-01-01

A capillary gel electrophoresis based automated DNA fraction collection technique was developed to support a novel DNA fragment-pooling strategy for expressed sequence tag (EST) library construction. The cDNA population is first cleaved by BsaJ I and EcoR I restriction enzymes, and then subpooled by selective ligation with specific adapters followed by polymerase chain reaction (PCR) amplification and labeling. Combination of this cDNA fingerprinting method with high-resolution capillary gel electrophoresis separation and precise fractionation of individual cDNA transcript representatives avoids redundant fragment selection and concomitant repetitive sequencing of abundant transcripts. Using a computer-controlled capillary electrophoresis device the transcript representatives were separated by their size and fractions were automatically collected in every 30 s into 96-well plates. The high resolving power of the sieving matrix ensured sequencing grade separation of the DNA fragments (i.e., single-base resolution) and successful fraction collection. Performance and precision of the fraction collection procedure was validated by PCR amplification of the collected DNA fragments followed by capillary electrophoresis analysis for size and purity verification. The collected and PCR-amplified transcript representatives, ranging up to several hundred base pairs, were then sequenced to create an EST library.
Mining of haplotype-based expressed sequence tag single nucleotide polymorphisms in citrus

PubMed Central

2013-01-01

Background Single nucleotide polymorphisms (SNPs), the most abundant variations in a genome, have been widely used in various studies. Detection and characterization of citrus haplotype-based expressed sequence tag (EST) SNPs will greatly facilitate further utilization of these gene-based resources. Results In this paper, haplotype-based SNPs were mined out of publicly available citrus expressed sequence tags (ESTs) from different citrus cultivars (genotypes) individually and collectively for comparison. There were a total of 567,297 ESTs belonging to 27 cultivars in varying numbers and consequentially yielding different numbers of haplotype-based quality SNPs. Sweet orange (SO) had the most (213,830) ESTs, generating 11,182 quality SNPs in 3,327 out of 4,228 usable contigs. Summed from all the individually mining results, a total of 25,417 quality SNPs were discovered – 15,010 (59.1%) were transitions (AG and CT), 9,114 (35.9%) were transversions (AC, GT, CG, and AT), and 1,293 (5.0%) were insertion/deletions (indels). A vast majority of SNP-containing contigs consisted of only 2 haplotypes, as expected, but the percentages of 2 haplotype contigs varied widely in these citrus cultivars. BLAST of the 25,417 25-mer SNP oligos to the Clementine reference genome scaffolds revealed 2,947 SNPs had “no hits found”, 19,943 had 1 unique hit / alignment, 1,571 had one hit and 2+ alignments per hit, and 956 had 2+ hits and 1+ alignment per hit. Of the total 24,293 scaffold hits, 23,955 (98.6%) were on the main scaffolds 1 to 9, and only 338 were on 87 minor scaffolds. Most alignments had 100% (25/25) or 96% (24/25) nucleotide identities, accounting for 93% of all the alignments. Considering almost all the nucleotide discrepancies in the 24/25 alignments were at the SNP sites, it served well as in silico validation of these SNPs, in addition to and consistent with the rate (81%) validated by sequencing and SNaPshot assay. Conclusions High-quality EST-SNPs from different
Cell-free translational screening of an expression sequence tag library of Clonorchis sinensis for novel antigen discovery.

PubMed

Kasi, Devi; Catherine, Christy; Lee, Seung-Won; Lee, Kyung-Ho; Kim, Yu Jung; Ro Lee, Myeong; Ju, Jung Won; Kim, Dong-Myung

2017-05-01

The rapidly evolving cloning and sequencing technologies have enabled understanding of genomic structure of parasite genomes, opening up new ways of combatting parasite-related diseases. To make the most of the exponentially accumulating genomic data, however, it is crucial to analyze the proteins encoded by these genomic sequences. In this study, we adopted an engineered cell-free protein synthesis system for large-scale expression screening of an expression sequence tag (EST) library of Clonorchis sinensis to identify potential antigens that can be used for diagnosis and treatment of clonorchiasis. To allow high-throughput expression and identification of individual genes comprising the library, a cell-free synthesis reaction was designed such that both the template DNA and the expressed proteins were co-immobilized on the same microbeads, leading to microbead-based linkage of the genotype and phenotype. This reaction configuration allowed streamlined expression, recovery, and analysis of proteins. This approach enabled us to identify 21 antigenic proteins. © 2017 American Institute of Chemical Engineers Biotechnol. Prog., 33:832-837, 2017. © 2017 American Institute of Chemical Engineers.
Analysis of expressed sequence tags for Frankliniella occidentalis, the western flower thrips.

PubMed

Rotenberg, D; Whitfield, A E

2010-08-01

Thrips are members of the insect order Thysanoptera and Frankliniella occidentalis (the western flower thrips) is the most economically important pest within this order. F. occidentalis is both a direct pest of crops and an efficient vector of plant viruses, including Tomato spotted wilt virus (TSWV). Despite the world-wide importance of thrips in agriculture, there is little knowledge of the F. occidentalis genome or gene functions at this time. A normalized cDNA library was constructed from first instar thrips and 13 839 expressed sequence tags (ESTs) were obtained. Our EST data assembled into 894 contigs and 11 806 singletons (12 700 nonredundant sequences). We found that 31% of these sequences had significant similarity (E< or = 10(-10)) to protein sequences in the National Center for Biotechnology Information nonredundant (nr) protein database, and 25% were functionally annotated using Blast 2GO. We identified 74 sequences with putative homology to proteins associated with insect innate immunity. Sixteen sequences had significant similarity to proteins associated with small RNA-mediated gene silencing pathways (RNA interference; RNAi), including the antiviral pathway (short interfering RNA-mediated pathway). Our EST collection provides new sequence resources for characterizing gene functions in F. occidentalis and other thrips species with regards to vital biological processes, studying the mechanism of interactions with the viruses harboured and transmitted by the vector, and identifying new insect gene-centred targets for plant disease and insect control.
Generation of a total of 6483 expressed sequence tags from 60 day-old bovine whole fetus and fetal placenta.

PubMed

Oishi, M; Gohma, H; Lejukole, H Y; Taniguchi, Y; Yamada, T; Suzuki, K; Shinkai, H; Uenishi, H; Yasue, H; Sasaki, Y

2004-05-01

Expressed sequence tags (ESTs) generated based on characterization of clones isolated randomly from cDNA libraries are used to study gene expression profiles in specific tissues and to provide useful information for characterizing tissue physiology. In this study, two directionally cloned cDNA libraries were constructed from 60 day-old bovine whole fetus and fetal placenta. We have characterized 5357 and 1126 clones, and then identified 3464 and 795 unique sequences for the fetus and placenta cDNA libraries: 1851 and 504 showed homology to already identified genes, and 1613 and 291 showed no significant matches to any of the sequences in DNA databases, respectively. Further, we found 94 unique sequences overlapping in both the fetus and the placenta, leading to a catalog of 4165 genes expressed in 60 day-old fetus and placenta. The catalog is used to examine expression profile of genes in 60 day-old bovine fetus and placenta.
Cloning, analysis and functional annotation of expressed sequence tags from the Earthworm Eisenia fetida

PubMed Central

Pirooznia, Mehdi; Gong, Ping; Guan, Xin; Inouye, Laura S; Yang, Kuan; Perkins, Edward J; Deng, Youping

2007-01-01

Background Eisenia fetida, commonly known as red wiggler or compost worm, belongs to the Lumbricidae family of the Annelida phylum. Little is known about its genome sequence although it has been extensively used as a test organism in terrestrial ecotoxicology. In order to understand its gene expression response to environmental contaminants, we cloned 4032 cDNAs or expressed sequence tags (ESTs) from two E. fetida libraries enriched with genes responsive to ten ordnance related compounds using suppressive subtractive hybridization-PCR. Results A total of 3144 good quality ESTs (GenBank dbEST accession number EH669363–EH672369 and EL515444–EL515580) were obtained from the raw clone sequences after cleaning. Clustering analysis yielded 2231 unique sequences including 448 contigs (from 1361 ESTs) and 1783 singletons. Comparative genomic analysis showed that 743 or 33% of the unique sequences shared high similarity with existing genes in the GenBank nr database. Provisional function annotation assigned 830 Gene Ontology terms to 517 unique sequences based on their homology with the annotated genomes of four model organisms Drosophila melanogaster, Mus musculus, Saccharomyces cerevisiae, and Caenorhabditis elegans. Seven percent of the unique sequences were further mapped to 99 Kyoto Encyclopedia of Genes and Genomes pathways based on their matching Enzyme Commission numbers. All the information is stored and retrievable at a highly performed, web-based and user-friendly relational database called EST model database or ESTMD version 2. Conclusion The ESTMD containing the sequence and annotation information of 4032 E. fetida ESTs is publicly accessible at . PMID:18047730
Expressed sequence tag analysis of functional genes associated with adventitious rooting in Liriodendron hybrids.

PubMed

Zhong, Y D; Sun, X Y; Liu, E Y; Li, Y Q; Gao, Z; Yu, F X

2016-06-24

Liriodendron hybrids (Liriodendron chinense x L. tulipifera) are important landscaping and afforestation hardwood trees. To date, little genomic research on adventitious rooting has been reported in these hybrids, as well as in the genus Liriodendron. In the present study, we used adventitious roots to construct the first cDNA library for Liriodendron hybrids. A total of 5176 expressed sequence tags (ESTs) were generated and clustered into 2921 unigenes. Among these unigenes, 2547 had significant homology to the non-redundant protein database representing a wide variety of putative functions. Homologs of these genes regulated many aspects of adventitious rooting, including those for auxin signal transduction and root hair development. Results of quantitative real-time polymerase chain reaction showed that AUX1, IRE, and FB1 were highly expressed in adventitious roots and the expression of AUX1, ARF1, NAC1, RHD1, and IRE increased during the development of adventitious roots. Additionally, 181 simple sequence repeats were identified from 166 ESTs and more than 91.16% of these were dinucleotide and trinucleotide repeats. To the best of our knowledge, the present study reports the identification of the genes associated with adventitious rooting in the genus Liriodendron for the first time and provides a valuable resource for future genomic studies. Expression analysis of selected genes could allow us to identify regulatory genes that may be essential for adventitious rooting.
Generation and Analysis of the Expressed Sequence Tags from the Mycelium of Ganoderma lucidum

PubMed Central

Huang, Yen-Hua; Wu, Hung-Yi; Wu, Keh-Ming; Liu, Tze-Tze; Liou, Ruey-Fen; Tsai, Shih-Feng; Shiao, Ming-Shi; Ho, Low-Tone; Tzean, Shean-Shong; Yang, Ueng-Cheng

2013-01-01

Ganoderma lucidum (G. lucidum) is a medicinal mushroom renowned in East Asia for its potential biological effects. To enable a systematic exploration of the genes associated with the various phenotypes of the fungus, the genome consortium of G. lucidum has carried out an expressed sequence tag (EST) sequencing project. Using a Sanger sequencing based approach, 47,285 ESTs were obtained from in vitro cultures of G. lucidum mycelium of various durations. These ESTs were further clustered and merged into 7,774 non-redundant expressed loci. The features of these expressed contigs were explored in terms of over-representation, alternative splicing, and natural antisense transcripts. Our results provide an invaluable information resource for exploring the G. lucidum transcriptome and its regulation. Many cases of the genes over-represented in fast-growing dikaryotic mycelium are closely related to growth, such as cell wall and bioactive compound synthesis. In addition, the EST-genome alignments containing putative cassette exons and retained introns were manually curated and then used to make inferences about the predominating splice-site recognition mechanism of G. lucidum. Moreover, a number of putative antisense transcripts have been pinpointed, from which we noticed that two cases are likely to reveal hitherto undiscovered biological pathways. To allow users to access the data and the initial analysis of the results of this project, a dedicated web site has been created at http://csb2.ym.edu.tw/est/. PMID:23658685
Analysis of Expressed Sequence Tags (EST) in Date Palm.

PubMed

Al-Faifi, Sulieman A; Migdadi, Hussein M; Algamdi, Salem S; Khan, Mohammad Altaf; Al-Obeed, Rashid S; Ammar, Megahed H; Jakse, Jerenj

2017-01-01

Expressed sequence tags (EST) were generated from a normalized cDNA library of the date palm Sukkari cv. to understand the high-quality and better field performance of this well-known commercial cultivar. A total of 6943 high-quality ESTs were generated, out of them 6671 are submitted to the GenBank dbEST (LIBEST_028537). The generated ESTs were assembled into 6362 unigenes, consisting of 494 (14.4%) contigs and 5868 (84.53%) singletons. The functional annotation shows that the majority of the ESTs are associated with binding (44%), catalytic (40%), transporter (5%), and structural molecular (5%) activities. The blastx results show that 73% of unigenes are significantly similar to known plant genes and 27% are novel. The latter could be of particular interest in date palm genetic studies. Further analysis shows that some ESTs are categorized as stress/defense- and fruit development-related genes. These newly generated ESTs could significantly enhance date palm EST databases in the public domain and are available to scientists and researchers across the globe. This knowledge will facilitate the discovery of candidate genes that govern important developmental and agronomical traits in date palm. It will provide important resources for developing genetic tools, comparative genomics, and genome evolution among date palm cultivars.
Identification of single nucleotide polymorphism in ginger using expressed sequence tags

PubMed Central

Chandrasekar, Arumugam; Riju, Aikkal; Sithara, Kandiyl; Anoop, Sahadevan; Eapen, Santhosh J

2009-01-01

Ginger (Zingiber officinale Rosc) (Family: Zingiberaceae) is a herbaceous perennial, the rhizomes of which are used as a spice. Ginger is a plant which is well known for its medicinal applications. Recently EST-derived SNPs are a free by-product of the currently expanding EST (Expressed Sequence Tag) databases. The development of high-throughput methods for the detection of SNPs (Single Nucleotide Polymorphism) and small indels (insertion/deletion) has led to a revolution in their use as molecular markers. Available (38139) Ginger EST sequences were mined from dbEST of NCBI. CAP3 program was used to assemble EST sequences into contigs. Candidate SNPs and Indel polymorphisms were detected using the perl script AutoSNP version 1.0 which has used 31905 ESTs for detecting SNPs and Indel sites. We found 64026 SNP sites and 7034 indel polymorphisms with frequency of 0.84 SNPs / 100 bp. Among the three tissues from which the EST libraries had been generated, Rhizomes had high frequency of 1.08 SNPs/indels per 100 bp whereas the leaves had lowest frequency of 0.63 per 100 bp and root is showing relative frequency 0.82/100bp. Transitions and transversion ratio is 0.90. In overall detected SNP, transversion is high when compare to transition. These detected SNPs can be used as markers for genetic studies. Availability The results of the present study hosted in our webserver www.spices.res.in/spicesnip PMID:20198184
Characteristics of the Lotus japonicus gene repertoire deduced from large-scale expressed sequence tag (EST) analysis.

PubMed

Asamizu, Erika; Nakamura, Yasukazu; Sato, Shusei; Tabata, Satoshi

2004-02-01

To perform a comprehensive analysis of genes expressed in a model legume, Lotus japonicus, a total of 74472 3'-end expressed sequence tags (EST) were generated from cDNA libraries produced from six different organs. Clustering of sequences was performed with an identity criterion of 95% for 50 bases, and a total of 20457 non-redundant sequences, 8503 contigs and 11954 singletons were generated. EST sequence coverage was analyzed by using the annotated L. japonicus genomic sequence and 1093 of the 1889 predicted protein-encoding genes (57.9%) were hit by the EST sequence(s). Gene content was compared to several plant species. Among the 8503 contigs, 471 were identified as sequences conserved only in leguminous species and these included several disease resistance-related genes. This suggested that in legumes, these genes may have evolved specifically to resist pathogen attack. The rate of gene sequence divergence was assessed by comparing similarity level and functional category based on the Gene Ontology (GO) annotation of Arabidopsis genes. This revealed that genes encoding ribosomal proteins, as well as those related to translation, photosynthesis, and cellular structure were more abundantly represented in the highly conserved class, and that genes encoding transcription factors and receptor protein kinases were abundantly represented in the less conserved class. To make the sequence information and the cDNA clones available to the research community, a Web database with useful services was created at http://www.kazusa.or.jp/en/plant/lotus/EST/.
A High-Throughput Data Mining of Single Nucleotide Polymorphisms in Coffea Species Expressed Sequence Tags Suggests Differential Homeologous Gene Expression in the Allotetraploid Coffea arabica1[W

PubMed Central

Vidal, Ramon Oliveira; Mondego, Jorge Maurício Costa; Pot, David; Ambrósio, Alinne Batista; Andrade, Alan Carvalho; Pereira, Luiz Filipe Protasio; Colombo, Carlos Augusto; Vieira, Luiz Gonzaga Esteves; Carazzolle, Marcelo Falsarella; Pereira, Gonçalo Amarante Guimarães

2010-01-01

Polyploidization constitutes a common mode of evolution in flowering plants. This event provides the raw material for the divergence of function in homeologous genes, leading to phenotypic novelty that can contribute to the success of polyploids in nature or their selection for use in agriculture. Mounting evidence underlined the existence of homeologous expression biases in polyploid genomes; however, strategies to analyze such transcriptome regulation remained scarce. Important factors regarding homeologous expression biases remain to be explored, such as whether this phenomenon influences specific genes, how paralogs are affected by genome doubling, and what is the importance of the variability of homeologous expression bias to genotype differences. This study reports the expressed sequence tag assembly of the allopolyploid Coffea arabica and one of its direct ancestors, Coffea canephora. The assembly was used for the discovery of single nucleotide polymorphisms through the identification of high-quality discrepancies in overlapped expressed sequence tags and for gene expression information indirectly estimated by the transcript redundancy. Sequence diversity profiles were evaluated within C. arabica (Ca) and C. canephora (Cc) and used to deduce the transcript contribution of the Coffea eugenioides (Ce) ancestor. The assignment of the C. arabica haplotypes to the C. canephora (CaCc) or C. eugenioides (CaCe) ancestral genomes allowed us to analyze gene expression contributions of each subgenome in C. arabica. In silico data were validated by the quantitative polymerase chain reaction and allele-specific combination TaqMAMA-based method. The presence of differential expression of C. arabica homeologous genes and its implications in coffee gene expression, ontology, and physiology are discussed. PMID:20864545
Generation and analysis of expressed sequence tags from the bone marrow of Chinese Sika deer.

PubMed

Yao, Baojin; Zhao, Yu; Zhang, Mei; Li, Juan

2012-03-01

Sika deer is one of the best-known and highly valued animals of China. Despite its economic, cultural, and biological importance, there has not been a large-scale sequencing project for Sika deer to date. With the ultimate goal of sequencing the complete genome of this organism, we first established a bone marrow cDNA library for Sika deer and generated a total of 2,025 reads. After processing the sequences, 2,017 high-quality expressed sequence tags (ESTs) were obtained. These ESTs were assembled into 1,157 unigenes, including 238 contigs and 919 singletons. Comparative analyses indicated that 888 (76.75%) of the unigenes had significant matches to sequences in the non-redundant protein database, In addition to highly expressed genes, such as stearoyl-CoA desaturase, cytochrome c oxidase, adipocyte-type fatty acid-binding protein, adiponectin and thymosin beta-4, we also obtained vascular endothelial growth factor-A and heparin-binding growth-associated molecule, both of which are of great importance for angiogenesis research. There were 244 (21.09%) unigenes with no significant match to any sequence in current protein or nucleotide databases, and these sequences may represent genes with unknown function in Sika deer. Open reading frame analysis of the sequences was performed using the getorf program. In addition, the sequences were functionally classified using the gene ontology hierarchy, clusters of orthologous groups of proteins and Kyoto encyclopedia of genes and genomes databases. Analysis of ESTs described in this paper provides an important resource for the transcriptome exploration of Sika deer, and will also facilitate further studies on functional genomics, gene discovery and genome annotation of Sika deer.
Mining and gene ontology based annotation of SSR markers from expressed sequence tags of Humulus lupulus

PubMed Central

Singh, Swati; Gupta, Sanchita; Mani, Ashutosh; Chaturvedi, Anoop

2012-01-01

Humulus lupulus is commonly known as hops, a member of the family moraceae. Currently many projects are underway leading to the accumulation of voluminous genomic and expressed sequence tag sequences in public databases. The genetically characterized domains in these databases are limited due to non-availability of reliable molecular markers. The large data of EST sequences are available in hops. The simple sequence repeat markers extracted from EST data are used as molecular markers for genetic characterization, in the present study. 25,495 EST sequences were examined and assembled to get full-length sequences. Maximum frequency distribution was shown by mononucleotide SSR motifs i.e. 60.44% in contig and 62.16% in singleton where as minimum frequency are observed for hexanucleotide SSR in contig (0.09%) and pentanucleotide SSR in singletons (0.12%). Maximum trinucleotide motifs code for Glutamic acid (GAA) while AT/TA were the most frequent repeat of dinucleotide SSRs. Flanking primer pairs were designed in-silico for the SSR containing sequences. Functional categorization of SSRs containing sequences was done through gene ontology terms like biological process, cellular component and molecular function. PMID:22368382
Application of an E. coli signal sequence as a versatile inclusion body tag.

PubMed

Jong, Wouter S P; Vikström, David; Houben, Diane; van den Berg van Saparoea, H Bart; de Gier, Jan-Willem; Luirink, Joen

2017-03-21

Heterologous protein production in Escherichia coli often suffers from bottlenecks such as proteolytic degradation, complex purification procedures and toxicity towards the expression host. Production of proteins in an insoluble form in inclusion bodies (IBs) can alleviate these problems. Unfortunately, the propensity of heterologous proteins to form IBs is variable and difficult to predict. Hence, fusing the target protein to an aggregation prone polypeptide or IB-tag is a useful strategy to produce difficult-to-express proteins in an insoluble form. When screening for signal sequences that mediate optimal targeting of heterologous proteins to the periplasmic space of E. coli, we observed that fusion to the 39 amino acid signal sequence of E. coli TorA (ssTorA) did not promote targeting but rather directed high-level expression of the human proteins hEGF, Pla2 and IL-3 in IBs. Further analysis revealed that ssTorA even mediated IB formation of the highly soluble endogenous E. coli proteins TrxA and MBP. The ssTorA also induced aggregation when fused to the C-terminus of target proteins and appeared functional as IB-tag in E. coli K-12 as well as B strains. An additive effect on IB-formation was observed upon fusion of multiple ssTorA sequences in tandem, provoking almost complete aggregation of TrxA and MBP. The ssTorA-moiety was successfully used to produce the intrinsically unstable hEGF and the toxic fusion partner SymE, demonstrating its applicability as an IB-tag for difficult-to-express and toxic proteins. We present proof-of-concept for the use of ssTorA as a small, versatile tag for robust E. coli-based expression of heterologous proteins in IBs.

In silico Analysis of 3′-End-Processing Signals in Aspergillus oryzae Using Expressed Sequence Tags and Genomic Sequencing Data

PubMed Central

Tanaka, Mizuki; Sakai, Yoshifumi; Yamada, Osamu; Shintani, Takahiro; Gomi, Katsuya

2011-01-01

To investigate 3′-end-processing signals in Aspergillus oryzae, we created a nucleotide sequence data set of the 3′-untranslated region (3′ UTR) plus 100 nucleotides (nt) sequence downstream of the poly(A) site using A. oryzae expressed sequence tags and genomic sequencing data. This data set comprised 1065 sequences derived from 1042 unique genes. The average 3′ UTR length in A. oryzae was 241 nt, which is greater than that in yeast but similar to that in plants. The 3′ UTR and 100 nt sequence downstream of the poly(A) site is notably U-rich, while the region located 15–30 nt upstream of the poly(A) site is markedly A-rich. The most frequently found hexanucleotide in this A-rich region is AAUGAA, although this sequence accounts for only 6% of all transcripts. These data suggested that A. oryzae has no highly conserved sequence element equivalent to AAUAAA, a mammalian polyadenylation signal. We identified that putative 3′-end-processing signals in A. oryzae, while less well conserved than those in mammals, comprised four sequence elements: the furthest upstream U-rich element, A-rich sequence, cleavage site, and downstream U-rich element flanking the cleavage site. Although these putative 3′-end-processing signals are similar to those in yeast and plants, some notable differences exist between them. PMID:21586533
An Ambystoma mexicanum EST sequencing project: analysis of 17,352 expressed sequence tags from embryonic and regenerating blastema cDNA libraries

PubMed Central

Habermann, Bianca; Bebin, Anne-Gaelle; Herklotz, Stephan; Volkmer, Michael; Eckelt, Kay; Pehlke, Kerstin; Epperlein, Hans Henning; Schackert, Hans Konrad; Wiebe, Glenis; Tanaka, Elly M

2004-01-01

Background The ambystomatid salamander, Ambystoma mexicanum (axolotl), is an important model organism in evolutionary and regeneration research but relatively little sequence information has so far been available. This is a major limitation for molecular studies on caudate development, regeneration and evolution. To address this lack of sequence information we have generated an expressed sequence tag (EST) database for A. mexicanum. Results Two cDNA libraries, one made from stage 18-22 embryos and the other from day-6 regenerating tail blastemas, generated 17,352 sequences. From the sequenced ESTs, 6,377 contigs were assembled that probably represent 25% of the expressed genes in this organism. Sequence comparison revealed significant homology to entries in the NCBI non-redundant database. Further examination of this gene set revealed the presence of genes involved in important cell and developmental processes, including cell proliferation, cell differentiation and cell-cell communication. On the basis of these data, we have performed phylogenetic analysis of key cell-cycle regulators. Interestingly, while cell-cycle proteins such as the cyclin B family display expected evolutionary relationships, the cyclin-dependent kinase inhibitor 1 gene family shows an unusual evolutionary behavior among the amphibians. Conclusions Our analysis reveals the importance of a comprehensive sequence set from a representative of the Caudata and illustrates that the EST sequence database is a rich source of molecular, developmental and regeneration studies. To aid in data mining, the ESTs have been organized into an easily searchable database that is freely available online. PMID:15345051
Analysis of expressed sequence tags of the cyclically parthenogenetic rotifer Brachionus plicatilis.

PubMed

Suga, Koushirou; Welch, David Mark; Tanaka, Yukari; Sakakura, Yoshitaka; Hagiwara, Atsushi

2007-08-01

Rotifers are among the most common non-arthropod animals and are the most experimentally tractable members of the basal assemblage of metazoan phyla known as Gnathifera. The monogonont rotifer Brachionus plicatilis is a developing model system for ecotoxicology, aquatic ecology, cryptic speciation, and the evolution of sex, and is an important food source for finfish aquaculture. However, basic knowledge of the genome and transcriptome of any rotifer species has been lacking. We generated and partially sequenced a cDNA library from B. plicatilis and constructed a database of over 2300 expressed sequence tags corresponding to more than 450 transcripts. About 20% of the transcripts had no significant similarity to database sequences by BLAST; most of these contained open reading frames of significant length but few had recognized Pfam motifs. Sixteen transcripts accounted for 25% of the ESTs; four of these had no significant similarity to BLAST or Pfam databases. Putative up- and downstream untranslated regions are relatively short and AT rich. In contrast to bdelloid rotifers, there was no evidence of a conserved trans-spliced leader sequence among the transcripts and most genes were single-copy. Despite the small size of this EST project it revealed several important features of the rotifer transcriptome and of individual monogonont genes. Because there is little genomic data for Gnathifera, the transcripts we found with no known function may represent genes that are species-, class-, phylum- or even superphylum-specific; the fact that some are among the most highly expressed indicates their importance. The absence of trans-spliced leader exons in this monogonont species contrasts with their abundance in bdelloid rotifers and indicates that the presence of this phenomenon can vary at the subphylum level. Our EST database provides a relatively large quantity of transcript-level data for B. plicatilis, and more generally of rotifers and other gnathiferan phyla, and
Analysis of Expressed Sequence Tags of the Cyclically Parthenogenetic Rotifer Brachionus plicatilis

PubMed Central

Suga, Koushirou; Mark Welch, David; Tanaka, Yukari; Sakakura, Yoshitaka; Hagiwara, Atsushi

2007-01-01

Background Rotifers are among the most common non-arthropod animals and are the most experimentally tractable members of the basal assemblage of metazoan phyla known as Gnathifera. The monogonont rotifer Brachionus plicatilis is a developing model system for ecotoxicology, aquatic ecology, cryptic speciation, and the evolution of sex, and is an important food source for finfish aquaculture. However, basic knowledge of the genome and transcriptome of any rotifer species has been lacking. Methodology/Principal Findings We generated and partially sequenced a cDNA library from B. plicatilis and constructed a database of over 2300 expressed sequence tags corresponding to more than 450 transcripts. About 20% of the transcripts had no significant similarity to database sequences by BLAST; most of these contained open reading frames of significant length but few had recognized Pfam motifs. Sixteen transcripts accounted for 25% of the ESTs; four of these had no significant similarity to BLAST or Pfam databases. Putative up- and downstream untranslated regions are relatively short and AT rich. In contrast to bdelloid rotifers, there was no evidence of a conserved trans-spliced leader sequence among the transcripts and most genes were single-copy. Conclusions/Significance Despite the small size of this EST project it revealed several important features of the rotifer transcriptome and of individual monogonont genes. Because there is little genomic data for Gnathifera, the transcripts we found with no known function may represent genes that are species-, class-, phylum- or even superphylum-specific; the fact that some are among the most highly expressed indicates their importance. The absence of trans-spliced leader exons in this monogonont species contrasts with their abundance in bdelloid rotifers and indicates that the presence of this phenomenon can vary at the subphylum level. Our EST database provides a relatively large quantity of transcript-level data for B. plicatilis
Construction of a Lotus japonicus late nodulin expressed sequence tag library and identification of novel nodule-specific genes.

PubMed Central

Szczyglowski, K; Hamburger, D; Kapranov, P; de Bruijn, F J

1997-01-01

A range of novel expressed sequence tags (ESTs) associated with late developmental events during nodule organogenesis in the legume Lotus japonicus were identified using mRNA differential display; 110 differentially displayed polymerase chain reaction products were cloned and analyzed. Of 88 unique cDNAs obtained, 22 shared significant homology to DNA/protein sequences in the respective databases. This group comprises, among others, a nodule-specific homolog of protein phosphatase 2C, a peptide transporter protein, and a nodule-specific form of cytochrome P450. RNA gel-blot analysis of 16 differentially displayed ESTs confirmed their nodule-specific expression pattern. The kinetics of mRNA accumulation of the majority of the ESTs analyzed were found to resemble the expression pattern observed for the L. japonicus leghemoglobin gene. These results indicate that the newly isolated molecular markers correspond to genes induced during late developmental stages of L. japonicus nodule organogenesis and provide important, novel tools for the study of nodulation. PMID:9276951
Analysis of expressed sequence tags from a NaHCO(3)-treated alkali-tolerant plant, Chloris virgata.

PubMed

Nishiuchi, Shunsaku; Fujihara, Kazumasa; Liu, Shenkui; Takano, Tetsuo

2010-04-01

Chloris virgata Swartz (C. virgata) is a gramineous wild plant that can survive in saline-alkali areas in northeast China. To examine the tolerance mechanisms of C. virgata, we constructed a cDNA library from whole plants of C. virgata that had been treated with 100 mM NaHCO(3) for 24 h and sequenced 3168 randomly selected clones. Most (2590) of the expressed sequence tags (ESTs) showed significant similarity to sequences in the NCBI database. Of the 2590 genes, 1893 were unique. Gene Ontology (GO) Slim annotations were obtained for 1081 ESTs by BLAST2GO and it was found that 75 genes of them were annotated with GO terms "response to stress", "response to abiotic stimulus", and "response to biotic stimulus", indicating these genes were likely to function in tolerance mechanism of C. virgata. In a separate experiment, 24 genes that are known from previous studies to be associated with abiotic stress tolerance were further examined by real-time RT-PCR to see how their expressions were affected by NaHCO(3) stress. NaHCO(3) treatment up-regulated the expressions of pathogenesis-related gene (DC998527), Win1 precursor gene (DC998617), catalase gene (DC999385), ribosome inactivating protein 1 (DC999555), Na(+)/H(+) antiporter gene (DC998043), and two-component regulator gene (DC998236). Copyright 2010 Elsevier Masson SAS. All rights reserved.
Differential gene expression in the siphonophore Nanomia bijuga (Cnidaria) assessed with multiple next-generation sequencing workflows.

PubMed

Siebert, Stefan; Robinson, Mark D; Tintori, Sophia C; Goetz, Freya; Helm, Rebecca R; Smith, Stephen A; Shaner, Nathan; Haddock, Steven H D; Dunn, Casey W

2011-01-01

We investigated differential gene expression between functionally specialized feeding polyps and swimming medusae in the siphonophore Nanomia bijuga (Cnidaria) with a hybrid long-read/short-read sequencing strategy. We assembled a set of partial gene reference sequences from long-read data (Roche 454), and generated short-read sequences from replicated tissue samples that were mapped to the references to quantify expression. We collected and compared expression data with three short-read expression workflows that differ in sample preparation, sequencing technology, and mapping tools. These workflows were Illumina mRNA-Seq, which generates sequence reads from random locations along each transcript, and two tag-based approaches, SOLiD SAGE and Helicos DGE, which generate reads from particular tag sites. Differences in expression results across workflows were mostly due to the differential impact of missing data in the partial reference sequences. When all 454-derived gene reference sequences were considered, Illumina mRNA-Seq detected more than twice as many differentially expressed (DE) reference sequences as the tag-based workflows. This discrepancy was largely due to missing tag sites in the partial reference that led to false negatives in the tag-based workflows. When only the subset of reference sequences that unambiguously have tag sites was considered, we found broad congruence across workflows, and they all identified a similar set of DE sequences. Our results are promising in several regards for gene expression studies in non-model organisms. First, we demonstrate that a hybrid long-read/short-read sequencing strategy is an effective way to collect gene expression data when an annotated genome sequence is not available. Second, our replicated sampling indicates that expression profiles are highly consistent across field-collected animals in this case. Third, the impacts of partial reference sequences on the ability to detect DE can be mitigated through
Differential Gene Expression in the Siphonophore Nanomia bijuga (Cnidaria) Assessed with Multiple Next-Generation Sequencing Workflows

PubMed Central

Siebert, Stefan; Robinson, Mark D.; Tintori, Sophia C.; Goetz, Freya; Helm, Rebecca R.; Smith, Stephen A.; Shaner, Nathan; Haddock, Steven H. D.; Dunn, Casey W.

2011-01-01

We investigated differential gene expression between functionally specialized feeding polyps and swimming medusae in the siphonophore Nanomia bijuga (Cnidaria) with a hybrid long-read/short-read sequencing strategy. We assembled a set of partial gene reference sequences from long-read data (Roche 454), and generated short-read sequences from replicated tissue samples that were mapped to the references to quantify expression. We collected and compared expression data with three short-read expression workflows that differ in sample preparation, sequencing technology, and mapping tools. These workflows were Illumina mRNA-Seq, which generates sequence reads from random locations along each transcript, and two tag-based approaches, SOLiD SAGE and Helicos DGE, which generate reads from particular tag sites. Differences in expression results across workflows were mostly due to the differential impact of missing data in the partial reference sequences. When all 454-derived gene reference sequences were considered, Illumina mRNA-Seq detected more than twice as many differentially expressed (DE) reference sequences as the tag-based workflows. This discrepancy was largely due to missing tag sites in the partial reference that led to false negatives in the tag-based workflows. When only the subset of reference sequences that unambiguously have tag sites was considered, we found broad congruence across workflows, and they all identified a similar set of DE sequences. Our results are promising in several regards for gene expression studies in non-model organisms. First, we demonstrate that a hybrid long-read/short-read sequencing strategy is an effective way to collect gene expression data when an annotated genome sequence is not available. Second, our replicated sampling indicates that expression profiles are highly consistent across field-collected animals in this case. Third, the impacts of partial reference sequences on the ability to detect DE can be mitigated through
Expressed sequence tag based identification and expression analysis of some cold inducible elements in seabuckthorn (Hippophae rhamnoides L.).

PubMed

Ghangal, Rajesh; Raghuvanshi, Saurabh; Sharma, Prakash C

2012-02-01

A cDNA library was constructed from the mature leaves of seabuckthorn (Hippophae rhamnoides). Expressed Sequence Tags (ESTs) were generated by single pass sequencing of 4500 cDNA clones. We submitted 3412 ESTs to dbEST of NCBI. Clustering of these ESTs yielded 1665 unigenes comprising of 345 contigs and 1320 singletons. Out of 1665 unigenes, 1278 unigenes were annotated by similarity search while the remaining 387 unannotated unigenes were considered as organism specific. Gene Ontology (GO) analysis of the unigene dataset showed 691 unigenes related to biological processes, 727 to molecular functions and 588 to cellular component category. On the basis of similarity search and GO annotation, 43 unigenes were found responsive to biotic and abiotic stresses. To validate this observation, 13 genes that are known to be associated with cold stress tolerance from previous studies in Arabidopsis and 3 novel transcripts were examined by Real time RT-PCR to understand the change in expression pattern under cold/freeze stress. In silico study of occurrence of microsatellites in these ESTs revealed the presence of 62 Simple Sequence Repeats (SSRs), some of which are being explored to assess genetic diversity among seabuckthorn collections. This is the first report of generation of transcriptome data providing information about genes involved in managing plant abiotic stress in seabuckthorn, a plant known for its enormous medicinal and ecological value. Copyright © 2011 Elsevier Masson SAS. All rights reserved.
Expressed sequence tag analysis of guinea pig (Cavia porcellus) eye tissues for NEIBank

PubMed Central

Simpanya, Mukoma F.; Wistow, Graeme; Gao, James; David, Larry L.; Giblin, Frank J.

2008-01-01

Purpose To characterize gene expression patterns in guinea pig ocular tissues and identify orthologs of human genes from NEIBank expressed sequence tags. Methods RNA was extracted from dissected eye tissues of 2.5-month-old guinea pigs to make three unamplified and unnormalized cDNA libraries in the pCMVSport-6 vector for the lens, retina, and eye minus lens and retina. Over 4,000 clones were sequenced from each library and were analyzed using GRIST for clustering and gene identification. Lens crystallin EST data were validated using two-dimensional electrophoresis (2-DE), matrix assisted laser desorption (MALDI), and electrospray ionization mass spectrometry (ESIMS). Results Combined data from the three libraries generated a total of 6,694 distinctive gene clusters, with each library having between 1,000 and 3,000 clusters. Approximately 60% of the total gene clusters were novel cDNA sequences and had significant homologies to other mammalian sequences in GenBank. Complete cDNA sequences were obtained for many guinea pig lens proteins, including αA/αAinsert-, γN-, and γS-crystallins, lengsin and GRIFIN. The ratio of αA- to αB-crystallin on 2-DE gels was 8: 1 in the lens nucleus and 6.5: 1 in the cortex. Analysis of ESTs, genome sequence, and proteins (by MALDI), did not reveal any evidence for the presence of γD-, γE-, and γF-crystallin in the guinea pig. Predicted masses of many guinea pig lens crystallins were confirmed by ESIMS analysis. For the retina, orthologs of human phototransduction genes were found, such as Rhodopsin, S-antigen (Sag, Arrestin), and Transducin. The guinea-pig ortholog of NRL, a key rod photoreceptor-specific transcription factor, was also represented in EST data. In the ‘rest-of-eye’ library, the most abundant transcripts included decorin and keratin 12, representative of the cornea. Conclusions Genomic analysis of guinea pig eye tissues provides sequence-verified clones for future studies. Guinea pig orthologs of many human
Uneven distribution of expressed sequence tag loci on maize pachytene chromosomes

PubMed Central

Anderson, Lorinda K.; Lai, Ann; Stack, Stephen M.; Rizzon, Carene; Gaut, Brandon S.

2006-01-01

Examining the relationships among DNA sequence, meiotic recombination, and chromosome structure at a genome-wide scale has been difficult because only a few markers connect genetic linkage maps with physical maps. Here, we have positioned 1195 genetically mapped expressed sequence tag (EST) markers onto the 10 pachytene chromosomes of maize by using a newly developed resource, the RN-cM map. The RN-cM map charts the distribution of crossing over in the form of recombination nodules (RNs) along synaptonemal complexes (SCs, pachytene chromosomes) and allows genetic cM distances to be converted into physical micrometer distances on chromosomes. When this conversion is made, most of the EST markers used in the study are located distally on the chromosomes in euchromatin. ESTs are significantly clustered on chromosomes, even when only euchromatic chromosomal segments are considered. Gene density and recombination rate (as measured by EST and RN frequencies, respectively) are strongly correlated. However, crossover frequencies for telomeric intervals are much higher than was expected from their EST frequencies. For pachytene chromosomes, EST density is about fourfold higher in euchromatin compared with heterochromatin, while DNA density is 1.4 times higher in heterochromatin than in euchromatin. Based on DNA density values and the fraction of pachytene chromosome length that is euchromatic, we estimate that ∼1500 Mbp of the maize genome is in euchromatin. This overview of the organization of the maize genome will be useful in examining genome and chromosome evolution in plants. PMID:16339046
Characterizing the Grape Transcriptome. Analysis of Expressed Sequence Tags from Multiple Vitis Species and Development of a Compendium of Gene Expression during Berry Development1[w

PubMed Central

Silva, Francisco Goes da; Iandolino, Alberto; Al-Kayal, Fadi; Bohlmann, Marlene C.; Cushman, Mary Ann; Lim, Hyunju; Ergul, Ali; Figueroa, Rubi; Kabuloglu, Elif K.; Osborne, Craig; Rowe, Joan; Tattersall, Elizabeth; Leslie, Anna; Xu, Jane; Baek, JongMin; Cramer, Grant R.; Cushman, John C.; Cook, Douglas R.

2005-01-01

We report the analysis and annotation of 146,075 expressed sequence tags from Vitis species. The majority of these sequences were derived from different cultivars of Vitis vinifera, comprising an estimated 25,746 unique contig and singleton sequences that survey transcription in various tissues and developmental stages and during biotic and abiotic stress. Putatively homologous proteins were identified for over 17,752 of the transcripts, with 1,962 transcripts further subdivided into one or more Gene Ontology categories. A simple structured vocabulary, with modules for plant genotype, plant development, and stress, was developed to describe the relationship between individual expressed sequence tags and cDNA libraries; the resulting vocabulary provides query terms to facilitate data mining within the context of a relational database. As a measure of the extent to which characterized metabolic pathways were encompassed by the data set, we searched for homologs of the enzymes leading from glycolysis, through the oxidative/nonoxidative pentose phosphate pathway, and into the general phenylpropanoid pathway. Homologs were identified for 65 of these 77 enzymes, with 86% of enzymatic steps represented by paralogous genes. Differentially expressed transcripts were identified by means of a stringent believability index cutoff of ≥98.4%. Correlation analysis and two-dimensional hierarchical clustering grouped these transcripts according to similarity of expression. In the broadest analysis, 665 differentially expressed transcripts were identified across 29 cDNA libraries, representing a range of developmental and stress conditions. The groupings revealed expected associations between plant developmental stages and tissue types, with the notable exception of abiotic stress treatments. A more focused analysis of flower and berry development identified 87 differentially expressed transcripts and provides the basis for a compendium that relates gene expression and annotation
A wing expressed sequence tag resource for Bicyclus anynana butterflies, an evo-devo model

PubMed Central

Beldade, Patrícia; Rudd, Stephen; Gruber, Jonathan D; Long, Anthony D

2006-01-01

Background Butterfly wing color patterns are a key model for integrating evolutionary developmental biology and the study of adaptive morphological evolution. Yet, despite the biological, economical and educational value of butterflies they are still relatively under-represented in terms of available genomic resources. Here, we describe an Expression Sequence Tag (EST) project for Bicyclus anynana that has identified the largest available collection to date of expressed genes for any butterfly. Results By targeting cDNAs from developing wings at the stages when pattern is specified, we biased gene discovery towards genes potentially involved in pattern formation. Assembly of 9,903 ESTs from a subtracted library allowed us to identify 4,251 genes of which 2,461 were annotated based on BLAST analyses against relevant gene collections. Gene prediction software identified 2,202 peptides, of which 215 longer than 100 amino acids had no homology to any known proteins and, thus, potentially represent novel or highly diverged butterfly genes. We combined gene and Single Nucleotide Polymorphism (SNP) identification by constructing cDNA libraries from pools of outbred individuals, and by sequencing clones from the 3' end to maximize alignment depth. Alignments of multi-member contigs allowed us to identify over 14,000 putative SNPs, with 316 genes having at least one high confidence double-hit SNP. We furthermore identified 320 microsatellites in transcribed genes that can potentially be used as genetic markers. Conclusion Our project was designed to combine gene and sequence polymorphism discovery and has generated the largest gene collection available for any butterfly and many potential markers in expressed genes. These resources will be invaluable for exploring the potential of B. anynana in particular, and butterflies in general, as models in ecological, evolutionary, and developmental genetics. PMID:16737530
Simple sequence repeat marker development from bacterial artificial chromosome end sequences and expressed sequence tags of flax (Linum usitatissimum L.).

PubMed

Cloutier, Sylvie; Miranda, Evelyn; Ward, Kerry; Radovanovic, Natasa; Reimer, Elsa; Walichnowski, Andrzej; Datla, Raju; Rowland, Gordon; Duguid, Scott; Ragupathy, Raja

2012-08-01

Flax is an important oilseed crop in North America and is mostly grown as a fibre crop in Europe. As a self-pollinated diploid with a small estimated genome size of ~370 Mb, flax is well suited for fast progress in genomics. In the last few years, important genetic resources have been developed for this crop. Here, we describe the assessment and comparative analyses of 1,506 putative simple sequence repeats (SSRs) of which, 1,164 were derived from BAC-end sequences (BESs) and 342 from expressed sequence tags (ESTs). The SSRs were assessed on a panel of 16 flax accessions with 673 (58 %) and 145 (42 %) primer pairs being polymorphic in the BESs and ESTs, respectively. With 818 novel polymorphic SSR primer pairs reported in this study, the repertoire of available SSRs in flax has more than doubled from the combined total of 508 of all previous reports. Among nucleotide motifs, trinucleotides were the most abundant irrespective of the class, but dinucleotides were the most polymorphic. SSR length was also positively correlated with polymorphism. Two dinucleotide (AT/TA and AG/GA) and two trinucleotide (AAT/ATA/TAA and GAA/AGA/AAG) motifs and their iterations, different from those reported in many other crops, accounted for more than half of all the SSRs and were also more polymorphic (63.4 %) than the rest of the markers (42.7 %). This improved resource promises to be useful in genetic, quantitative trait loci (QTL) and association mapping as well as for anchoring the physical/genetic map with the whole genome shotgun reference sequence of flax.
Analysis of expressed sequence tags from the Ulva prolifera (Chlorophyta)

NASA Astrophysics Data System (ADS)

Niu, Jianfeng; Hu, Haiyan; Hu, Songnian; Wang, Guangce; Peng, Guang; Sun, Song

2010-01-01

In 2008, a green tide broke out before the sailing competition of the 29th Olympic Games in Qingdao. The causative species was determined to be Enteromorpha prolifera ( Ulva prolifera O. F. Müller), a familiar green macroalga along the coastline of China. Rapid accumulation of a large biomass of floating U. prolifera prompted research on different aspects of this species. In this study, we constructed a nonnormalized cDNA library from the thalli of U. prolifera and acquired 10 072 high-quality expressed sequence tags (ESTs). These ESTs were assembled into 3 519 nonredundant gene groups, including 1 446 clusters and 2 073 singletons. After annotation with the nr database, a large number of genes were found to be related with chloroplast and ribosomal protein, GO functional classification showed 1 418 ESTs participated in photosynthesis and 1 359 ESTs were responsible for the generation of precursor metabolites and energy. In addition, rather comprehensive carbon fixation pathways were found in U. prolifera using KEGG. Some stress-related and signal transduction-related genes were also found in this study. All the evidences displayed that U. prolifera had substance and energy foundation for the intense photosynthesis and the rapid proliferation. Phylogenetic analysis of cytochrome c oxidase subunit I revealed that this green-tide causative species is most closely affiliated to Pseudendoclonium akinetum (Ulvophyceae).
Characterization of genic microsatellite markers derived from expressed sequence tags in Pacific abalone ( Haliotis discus hannai)

NASA Astrophysics Data System (ADS)

Li, Qi; Shu, Jing; Zhao, Cui; Liu, Shikai; Kong, Lingfeng; Zheng, Xiaodong

2010-01-01

Simple sequence repeat (SSR) markers were developed from the expressed sequence tags (ESTs) of Pacific abalone ( Haliotis discus hannai). Repeat motifs were found in 4.95% of the ESTs at a frequency of one repeat every 10.04 kb of EST sequences, after redundancy elimination. Seventeen polymorphic EST-SSRs were developed. The number of alleles per locus varied from 2-17, with an average of 6.8 alleles per locus. The expected and observed heterozygosities ranged from 0.159 to 0.928 and from 0.132 to 0.922, respectively. Twelve of the 17 loci (70.6%) were successfully amplified in H. diversicolor. Seventeen loci segregated in three families, with three showing the presence of null alleles (17.6%). The adequate level of variability and low frequency of null alleles observed in H. discus hannai, together with the high rate of transportability across Haliotis species, make this set of EST-SSR markers an important tool for comparative mapping, marker-assisted selection, and evolutionary studies, not only in the Pacific abalone, but also in related species.
Sequence tagging reveals unexpected modifications in toxicoproteomics

PubMed Central

Dasari, Surendra; Chambers, Matthew C.; Codreanu, Simona G.; Liebler, Daniel C.; Collins, Ben C.; Pennington, Stephen R.; Gallagher, William M.; Tabb, David L.

2010-01-01

Toxicoproteomic samples are rich in posttranslational modifications (PTMs) of proteins. Identifying these modifications via standard database searching can incur significant performance penalties. Here we describe the latest developments in TagRecon, an algorithm that leverages inferred sequence tags to identify modified peptides in toxicoproteomic data sets. TagRecon identifies known modifications more effectively than the MyriMatch database search engine. TagRecon outperformed state of the art software in recognizing unanticipated modifications from LTQ, Orbitrap, and QTOF data sets. We developed user-friendly software for detecting persistent mass shifts from samples. We follow a three-step strategy for detecting unanticipated PTMs in samples. First, we identify the proteins present in the sample with a standard database search. Next, identified proteins are interrogated for unexpected PTMs with a sequence tag-based search. Finally, additional evidence is gathered for the detected mass shifts with a refinement search. Application of this technology on toxicoproteomic data sets revealed unintended cross-reactions between proteins and sample processing reagents. Twenty five proteins in rat liver showed signs of oxidative stress when exposed to potentially toxic drugs. These results demonstrate the value of mining toxicoproteomic data sets for modifications. PMID:21214251
Expressed sequence tags from the oomycete fish pathogen Saprolegnia parasitica reveal putative virulence factors

PubMed Central

Torto-Alalibo, Trudy; Tian, Miaoying; Gajendran, Kamal; Waugh, Mark E; van West, Pieter; Kamoun, Sophien

2005-01-01

Background The oomycete Saprolegnia parasitica is one of the most economically important fish pathogens. There is a dramatic recrudescence of Saprolegnia infections in aquaculture since the use of the toxic organic dye malachite green was banned in 2002. Little is known about the molecular mechanisms underlying pathogenicity in S. parasitica and other animal pathogenic oomycetes. In this study we used a genomics approach to gain a first insight into the transcriptome of S. parasitica. Results We generated 1510 expressed sequence tags (ESTs) from a mycelial cDNA library of S. parasitica. A total of 1279 consensus sequences corresponding to 525944 base pairs were assembled. About half of the unigenes showed similarities to known protein sequences or motifs. The S. parasitica sequences tended to be relatively divergent from Phytophthora sequences. Based on the sequence alignments of 18 conserved proteins, the average amino acid identity between S. parasitica and three Phytophthora species was 77% compared to 93% within Phytophthora. Several S. parasitica cDNAs, such as those with similarity to fungal type I cellulose binding domain proteins, PAN/Apple module proteins, glycosyl hydrolases, proteases, as well as serine and cysteine protease inhibitors, were predicted to encode secreted proteins that could function in virulence. Some of these cDNAs were more similar to fungal proteins than to other eukaryotic proteins confirming that oomycetes and fungi share some virulence components despite their evolutionary distance Conclusion We provide a first glimpse into the gene content of S. parasitica, a reemerging oomycete fish pathogen. These resources will greatly accelerate research on this important pathogen. The data is available online through the Oomycete Genomics Database [1]. PMID:16076392
Expressed sequence tag analysis of human RPE/choroid for the NEIBank Project: over 6000 non-redundant transcripts, novel genes and splice variants.

PubMed

Wistow, Graeme; Bernstein, Steven L; Wyatt, M Keith; Fariss, Robert N; Behal, Amita; Touchman, Jeffrey W; Bouffard, Gerald; Smith, Don; Peterson, Katherine

2002-06-15

The retinal pigment epithelium (RPE) and choroid comprise a functional unit of the eye that is essential to normal retinal health and function. Here we describe expressed sequence tag (EST) analysis of human RPE/choroid as part of a project for ocular bioinformatics. A cDNA library (cs) was made from human RPE/choroid and sequenced. Data were analyzed and assembled using the program GRIST (GRouping and Identification of Sequence Tags). Complete sequencing, Northern and Western blots, RH mapping, peptide antibody synthesis and immunofluorescence (IF) have been used to examine expression patterns and genome location for selected transcripts and proteins. Ten thousand individual sequence reads yield over 6300 unique gene clusters of which almost half have no matches with named genes. One of the most abundant transcripts is from a gene (named "alpha") that maps to the BBS1 region of chromosome 11. A number of tissue preferred transcripts are common to both RPE/choroid and iris. These include oculoglycan/opticin, for which an alternative splice form is detected in RPE/choroid, and "oculospanin" (Ocsp), a novel tetraspanin that maps to chromosome 17q. Antiserum to Ocsp detects expression in RPE, iris, ciliary body, and retinal ganglion cells by IF. A newly identified gene for a zinc-finger protein (TIRC) maps to 19q13.4. Variant transcripts of several genes were also detected. Most notably, the predominant form of Bestrophin represented in cs contains a longer open reading frame as a result of splice junction skipping. The unamplified cs library gives a view of the transcriptional repertoire of the adult RPE/choroid. A large number of potentially novel genes and splice forms and candidates for genetic diseases are revealed. Clones from this collection are being included in a large, nonredundant set for cDNA microarray construction.
Re-evaluating microglia expression profiles using RiboTag and cell isolation strategies.

PubMed

Haimon, Zhana; Volaski, Alon; Orthgiess, Johannes; Boura-Halfon, Sigalit; Varol, Diana; Shemer, Anat; Yona, Simon; Zuckerman, Binyamin; David, Eyal; Chappell-Maor, Louise; Bechmann, Ingo; Gericke, Martin; Ulitsky, Igor; Jung, Steffen

2018-06-01

Transcriptome profiling is widely used to infer functional states of specific cell types, as well as their responses to stimuli, to define contributions to physiology and pathophysiology. Focusing on microglia, the brain's macrophages, we report here a side-by-side comparison of classical cell-sorting-based transcriptome sequencing and the 'RiboTag' method, which avoids cell retrieval from tissue context and yields translatome sequencing information. Conventional whole-cell microglial transcriptomes were found to be significantly tainted by artifacts introduced by tissue dissociation, cargo contamination and transcripts sequestered from ribosomes. Conversely, our data highlight the added value of RiboTag profiling for assessing the lineage accuracy of Cre recombinase expression in transgenic mice. Collectively, this study indicates method-based biases, reveals observer effects and establishes RiboTag-based translatome profiling as a valuable complement to standard sorting-based profiling strategies.

Identification and validation of Asteraceae miRNAs by the expressed sequence tag analysis.

PubMed

Monavar Feshani, Aboozar; Mohammadi, Saeed; Frazier, Taylor P; Abbasi, Abbas; Abedini, Raha; Karimi Farsad, Laleh; Ehya, Farveh; Salekdeh, Ghasem Hosseini; Mardi, Mohsen

2012-02-10

MicroRNAs (miRNAs) are small non-coding RNA molecules that play a vital role in the regulation of gene expression. Despite their identification in hundreds of plant species, few miRNAs have been identified in the Asteraceae, a large family that comprises approximately one tenth of all flowering plants. In this study, we used the expressed sequence tag (EST) analysis to identify potential conserved miRNAs and their putative target genes in the Asteraceae. We applied quantitative Real-Time PCR (qRT-PCR) to confirm the expression of eight potential miRNAs in Carthamus tinctorius and Helianthus annuus. We also performed qRT-PCR analysis to investigate the differential expression pattern of five newly identified miRNAs during five different cotyledon growth stages in safflower. Using these methods, we successfully identified and characterized 151 potentially conserved miRNAs, belonging to 26 miRNA families, in 11 genus of Asteraceae. EST analysis predicted that the newly identified conserved Asteraceae miRNAs target 130 total protein-coding ESTs in sunflower and safflower, as well as 433 additional target genes in other plant species. We experimentally confirmed the existence of seven predicted miRNAs, (miR156, miR159, miR160, miR162, miR166, miR396, and miR398) in safflower and sunflower seedlings. We also observed that five out of eight miRNAs are differentially expressed during cotyledon development. Our results indicate that miRNAs may be involved in the regulation of gene expression during seed germination and the formation of the cotyledons in the Asteraceae. The findings of this study might ultimately help in the understanding of miRNA-mediated gene regulation in important crop species. Copyright © 2011 Elsevier B.V. All rights reserved.
Analysis of expressed sequence tags from the four main developmental stages of Trypanosoma congolense

PubMed Central

Helm, Jared R.; Hertz-Fowler, Christiane; Aslett, Martin; Berriman, Matthew; Sanders, Mandy; Quail, Michael A.; Soares, Marcelo B.; Bonaldo, Maria F.; Sakurai, Tatsuya; Inoue, Noboru; Donelson, John E.

2009-01-01

Trypanosoma congolense is one of the most economically important pathogens of livestock in Africa. Culture-derived parasites of each of the three main insect stages of the T. congolense life cycle, i.e., the procyclic, epimastigote and metacyclic stages, and bloodstream stage parasites isolated from infected mice, were used to construct stage-specific cDNA libraries and expressed sequence tags (ESTs or cDNA clones) in each library were sequenced. Thirteen EST clusters encoding different variant surface glycoproteins (VSGs) were detected in the metacyclic library and twenty-six VSG EST clusters were found in the bloodstream library, six of which are shared by the metacyclic library. Rare VSG ESTs are present in the epimastigote library, and none were detected in the procyclic library. ESTs encoding enzymes that catalyze oxidative phosphorylation and amino acid metabolism are about twice as abundant in the procyclic and epimastigote stages as in the metacyclic and bloodstream stages. In contrast, ESTs encoding enzymes involved in glycolysis, the citric acid cycle and nucleotide metabolism are about the same in all four developmental stages. Cysteine proteases, kinases and phosphatases are the most abundant enzyme groups represented by the ESTs. All four libraries contain T. congolense-specific expressed sequences not present in the T. brucei and T. cruzi genomes. Normalized cDNA libraries were constructed from the metacyclic and bloodstream stages, and found to be further enriched for T. congolense-specific ESTs. Given that cultured T. congolense offers an experimental advantage over other African trypanosome species, these ESTs provide a basis for further investigation of the molecular properties of these four developmental stages, especially the epimastigote and metacyclic stages for which it is difficult to obtain large quantities of organisms. The T. congolense EST databases are available at: http://www.sanger.ac.uk/Projects/T_congolense/EST_index.shtml. PMID
Expressed sequence tags (ESTs) and phylogenetic analysis of floral genes from a paleoherb species, Asarum caudigerum.

PubMed

Zhao, Yinhe; Wang, Guoying; Zhang, Jinpeng; Yang, Junbo; Peng, Shang; Gao, Lianming; Li, Chengyun; Hu, Jinyong; Li, Dezhu; Gao, Lizhi

2006-07-01

Asarum caudigerum (Aristolochiaceae) is an important species of paleoherb in relation to understanding the origin and evolution of angiosperm flowers, due to its basal position in the angiosperms. The aim of this study was to isolate floral-related genes from A. caudigerum, and to infer evolutionary relationships among florally expression-related genes, to further illustrate the origin and diversification of flowers in angiosperms. A subtracted floral cDNA library was constructed from floral buds using suppression subtractive hybridization (SSH). The cDNA of floral buds and leaves at the seedling stage were used as a tester and a driver, respectively. To further identify the function of putative MADS-box transcription factors, phylogenetic trees were reconstructed in order to infer evolutionary relationships within the MADS-box gene family. In the forward-subtracted floral cDNA library, 1920 clones were randomly sequenced, from which 567 unique expressed sequence tags (ESTs) were obtained. Among them, 127 genes failed to show significant similarity to any published sequences in GenBank and thus are putatively novel genes. Phylogenetic analysis indicated that a total of 29 MADS-box transcription factors were members of the APETALA3(AP3) subfamily, while nine others were putative MADS-box transcription factors that formed a cluster with MADS-box genes isolated from Amborella, the basal-most angiosperm, and those from the gymnosperms. This suggests that the origin of A. caudigerum is intermediate between the angiosperms and gymnosperms.
In vivo expression and purification of aptamer-tagged small RNA regulators

PubMed Central

Said, Nelly; Rieder, Renate; Hurwitz, Robert; Deckert, Jochen; Urlaub, Henning; Vogel, Jörg

2009-01-01

Small non-coding RNAs (sRNAs) are an emerging class of post-transcriptional regulators of bacterial gene expression. To study sRNAs and their potential protein interaction partners, it is desirable to purify sRNAs from cells in their native form. Here, we used RNA-based affinity chromatography to purify sRNAs following their expression as aptamer-tagged variants in vivo. To this end, we developed a family of plasmids to express sRNAs with any of three widely used aptamer sequences (MS2, boxB, eIF4A), and systematically tested how the aptamer tagging impacted on intracellular accumulation and target regulation of the Salmonella GcvB, InvR or RybB sRNAs. In addition, we successfully tagged the chromosomal rybB gene with MS2 to observe that RybB-MS2 is fully functional as an envelope stress-induced repressor of ompN mRNA following induction of sigmaE. We further demonstrate that the common sRNA-binding protein, Hfq, co-purifies with MS2-tagged sRNAs of Salmonella. The presented affinity purification strategy may facilitate the isolation of in vivo assembled sRNA–protein complexes in a wide range of bacteria. PMID:19726584
Expressed sequence tag analysis of adult human lens for the NEIBank Project: over 2000 non-redundant transcripts, novel genes and splice variants.

PubMed

Wistow, Graeme; Bernstein, Steven L; Wyatt, M Keith; Behal, Amita; Touchman, Jeffrey W; Bouffard, Gerald; Smith, Don; Peterson, Katherine

2002-06-15

To explore the expression profile of the human lens and to provide a resource for microarray studies, expressed sequence tag (EST) analysis has been performed on cDNA libraries from adult lenses. A cDNA library was constructed from two adult (40 year old) human lenses. Over two thousand clones were sequenced from the unamplified, un-normalized library. The library was then normalized and a further 2200 sequences were obtained. All the data were analyzed using GRIST (GRouping and Identification of Sequence Tags), a procedure for gene identification and clustering. The lens library (by) contains a low percentage of non-mRNA contaminants and a high fraction (over 75%) of apparently full length cDNA clones. Approximately 2000 reads from the unamplified library yields 810 clusters, potentially representing individual genes expressed in the lens. After normalization, the content of crystallins and other abundant cDNAs is markedly reduced and a similar number of reads from this library (fs) yields 1455 unique groups of which only two thirds correspond to named genes in GenBank. Among the most abundant cDNAs is one for a novel gene related to glutamine synthetase, which was designated "lengsin" (LGS). Analyses of ESTs also reveal examples of alternative transcripts, including a major alternative splice form for the lens specific membrane protein MP19. Variant forms for other transcripts, including those encoding the apoptosis inhibitor Livin and the armadillo repeat protein ARVCF, are also described. The lens cDNA libraries are a resource for gene discovery, full length cDNAs for functional studies and microarrays. The discovery of an abundant, novel transcript, lengsin, and a major novel splice form of MP19 reflect the utility of unamplified libraries constructed from dissected tissue. Many novel transcripts and splice forms are represented, some of which may be candidates for genetic diseases.
Developing expressed sequence tag libraries and the discovery of simple sequence repeat markers for two species of raspberry (Rubus L.).

PubMed

Bushakra, Jill M; Lewers, Kim S; Staton, Margaret E; Zhebentyayeva, Tetyana; Saski, Christopher A

2015-10-26

Due to a relatively high level of codominant inheritance and transferability within and among taxonomic groups, simple sequence repeat (SSR) markers are important elements in comparative mapping and delineation of genomic regions associated with traits of economic importance. Expressed sequence tags (ESTs) are a source of SSRs that can be used to develop markers to facilitate plant breeding and for more basic research across genera and higher plant orders. Leaf and meristem tissue from 'Heritage' red raspberry (Rubus idaeus) and 'Bristol' black raspberry (R. occidentalis) were utilized for RNA extraction. After conversion to cDNA and library construction, ESTs were sequenced, quality verified, assembled and scanned for SSRs. Primers flanking the SSRs were designed and a subset tested for amplification, polymorphism and transferability across species. ESTs containing SSRs were functionally annotated using the GenBank non-redundant (nr) database and further classified using the gene ontology database. To accelerate development of EST-SSRs in the genus Rubus (Rosaceae), 1149 and 2358 cDNA sequences were generated from red raspberry and black raspberry, respectively. The cDNA sequences were screened using rigorous filtering criteria which resulted in the identification of 121 and 257 SSR loci for red and black raspberry, respectively. Primers were designed from the surrounding sequences resulting in 131 and 288 primer pairs, respectively, as some sequences contained more than one SSR locus. Sequence analysis revealed that the SSR-containing genes span a diversity of functions and share more sequence identity with strawberry genes than with other Rosaceous species. This resource of Rubus-specific, gene-derived markers will facilitate the construction of linkage maps composed of transferable markers for studying and manipulating important traits in this economically important genus.
Development, characterization and cross species amplification of polymorphic microsatellite markers from expressed sequence tags of turmeric (Curcuma longa L.).

PubMed

Siju, S; Dhanya, K; Syamkumar, S; Sasikumar, B; Sheeja, T E; Bhat, A I; Parthasarathy, V A

2010-02-01

Expressed sequence tags (ESTs) from turmeric (Curcuma longa L.) were used for the screening of type and frequency of Class I (hypervariable) simple sequence repeats (SSRs). A total of 231 microsatellite repeats were detected from 12,593 EST sequences of turmeric after redundancy elimination. The average density of Class I SSRs accounts to one SSR per 17.96 kb of EST. Mononucleotides were the most abundant class of microsatellite repeat in turmeric ESTs followed by trinucleotides. A robust set of 17 polymorphic EST-SSRs were developed and used for evaluating 20 turmeric accessions. The number of alleles detected ranged from 3 to 8 per loci. The developed markers were also evaluated in 13 related species of C. longa confirming high rate (100%) of cross species transferability. The polymorphic microsatellite markers generated from this study could be used for genetic diversity analysis and resolving the taxonomic confusion prevailing in the genus.
Analysis and functional annotation of expressed sequence tags (ESTs) from multiple tissues of oil palm (Elaeis guineensis Jacq.)

PubMed Central

Ho, Chai-Ling; Kwan, Yen-Yen; Choi, Mei-Chooi; Tee, Sue-Sean; Ng, Wai-Har; Lim, Kok-Ang; Lee, Yang-Ping; Ooi, Siew-Eng; Lee, Weng-Wah; Tee, Jin-Ming; Tan, Siang-Hee; Kulaveerasingam, Harikrishna; Alwee, Sharifah Shahrul Rabiah Syed; Abdullah, Meilina Ong

2007-01-01

Background Oil palm is the second largest source of edible oil which contributes to approximately 20% of the world's production of oils and fats. In order to understand the molecular biology involved in in vitro propagation, flowering, efficient utilization of nitrogen sources and root diseases, we have initiated an expressed sequence tag (EST) analysis on oil palm. Results In this study, six cDNA libraries from oil palm zygotic embryos, suspension cells, shoot apical meristems, young flowers, mature flowers and roots, were constructed. We have generated a total of 14537 expressed sequence tags (ESTs) from these libraries, from which 6464 tentative unique contigs (TUCs) and 2129 singletons were obtained. Approximately 6008 of these tentative unique genes (TUGs) have significant matches to the non-redundant protein database, from which 2361 were assigned to one or more Gene Ontology categories. Predominant transcripts and differentially expressed genes were identified in multiple oil palm tissues. Homologues of genes involved in many aspects of flower development were also identified among the EST collection, such as CONSTANS-like, AGAMOUS-like (AGL)2, AGL20, LFY-like, SQUAMOSA, SQUAMOSA binding protein (SBP) etc. Majority of them are the first representatives in oil palm, providing opportunities to explore the cause of epigenetic homeotic flowering abnormality in oil palm, given the importance of flowering in fruit production. The transcript levels of two flowering-related genes, EgSBP and EgSEP were analysed in the flower tissues of various developmental stages. Gene homologues for enzymes involved in oil biosynthesis, utilization of nitrogen sources, and scavenging of oxygen radicals, were also uncovered among the oil palm ESTs. Conclusion The EST sequences generated will allow comparative genomic studies between oil palm and other monocotyledonous and dicotyledonous plants, development of gene-targeted markers for the reference genetic map, design and
A novel expression system for intracellular production and purification of recombinant affinity-tagged proteins in Aspergillus niger.

PubMed

Roth, Andreas H F J; Dersch, Petra

2010-03-01

A set of different integrative expression vectors for the intracellular production of recombinant proteins with or without affinity tag in Aspergillus niger was developed. Target genes can be expressed under the control of the highly efficient, constitutive pkiA promoter or the novel sucrose-inducible promoter of the beta-fructofuranosidase (sucA) gene of A. niger in the presence or absence of alternative carbon sources. All expression plasmids contain an identical multiple cloning sequence that allows parallel construction of N- or C-terminally His6- and StrepII-tagged versions of the target proteins. Production of two heterologous model proteins, the green fluorescence protein and the Thermobifida fusca hydrolase, proved the functionality of the vector system. Efficient production and easy detection of the target proteins as well as their fast purification by a one-step affinity chromatography, using the His6- or StrepII-tag sequence, was demonstrated.
Generation of expressed sequence tags for discovery of genes responsible for floral traits of Chrysanthemum morifolium by next-generation sequencing technology.

PubMed

Sasaki, Katsutomo; Mitsuda, Nobutaka; Nashima, Kenji; Kishimoto, Kyutaro; Katayose, Yuichi; Kanamori, Hiroyuki; Ohmiya, Akemi

2017-09-04

Chrysanthemum morifolium is one of the most economically valuable ornamental plants worldwide. Chrysanthemum is an allohexaploid plant with a large genome that is commercially propagated by vegetative reproduction. New cultivars with different floral traits, such as color, morphology, and scent, have been generated mainly by classical cross-breeding and mutation breeding. However, only limited genetic resources and their genome information are available for the generation of new floral traits. To obtain useful information about molecular bases for floral traits of chrysanthemums, we read expressed sequence tags (ESTs) of chrysanthemums by high-throughput sequencing using the 454 pyrosequencing technology. We constructed normalized cDNA libraries, consisting of full-length, 3'-UTR, and 5'-UTR cDNAs derived from various tissues of chrysanthemums. These libraries produced a total number of 3,772,677 high-quality reads, which were assembled into 213,204 contigs. By comparing the data obtained with those of full genome-sequenced species, we confirmed that our chrysanthemum contig set contained the majority of all expressed genes, which was sufficient for further molecular analysis in chrysanthemums. We confirmed that our chrysanthemum EST set (contigs) contained a number of contigs that encoded transcription factors and enzymes involved in pigment and aroma compound metabolism that was comparable to that of other species. This information can serve as an informative resource for identifying genes involved in various biological processes in chrysanthemums. Moreover, the findings of our study will contribute to a better understanding of the floral characteristics of chrysanthemums including the myriad cultivars at the molecular level.
Generation and analysis of expressed sequence tags from a cDNA library of the fruiting body of Ganoderma lucidum

PubMed Central

2010-01-01

Background Little genomic or trancriptomic information on Ganoderma lucidum (Lingzhi) is known. This study aims to discover the transcripts involved in secondary metabolite biosynthesis and developmental regulation of G. lucidum using an expressed sequence tag (EST) library. Methods A cDNA library was constructed from the G. lucidum fruiting body. Its high-quality ESTs were assembled into unique sequences with contigs and singletons. The unique sequences were annotated according to sequence similarities to genes or proteins available in public databases. The detection of simple sequence repeats (SSRs) was preformed by online analysis. Results A total of 1,023 clones were randomly selected from the G. lucidum library and sequenced, yielding 879 high-quality ESTs. These ESTs showed similarities to a diverse range of genes. The sequences encoding squalene epoxidase (SE) and farnesyl-diphosphate synthase (FPS) were identified in this EST collection. Several candidate genes, such as hydrophobin, MOB2, profilin and PHO84 were detected for the first time in G. lucidum. Thirteen (13) potential SSR-motif microsatellite loci were also identified. Conclusion The present study demonstrates a successful application of EST analysis in the discovery of transcripts involved in the secondary metabolite biosynthesis and the developmental regulation of G. lucidum. PMID:20230644
Identification and characterization of 43 microsatellite markers derived from expressed sequence tags of the sea cucumber ( Apostichopus japonicus)

NASA Astrophysics Data System (ADS)

Jiang, Qun; Li, Qi; Yu, Hong; Kong, Lingfeng

2011-06-01

The sea cucumber Apostichopus japonicus is a commercially and ecologically important species in China. A total of 3056 potential unigenes were generated after assembling 7597 A. japonicus expressed sequence tags (ESTs) downloaded from Gen-Bank. Two hundred and fifty microsatellite-containing ESTs (8.18%) and 299 simple sequence repeats (SSRs) were detected. The average density of SSRs was 1 per 7.403 kb of EST after redundancy elimination. Di-nucleotide repeat motifs appeared to be the most abundant type with a percentage of 69.90%. Of the 126 primer pairs designed, 90 amplified the expected products and 43 showed polymorphism in 30 individuals tested. The number of alleles per locus ranged from 2 to 26 with an average of 7.0 alleles, and the observed and expected heterozygosities varied from 0.067 to 1.000 and from 0.066 to 0.959, respectively. These new EST-derived microsatellite markers would provide sufficient polymorphism for population genetic studies and genome mapping of this sea cucumber species.
Analysis of expressed sequence tags from a single wheat cultivar facilitates interpretation of tandem mass spectrometry data and discrimination of gamma gliadin proteins that may play different functional roles in flour

USDA-ARS?s Scientific Manuscript database

The complement of gamma gliadin genes expressed in the wheat cultivar Butte 86 was evaluated by analyzing publicly available expressed sequence tag (EST) data. Eleven contigs were assembled from 153 Butte 86 ESTs. Nine of the contigs encoded full-length proteins and four of the proteins contained an...
TagDust2: a generic method to extract reads from sequencing data.

PubMed

Lassmann, Timo

2015-01-28

Arguably the most basic step in the analysis of next generation sequencing data (NGS) involves the extraction of mappable reads from the raw reads produced by sequencing instruments. The presence of barcodes, adaptors and artifacts subject to sequencing errors makes this step non-trivial. Here I present TagDust2, a generic approach utilizing a library of hidden Markov models (HMM) to accurately extract reads from a wide array of possible read architectures. TagDust2 extracts more reads of higher quality compared to other approaches. Processing of multiplexed single, paired end and libraries containing unique molecular identifiers is fully supported. Two additional post processing steps are included to exclude known contaminants and filter out low complexity sequences. Finally, TagDust2 can automatically detect the library type of sequenced data from a predefined selection. Taken together TagDust2 is a feature rich, flexible and adaptive solution to go from raw to mappable NGS reads in a single step. The ability to recognize and record the contents of raw reads will help to automate and demystify the initial, and often poorly documented, steps in NGS data analysis pipelines. TagDust2 is freely available at: http://tagdust.sourceforge.net .
Bioinformatics and reanalysis of subtracted expressed sequence tags from the human ciliary body: Identification of novel biological functions.

PubMed

Escribano, Julio; Coca-Prados, Miguel

2002-08-28

The ciliary body is largely known for its major roles in the regulation of aqueous humor secretion, intraocular pressure, and accommodation of the lens. In this review article we applied bioinformatics to re-examine hundreds of expressed sequence tags (ESTs) previously isolated by subtractive hybridization from a human ciliary body library [1]. The DNA sequences of these clones have been recently added to the web site of NEIBank. DNA sequence comparisons of subtracted ESTs were performed against all entries in the last available release of the non-redundant database containing GenBank, EMBL, DDBJ and PDB sequences using the BlastN program accessed through NCBI's BLAST services on the internet (NCBI). Sequences were also compared and mapped using the Blast search program provided through the Internet by the Human Genome Project (UCSC). A total number of 284 independent ESTs were classified in 17 functional groups. Analysis of their relationships allowed to define the expression of five major groups of known genes: (i) protein synthesis, folding, secretion and degradation (20%); (ii) energy supply and biosynthesis (12%); (iii) contractility and cytoskeleton structure (6%); (iv) cellular signaling and cell cycle regulation (7%); and (v) nerve cell related tasks (2%), including neuropeptide processing and putative non-visual phototransduction and circadian rhythm control. The largest group contain unidentified sequences, a total of 105 sequences, accounting for 37% of ESTs. The unidentified sequences show similarity to genomic non-coding regions, or genes of unknown function. The most highly represented EST, correspond to myocilin, a gene involved in glaucoma. The data also confirms the secretory functions of the ciliary epithelium, and its high metabolism; the presence of a neuroendocrine peptidergic system presumably involved in the regulation of the intraocular pressure and/or aqueous humor secretion. Additional genes may be related to a non-visual phototransduction
Porcine transcriptome analysis based on 97 non-normalized cDNA libraries and assembly of 1,021,891 expressed sequence tags

PubMed Central

Gorodkin, Jan; Cirera, Susanna; Hedegaard, Jakob; Gilchrist, Michael J; Panitz, Frank; Jørgensen, Claus; Scheibye-Knudsen, Karsten; Arvin, Troels; Lumholdt, Steen; Sawera, Milena; Green, Trine; Nielsen, Bente J; Havgaard, Jakob H; Rosenkilde, Carina; Wang, Jun; Li, Heng; Li, Ruiqiang; Liu, Bin; Hu, Songnian; Dong, Wei; Li, Wei; Yu, Jun; Wang, Jian; Stærfeldt, Hans-Henrik; Wernersson, Rasmus; Madsen, Lone B; Thomsen, Bo; Hornshøj, Henrik; Bujie, Zhan; Wang, Xuegang; Wang, Xuefei; Bolund, Lars; Brunak, Søren; Yang, Huanming; Bendixen, Christian; Fredholm, Merete

2007-01-01

Background Knowledge of the structure of gene expression is essential for mammalian transcriptomics research. We analyzed a collection of more than one million porcine expressed sequence tags (ESTs), of which two-thirds were generated in the Sino-Danish Pig Genome Project and one-third are from public databases. The Sino-Danish ESTs were generated from one normalized and 97 non-normalized cDNA libraries representing 35 different tissues and three developmental stages. Results Using the Distiller package, the ESTs were assembled to roughly 48,000 contigs and 73,000 singletons, of which approximately 25% have a high confidence match to UniProt. Approximately 6,000 new porcine gene clusters were identified. Expression analysis based on the non-normalized libraries resulted in the following findings. The distribution of cluster sizes is scaling invariant. Brain and testes are among the tissues with the greatest number of different expressed genes, whereas tissues with more specialized function, such as developing liver, have fewer expressed genes. There are at least 65 high confidence housekeeping gene candidates and 876 cDNA library-specific gene candidates. We identified differential expression of genes between different tissues, in particular brain/spinal cord, and found patterns of correlation between genes that share expression in pairs of libraries. Finally, there was remarkable agreement in expression between specialized tissues according to Gene Ontology categories. Conclusion This EST collection, the largest to date in pig, represents an essential resource for annotation, comparative genomics, assembly of the pig genome sequence, and further porcine transcription studies. PMID:17407547
Identification of Anhydrobiosis-related Genes from an Expressed Sequence Tag Database in the Cryptobiotic Midge Polypedilum vanderplanki (Diptera; Chironomidae)*

PubMed Central

Cornette, Richard; Kanamori, Yasushi; Watanabe, Masahiko; Nakahara, Yuichi; Gusev, Oleg; Mitsumasu, Kanako; Kadono-Okuda, Keiko; Shimomura, Michihiko; Mita, Kazuei; Kikawada, Takahiro; Okuda, Takashi

2010-01-01

Some organisms are able to survive the loss of almost all their body water content, entering a latent state known as anhydrobiosis. The sleeping chironomid (Polypedilum vanderplanki) lives in the semi-arid regions of Africa, and its larvae can survive desiccation in an anhydrobiotic form during the dry season. To unveil the molecular mechanisms of this resistance to desiccation, an anhydrobiosis-related Expressed Sequence Tag (EST) database was obtained from the sequences of three cDNA libraries constructed from P. vanderplanki larvae after 0, 12, and 36 h of desiccation. The database contained 15,056 ESTs distributed into 4,807 UniGene clusters. ESTs were classified according to gene ontology categories, and putative expression patterns were deduced for all clusters on the basis of the number of clones in each library; expression patterns were confirmed by real-time PCR for selected genes. Among up-regulated genes, antioxidants, late embryogenesis abundant (LEA) proteins, and heat shock proteins (Hsps) were identified as important groups for anhydrobiosis. Genes related to trehalose metabolism and various transporters were also strongly induced by desiccation. Those results suggest that the oxidative stress response plays a central role in successful anhydrobiosis. Similarly, protein denaturation and aggregation may be prevented by marked up-regulation of Hsps and the anhydrobiosis-specific LEA proteins. A third major feature is the predicted increase in trehalose synthesis and in the expression of various transporter proteins allowing the distribution of trehalose and other solutes to all tissues. PMID:20833722
Population structure of pigs determined by single nucleotide polymorphisms observed in assembled expressed sequence tags.

PubMed

Matsumoto, Toshimi; Okumura, Naohiko; Uenishi, Hirohide; Hayashi, Takeshi; Hamasima, Noriyuki; Awata, Takashi

2012-01-01

We have collected more than 190000 porcine expressed sequence tags (ESTs) from full-length complementary DNA (cDNA) libraries and identified more than 2800 single nucleotide polymorphisms (SNPs). In this study, we tentatively chose 222 SNPs observed in assembled ESTs to study pigs of different breeds; 104 were selected by comparing the cDNA sequences of a Meishan pig and samples of three-way cross pigs (Landrace, Large White, and Duroc: LWD), and 118 were selected from LWD samples. To evaluate the genetic variation between the chosen SNPs from pig breeds, we determined the genotypes for 192 pig samples (11 pig groups) from our DNA reference panel with matrix-assisted laser desorption ionization time-of-flight mass spectrometry. Of the 222 reference SNPs, 186 were successfully genotyped. A neighbor-joining tree showed that the pig groups were classified into two large clusters, namely, Euro-American and East Asian pig populations. F-statistics and the analysis of molecular variance of Euro-American pig groups revealed that approximately 25% of the genetic variations occurred because of intergroup differences. As the F(IS) values were less than the F(ST) values(,) the clustering, based on the Bayesian inference, implied that there was strong genetic differentiation among pig groups and less divergence within the groups in our samples. © 2011 The Authors. Animal Science Journal © 2011 Japanese Society of Animal Science.
Generation and Analysis of Expressed Sequence Tags from Olea europaea L.

PubMed Central

Ozdemir Ozgenturk, Nehir; Oruç, Fatma; Sezerman, Ugur; Kuçukural, Alper; Vural Korkut, Senay; Toksoz, Feriha; Un, Cemal

2010-01-01

Olive (Olea europaea L.) is an important source of edible oil which was originated in Near-East region. In this study, two cDNA libraries were constructed from young olive leaves and immature olive fruits for generation of ESTs to discover the novel genes and search the function of unknown genes of olive. The randomly selected 3840 colonies were sequenced for EST collection from both libraries. Readable 2228 sequences for olive leaf and 1506 sequences for olive fruit were assembled into 205 and 69 contigs, respectively, whereas 2478 were singletons. Putative functions of all 2752 differentially expressed unique sequences were designated by gene homology based on BLAST and annotated using BLAST2GO. While 1339 ESTs show no homology to the database, 2024 ESTs have homology (under 80%) with hypothetical proteins, putative proteins, expressed proteins, and unknown proteins in NCBI-GenBank. 635 EST's unique genes sequence have been identified by over 80% homology to known function in other species which were not previously described in Olea family. Only 3.1% of total EST's was shown similarity with olive database existing in NCBI. This generated EST's data and consensus sequences were submitted to NCBI as valuable source for functional genome studies of olive. PMID:21197085
Generation and analysis of expressed sequence tags in the extreme large genomes Lilium and Tulipa

PubMed Central

2012-01-01

Background Bulbous flowers such as lily and tulip (Liliaceae family) are monocot perennial herbs that are economically very important ornamental plants worldwide. However, there are hardly any genetic studies performed and genomic resources are lacking. To build genomic resources and develop tools to speed up the breeding in both crops, next generation sequencing was implemented. We sequenced and assembled transcriptomes of four lily and five tulip genotypes using 454 pyro-sequencing technology. Results Successfully, we developed the first set of 81,791 contigs with an average length of 514 bp for tulip, and enriched the very limited number of 3,329 available ESTs (Expressed Sequence Tags) for lily with 52,172 contigs with an average length of 555 bp. The contigs together with singletons covered on average 37% of lily and 39% of tulip estimated transcriptome. Mining lily and tulip sequence data for SSRs (Simple Sequence Repeats) showed that di-nucleotide repeats were twice more abundant in UTRs (UnTranslated Regions) compared to coding regions, while tri-nucleotide repeats were equally spread over coding and UTR regions. Two sets of single nucleotide polymorphism (SNP) markers suitable for high throughput genotyping were developed. In the first set, no SNPs flanking the target SNP (50 bp on either side) were allowed. In the second set, one SNP in the flanking regions was allowed, which resulted in a 2 to 3 fold increase in SNP marker numbers compared with the first set. Orthologous groups between the two flower bulbs: lily and tulip (12,017 groups) and among the three monocot species: lily, tulip, and rice (6,900 groups) were determined using OrthoMCL. Orthologous groups were screened for common SNP markers and EST-SSRs to study synteny between lily and tulip, which resulted in 113 common SNP markers and 292 common EST-SSR. Lily and tulip contigs generated were annotated and described according to Gene Ontology terminology. Conclusions Two transcriptome sets

Generation and analysis of expressed sequence tags in the extreme large genomes Lilium and Tulipa.

PubMed

Shahin, Arwa; van Kaauwen, Martijn; Esselink, Danny; Bargsten, Joachim W; van Tuyl, Jaap M; Visser, Richard G F; Arens, Paul

2012-11-20

Bulbous flowers such as lily and tulip (Liliaceae family) are monocot perennial herbs that are economically very important ornamental plants worldwide. However, there are hardly any genetic studies performed and genomic resources are lacking. To build genomic resources and develop tools to speed up the breeding in both crops, next generation sequencing was implemented. We sequenced and assembled transcriptomes of four lily and five tulip genotypes using 454 pyro-sequencing technology. Successfully, we developed the first set of 81,791 contigs with an average length of 514 bp for tulip, and enriched the very limited number of 3,329 available ESTs (Expressed Sequence Tags) for lily with 52,172 contigs with an average length of 555 bp. The contigs together with singletons covered on average 37% of lily and 39% of tulip estimated transcriptome. Mining lily and tulip sequence data for SSRs (Simple Sequence Repeats) showed that di-nucleotide repeats were twice more abundant in UTRs (UnTranslated Regions) compared to coding regions, while tri-nucleotide repeats were equally spread over coding and UTR regions. Two sets of single nucleotide polymorphism (SNP) markers suitable for high throughput genotyping were developed. In the first set, no SNPs flanking the target SNP (50 bp on either side) were allowed. In the second set, one SNP in the flanking regions was allowed, which resulted in a 2 to 3 fold increase in SNP marker numbers compared with the first set. Orthologous groups between the two flower bulbs: lily and tulip (12,017 groups) and among the three monocot species: lily, tulip, and rice (6,900 groups) were determined using OrthoMCL. Orthologous groups were screened for common SNP markers and EST-SSRs to study synteny between lily and tulip, which resulted in 113 common SNP markers and 292 common EST-SSR. Lily and tulip contigs generated were annotated and described according to Gene Ontology terminology. Two transcriptome sets were built that are valuable
Expressed Sequence Tag Analysis of the Human Pathogen Paracoccidioides brasiliensis Yeast Phase: Identification of Putative Homologues of Candida albicans Virulence and Pathogenicity Genes

PubMed Central

Goldman, Gustavo H.; dos Reis Marques, Everaldo; Custódio Duarte Ribeiro, Diógenes; Ângelo de Souza Bernardes, Luciano; Quiapin, Andréa Carla; Vitorelli, Patrícia Marostica; Savoldi, Marcela; Semighini, Camile P.; de Oliveira, Regina C.; Nunes, Luiz R.; Travassos, Luiz R.; Puccia, Rosana; Batista, Wagner L.; Ferreira, Leslie Ecker; Moreira, Júlio C.; Bogossian, Ana Paula; Tekaia, Fredj; Nobrega, Marina Pasetto; Nobrega, Francisco G.; Goldman, Maria Helena S.

2003-01-01

Paracoccidioides brasiliensis, a thermodimorphic fungus, is the causative agent of the prevalent systemic mycosis in Latin America, paracoccidioidomycosis. We present here a survey of expressed genes in the yeast pathogenic phase of P. brasiliensis. We obtained 13,490 expressed sequence tags from both 5′ and 3′ ends. Clustering analysis yielded the partial sequences of 4,692 expressed genes that were functionally classified by similarity to known genes. We have identified several Candida albicans virulence and pathogenicity homologues in P. brasiliensis. Furthermore, we have analyzed the expression of some of these genes during the dimorphic yeast-mycelium-yeast transition by real-time quantitative reverse transcription-PCR. Clustering analysis of the mycelium-yeast transition revealed three groups: (i) RBT, hydrophobin, and isocitrate lyase; (ii) malate dehydrogenase, contigs Pb1067 and Pb1145, GPI, and alternative oxidase; and (iii) ubiquitin, delta-9-desaturase, HSP70, HSP82, and HSP104. The first two groups displayed high mRNA expression in the mycelial phase, whereas the third group showed higher mRNA expression in the yeast phase. Our results suggest the possible conservation of pathogenicity and virulence mechanisms among fungi, expand considerably gene identification in P. brasiliensis, and provide a broader basis for further progress in understanding its biological peculiarities. PMID:12582121
Diversity analysis in Cannabis sativa based on large-scale development of expressed sequence tag-derived simple sequence repeat markers.

PubMed

Gao, Chunsheng; Xin, Pengfei; Cheng, Chaohua; Tang, Qing; Chen, Ping; Wang, Changbiao; Zang, Gonggu; Zhao, Lining

2014-01-01

Cannabis sativa L. is an important economic plant for the production of food, fiber, oils, and intoxicants. However, lack of sufficient simple sequence repeat (SSR) markers has limited the development of cannabis genetic research. Here, large-scale development of expressed sequence tag simple sequence repeat (EST-SSR) markers was performed to obtain more informative genetic markers, and to assess genetic diversity in cannabis (Cannabis sativa L.). Based on the cannabis transcriptome, 4,577 SSRs were identified from 3,624 ESTs. From there, a total of 3,442 complementary primer pairs were designed as SSR markers. Among these markers, trinucleotide repeat motifs (50.99%) were the most abundant, followed by hexanucleotide (25.13%), dinucleotide (16.34%), tetranucloetide (3.8%), and pentanucleotide (3.74%) repeat motifs, respectively. The AAG/CTT trinucleotide repeat (17.96%) was the most abundant motif detected in the SSRs. One hundred and seventeen EST-SSR markers were randomly selected to evaluate primer quality in 24 cannabis varieties. Among these 117 markers, 108 (92.31%) were successfully amplified and 87 (74.36%) were polymorphic. Forty-five polymorphic primer pairs were selected to evaluate genetic diversity and relatedness among the 115 cannabis genotypes. The results showed that 115 varieties could be divided into 4 groups primarily based on geography: Northern China, Europe, Central China, and Southern China. Moreover, the coefficient of similarity when comparing cannabis from Northern China with the European group cannabis was higher than that when comparing with cannabis from the other two groups, owing to a similar climate. This study outlines the first large-scale development of SSR markers for cannabis. These data may serve as a foundation for the development of genetic linkage, quantitative trait loci mapping, and marker-assisted breeding of cannabis.
Diversity Analysis in Cannabis sativa Based on Large-Scale Development of Expressed Sequence Tag-Derived Simple Sequence Repeat Markers

PubMed Central

Cheng, Chaohua; Tang, Qing; Chen, Ping; Wang, Changbiao; Zang, Gonggu; Zhao, Lining

2014-01-01

Cannabis sativa L. is an important economic plant for the production of food, fiber, oils, and intoxicants. However, lack of sufficient simple sequence repeat (SSR) markers has limited the development of cannabis genetic research. Here, large-scale development of expressed sequence tag simple sequence repeat (EST-SSR) markers was performed to obtain more informative genetic markers, and to assess genetic diversity in cannabis (Cannabis sativa L.). Based on the cannabis transcriptome, 4,577 SSRs were identified from 3,624 ESTs. From there, a total of 3,442 complementary primer pairs were designed as SSR markers. Among these markers, trinucleotide repeat motifs (50.99%) were the most abundant, followed by hexanucleotide (25.13%), dinucleotide (16.34%), tetranucloetide (3.8%), and pentanucleotide (3.74%) repeat motifs, respectively. The AAG/CTT trinucleotide repeat (17.96%) was the most abundant motif detected in the SSRs. One hundred and seventeen EST-SSR markers were randomly selected to evaluate primer quality in 24 cannabis varieties. Among these 117 markers, 108 (92.31%) were successfully amplified and 87 (74.36%) were polymorphic. Forty-five polymorphic primer pairs were selected to evaluate genetic diversity and relatedness among the 115 cannabis genotypes. The results showed that 115 varieties could be divided into 4 groups primarily based on geography: Northern China, Europe, Central China, and Southern China. Moreover, the coefficient of similarity when comparing cannabis from Northern China with the European group cannabis was higher than that when comparing with cannabis from the other two groups, owing to a similar climate. This study outlines the first large-scale development of SSR markers for cannabis. These data may serve as a foundation for the development of genetic linkage, quantitative trait loci mapping, and marker-assisted breeding of cannabis. PMID:25329551
Pilot survey of expressed sequence tags (ESTs) from the asexual blood stages of Plasmodium vivax in human patients.

PubMed

Merino, Emilio F; Fernandez-Becerra, Carmen; Madeira, Alda M B N; Machado, Ariane L; Durham, Alan; Gruber, Arthur; Hall, Neil; del Portillo, Hernando A

2003-07-21

Plasmodium vivax is the most widely distributed human malaria, responsible for 70-80 million clinical cases each year and large socio-economical burdens for countries such as Brazil where it is the most prevalent species. Unfortunately, due to the impossibility of growing this parasite in continuous in vitro culture, research on P. vivax remains largely neglected. A pilot survey of expressed sequence tags (ESTs) from the asexual blood stages of P. vivax was performed. To do so, 1,184 clones from a cDNA library constructed with parasites obtained from 10 different human patients in the Brazilian Amazon were sequenced. Sequences were automatedly processed to remove contaminants and low quality reads. A total of 806 sequences with an average length of 586 bp met such criteria and their clustering revealed 666 distinct events. The consensus sequence of each cluster and the unique sequences of the singlets were used in similarity searches against different databases that included P. vivax, Plasmodium falciparum, Plasmodium yoelii, Plasmodium knowlesi, Apicomplexa and the GenBank non-redundant database. An E-value of <10(-30) was used to define a significant database match. ESTs were manually assigned a gene ontology (GO) terminology A total of 769 ESTs could be assigned a putative identity based upon sequence similarity to known proteins in GenBank. Moreover, 292 ESTs were annotated and a GO terminology was assigned to 164 of them. These are the first ESTs reported for P. vivax and, as such, they represent a valuable resource to assist in the annotation of the P. vivax genome currently being sequenced. Moreover, since the GC-content of the P. vivax genome is strikingly different from that of P. falciparum, these ESTs will help in the validation of gene predictions for P. vivax and to create a gene index of this malaria parasite.
Expressed sequence tag (EST) analysis of two subspecies of Metarhizium anisopliae reveals a plethora of secreted proteins with potential activity in insect hosts.

PubMed

Freimoser, Florian M; Screen, Steven; Bagga, Savita; Hu, Gang; St Leger, Raymond J

2003-01-01

Expressed sequence tag (EST) libraries for Metarhizium anisopliae, the causative agent of green muscardine disease, were developed from the broad host-range pathogen Metarhizium anisopliae sf. anisopliae and the specific grasshopper pathogen, M. anisopliae sf. acridum. Approximately 1,700 5' end sequences from each subspecies were generated from cDNA libraries representing fungi grown under conditions that maximize secretion of cuticle-degrading enzymes. Both subspecies had ESTs for virtually all pathogenicity-related genes cloned to date from M. anisopliae, but many novel genes encoding potential virulence factors were also tagged. Enzymes with potential targets in the insect host included proteases, chitinases, phospholipases, lipases, esterases, phosphatases and enzymes producing toxic secondary metabolites. A diverse array of proteases composed 36 % of all M. anisopliae sf. anisopliae ESTs. Eighty percent of the ESTs that could be clustered into functional groups had significant matches (E<10(-5)) in other ascomycete fungi. These included genes reported to have specific roles in pathogens with plant or vertebrate hosts. Many of the remaining ESTs had their best BLAST match among animal, plant and bacterial sequences. These include genes with plant and microbial counterparts that produce potent antimicrobials. The abundance of transcripts discovered for different functional groups varied between the two subspecies of M. anisopliae in a manner consistent with ecological adaptations of the two pathogens. By hastening gene discovery this project has enhanced development of improved mycoinsecticides. In addition, the M. anisopliae ESTs represent a significant contribution to the extensive database of sequences from ascomycetes that are saprophytes or plant and vertebrate pathogens. Comparative analyses of these sequences is providing important information about the biology and evolutionary history of this clade.
The contribution of 700,000 ORF sequence tags to the definition of the human transcriptome

PubMed Central

Camargo, Anamaria A.; Samaia, Helena P. B.; Dias-Neto, Emmanuel; Simão, Daniel F.; Migotto, Italo A.; Briones, Marcelo R. S.; Costa, Fernando F.; Aparecida Nagai, Maria; Verjovski-Almeida, Sergio; Zago, Marco A.; Andrade, Luis Eduardo C.; Carrer, Helaine; El-Dorry, Hamza F. A.; Espreafico, Enilza M.; Habr-Gama, Angelita; Giannella-Neto, Daniel; Goldman, Gustavo H.; Gruber, Arthur; Hackel, Christine; Kimura, Edna T.; Maciel, Rui M. B.; Marie, Suely K. N.; Martins, Elizabeth A. L.; Nóbrega, Marina P.; Paçó-Larson, Maria Luisa; Pardini, Maria Inês M. C.; Pereira, Gonçalo G.; Pesquero, João Bosco; Rodrigues, Vanderlei; Rogatto, Silvia R.; da Silva, Ismael D. C. G.; Sogayar, Mari C.; Sonati, Maria de Fátima; Tajara, Eloiza H.; Valentini, Sandro R.; Alberto, Fernando L.; Amaral, Maria Elisabete J.; Aneas, Ivy; Arnaldi, Liliane A. T.; de Assis, Angela M.; Bengtson, Mário Henrique; Bergamo, Nadia Aparecida; Bombonato, Vanessa; de Camargo, Maria E. R.; Canevari, Renata A.; Carraro, Dirce M.; Cerutti, Janete M.; Corrêa, Maria Lucia C.; Corrêa, Rosana F. R.; Costa, Maria Cristina R.; Curcio, Cyntia; Hokama, Paula O. M.; Ferreira, Ari J. S.; Furuzawa, Gilberto K.; Gushiken, Tsieko; Ho, Paulo L.; Kimura, Elza; Krieger, José E.; Leite, Luciana C. C.; Majumder, Paromita; Marins, Mozart; Marques, Everaldo R.; Melo, Analy S. A.; Melo, Monica; Mestriner, Carlos Alberto; Miracca, Elisabete C.; Miranda, Daniela C.; Nascimento, Ana Lucia T. O.; Nóbrega, Francisco G.; Ojopi, Élida P. B.; Pandolfi, José Rodrigo C.; Pessoa, Luciana G.; Prevedel, Aline C.; Rahal, Paula; Rainho, Claudia A.; Reis, Eduardo M. R.; Ribeiro, Marcelo L.; da Rós, Nancy; de Sá, Renata G.; Sales, Magaly M.; Sant'anna, Simone Cristina; dos Santos, Mariana L.; da Silva, Aline M.; da Silva, Neusa P.; Silva, Wilson A.; da Silveira, Rosana A.; Sousa, Josane F.; Stecconi, Daniella; Tsukumo, Fernando; Valente, Valéria; Soares, Fernando; Moreira, Eloisa S.; Nunes, Diana N.; Correa, Ricardo G.; Zalcberg, Heloisa; Carvalho, Alex F.; Reis, Luis F. L.; Brentani, Ricardo R.; Simpson, Andrew J. G.; de Souza, Sandro J.

2001-01-01

Open reading frame expressed sequences tags (ORESTES) differ from conventional ESTs by providing sequence data from the central protein coding portion of transcripts. We generated a total of 696,745 ORESTES sequences from 24 human tissues and used a subset of the data that correspond to a set of 15,095 full-length mRNAs as a means of assessing the efficiency of the strategy and its potential contribution to the definition of the human transcriptome. We estimate that ORESTES sampled over 80% of all highly and moderately expressed, and between 40% and 50% of rarely expressed, human genes. In our most thoroughly sequenced tissue, the breast, the 130,000 ORESTES generated are derived from transcripts from an estimated 70% of all genes expressed in that tissue, with an equally efficient representation of both highly and poorly expressed genes. In this respect, we find that the capacity of the ORESTES strategy both for gene discovery and shotgun transcript sequence generation significantly exceeds that of conventional ESTs. The distribution of ORESTES is such that many human transcripts are now represented by a scaffold of partial sequences distributed along the length of each gene product. The experimental joining of the scaffold components, by reverse transcription–PCR, represents a direct route to transcript finishing that may represent a useful alternative to full-length cDNA cloning. PMID:11593022
The contribution of 700,000 ORF sequence tags to the definition of the human transcriptome.

PubMed

Camargo, A A; Samaia, H P; Dias-Neto, E; Simão, D F; Migotto, I A; Briones, M R; Costa, F F; Nagai, M A; Verjovski-Almeida, S; Zago, M A; Andrade, L E; Carrer, H; El-Dorry, H F; Espreafico, E M; Habr-Gama, A; Giannella-Neto, D; Goldman, G H; Gruber, A; Hackel, C; Kimura, E T; Maciel, R M; Marie, S K; Martins, E A; Nobrega, M P; Paco-Larson, M L; Pardini, M I; Pereira, G G; Pesquero, J B; Rodrigues, V; Rogatto, S R; da Silva, I D; Sogayar, M C; Sonati, M F; Tajara, E H; Valentini, S R; Alberto, F L; Amaral, M E; Aneas, I; Arnaldi, L A; de Assis, A M; Bengtson, M H; Bergamo, N A; Bombonato, V; de Camargo, M E; Canevari, R A; Carraro, D M; Cerutti, J M; Correa, M L; Correa, R F; Costa, M C; Curcio, C; Hokama, P O; Ferreira, A J; Furuzawa, G K; Gushiken, T; Ho, P L; Kimura, E; Krieger, J E; Leite, L C; Majumder, P; Marins, M; Marques, E R; Melo, A S; Melo, M B; Mestriner, C A; Miracca, E C; Miranda, D C; Nascimento, A L; Nobrega, F G; Ojopi, E P; Pandolfi, J R; Pessoa, L G; Prevedel, A C; Rahal, P; Rainho, C A; Reis, E M; Ribeiro, M L; da Ros, N; de Sa, R G; Sales, M M; Sant'anna, S C; dos Santos, M L; da Silva, A M; da Silva, N P; Silva, W A; da Silveira, R A; Sousa, J F; Stecconi, D; Tsukumo, F; Valente, V; Soares, F; Moreira, E S; Nunes, D N; Correa, R G; Zalcberg, H; Carvalho, A F; Reis, L F; Brentani, R R; Simpson, A J; de Souza, S J; Melo, M

2001-10-09

Open reading frame expressed sequences tags (ORESTES) differ from conventional ESTs by providing sequence data from the central protein coding portion of transcripts. We generated a total of 696,745 ORESTES sequences from 24 human tissues and used a subset of the data that correspond to a set of 15,095 full-length mRNAs as a means of assessing the efficiency of the strategy and its potential contribution to the definition of the human transcriptome. We estimate that ORESTES sampled over 80% of all highly and moderately expressed, and between 40% and 50% of rarely expressed, human genes. In our most thoroughly sequenced tissue, the breast, the 130,000 ORESTES generated are derived from transcripts from an estimated 70% of all genes expressed in that tissue, with an equally efficient representation of both highly and poorly expressed genes. In this respect, we find that the capacity of the ORESTES strategy both for gene discovery and shotgun transcript sequence generation significantly exceeds that of conventional ESTs. The distribution of ORESTES is such that many human transcripts are now represented by a scaffold of partial sequences distributed along the length of each gene product. The experimental joining of the scaffold components, by reverse transcription-PCR, represents a direct route to transcript finishing that may represent a useful alternative to full-length cDNA cloning.
Expressed sequence tag analysis of adult human optic nerve for NEIBank: Identification of cell type and tissue markers

PubMed Central

Bernstein, Steven L; Guo, Yan; Peterson, Katherine; Wistow, Graeme

2009-01-01

Background The optic nerve is a pure white matter central nervous system (CNS) tract with an isolated blood supply, and is widely used in physiological studies of white matter response to various insults. We examined the gene expression profile of human optic nerve (ON) and, through the NEIBANK online resource, to provide a resource of sequenced verified cDNA clones. An un-normalized cDNA library was constructed from pooled human ON tissues and was used in expressed sequence tag (EST) analysis. Location of an abundant oligodendrocyte marker was examined by immunofluorescence. Quantitative real time polymerase chain reaction (qRT-PCR) and Western analysis were used to compare levels of expression for key calcium channel protein genes and protein product in primate and rodent ON. Results Our analyses revealed a profile similar in many respects to other white matter related tissues, but significantly different from previously available ON cDNA libraries. The previous libraries were found to include specific markers for other eye tissues, suggesting contamination. Immune/inflammatory markers were abundant in the new ON library. The oligodendrocyte marker QKI was abundant at the EST level. Immunofluorescence revealed that this protein is a useful oligodendrocyte cell-type marker in rodent and primate ONs. L-type calcium channel EST abundance was found to be particularly low. A qRT-PCR-based comparative mammalian species analysis reveals that L-type calcium channel expression levels are significantly lower in primate than in rodent ON, which may help account for the class-specific difference in responsiveness to calcium channel blocking agents. Several known eye disease genes are abundantly expressed in ON. Many genes associated with normal axonal function, mRNAs associated with axonal transport, inflammation and neuroprotection are observed. Conclusion We conclude that the new cDNA library is a faithful representation of human ON and EST data provide an initial overview
Characterization of expressed sequence tag-derived simple sequence repeat markers for Aspergillus flavus: emphasis on variability of isolates from the southern United States.

PubMed

Wang, Xinwang; Wadl, Phillip A; Wood-Jones, Alicia; Windham, Gary; Trigiano, Robert N; Scruggs, Mary; Pilgrim, Candace; Baird, Richard

2012-12-01

Simple sequence repeat (SSR) markers were developed from Aspergillus flavus expressed sequence tag (EST) database to conduct an analysis of genetic relationships of Aspergillus isolates from numerous host species and geographical regions, but primarily from the United States. Twenty-nine primers were designed from 362 tri-nucleotide EST-SSR sequences. Eighteen polymorphic loci were used to genotype 96 Aspergillus species isolates. The number of alleles detected per locus ranged from 2 to 24 with a mean of 8.2 alleles. Haploid diversity ranged from 0.28 to 0.91. Genetic distance matrix was used to perform principal coordinates analysis (PCA) and to generate dendrograms using unweighted pair group method with arithmetic mean (UPGMA). Two principal coordinates explained more than 75 % of the total variation among the isolates. One clade was identified for A. flavus isolates (n = 87) with the other Aspergillus species (n = 7) using PCA, but five distinct clusters were present when the others taxa were excluded from the analysis. Six groups were noted when the EST-SSR data were compared using UPGMA. However, the latter PCA or UPGMA comparison resulted in no direct associations with host species, geographical region or aflatoxin production. Furthermore, there was no direct correlation to visible morphological features such as sclerotial types. The isolates from Mississippi Delta region, which contained the largest percentage of isolates, did not show any unusual clustering except for isolates K32, K55, and 199. Further studies of these three isolates are warranted to evaluate their pathogenicity, aflatoxin production potential, additional gene sequences (e.g., RPB2), and morphological comparisons.
Expressed sequence tag (EST) analysis of the pine wood nematode Bursaphelenchus xylophilus and B. mucronatus.

PubMed

Kikuchi, Taisei; Aikawa, Takuya; Kosaka, Hajime; Pritchard, Leighton; Ogura, Nobuo; Jones, John T

2007-09-01

Most Bursaphelenchus species feed on fungi that colonise dead or dying trees. However, Bursaphelenchus xylophilus is unique in that in addition to feeding on fungi it has the capacity to be a parasite of live pine trees. We present an analysis of over 13,000 expressed sequence tags (ESTs) from B. xylophilus and, by way of contrast, over 3000 ESTs from a closely related species that does not parasitise plants as readily; B. mucronatus. Four libraries from B. xylophilus, from a variety of life stages including fungal feeding nematodes, nematodes extracted from plants and dauer-like stage nematodes, and one library from B. mucronatus were constructed and used to generate ESTs. Contig analysis showed that the 13,327 B. xylophilus ESTs could be grouped into 2110 contigs and 4377 singletons giving a total of 6487 identified genes. Similarly the 3193 B. mucronatus ESTs yielded a total of 2219 identified genes from 425 contigs and 1794 singletons. A variety of proteins potentially important in the parasitic process of B. xylophilus and B. mucronatus, including plant and fungal cell wall degrading enzymes and a novel gene potentially encoding a expansin-like protein that may disrupt non-covalent bonds in the plant cell wall were identified in the libraries. Additionally several gene candidates potentially involved in dauer entry or maintenance were also identified in the EST dataset. The EST sequences from this study will provide a solid base for future research on the biology, pathogenicity and evolutionary history of this nematode group.
Generation of non-genomic oligonucleotide tag sequences for RNA template-specific PCR

PubMed Central

Pinto, Fernando Lopes; Svensson, Håkan; Lindblad, Peter

2006-01-01

Background In order to overcome genomic DNA contamination in transcriptional studies, reverse template-specific polymerase chain reaction, a modification of reverse transcriptase polymerase chain reaction, is used. The possibility of using tags whose sequences are not found in the genome further improves reverse specific polymerase chain reaction experiments. Given the absence of software available to produce genome suitable tags, a simple tool to fulfill such need was developed. Results The program was developed in Perl, with separate use of the basic local alignment search tool, making the tool platform independent (known to run on Windows XP and Linux). In order to test the performance of the generated tags, several molecular experiments were performed. The results show that Tagenerator is capable of generating tags with good priming properties, which will deliberately not result in PCR amplification of genomic DNA. Conclusion The program Tagenerator is capable of generating tag sequences that combine genome absence with good priming properties for RT-PCR based experiments, circumventing the effects of genomic DNA contamination in an RNA sample. PMID:16820068
Discovery and mapping of a new expressed sequence tag-single nucleotide polymorphism and simple sequence repeat panel for large-scale genetic studies and breeding of Theobroma cacao L.

PubMed Central

Allegre, Mathilde; Argout, Xavier; Boccara, Michel; Fouet, Olivier; Roguet, Yolande; Bérard, Aurélie; Thévenin, Jean Marc; Chauveau, Aurélie; Rivallan, Ronan; Clement, Didier; Courtois, Brigitte; Gramacho, Karina; Boland-Augé, Anne; Tahi, Mathias; Umaharan, Pathmanathan; Brunel, Dominique; Lanaud, Claire

2012-01-01

Theobroma cacao is an economically important tree of several tropical countries. Its genetic improvement is essential to provide protection against major diseases and improve chocolate quality. We discovered and mapped new expressed sequence tag-single nucleotide polymorphism (EST-SNP) and simple sequence repeat (SSR) markers and constructed a high-density genetic map. By screening 149 650 ESTs, 5246 SNPs were detected in silico, of which 1536 corresponded to genes with a putative function, while 851 had a clear polymorphic pattern across a collection of genetic resources. In addition, 409 new SSR markers were detected on the Criollo genome. Lastly, 681 new EST-SNPs and 163 new SSRs were added to the pre-existing 418 co-dominant markers to construct a large consensus genetic map. This high-density map and the set of new genetic markers identified in this study are a milestone in cocoa genomics and for marker-assisted breeding. The data are available at http://tropgenedb.cirad.fr. PMID:22210604
Anchoring 9,371 Maize Expressed Sequence Tagged Unigenes to the Bacterial Artificial Chromosome Contig Map by Two-Dimensional Overgo Hybridization1

PubMed Central

Gardiner, Jack; Schroeder, Steven; Polacco, Mary L.; Sanchez-Villeda, Hector; Fang, Zhiwei; Morgante, Michele; Landewe, Tim; Fengler, Kevin; Useche, Francisco; Hanafey, Michael; Tingey, Scott; Chou, Hugh; Wing, Rod; Soderlund, Carol; Coe, Edward H.

2004-01-01

Our goal is to construct a robust physical map for maize (Zea mays) comprehensively integrated with the genetic map. We have used a two-dimensional 24 × 24 overgo pooling strategy to anchor maize expressed sequence tagged (EST) unigenes to 165,888 bacterial artificial chromosomes (BACs) on high-density filters. A set of 70,716 public maize ESTs seeded derivation of 10,723 EST unigene assemblies. From these assemblies, 10,642 overgo sequences of 40 bp were applied as hybridization probes. BAC addresses were obtained for 9,371 overgo probes, representing an 88% success rate. More than 96% of the successful overgo probes identified two or more BACs, while 5% identified more than 50 BACs. The majority of BACs identified (79%) were hybridized with one or two overgos. A small number of BACs hybridized with eight or more overgos, suggesting that these BACs must be gene rich. Approximately 5,670 overgos identified BACs assembled within one contig, indicating that these probes are highly locus specific. A total of 1,795 megabases (Mb; 87%) of the total 2,050 Mb in BAC contigs were associated with one or more overgos, which are serving as sequence-tagged sites for single nucleotide polymorphism development. Overgo density ranged from less than one overgo per megabase to greater than 20 overgos per megabase. The majority of contigs (52%) hit by overgos contained three to nine overgos per megabase. Analysis of approximately 1,022 Mb of genetically anchored BAC contigs indicates that 9,003 of the total 13,900 overgo-contig sites are genetically anchored. Our results indicate overgos are a powerful approach for generating gene-specific hybridization probes that are facilitating the assembly of an integrated genetic and physical map for maize. PMID:15020742
Expression and purification of recombinant proteins in Escherichia coli tagged with the metal-binding protein CusF.

PubMed

Cantu-Bustos, J Enrique; Vargas-Cortez, Teresa; Morones-Ramirez, Jose Ruben; Balderas-Renteria, Isaias; Galbraith, David W; McEvoy, Megan M; Zarate, Xristo

2016-05-01

Production of recombinant proteins in Escherichia coli has been improved considerably through the use of fusion proteins, because they increase protein solubility and facilitate purification via affinity chromatography. In this article, we propose the use of CusF as a new fusion partner for expression and purification of recombinant proteins in E. coli. Using a cell-free protein expression system, based on the E. coli S30 extract, Green Fluorescent Protein (GFP) was expressed with a series of different N-terminal tags, immobilized on self-assembled protein microarrays, and its fluorescence quantified. GFP tagged with CusF showed the highest fluorescence intensity, and this was greater than the intensities from corresponding GFP constructs that contained MBP or GST tags. Analysis of protein production in vivo showed that CusF produces large amounts of soluble protein with low levels of inclusion bodies. Furthermore, fusion proteins can be exported to the cellular periplasm, if CusF contains the signal sequence. Taking advantage of its ability to bind copper ions, recombinant proteins can be purified with readily available IMAC resins charged with this metal ion, producing pure proteins after purification and tag removal. We therefore recommend the use of CusF as a viable alternative to MBP or GST as a fusion protein/affinity tag for the production of soluble recombinant proteins in E. coli. Copyright © 2016 Elsevier Inc. All rights reserved.
The First Molecular Identification of an Olive Collection Applying Standard Simple Sequence Repeats and Novel Expressed Sequence Tag Markers.

PubMed

Mousavi, Soraya; Mariotti, Roberto; Regni, Luca; Nasini, Luigi; Bufacchi, Marina; Pandolfi, Saverio; Baldoni, Luciana; Proietti, Primo

2017-01-01

Germplasm collections of tree crop species represent fundamental tools for conservation of diversity and key steps for its characterization and evaluation. For the olive tree, several collections were created all over the world, but only few of them have been fully characterized and molecularly identified. The olive collection of Perugia University (UNIPG), established in the years' 60, represents one of the first attempts to gather and safeguard olive diversity, keeping together cultivars from different countries. In the present study, a set of 370 olive trees previously uncharacterized was screened with 10 standard simple sequence repeats (SSRs) and nine new EST-SSR markers, to correctly and thoroughly identify all genotypes, verify their representativeness of the entire cultivated olive variation, and validate the effectiveness of new markers in comparison to standard genotyping tools. The SSR analysis revealed the presence of 59 genotypes, corresponding to 72 well known cultivars, 13 of them resulting exclusively present in this collection. The new EST-SSRs have shown values of diversity parameters quite similar to those of best standard SSRs. When compared to hundreds of Mediterranean cultivars, the UNIPG olive accessions were splitted into the three main populations (East, Center and West Mediterranean), confirming that the collection has a good representativeness of the entire olive variability. Furthermore, Bayesian analysis, performed on the 59 genotypes of the collection by the use of both sets of markers, have demonstrated their splitting into four clusters, with a well balanced membership obtained by EST respect to standard SSRs. The new OLEST ( Olea expressed sequence tags) SSR markers resulted as effective as the best standard markers. The information obtained from this study represents a high valuable tool for ex situ conservation and management of olive genetic resources, useful to build a common database from worldwide olive cultivar collections
Real-time single-molecule electronic DNA sequencing by synthesis using polymer-tagged nucleotides on a nanopore array

PubMed Central

Fuller, Carl W.; Kumar, Shiv; Porel, Mintu; Chien, Minchen; Bibillo, Arek; Stranges, P. Benjamin; Dorwart, Michael; Tao, Chuanjuan; Li, Zengmin; Guo, Wenjing; Shi, Shundi; Korenblum, Daniel; Trans, Andrew; Aguirre, Anne; Liu, Edward; Harada, Eric T.; Pollard, James; Bhat, Ashwini; Cech, Cynthia; Yang, Alexander; Arnold, Cleoma; Palla, Mirkó; Hovis, Jennifer; Chen, Roger; Morozova, Irina; Kalachikov, Sergey; Russo, James J.; Kasianowicz, John J.; Davis, Randy; Roever, Stefan; Church, George M.; Ju, Jingyue

2016-01-01

DNA sequencing by synthesis (SBS) offers a robust platform to decipher nucleic acid sequences. Recently, we reported a single-molecule nanopore-based SBS strategy that accurately distinguishes four bases by electronically detecting and differentiating four different polymer tags attached to the 5′-phosphate of the nucleotides during their incorporation into a growing DNA strand catalyzed by DNA polymerase. Further developing this approach, we report here the use of nucleotides tagged at the terminal phosphate with oligonucleotide-based polymers to perform nanopore SBS on an α-hemolysin nanopore array platform. We designed and synthesized several polymer-tagged nucleotides using tags that produce different electrical current blockade levels and verified they are active substrates for DNA polymerase. A highly processive DNA polymerase was conjugated to the nanopore, and the conjugates were complexed with primer/template DNA and inserted into lipid bilayers over individually addressable electrodes of the nanopore chip. When an incoming complementary-tagged nucleotide forms a tight ternary complex with the primer/template and polymerase, the tag enters the pore, and the current blockade level is measured. The levels displayed by the four nucleotides tagged with four different polymers captured in the nanopore in such ternary complexes were clearly distinguishable and sequence-specific, enabling continuous sequence determination during the polymerase reaction. Thus, real-time single-molecule electronic DNA sequencing data with single-base resolution were obtained. The use of these polymer-tagged nucleotides, combined with polymerase tethering to nanopores and multiplexed nanopore sensors, should lead to new high-throughput sequencing methods. PMID:27091962
An efficient tag derived from the common epitope of tospoviral NSs proteins for monitoring recombinant proteins expressed in both bacterial and plant systems.

PubMed

Cheng, Hao-Wen; Chen, Kuan-Chun; Raja, Joseph A J; Li, Jian-Xian; Yeh, Shyi-Dong

2013-04-15

NSscon (23 aa), a common epitope in the gene silencing suppressor NSs proteins of the members of the Watermelon silver mottle virus (WSMoV) serogroup, was previously identified. In this investigation, we expressed different green fluorescent protein (GFP)-fused deletions of NSscon in bacteria and reacted with NSscon monoclonal antibody (MAb). Our results indicated that the core 9 amino acids, "(109)KFTMHNQIF(117)", denoted as "nss", retain the reactivity of NSscon. In bacterial pET system, four different recombinant proteins labeled with nss, either at N- or C-extremes, were readily detectable without position effects, with sensitivity superior to that for the polyhistidine-tag. When the nss-tagged Zucchini yellow mosaic virus (ZYMV) helper component-protease (HC-Pro) and WSMoV nucleocapsid protein were transiently expressed by agroinfiltration in tobacco, they were readily detectable and the tag's possible efficacy for gene silencing suppression was not noticed. Co-immunoprecipitation of nss-tagged and non-tagged proteins expressed from bacteria confirmed the interaction of potyviral HC-Pro and coat protein. Thus, we conclude that this novel nss sequence is highly valuable for tagging recombinant proteins in both bacterial and plant expression systems. Copyright © 2013 Elsevier B.V. All rights reserved.
Characterization of constitutive and putative differentially expressed mRNAs by means of expressed sequence tags, differential display reverse transcriptase-PCR and randomly amplified polymorphic DNA-PCR from the sand fly vector Lutzomyia longipalpis.

PubMed

Ramalho-Ortigão, J M; Temporal, P; de Oliveira , S M; Barbosa, A F; Vilela, M L; Rangel, E F; Brazil, R P; Traub-Cseko, Y M

2001-01-01

Molecular studies of insect disease vectors are of paramount importance for understanding parasite-vector relationship. Advances in this area have led to important findings regarding changes in vectors' physiology upon blood feeding and parasite infection. Mechanisms for interfering with the vectorial capacity of insects responsible for the transmission of diseases such as malaria, Chagas disease and dengue fever are being devised with the ultimate goal of developing transgenic insects. A primary necessity for this goal is information on gene expression and control in the target insect. Our group is investigating molecular aspects of the interaction between Leishmania parasites and Lutzomyia sand flies. As an initial step in our studies we have used random sequencing of cDNA clones from two expression libraries made from head/thorax and abdomen of sugar fed L. longipalpis for the identification of expressed sequence tags (EST). We applied differential display reverse transcriptase-PCR and randomly amplified polymorphic DNA-PCR to characterize differentially expressed mRNA from sugar and blood fed insects, and, in one case, from a L. (V.) braziliensis-infected L. longipalpis. We identified 37 cDNAs that have shown homology to known sequences from GeneBank. Of these, 32 cDNAs code for constitutive proteins such as zinc finger protein, glutamine synthetase, G binding protein, ubiquitin conjugating enzyme. Three are putative differentially expressed cDNAs from blood fed and Leishmania-infected midgut, a chitinase, a V-ATPase and a MAP kinase. Finally, two sequences are homologous to Drosophila melanogaster gene products recently discovered through the Drosophila genome initiative.
Peptides derivatized with bicyclic quaternary ammonium ionization tags. Sequencing via tandem mass spectrometry.

PubMed

Setner, Bartosz; Rudowska, Magdalena; Klem, Ewelina; Cebrat, Marek; Szewczuk, Zbigniew

2014-10-01

Improving the sensitivity of detection and fragmentation of peptides to provide reliable sequencing of peptides is an important goal of mass spectrometric analysis. Peptides derivatized by bicyclic quaternary ammonium ionization tags: 1-azabicyclo[2.2.2]octane (ABCO) or 1,4-diazabicyclo[2.2.2]octane (DABCO), are characterized by an increased detection sensitivity in electrospray ionization mass spectrometry (ESI-MS) and longer retention times on the reverse-phase (RP) chromatography columns. The improvement of the detection limit was observed even for peptides dissolved in 10 mM NaCl. Collision-induced dissociation tandem mass spectrometry of quaternary ammonium salts derivatives of peptides showed dominant a- and b-type ions, allowing facile sequencing of peptides. The bicyclic ionization tags are stable in collision-induced dissociation experiments, and the resulted fragmentation pattern is not significantly influenced by either acidic or basic amino acid residues in the peptide sequence. Obtained results indicate the general usefulness of the bicyclic quaternary ammonium ionization tags for ESI-MS/MS sequencing of peptides. Copyright © 2014 John Wiley & Sons, Ltd.

Chromosome-specific physical localisation of expressed sequence tag loci in Corchorus olitorius L.

PubMed

Joshi, A; Das, S K; Samanta, P; Paria, P; Sen, S K; Basu, A

2014-11-01

Jute (Corchorus spp.), as a natural fibre-producing species, ranks next only to cotton. Inadequate understanding of its genetic architecture is a major lacuna for genetic improvement of this crop in terms of yield and quality. Establishment of a physical map provides a genomic tool that helps in positional cloning of valuable genes. In this report, an attempt was initiated to study association and localisation of single copy expressed sequence tag (EST) loci in the genome of Corchorus olitorius. The chromosome-specific association of EST was determined based on the appearance of an extra signal for a single copy cDNA probe in mitotic interphase nuclei of specific trisomic(s) for fluorescence in situ hybridisation, and validated using a cDNA fragment of the 26S rRNA gene (600 bp) as molecular probe. The probe exhibited three signals in meiotic interphase nuclei of trisomic 5, instead of two as observed in diploids and other trisomics, indicating its association with chromosome 5. Subsequent hybridisation of the same probe on the pachytene chromosomes of diploids confirmed that 26S rRNA occupies the terminal end of the short arm of chromosome 5 in C. olitorius. Subsequently, chromosome-specific association of 63 single copy EST and their physical localisation were determined on chromosomes 2, 4, 5 and 7. The study describes chromosome-specific physical localisation of genes in jute. The approach used here could be a step towards construction of genome-wide physical maps for any recalcitrant plant species like jute. © 2014 German Botanical Society and The Royal Botanical Society of the Netherlands.
A large scale analysis of cDNA in Arabidopsis thaliana: generation of 12,028 non-redundant expressed sequence tags from normalized and size-selected cDNA libraries.

PubMed

Asamizu, E; Nakamura, Y; Sato, S; Tabata, S

2000-06-30

For comprehensive analysis of genes expressed in the model dicotyledonous plant, Arabidopsis thaliana, expressed sequence tags (ESTs) were accumulated. Normalized and size-selected cDNA libraries were constructed from aboveground organs, flower buds, roots, green siliques and liquid-cultured seedlings, respectively, and a total of 14,026 5'-end ESTs and 39,207 3'-end ESTs were obtained. The 3'-end ESTs could be clustered into 12,028 non-redundant groups. Similarity search of the non-redundant ESTs against the public non-redundant protein database indicated that 4816 groups show similarity to genes of known function, 1864 to hypothetical genes, and the remaining 5348 are novel sequences. Gene coverage by the non-redundant ESTs was analyzed using the annotated genomic sequences of approximately 10 Mb on chromosomes 3 and 5. A total of 923 regions were hit by at least one EST, among which only 499 regions were hit by the ESTs deposited in the public database. The result indicates that the EST source generated in this project complements the EST data in the public database and facilitates new gene discovery.
Sequencing, Analysis, and Annotation of Expressed Sequence Tags for Camelus dromedarius

PubMed Central

Al-Swailem, Abdulaziz M.; Shehata, Maher M.; Abu-Duhier, Faisel M.; Al-Yamani, Essam J.; Al-Busadah, Khalid A.; Al-Arawi, Mohammed S.; Al-Khider, Ali Y.; Al-Muhaimeed, Abdullah N.; Al-Qahtani, Fahad H.; Manee, Manee M.; Al-Shomrani, Badr M.; Al-Qhtani, Saad M.; Al-Harthi, Amer S.; Akdemir, Kadir C.; Otu, Hasan H.

2010-01-01

Despite its economical, cultural, and biological importance, there has not been a large scale sequencing project to date for Camelus dromedarius. With the goal of sequencing complete DNA of the organism, we first established and sequenced camel EST libraries, generating 70,272 reads. Following trimming, chimera check, repeat masking, cluster and assembly, we obtained 23,602 putative gene sequences, out of which over 4,500 potentially novel or fast evolving gene sequences do not carry any homology to other available genomes. Functional annotation of sequences with similarities in nucleotide and protein databases has been obtained using Gene Ontology classification. Comparison to available full length cDNA sequences and Open Reading Frame (ORF) analysis of camel sequences that exhibit homology to known genes show more than 80% of the contigs with an ORF>300 bp and ∼40% hits extending to the start codons of full length cDNAs suggesting successful characterization of camel genes. Similarity analyses are done separately for different organisms including human, mouse, bovine, and rat. Accompanying web portal, CAGBASE (http://camel.kacst.edu.sa/), hosts a relational database containing annotated EST sequences and analysis tools with possibility to add sequences from public domain. We anticipate our results to provide a home base for genomic studies of camel and other comparative studies enabling a starting point for whole genome sequencing of the organism. PMID:20502665
Generation, Annotation, and Analysis of a Large-Scale Expressed Sequence Tag Library from Arabidopsis pumila to Explore Salt-Responsive Genes.

PubMed

Huang, Xianzhong; Yang, Lifei; Jin, Yuhuan; Lin, Jun; Liu, Fang

2017-01-01

Arabidopsis pumila is an ephemeral plant, and a close relative of the model plant Arabidopsis thaliana , but it possesses higher photosynthetic efficiency, higher propagation rate, and higher salinity tolerance compared to those A. thaliana , thus providing a candidate plant system for gene mining for environmental adaption and salt tolerance. However, A. pumila is an under-explored resource for understanding the genetic mechanisms underlying abiotic stress adaptation. To improve our understanding of the molecular and genetic mechanisms of salt stress adaptation, more than 19,900 clones randomly selected from a cDNA library constructed previously from leaf tissue exposed to high-salinity shock were sequenced. A total of 16,014 high-quality expressed sequence tags (ESTs) were generated, which have been deposited in the dbEST GenBank under accession numbers JZ932319 to JZ948332. Clustering and assembly of these ESTs resulted in the identification of 8,835 unique sequences, consisting of 2,469 contigs and 6,366 singletons. The blastx results revealed 8,011 unigenes with significant similarity to known genes, while only 425 unigenes remained uncharacterized. Functional classification demonstrated an abundance of unigenes involved in binding, catalytic, structural or transporter activities, and in pathways of energy, carbohydrate, amino acid, or lipid metabolism. At least seven main classes of genes were related to salt-tolerance among the 8,835 unigenes. Many previously reported salt tolerance genes were also manifested in this library, for example VP1, H + -ATPase, NHX1, SOS2, SOS3, NAC, MYB, ERF, LEA, P5CS1 . In addition, 251 transcription factors were identified from the library, classified into 42 families. Lastly, changes in expression of the 12 most abundant unigenes, 12 transcription factor genes, and 19 stress-related genes in the first 24 h of exposure to high-salinity stress conditions were monitored by qRT-PCR. The large-scale EST library obtained in this
N-terminal SKIK peptide tag markedly improves expression of difficult-to-express proteins in Escherichia coli and Saccharomyces cerevisiae.

PubMed

Ojima-Kato, Teruyo; Nagai, Satomi; Nakano, Hideo

2017-05-01

Despite advances in microbial protein expression systems, low production of proteins remains a great concern for some genes. Here we report that the insertion of a short peptide tag, consisting of Ser-Lys-Ile-Lys (SKIK), adjacent to the start codon of genes encoding difficult-to-express proteins can increase protein expression in Escherichia coli and Saccharomyces cerevisiae. Protein expression levels of a mouse monoclonal antibody (mAb), rabbit mAbs obtained from clonal B cells, and an artificially designed peptide were significantly increased simply by the addition of the SKIK tag in E. coli systems. In particular, a ∼30-fold increase in protein production was observed for the mouse mAb, and the artificially designed peptide band became detectable in sodium dodecyl sulfate-poly acrylamide gel electrophoresis after coomassie brilliant blue staining or western blotting on adding the SKIK tag. The tag also increased the expression of tagged proteins in S. cerevisiae and an E. coli cell-free protein synthesis system. Although the mechanism of high protein expression on addition of the tag is unclear, our findings offer great benefits to biotechnology research and industry. Copyright © 2016 The Society for Biotechnology, Japan. Published by Elsevier B.V. All rights reserved.
Identification and functional characterization of effectors in expressed sequence tags from various life cycle stages of the potato cyst nematode Globodera pallida.

PubMed

Jones, John T; Kumar, Amar; Pylypenko, Liliya A; Thirugnanasambandam, Amarnath; Castelli, Lydia; Chapman, Sean; Cock, Peter J A; Grenier, Eric; Lilley, Catherine J; Phillips, Mark S; Blok, Vivian C

2009-11-01

In this article, we describe the analysis of over 9000 expressed sequence tags (ESTs) from cDNA libraries obtained from various life cycle stages of Globodera pallida. We have identified over 50 G. pallida effectors from this dataset using bioinformatics analysis, by screening clones in order to identify secreted proteins up-regulated after the onset of parasitism and using in situ hybridization to confirm the expression in pharyngeal gland cells. A substantial gene family encoding G. pallida SPRYSEC proteins has been identified. The expression of these genes is restricted to the dorsal pharyngeal gland cell. Different members of the SPRYSEC family of proteins from G. pallida show different subcellular localization patterns in plants, with some localized to the cytoplasm and others to the nucleus and nucleolus. Differences in subcellular localization may reflect diverse functional roles for each individual protein or, more likely, variety in the compartmentalization of plant proteins targeted by the nematode. Our data are therefore consistent with the suggestion that the SPRYSEC proteins suppress host defences, as suggested previously, and that they achieve this through interaction with a range of host targets.
Comprehensive Genetic Database of Expressed Sequence Tags for Coccolithophorids

NASA Astrophysics Data System (ADS)

Ranji, Mohammad; Hadaegh, Ahmad R.

Coccolithophorids are unicellular, marine, golden-brown, single-celled algae (Haptophyta) commonly found in near-surface waters in patchy distributions. They belong to the Phytoplankton family that is known to be responsible for much of the earth reproduction. Phytoplankton, just like plants live based on the energy obtained by Photosynthesis which produces oxygen. Substantial amount of oxygen in the earth's atmosphere is produced by Phytoplankton through Photosynthesis. The single-celled Emiliana Huxleyi is the most commonly known specie of Coccolithophorids and is known for extracting bicarbonate (HCO3) from its environment and producing calcium carbonate to form Coccoliths. Coccolithophorids are one of the world's primary producers, contributing about 15% of the average oceanic phytoplankton biomass to the oceans. They produce elaborate, minute calcite platelets (Coccoliths), covering the cell to form a Coccosphere and supplying up to 60% of the bulk pelagic calcite deposited on the sea floors. In order to understand the genetics of Coccolithophorid and the complexities of their biochemical reactions, we decided to build a database to store a complete profile of these organisms' genomes. Although a variety of such databases currently exist, (http://www.geneservice.co.uk/home/) none have yet been developed to comprehensively address the sequencing efforts underway by the Coccolithophorid research community. This database is called CocooExpress and is available to public (http://bioinfo.csusm.edu) for both data queries and sequence contribution.
Chromatin modification contributes to the expression divergence of three TaGS2 homoeologs in hexaploid wheat

PubMed Central

Zhang, Wei; Fan, Xiaoli; Gao, Yingjie; Liu, Lei; Sun, Lijing; Su, Qiannan; Han, Jie; Zhang, Na; Cui, Fa; Ji, Jun; Tong, Yiping; Li, Junming

2017-01-01

Plastic glutamine synthetase (GS2) is responsible for ammonium assimilation. The reason that TaGS2 homoeologs in hexaploid wheat experience different selection pressures in the breeding process remains unclear. TaGS2 were minimally expressed in roots but predominantly expressed in leaves, and TaGS2-B had higher expression than TaGS2-A and TaGS2-D. ChIP assays revealed that the activation of TaGS2-B expression in leaves was correlated with increased H3K4 trimethylation. The transcriptional silencing of TaGS2 in roots was correlated with greater cytosine methylation and less H3K4 trimethylation. Micrococcal nuclease and DNase I accessibility experiments indicated that the promoter region was more resistant to digestion in roots than leaves, which indicated that the closed nucleosome conformation of the promoter region was important to the transcription initiation for the spatial-temporal expression of TaGS2. In contrast, the transcribed regions possess different nuclease accessibilities of three TaGS2 homoeologs in the same tissue, suggesting that nucleosome conformation of the transcribed region was part of the fine adjustment of TaGS2 homoeologs. This study provides evidence that histone modification, DNA methylation and nuclease accessibility coordinated the control of the transcription of TaGS2 homoeologs. Our results provided important evidence that TaGS2-B experienced the strongest selection pressures during the breeding process. PMID:28300215
High Level Expression and Purification of Recombinant Proteins from Escherichia coli with AK-TAG

PubMed Central

Luo, Dan; Wen, Caixia; Zhao, Rongchuan; Liu, Xinyu; Liu, Xinxin; Cui, Jingjing; Liang, Joshua G.; Liang, Peng

2016-01-01

Adenylate kinase (AK) from Escherichia coli was used as both solubility and affinity tag for recombinant protein production. When fused to the N-terminus of a target protein, an AK fusion protein could be expressed in soluble form and purified to near homogeneity in a single step from Blue-Sepherose via affinity elution with micromolar concentration of P1, P5- di (adenosine—5’) pentaphosphate (Ap5A), a transition-state substrate analog of AK. Unlike any other affinity tags, the level of a recombinant protein expression in soluble form and its yield of recovery during each purification step could be readily assessed by AK enzyme activity in near real time. Coupled to a His-Tag installed at the N-terminus and a thrombin cleavage site at the C terminus of AK, the streamlined method, here we dubbed AK-TAG, could also allow convenient expression and retrieval of a cleaved recombinant protein in high yield and purity via dual affinity purification steps. Thus AK-TAG is a new addition to the arsenal of existing affinity tags for recombinant protein expression and purification, and is particularly useful where soluble expression and high degree of purification are at stake. PMID:27214237
Broad host range vectors for expression of proteins with (Twin-) Strep-tag, His-tag and engineered, export optimized yellow fluorescent protein

PubMed Central

2013-01-01

Background In current protein research, a limitation still is the production of active recombinant proteins or native protein associations to assess their function. Especially the localization and analysis of protein-complexes or the identification of modifications and small molecule interaction partners by co-purification experiments requires a controllable expression of affinity- and/or fluorescence tagged variants of a protein of interest in its native cellular background. Advantages of periplasmic and/or homologous expressions can frequently not be realized due to a lack of suitable tools. Instead, experiments are often limited to the heterologous production in one of the few well established expression strains. Results Here, we introduce a series of new RK2 based broad host range expression plasmids for inducible production of affinity- and fluorescence tagged proteins in the cytoplasm and periplasm of a wide range of Gram negative hosts which are designed to match the recently suggested modular Standard European Vector Architecture and database. The vectors are equipped with a yellow fluorescent protein variant which is engineered to fold and brightly fluoresce in the bacterial periplasm following Sec-mediated export, as shown from fractionation and imaging studies. Expression of Strep-tag®II and Twin-Strep-tag® fusion proteins in Pseudomonas putida KT2440 is demonstrated for various ORFs. Conclusion The broad host range constructs we have produced enable good and controlled expression of affinity tagged protein variants for single-step purification and qualify for complex co-purification experiments. Periplasmic export variants enable production of affinity tagged proteins and generation of fusion proteins with a novel engineered Aequorea-based yellow fluorescent reporter protein variant with activity in the periplasm of the tested Gram-negative model bacteria Pseudomonas putida KT2440 and Escherichia coli K12 for production, localization or co
Sorghum Expressed Sequence Tags Identify Signature Genes for Drought, Pathogenesis, and Skotomorphogenesis from a Milestone Set of 16,801 Unique Transcripts1[w

PubMed Central

Pratt, Lee H.; Liang, Chun; Shah, Manish; Sun, Feng; Wang, Haiming; Reid, St. Patrick; Gingle, Alan R.; Paterson, Andrew H.; Wing, Rod; Dean, Ralph; Klein, Robert; Nguyen, Henry T.; Ma, Hong-mei; Zhao, Xin; Morishige, Daryl T.; Mullet, John E.; Cordonnier-Pratt, Marie-Michèle

2005-01-01

Improved knowledge of the sorghum transcriptome will enhance basic understanding of how plants respond to stresses and serve as a source of genes of value to agriculture. Toward this goal, Sorghum bicolor L. Moench cDNA libraries were prepared from light- and dark-grown seedlings, drought-stressed plants, Colletotrichum-infected seedlings and plants, ovaries, embryos, and immature panicles. Other libraries were prepared with meristems from Sorghum propinquum (Kunth) Hitchc. that had been photoperiodically induced to flower, and with rhizomes from S. propinquum and johnsongrass (Sorghum halepense L. Pers.). A total of 117,682 expressed sequence tags (ESTs) were obtained representing both 3′ and 5′ sequences from about half that number of cDNA clones. A total of 16,801 unique transcripts, representing tentative UniScripts (TUs), were identified from 55,783 3′ ESTs. Of these TUs, 9,032 are represented by two or more ESTs. Collectively, these libraries were predicted to contain a total of approximately 31,000 TUs. Individual libraries, however, were predicted to contain no more than about 6,000 to 9,000, with the exception of light-grown seedlings, which yielded an estimate of close to 13,000. In addition, each library exhibits about the same level of complexity with respect to both the number of TUs preferentially expressed in that library and the frequency with which two or more ESTs is found in only that library. These results indicate that the sorghum genome is expressed in highly selective fashion in the individual organs and in response to the environmental conditions surveyed here. Close to 2,000 differentially expressed TUs were identified among the cDNA libraries examined, of which 775 were differentially expressed at a confidence level of 98%. From these 775 TUs, signature genes were identified defining drought, Colletotrichum infection, skotomorphogenesis (etiolation), ovary, immature panicle, and embryo. PMID:16169961
Insulin chains as efficient fusion tags for prokaryotic expression of short peptides.

PubMed

Deng, Ligang; Xue, Xiaoying; Shen, Cangjie; Song, Xiaohan; Wang, Chunyang; Wang, Nan

2017-10-01

Insulin chains are usually expressed in Escherichia coli as fusion proteins with different tags, including various low molecular weight peptide tags. The objective of this study was to determine if insulin chains could facilitate the recombinant expression of other target proteins, with an emphasis on low molecular weight peptides. A series of short peptides were fused to mini-proinsulin, chain B or chain A, and induced for expression in Escherichia coli. All the tested peptides including glucagon-like peptide 1 (GLP-1), a C-terminal extended GLP-1, oxyntomodulin, enfuvirtide, linaclotide, and an unstructured artificial peptide were expressed with reasonable yields, identified by Tricine-SDS-PAGE and immunoblotting. All recombinant products were expressed in inclusion bodies. The effective accumulation of products was largely attributed to the insoluble expression induced by fusion with insulin chains, and was confirmed by the fusion expression of transthyretin. Insulin chains thus show promise as efficient fusion tags for mass production of heterologous peptides in prokaryotes. Copyright © 2017 Elsevier Inc. All rights reserved.
Fluorescent Labeling of COS-7 Expressing SNAP-tag Fusion Proteins for Live Cell Imaging

PubMed Central

Provost, Christopher R.; Sun, Luo

2010-01-01

SNAP-tag and CLIP-tag protein labeling systems enable the specific, covalent attachment of molecules, including fluorescent dyes, to a protein of interest in live cells. These systems offer a broad selection of fluorescent substrates optimized for a range of imaging instrumentation. Once cloned and expressed, the tagged protein can be used with a variety of substrates for numerous downstream applications without having to clone again. There are two steps to using this system: cloning and expression of the protein of interest as a SNAP-tag fusion, and labeling of the fusion with the SNAP-tag substrate of choice. The SNAP-tag is a small protein based on human O6-alkylguanine-DNA-alkyltransferase (hAGT), a DNA repair protein. SNAP-tag labels are dyes conjugated to guanine or chloropyrimidine leaving groups via a benzyl linker. In the labeling reaction, the substituted benzyl group of the substrate is covalently attached to the SNAP-tag. CLIP-tag is a modified version of SNAP-tag, engineered to react with benzylcytosine rather than benzylguanine derivatives. When used in conjunction with SNAP-tag, CLIP-tag enables the orthogonal and complementary labeling of two proteins simultaneously in the same cells. PMID:20485262
Cloning, Expression, and Purification of Histidine-Tagged Escherichia coli Dihydrodipicolinate Reductase.

PubMed

Trigoso, Yvonne D; Evans, Russell C; Karsten, William E; Chooback, Lilian

2016-01-01

The enzyme dihydrodipicolinate reductase (DHDPR) is a component of the lysine biosynthetic pathway in bacteria and higher plants. DHDPR catalyzes the NAD(P)H dependent reduction of 2,3-dihydrodipicolinate to the cyclic imine L-2,3,4,5,-tetrahydropicolinic acid. The dapB gene that encodes dihydrodipicolinate reductase has previously been cloned, but the expression of the enzyme is low and the purification is time consuming. Therefore the E. coli dapB gene was cloned into the pET16b vector to improve the protein expression and simplify the purification. The dapB gene sequence was utilized to design forward and reverse oligonucleotide primers that were used to PCR the gene from Escherichia coli genomic DNA. The primers were designed with NdeI or BamHI restriction sites on the 5'and 3' terminus respectively. The PCR product was sequenced to confirm the identity of dapB. The gene was cloned into the expression vector pET16b through NdeI and BamHI restriction endonuclease sites. The resulting plasmid containing dapB was transformed into the bacterial strain BL21 (DE3). The transformed cells were utilized to grow and express the histidine-tagged reductase and the protein was purified using Ni-NTA affinity chromatography. SDS/PAGE gel analysis has shown that the protein was 95% pure and has approximate subunit molecular weight of 28 kDa. The protein purification is completed in one day and 3 liters of culture produced approximately 40-50 mgs of protein, an improvement on the previous protein expression and multistep purification.
SPlinted Ligation Adapter Tagging (SPLAT), a novel library preparation method for whole genome bisulphite sequencing

PubMed Central

Manlig, Erika; Wahlberg, Per

2017-01-01

Abstract Sodium bisulphite treatment of DNA combined with next generation sequencing (NGS) is a powerful combination for the interrogation of genome-wide DNA methylation profiles. Library preparation for whole genome bisulphite sequencing (WGBS) is challenging due to side effects of the bisulphite treatment, which leads to extensive DNA damage. Recently, a new generation of methods for bisulphite sequencing library preparation have been devised. They are based on initial bisulphite treatment of the DNA, followed by adaptor tagging of single stranded DNA fragments, and enable WGBS using low quantities of input DNA. In this study, we present a novel approach for quick and cost effective WGBS library preparation that is based on splinted adaptor tagging (SPLAT) of bisulphite-converted single-stranded DNA. Moreover, we validate SPLAT against three commercially available WGBS library preparation techniques, two of which are based on bisulphite treatment prior to adaptor tagging and one is a conventional WGBS method. PMID:27899585
A dual tag system for facilitated detection of surface expressed proteins in Escherichia coli

PubMed Central

2012-01-01

Background The discovery of the autotransporter family has provided a mechanism for surface expression of proteins in laboratory strains of Escherichia coli. We have previously reported the use of the AIDA-I autotransport system to express the Salmonella enterica serovar Enteritidis proteins SefA and H:gm. The SefA protein was successfully exposed to the medium, but the orientation of H:gm in the outer membrane could not be determined due to proteolytic cleavage of the N-terminal detection-tag. The goal of the present work was therefore to construct a vector containing elements that facilitates analysis of surface expression, especially for proteins that are sensitive to proteolysis or otherwise difficult to express. Results The surface expression system pAIDA1 was created with two detection tags flanking the passenger protein. Successful expression of SefA and H:gm on the surface of E. coli was confirmed with fluorescently labeled antibodies specific for the N-terminal His6-tag and the C-terminal Myc-tag. While both tags were detected during SefA expression, only the Myc-tag could be detected for H:gm. The negative signal indicates a proteolytic cleavage of this protein that removes the His6-tag facing the medium. Conclusions Expression levels from pAIDA1 were comparable to or higher than those achieved with the formerly used vector. The presence of the Myc- but not of the His6-tag on the cell surface during H:gm expression allowed us to confirm the hypothesis that this fusion protein was present on the surface and oriented towards the cell exterior. Western blot analysis revealed degradation products of the same molecular weight for SefA and H:gm. The size of these fragments suggests that both fusion proteins have been cleaved at a specific site close to the C-terminal end of the passenger. This proteolysis was concluded to take place either in the outer membrane or in the periplasm. Since H:gm was cleaved to a much greater extent then the three times smaller Sef
Exploring the Host Parasitism of the Migratory Plant-Parasitic Nematode Ditylenchus destuctor by Expressed Sequence Tags Analysis

PubMed Central

Peng, Huan; Gao, Bing-li; Kong, Ling-an; Yu, Qing; Huang, Wen-kun; He, Xu-feng; Long, Hai-bo; Peng, De-liang

2013-01-01

The potato rot nematode, Ditylenchus destructor, is a very destructive nematode pest on many agriculturally important crops worldwide, but the molecular characterization of its parasitism of plant has been limited. The effectors involved in nematode parasitism of plant for several sedentary endo-parasitic nematodes such as Heterodera glycines, Globodera rostochiensis and Meloidogyne incognita have been identified and extensively studied over the past two decades. Ditylenchus destructor, as a migratory plant parasitic nematode, has different feeding behavior, life cycle and host response. Comparing the transcriptome and parasitome among different types of plant-parasitic nematodes is the way to understand more fully the parasitic mechanism of plant nematodes. We undertook the approach of sequencing expressed sequence tags (ESTs) derived from a mixed stage cDNA library of D. destructor. This is the first study of D. destructor ESTs. A total of 9800 ESTs were grouped into 5008 clusters including 3606 singletons and 1402 multi-member contigs, representing a catalog of D. destructor genes. Implementing a bioinformatics' workflow, we found 1391 clusters have no match in the available gene database; 31 clusters only have similarities to genes identified from D. africanus, the most closely related species to D. destructor; 1991 clusters were annotated using Gene Ontology (GO); 1550 clusters were assigned enzyme commission (EC) numbers; and 1211 clusters were mapped to 181 KEGG biochemical pathways. 22 ESTs had similarities to reported nematode effectors. Interestedly, most of the effectors identified in this study are involved in host cell wall degradation or modification, such as 1,4-beta-glucanse, 1,3-beta-glucanse, pectate lyase, chitinases and expansin, or host defense suppression such as calreticulin, annexin and venom allergen-like protein. This result implies that the migratory plant-parasitic nematode D. destructor secrets similar effectors to those of sedentary
Chasing Migration Genes: A Brain Expressed Sequence Tag Resource for Summer and Migratory Monarch Butterflies (Danaus plexippus)

PubMed Central

Zhu, Haisun; Casselman, Amy; Reppert, Steven M.

2008-01-01

North American monarch butterflies (Danaus plexippus) undergo a spectacular fall migration. In contrast to summer butterflies, migrants are juvenile hormone (JH) deficient, which leads to reproductive diapause and increased longevity. Migrants also utilize time-compensated sun compass orientation to help them navigate to their overwintering grounds. Here, we describe a brain expressed sequence tag (EST) resource to identify genes involved in migratory behaviors. A brain EST library was constructed from summer and migrating butterflies. Of 9,484 unique sequences, 6068 had positive hits with the non-redundant protein database; the EST database likely represents ∼52% of the gene-encoding potential of the monarch genome. The brain transcriptome was cataloged using Gene Ontology and compared to Drosophila. Monarch genes were well represented, including those implicated in behavior. Three genes involved in increased JH activity (allatotropin, juvenile hormone acid methyltransfersase, and takeout) were upregulated in summer butterflies, compared to migrants. The locomotion-relevant turtle gene was marginally upregulated in migrants, while the foraging and single-minded genes were not differentially regulated. Many of the genes important for the monarch circadian clock mechanism (involved in sun compass orientation) were in the EST resource, including the newly identified cryptochrome 2. The EST database also revealed a novel Na+/K+ ATPase allele predicted to be more resistant to the toxic effects of milkweed than that reported previously. Potential genetic markers were identified from 3,486 EST contigs and included 1599 double-hit single nucleotide polymorphisms (SNPs) and 98 microsatellite polymorphisms. These data provide a template of the brain transcriptome for the monarch butterfly. Our “snap-shot” analysis of the differential regulation of candidate genes between summer and migratory butterflies suggests that unbiased, comprehensive transcriptional profiling
Cloning, Expression, and Purification of Histidine-Tagged Escherichia coli Dihydrodipicolinate Reductase

PubMed Central

Trigoso, Yvonne D.; Evans, Russell C.; Karsten, William E.; Chooback, Lilian

2016-01-01

The enzyme dihydrodipicolinate reductase (DHDPR) is a component of the lysine biosynthetic pathway in bacteria and higher plants. DHDPR catalyzes the NAD(P)H dependent reduction of 2,3-dihydrodipicolinate to the cyclic imine L-2,3,4,5,-tetrahydropicolinic acid. The dapB gene that encodes dihydrodipicolinate reductase has previously been cloned, but the expression of the enzyme is low and the purification is time consuming. Therefore the E. coli dapB gene was cloned into the pET16b vector to improve the protein expression and simplify the purification. The dapB gene sequence was utilized to design forward and reverse oligonucleotide primers that were used to PCR the gene from Escherichia coli genomic DNA. The primers were designed with NdeI or BamHI restriction sites on the 5’and 3’ terminus respectively. The PCR product was sequenced to confirm the identity of dapB. The gene was cloned into the expression vector pET16b through NdeI and BamHI restriction endonuclease sites. The resulting plasmid containing dapB was transformed into the bacterial strain BL21 (DE3). The transformed cells were utilized to grow and express the histidine-tagged reductase and the protein was purified using Ni-NTA affinity chromatography. SDS/PAGE gel analysis has shown that the protein was 95% pure and has approximate subunit molecular weight of 28 kDa. The protein purification is completed in one day and 3 liters of culture produced approximately 40–50 mgs of protein, an improvement on the previous protein expression and multistep purification. PMID:26815040
Large-scale identification of odorant-binding proteins and chemosensory proteins from expressed sequence tags in insects

PubMed Central

2009-01-01

Background Insect odorant binding proteins (OBPs) and chemosensory proteins (CSPs) play an important role in chemical communication of insects. Gene discovery of these proteins is a time-consuming task. In recent years, expressed sequence tags (ESTs) of many insect species have accumulated, thus providing a useful resource for gene discovery. Results We have developed a computational pipeline to identify OBP and CSP genes from insect ESTs. In total, 752,841 insect ESTs were examined from 54 species covering eight Orders of Insecta. From these ESTs, 142 OBPs and 177 CSPs were identified, of which 117 OBPs and 129 CSPs are new. The complete open reading frames (ORFs) of 88 OBPs and 123 CSPs were obtained by electronic elongation. We randomly chose 26 OBPs from eight species of insects, and 21 CSPs from four species for RT-PCR validation. Twenty two OBPs and 16 CSPs were confirmed by RT-PCR, proving the efficiency and reliability of the algorithm. Together with all family members obtained from the NCBI (OBPs) or the UniProtKB (CSPs), 850 OBPs and 237 CSPs were analyzed for their structural characteristics and evolutionary relationship. Conclusions A large number of new OBPs and CSPs were found, providing the basis for deeper understanding of these proteins. In addition, the conserved motif and evolutionary analysis provide some new insights into the evolution of insect OBPs and CSPs. Motif pattern fine-tune the functions of OBPs and CSPs, leading to the minor difference in binding sex pheromone or plant volatiles in different insect Orders. PMID:20034407

Identification of differentially expressed genes in cucumber (Cucumis sativus L.) root under waterlogging stress by digital gene expression profile.

PubMed

Qi, Xiao-Hua; Xu, Xue-Wen; Lin, Xiao-Jian; Zhang, Wen-Jie; Chen, Xue-Hao

2012-03-01

High-throughput tag-sequencing (Tag-seq) analysis based on the Solexa Genome Analyzer platform was applied to analyze the gene expression profiling of cucumber plant at 5 time points over a 24h period of waterlogging treatment. Approximately 5.8 million total clean sequence tags per library were obtained with 143013 distinct clean tag sequences. Approximately 23.69%-29.61% of the distinct clean tags were mapped unambiguously to the unigene database, and 53.78%-60.66% of the distinct clean tags were mapped to the cucumber genome database. Analysis of the differentially expressed genes revealed that most of the genes were down-regulated in the waterlogging stages, and the differentially expressed genes mainly linked to carbon metabolism, photosynthesis, reactive oxygen species generation/scavenging, and hormone synthesis/signaling. Finally, quantitative real-time polymerase chain reaction using nine genes independently verified the tag-mapped results. This present study reveals the comprehensive mechanisms of waterlogging-responsive transcription in cucumber. Copyright Â© 2011 Elsevier Inc. All rights reserved.
Expression and purification of ELP-intein-tagged target proteins in high cell density E. coli fermentation.

PubMed

Fong, Baley A; Wood, David W

2010-10-19

Elastin-like polypeptides (ELPs) are useful tools that can be used to non-chromatographically purify proteins. When paired with self-cleaving inteins, they can be used as economical self-cleaving purification tags. However, ELPs and ELP-tagged target proteins have been traditionally expressed using highly enriched media in shake flask cultures, which are generally not amenable to scale-up. In this work, we describe the high cell-density expression of self-cleaving ELP-tagged targets in a supplemented minimal medium at a 2.5 liter fermentation scale, with increased yields and purity compared to traditional shake flask cultures. This demonstration of ELP expression in supplemented minimal media is juxtaposed to previous expression of ELP tags in extract-based rich media. We also describe several sets of fed-batch conditions and their impact on ELP expression and growth medium cost. By using fed batch E. coli fermentation at high cell density, ELP-intein-tagged proteins can be expressed and purified at high yield with low cost. Further, the impact of media components and fermentation design can significantly impact the overall process cost, particularly at large scale. This work thus demonstrates an important advances in the scale up of self-cleaving ELP tag-mediated processes.
Expression and purification of ELP-intein-tagged target proteins in high cell density E. coli fermentation

PubMed Central

2010-01-01

Background Elastin-like polypeptides (ELPs) are useful tools that can be used to non-chromatographically purify proteins. When paired with self-cleaving inteins, they can be used as economical self-cleaving purification tags. However, ELPs and ELP-tagged target proteins have been traditionally expressed using highly enriched media in shake flask cultures, which are generally not amenable to scale-up. Results In this work, we describe the high cell-density expression of self-cleaving ELP-tagged targets in a supplemented minimal medium at a 2.5 liter fermentation scale, with increased yields and purity compared to traditional shake flask cultures. This demonstration of ELP expression in supplemented minimal media is juxtaposed to previous expression of ELP tags in extract-based rich media. We also describe several sets of fed-batch conditions and their impact on ELP expression and growth medium cost. Conclusions By using fed batch E. coli fermentation at high cell density, ELP-intein-tagged proteins can be expressed and purified at high yield with low cost. Further, the impact of media components and fermentation design can significantly impact the overall process cost, particularly at large scale. This work thus demonstrates an important advances in the scale up of self-cleaving ELP tag-mediated processes. PMID:20959011
Evaluating information content of SNPs for sample-tagging in re-sequencing projects.

PubMed

Hu, Hao; Liu, Xiang; Jin, Wenfei; Hilger Ropers, H; Wienker, Thomas F

2015-05-15

Sample-tagging is designed for identification of accidental sample mix-up, which is a major issue in re-sequencing studies. In this work, we develop a model to measure the information content of SNPs, so that we can optimize a panel of SNPs that approach the maximal information for discrimination. The analysis shows that as low as 60 optimized SNPs can differentiate the individuals in a population as large as the present world, and only 30 optimized SNPs are in practice sufficient in labeling up to 100 thousand individuals. In the simulated populations of 100 thousand individuals, the average Hamming distances, generated by the optimized set of 30 SNPs are larger than 18, and the duality frequency, is lower than 1 in 10 thousand. This strategy of sample discrimination is proved robust in large sample size and different datasets. The optimized sets of SNPs are designed for Whole Exome Sequencing, and a program is provided for SNP selection, allowing for customized SNP numbers and interested genes. The sample-tagging plan based on this framework will improve re-sequencing projects in terms of reliability and cost-effectiveness.
Sequence analysis of 497 mouse brain ESTs expressed in the substantia nigra

DOE Office of Scientific and Technical Information (OSTI.GOV)

Stewart, G.J.; Savioz, A.; Davies, R.W.

1997-01-15

The use of subtracted, region-specific cDNA libraries combined with single-pass cDNA sequencing allows the discovery of novel genes and facilitates molecular description of the tissue or region involved. We report the sequence of 497 mouse expressed sequence tags (ESTs) from two subtracted libraries enriched for cDNAs expressed in the substantia nigra, a brain region with important roles in movement control and Parkinson disease. Of these, 238 ESTs give no database matches and therefore derive from novel genes. A further 115 ESTs show sequence similarity to ESTs from other organisms, which themselves do not yield any significant database matches to genesmore » of known function. Fifty-six ESTs show sequence similarity to previously identified genes whose mouse homologues have not been reported. The total number of ESTs reported that are new for the mouse is 407, which, together with the 90 ESTs corresponding to known mouse genes or cDNAs, contributes to the molecular description of the substantia nigra. 21 refs., 4 tabs.« less
Microbial Diversity in Deep-sea Methane Seep Sediments Presented by SSU rRNA Gene Tag Sequencing

PubMed Central

Nunoura, Takuro; Takaki, Yoshihiro; Kazama, Hiromi; Hirai, Miho; Ashi, Juichiro; Imachi, Hiroyuki; Takai, Ken

2012-01-01

Microbial community structures in methane seep sediments in the Nankai Trough were analyzed by tag-sequencing analysis for the small subunit (SSU) rRNA gene using a newly developed primer set. The dominant members of Archaea were Deep-sea Hydrothermal Vent Euryarchaeotic Group 6 (DHVEG 6), Marine Group I (MGI) and Deep Sea Archaeal Group (DSAG), and those in Bacteria were Alpha-, Gamma-, Delta- and Epsilonproteobacteria, Chloroflexi, Bacteroidetes, Planctomycetes and Acidobacteria. Diversity and richness were examined by 8,709 and 7,690 tag-sequences from sediments at 5 and 25 cm below the seafloor (cmbsf), respectively. The estimated diversity and richness in the methane seep sediment are as high as those in soil and deep-sea hydrothermal environments, although the tag-sequences obtained in this study were not sufficient to show whole microbial diversity in this analysis. We also compared the diversity and richness of each taxon/division between the sediments from the two depths, and found that the diversity and richness of some taxa/divisions varied significantly along with the depth. PMID:22510646
A normalization strategy for comparing tag count data

PubMed Central

2012-01-01

Background High-throughput sequencing, such as ribonucleic acid sequencing (RNA-seq) and chromatin immunoprecipitation sequencing (ChIP-seq) analyses, enables various features of organisms to be compared through tag counts. Recent studies have demonstrated that the normalization step for RNA-seq data is critical for a more accurate subsequent analysis of differential gene expression. Development of a more robust normalization method is desirable for identifying the true difference in tag count data. Results We describe a strategy for normalizing tag count data, focusing on RNA-seq. The key concept is to remove data assigned as potential differentially expressed genes (DEGs) before calculating the normalization factor. Several R packages for identifying DEGs are currently available, and each package uses its own normalization method and gene ranking algorithm. We compared a total of eight package combinations: four R packages (edgeR, DESeq, baySeq, and NBPSeq) with their default normalization settings and with our normalization strategy. Many synthetic datasets under various scenarios were evaluated on the basis of the area under the curve (AUC) as a measure for both sensitivity and specificity. We found that packages using our strategy in the data normalization step overall performed well. This result was also observed for a real experimental dataset. Conclusion Our results showed that the elimination of potential DEGs is essential for more accurate normalization of RNA-seq data. The concept of this normalization strategy can widely be applied to other types of tag count data and to microarray data. PMID:22475125
An expressed sequence tag (EST) library for Drosophila serrata, a model system for sexual selection and climatic adaptation studies.

PubMed

Frentiu, Francesca D; Adamski, Marcin; McGraw, Elizabeth A; Blows, Mark W; Chenoweth, Stephen F

2009-01-21

The native Australian fly Drosophila serrata belongs to the highly speciose montium subgroup of the melanogaster species group. It has recently emerged as an excellent model system with which to address a number of important questions, including the evolution of traits under sexual selection and traits involved in climatic adaptation along latitudinal gradients. Understanding the molecular genetic basis of such traits has been limited by a lack of genomic resources for this species. Here, we present the first expressed sequence tag (EST) collection for D. serrata that will enable the identification of genes underlying sexually-selected phenotypes and physiological responses to environmental change and may help resolve controversial phylogenetic relationships within the montium subgroup. A normalized cDNA library was constructed from whole fly bodies at several developmental stages, including larvae and adults. Assembly of 11,616 clones sequenced from the 3' end allowed us to identify 6,607 unique contigs, of which at least 90% encoded peptides. Partial transcripts were discovered from a variety of genes of evolutionary interest by BLASTing contigs against the 12 Drosophila genomes currently sequenced. By incorporating into the cDNA library multiple individuals from populations spanning a large portion of the geographical range of D. serrata, we were able to identify 11,057 putative single nucleotide polymorphisms (SNPs), with 278 different contigs having at least one "double hit" SNP that is highly likely to be a real polymorphism. At least 394 EST-associated microsatellite markers, representing 355 different contigs, were also found, providing an additional set of genetic markers. The assembled EST library is available online at http://www.chenowethlab.org/serrata/index.cgi. We have provided the first gene collection and largest set of polymorphic genetic markers, to date, for the fly D. serrata. The EST collection will provide much needed genomic resources for
Uncovering the Salt Response of Soybean by Unraveling Its Wild and Cultivated Functional Genomes Using Tag Sequencing

PubMed Central

Ali, Zulfiqar; Zhang, Da Yong; Xu, Zhao Long; Xu, Ling; Yi, Jin Xin; He, Xiao Lan; Huang, Yi Hong; Liu, Xiao Qing; Khan, Asif Ali; Trethowan, Richard M.; Ma, Hong Xiang

2012-01-01

Soil salinity has very adverse effects on growth and yield of crop plants. Several salt tolerant wild accessions and cultivars are reported in soybean. Functional genomes of salt tolerant Glycine soja and a salt sensitive genotype of Glycine max were investigated to understand the mechanism of salt tolerance in soybean. For this purpose, four libraries were constructed for Tag sequencing on Illumina platform. We identify around 490 salt responsive genes which included a number of transcription factors, signaling proteins, translation factors and structural genes like transporters, multidrug resistance proteins, antiporters, chaperons, aquaporins etc. The gene expression levels and ratio of up/down-regulated genes was greater in tolerant plants. Translation related genes remained stable or showed slightly higher expression in tolerant plants under salinity stress. Further analyses of sequenced data and the annotations for gene ontology and pathways indicated that soybean adapts to salt stress through ABA biosynthesis and regulation of translation and signal transduction of structural genes. Manipulation of these pathways may mitigate the effect of salt stress thus enhancing salt tolerance. PMID:23209559
Genetic diversity analysis in Malaysian giant prawns using expressed sequence tag microsatellite markers for stock improvement program.

PubMed

Atin, K H; Christianus, A; Fatin, N; Lutas, A C; Shabanimofrad, M; Subha, B

2017-08-17

The Malaysian giant prawn is among the most commonly cultured species of the genus Macrobrachium. Stocks of giant prawns from four rivers in Peninsular Malaysia have been used for aquaculture over the past 25 years, which has led to repeated harvesting, restocking, and transplantation between rivers. Consequently, a stock improvement program is now important to avoid the depletion of wild stocks and the loss of genetic diversity. However, the success of such an improvement program depends on our knowledge of the genetic variation of these base populations. The aim of the current study was to estimate genetic variation and differentiation of these riverine sources using novel expressed sequence tag-microsatellite (EST-SSR) markers, which not only are informative on genetic diversity but also provide information on immune and metabolic traits. Our findings indicated that the tested stocks have inbreeding depression due to a significant deficiency in heterozygotes, and F IS was estimated as 0.15538 to 0.31938. An F-statistics analysis suggested that the stocks are composed of one large panmictic population. Among the four locations, stocks from Johor, in the southern region of the peninsular, showed higher allelic and genetic diversity than the other stocks. To overcome inbreeding problems, the Johor population could be used as a base population in a stock improvement program by crossing to the other populations. The study demonstrated that EST-SSR markers can be incorporated in future marker assisted breeding to aid the proper management of the stocks by breeders and stakeholders in Malaysia.
A molecular analysis of desiccation tolerance mechanisms in the anhydrobiotic nematode Panagrolaimus superbus using expressed sequenced tags

PubMed Central

2012-01-01

Background Some organisms can survive extreme desiccation by entering into a state of suspended animation known as anhydrobiosis. Panagrolaimus superbus is a free-living anhydrobiotic nematode that can survive rapid environmental desiccation. The mechanisms that P. superbus uses to combat the potentially lethal effects of cellular dehydration may include the constitutive and inducible expression of protective molecules, along with behavioural and/or morphological adaptations that slow the rate of cellular water loss. In addition, inducible repair and revival programmes may also be required for successful rehydration and recovery from anhydrobiosis. Results To identify constitutively expressed candidate anhydrobiotic genes we obtained 9,216 ESTs from an unstressed mixed stage population of P. superbus. We derived 4,009 unigenes from these ESTs. These unigene annotations and sequences can be accessed at http://www.nematodes.org/nembase4/species_info.php?species=PSC. We manually annotated a set of 187 constitutively expressed candidate anhydrobiotic genes from P. superbus. Notable among those is a putative lineage expansion of the lea (late embryogenesis abundant) gene family. The most abundantly expressed sequence was a member of the nematode specific sxp/ral-2 family that is highly expressed in parasitic nematodes and secreted onto the surface of the nematodes' cuticles. There were 2,059 novel unigenes (51.7% of the total), 149 of which are predicted to encode intrinsically disordered proteins lacking a fixed tertiary structure. One unigene may encode an exo-β-1,3-glucanase (GHF5 family), most similar to a sequence from Phytophthora infestans. GHF5 enzymes have been reported from several species of plant parasitic nematodes, with horizontal gene transfer (HGT) from bacteria proposed to explain their evolutionary origin. This P. superbus sequence represents another possible HGT event within the Nematoda. The expression of five of the 19 putative stress response
Expression and purification of the non-tagged LipL32 of pathogenic Leptospira.

PubMed

Hauk, P; Carvalho, E; Ho, P L

2011-04-01

Leptospirosis is a reemerging infectious disease and the most disseminated zoonosis worldwide. A leptospiral surface protein, LipL32, only occurs in pathogenic Leptospira, and is the most abundant protein on the bacterial surface, being described as an important factor in host immunogenic response and also in bacterial infection. We describe here an alternative and simple purification protocol for non-tagged recombinant LipL32. The recombinant LipL32(21-272) was expressed in Escherichia coli without His-tag or any other tag used to facilitate recombinant protein purification. The recombinant protein was expressed in the soluble form, and the purification was based on ion exchange (anionic and cationic) and hydrophobic interactions. The final purification yielded 3 mg soluble LipL32(21-272) per liter of the induced culture. Antiserum produced against the recombinant protein was effective to detect native LipL32 from cell extracts of several Leptospira serovars. The purified recombinant LipL32(21-272) produced by this protocol can be used for structural, biochemical and functional studies and avoids the risk of possible interactions and interferences of the tags commonly used as well as the time consuming and almost always inefficient methods to cleave these tags when a tag-free LipL32 is needed. Non-tagged LipL32 may represent an alternative antigen for biochemical studies, for serodiagnosis and for the development of a vaccine against leptospirosis.
Generation and analysis of expressed sequence tags from six developing xylem libraries in Pinus radiata D. Don

PubMed Central

Li, Xinguo; Wu, Harry X; Dillon, Shannon K; Southerton, Simon G

2009-01-01

Background Wood is a major renewable natural resource for the timber, fibre and bioenergy industry. Pinus radiata D. Don is the most important commercial plantation tree species in Australia and several other countries; however, genomic resources for this species are very limited in public databases. Our primary objective was to sequence a large number of expressed sequence tags (ESTs) from genes involved in wood formation in radiata pine. Results Six developing xylem cDNA libraries were constructed from earlywood and latewood tissues sampled at juvenile (7 yrs), transition (11 yrs) and mature (30 yrs) ages, respectively. These xylem tissues represent six typical development stages in a rotation period of radiata pine. A total of 6,389 high quality ESTs were collected from 5,952 cDNA clones. Assembly of 5,952 ESTs from 5' end sequences generated 3,304 unigenes including 952 contigs and 2,352 singletons. About 97.0% of the 5,952 ESTs and 96.1% of the unigenes have matches in the UniProt and TIGR databases. Of the 3,174 unigenes with matches, 42.9% were not assigned GO (Gene Ontology) terms and their functions are unknown or unclassified. More than half (52.1%) of the 5,952 ESTs have matches in the Pfam database and represent 772 known protein families. About 18.0% of the 5,952 ESTs matched cell wall related genes in the MAIZEWALL database, representing all 18 categories, 91 of all 174 families and possibly 557 genes. Fifteen cell wall-related genes are ranked in the 30 most abundant genes, including CesA, tubulin, AGP, SAMS, actin, laccase, CCoAMT, MetE, phytocyanin, pectate lyase, cellulase, SuSy, expansin, chitinase and UDP-glucose dehydrogenase. Based on the PlantTFDB database 41 of the 64 transcription factor families in the poplar genome were identified as being involved in radiata pine wood formation. Comparative analysis of GO term abundance revealed a distinct transcriptome in juvenile earlywood formation compared to other stages of wood development
Analysis of expressed sequence tags from Maize mosaic rhabdovirus-infected gut tissues of Peregrinus maidis reveals the presence of key components of insect innate immunity.

PubMed

Whitfield, A E; Rotenberg, D; Aritua, V; Hogenhout, S A

2011-04-01

The corn planthopper, Peregrinus maidis, causes direct feeding damage to plants and transmits Maize mosaic rhabdovirus (MMV) in a persistent-propagative manner. MMV must cross several insect tissue layers for successful transmission to occur, and the gut serves as an important barrier for rhabdovirus transmission. In order to facilitate the identification of proteins that may interact with MMV either by facilitating acquisition or responding to virus infection, we generated and analysed the gut transcriptome of P. maidis. From two normalized cDNA libraries, we generated a P. maidis gut transcriptome composed of 20,771 expressed sequence tags (ESTs). Assembly of the sequences yielded 1860 contigs and 14,032 singletons, and biological roles were assigned to 5793 (36%). Comparison of P. maidis ESTs with other insect amino acid sequences revealed that P. maidis shares greatest sequence similarity with another hemipteran, the brown planthopper Nilaparvata lugens. We identified 202 P. maidis transcripts with putative homology to proteins associated with insect innate immunity, including those implicated in the Toll, Imd, JAK/STAT, Jnk and the small-interfering RNA-mediated pathways. Sequence comparisons between our P. maidis gut EST collection and the currently available National Center for Biotechnology Information EST database collection for Ni. lugens revealed that a pathogen recognition receptor in the Imd pathway, peptidoglycan recognition protein-long class (PGRP-LC), is present in these two members of the family Delphacidae; however, these recognition receptors are lacking in the model hemipteran Acyrthosiphon pisum. In addition, we identified sequences in the P. maidis gut transcriptome that share significant amino acid sequence similarities with the rhabdovirus receptor molecule, acetylcholine receptor (AChR), found in other hosts. This EST analysis sheds new light on immune response pathways in hemipteran guts that will be useful for further dissecting innate
Developing expressed sequence tag libraries and the discovery of simple sequence repeat markers for two species of raspberry (Rubus L.)

USDA-ARS?s Scientific Manuscript database

Background: Due to a relatively high level of codominant inheritance and transferability within and among taxonomic groups, simple sequence repeat (SSR) markers are important elements in comparative mapping and delineation of genomic regions associated with traits of economic importance. Expressed S...
In silico analysis of expressed sequence tags from Trichostrongylus vitrinus (Nematoda): comparison of the automated ESTExplorer workflow platform with conventional database searches.

PubMed

Nagaraj, Shivashankar H; Gasser, Robin B; Nisbet, Alasdair J; Ranganathan, Shoba

2008-01-01

The analysis of expressed sequence tags (EST) offers a rapid and cost effective approach to elucidate the transcriptome of an organism, but requires several computational methods for assembly and annotation. Researchers frequently analyse each step manually, which is laborious and time consuming. We have recently developed ESTExplorer, a semi-automated computational workflow system, in order to achieve the rapid analysis of EST datasets. In this study, we evaluated EST data analysis for the parasitic nematode Trichostrongylus vitrinus (order Strongylida) using ESTExplorer, compared with database matching alone. We functionally annotated 1776 ESTs obtained via suppressive-subtractive hybridisation from T. vitrinus, an important parasitic trichostrongylid of small ruminants. Cluster and comparative genomic analyses of the transcripts using ESTExplorer indicated that 290 (41%) sequences had homologues in Caenorhabditis elegans, 329 (42%) in parasitic nematodes, 202 (28%) in organisms other than nematodes, and 218 (31%) had no significant match to any sequence in the current databases. Of the C. elegans homologues, 90 were associated with 'non-wildtype' double-stranded RNA interference (RNAi) phenotypes, including embryonic lethality, maternal sterility, sterile progeny, larval arrest and slow growth. We could functionally classify 267 (38%) sequences using the Gene Ontologies (GO) and establish pathway associations for 230 (33%) sequences using the Kyoto Encyclopedia of Genes and Genomes (KEGG). Further examination of this EST dataset revealed a number of signalling molecules, proteases, protease inhibitors, enzymes, ion channels and immune-related genes. In addition, we identified 40 putative secreted proteins that could represent potential candidates for developing novel anthelmintics or vaccines. We further compared the automated EST sequence annotations, using ESTExplorer, with database search results for individual T. vitrinus ESTs. ESTExplorer reliably and
RAD tag sequencing as a source of SNP markers in Cynara cardunculus L

PubMed Central

2012-01-01

Background The globe artichoke (Cynara cardunculus L. var. scolymus) genome is relatively poorly explored, especially compared to those of the other major Asteraceae crops sunflower and lettuce. No SNP markers are in the public domain. We have combined the recently developed restriction-site associated DNA (RAD) approach with the Illumina DNA sequencing platform to effect the rapid and mass discovery of SNP markers for C. cardunculus. Results RAD tags were sequenced from the genomic DNA of three C. cardunculus mapping population parents, generating 9.7 million reads, corresponding to ~1 Gbp of sequence. An assembly based on paired ends produced ~6.0 Mbp of genomic sequence, separated into ~19,000 contigs (mean length 312 bp), of which ~21% were fragments of putative coding sequence. The shared sequences allowed for the discovery of ~34,000 SNPs and nearly 800 indels, equivalent to a SNP frequency of 5.6 per 1,000 nt, and an indel frequency of 0.2 per 1,000 nt. A sample of heterozygous SNP loci was mapped by CAPS assays and this exercise provided validation of our mining criteria. The repetitive fraction of the genome had a high representation of retrotransposon sequence, followed by simple repeats, AT-low complexity regions and mobile DNA elements. The genomic k-mers distribution and CpG rate of C. cardunculus, compared with data derived from three whole genome-sequenced dicots species, provided a further evidence of the random representation of the C. cardunculus genome generated by RAD sampling. Conclusion The RAD tag sequencing approach is a cost-effective and rapid method to develop SNP markers in a highly heterozygous species. Our approach permitted to generate a large and robust SNP datasets by the adoption of optimized filtering criteria. PMID:22214349
Rediscovering Medicinal Plants' Potential with OMICS: Microsatellite Survey in Expressed Sequence Tags of Eleven Traditional Plants with Potent Antidiabetic Properties

PubMed Central

Sahu, Jagajjit; Sen, Priyabrata; Choudhury, Manabendra Dutta; Dehury, Budheswar; Barooah, Madhumita; Modi, Mahendra Kumar

2014-01-01

Abstract Herbal medicines and traditionally used medicinal plants present an untapped potential for novel molecular target discovery using systems science and OMICS biotechnology driven strategies. Since up to 40% of the world's poor people have no access to government health services, traditional and folk medicines are often the only therapeutics available to them. In this vein, North East (NE) India is recognized for its rich bioresources. As part of the Indo-Burma hotspot, it is regarded as an epicenter of biodiversity for several plants having myriad traditional uses, including medicinal use. However, the improvement of these valuable bioresources through molecular breeding strategies, for example, using genic microsatellites or Simple Sequence Repeats (SSRs) or Expressed Sequence Tags (ESTs)-derived SSRs has not been fully utilized in large scale to date. In this study, we identified a total of 47,700 microsatellites from 109,609 ESTs of 11 medicinal plants (pineapple, papaya, noyontara, bitter orange, bermuda brass, ratalu, barbados nut, mango, mulberry, lotus, and guduchi) having proven antidiabetic properties. A total of 58,159 primer pairs were designed for the non-redundant 8060 SSR-positive ESTs and putative functions were assigned to 4483 unique contigs. Among the identified microsatellites, excluding mononucleotide repeats, di-/trinucleotides are predominant, among which repeat motifs of AG/CT and AAG/CTT were most abundant. Similarity search of SSR containing ESTs and antidiabetic gene sequences revealed 11 microsatellites linked to antidiabetic genes in five plants. GO term enrichment analysis revealed a total of 80 enriched GO terms widely distributed in 53 biological processes, 17 molecular functions, and 10 cellular components associated with the 11 markers. The present study therefore provides concrete insights into the frequency and distribution of SSRs in important medicinal resources. The microsatellite markers reported here markedly add to
Rediscovering medicinal plants' potential with OMICS: microsatellite survey in expressed sequence tags of eleven traditional plants with potent antidiabetic properties.

PubMed

Sahu, Jagajjit; Sen, Priyabrata; Choudhury, Manabendra Dutta; Dehury, Budheswar; Barooah, Madhumita; Modi, Mahendra Kumar; Talukdar, Anupam Das

2014-05-01

Herbal medicines and traditionally used medicinal plants present an untapped potential for novel molecular target discovery using systems science and OMICS biotechnology driven strategies. Since up to 40% of the world's poor people have no access to government health services, traditional and folk medicines are often the only therapeutics available to them. In this vein, North East (NE) India is recognized for its rich bioresources. As part of the Indo-Burma hotspot, it is regarded as an epicenter of biodiversity for several plants having myriad traditional uses, including medicinal use. However, the improvement of these valuable bioresources through molecular breeding strategies, for example, using genic microsatellites or Simple Sequence Repeats (SSRs) or Expressed Sequence Tags (ESTs)-derived SSRs has not been fully utilized in large scale to date. In this study, we identified a total of 47,700 microsatellites from 109,609 ESTs of 11 medicinal plants (pineapple, papaya, noyontara, bitter orange, bermuda brass, ratalu, barbados nut, mango, mulberry, lotus, and guduchi) having proven antidiabetic properties. A total of 58,159 primer pairs were designed for the non-redundant 8060 SSR-positive ESTs and putative functions were assigned to 4483 unique contigs. Among the identified microsatellites, excluding mononucleotide repeats, di-/trinucleotides are predominant, among which repeat motifs of AG/CT and AAG/CTT were most abundant. Similarity search of SSR containing ESTs and antidiabetic gene sequences revealed 11 microsatellites linked to antidiabetic genes in five plants. GO term enrichment analysis revealed a total of 80 enriched GO terms widely distributed in 53 biological processes, 17 molecular functions, and 10 cellular components associated with the 11 markers. The present study therefore provides concrete insights into the frequency and distribution of SSRs in important medicinal resources. The microsatellite markers reported here markedly add to the genetic
High-level expression of soluble recombinant proteins in Escherichia coli using an HE-maltotriose-binding protein fusion tag.

PubMed

Han, Yingqian; Guo, Wanying; Su, Bingqian; Guo, Yujie; Wang, Jiang; Chu, Beibei; Yang, Guoyu

2018-02-01

Recombinant proteins are commonly expressed in prokaryotic expression systems for large-scale production. The use of genetically engineered affinity and solubility enhancing fusion proteins has increased greatly in recent years, and there now exists a considerable repertoire of these that can be used to enhance the expression, stability, solubility, folding, and purification of their fusion partner. Here, a modified histidine tag (HE) used as an affinity tag was employed together with a truncated maltotriose-binding protein (MBP; consisting of residues 59-433) from Pyrococcus furiosus as a solubility enhancing tag accompanying a tobacco etch virus protease-recognition site for protein expression and purification in Escherichia coli. Various proteins tagged at the N-terminus with HE-MBP(Pyr) were expressed in E. coli BL21(DE3) cells to determine expression and solubility relative to those tagged with His6-MBP or His6-MBP(Pyr). Furthermore, four HE-MBP(Pyr)-fused proteins were purified by immobilized metal affinity chromatography to assess the affinity of HE with immobilized Ni 2+ . Our results showed that HE-MBP(Pyr) represents an attractive fusion protein allowing high levels of soluble expression and purification of recombinant protein in E. coli. Copyright © 2017 Elsevier Inc. All rights reserved.

Ginger and turmeric expressed sequence tags identify signature genes for rhizome identity and development and the biosynthesis of curcuminoids, gingerols and terpenoids

PubMed Central

2013-01-01

Background Ginger (Zingiber officinale) and turmeric (Curcuma longa) accumulate important pharmacologically active metabolites at high levels in their rhizomes. Despite their importance, relatively little is known regarding gene expression in the rhizomes of ginger and turmeric. Results In order to identify rhizome-enriched genes and genes encoding specialized metabolism enzymes and pathway regulators, we evaluated an assembled collection of expressed sequence tags (ESTs) from eight different ginger and turmeric tissues. Comparisons to publicly available sorghum rhizome ESTs revealed a total of 777 gene transcripts expressed in ginger/turmeric and sorghum rhizomes but apparently absent from other tissues. The list of rhizome-specific transcripts was enriched for genes associated with regulation of tissue growth, development, and transcription. In particular, transcripts for ethylene response factors and AUX/IAA proteins appeared to accumulate in patterns mirroring results from previous studies regarding rhizome growth responses to exogenous applications of auxin and ethylene. Thus, these genes may play important roles in defining rhizome growth and development. Additional associations were made for ginger and turmeric rhizome-enriched MADS box transcription factors, their putative rhizome-enriched homologs in sorghum, and rhizomatous QTLs in rice. Additionally, analysis of both primary and specialized metabolism genes indicates that ginger and turmeric rhizomes are primarily devoted to the utilization of leaf supplied sucrose for the production and/or storage of specialized metabolites associated with the phenylpropanoid pathway and putative type III polyketide synthase gene products. This finding reinforces earlier hypotheses predicting roles of this enzyme class in the production of curcuminoids and gingerols. Conclusion A significant set of genes were found to be exclusively or preferentially expressed in the rhizome of ginger and turmeric. Specific
An integrated PCR colony hybridization approach to screen cDNA libraries for full-length coding sequences.

PubMed

Pollier, Jacob; González-Guzmán, Miguel; Ardiles-Diaz, Wilson; Geelen, Danny; Goossens, Alain

2011-01-01

cDNA-Amplified Fragment Length Polymorphism (cDNA-AFLP) is a commonly used technique for genome-wide expression analysis that does not require prior sequence knowledge. Typically, quantitative expression data and sequence information are obtained for a large number of differentially expressed gene tags. However, most of the gene tags do not correspond to full-length (FL) coding sequences, which is a prerequisite for subsequent functional analysis. A medium-throughput screening strategy, based on integration of polymerase chain reaction (PCR) and colony hybridization, was developed that allows in parallel screening of a cDNA library for FL clones corresponding to incomplete cDNAs. The method was applied to screen for the FL open reading frames of a selection of 163 cDNA-AFLP tags from three different medicinal plants, leading to the identification of 109 (67%) FL clones. Furthermore, the protocol allows for the use of multiple probes in a single hybridization event, thus significantly increasing the throughput when screening for rare transcripts. The presented strategy offers an efficient method for the conversion of incomplete expressed sequence tags (ESTs), such as cDNA-AFLP tags, to FL-coding sequences.
Serial analysis of gene expression (SAGE) in normal human trabecular meshwork.

PubMed

Liu, Yutao; Munro, Drew; Layfield, David; Dellinger, Andrew; Walter, Jeffrey; Peterson, Katherine; Rickman, Catherine Bowes; Allingham, R Rand; Hauser, Michael A

2011-04-08

To identify the genes expressed in normal human trabecular meshwork tissue, a tissue critical to the pathogenesis of glaucoma. Total RNA was extracted from human trabecular meshwork (HTM) harvested from 3 different donors. Extracted RNA was used to synthesize individual SAGE (serial analysis of gene expression) libraries using the I-SAGE Long kit from Invitrogen. Libraries were analyzed using SAGE 2000 software to extract the 17 base pair sequence tags. The extracted sequence tags were mapped to the genome using SAGE Genie map. A total of 298,834 SAGE tags were identified from all HTM libraries (96,842, 88,126, and 113,866 tags, respectively). Collectively, there were 107,325 unique tags. There were 10,329 unique tags with a minimum of 2 counts from a single library. These tags were mapped to known unique Unigene clusters. Approximately 29% of the tags (orphan tags) did not map to a known Unigene cluster. Thirteen percent of the tags mapped to at least 2 Unigene clusters. Sequence tags from many glaucoma-related genes, including myocilin, optineurin, and WD repeat domain 36, were identified. This is the first time SAGE analysis has been used to characterize the gene expression profile in normal HTM. SAGE analysis provides an unbiased sampling of gene expression of the target tissue. These data will provide new and valuable information to improve understanding of the biology of human aqueous outflow.
Preparation of next-generation sequencing libraries using Nextera™ technology: simultaneous DNA fragmentation and adaptor tagging by in vitro transposition.

PubMed

Caruccio, Nicholas

2011-01-01

DNA library preparation is a common entry point and bottleneck for next-generation sequencing. Current methods generally consist of distinct steps that often involve significant sample loss and hands-on time: DNA fragmentation, end-polishing, and adaptor-ligation. In vitro transposition with Nextera™ Transposomes simultaneously fragments and covalently tags the target DNA, thereby combining these three distinct steps into a single reaction. Platform-specific sequencing adaptors can be added, and the sample can be enriched and bar-coded using limited-cycle PCR to prepare di-tagged DNA fragment libraries. Nextera technology offers a streamlined, efficient, and high-throughput method for generating bar-coded libraries compatible with multiple next-generation sequencing platforms.
The Use of Affinity Tags to Overcome Obstacles in Recombinant Protein Expression and Purification.

PubMed

Amarasinghe, Chinthaka; Jin, Jian-Ping

2015-01-01

Research and industrial demands for recombinant proteins continue to increase over time for their broad applications in structural and functional studies and as therapeutic agents. These applications often require large quantities of recombinant protein at desirable purity, which highlights the importance of developing and improving production approaches that provide high level expression and readily achievable purity of recombinant protein. E. coli is the most widely used host for the expression of a diverse range of proteins at low cost. However, there are common pitfalls that can severely limit the expression of exogenous proteins, such as stability, low solubility and toxicity to the host cell. To overcome these obstacles, one strategy that has found to be promising is the use of affinity tags or carrier peptide to aid in the folding of the target protein, increase solubility, lower toxicity and increase the level of expression. In the meantime, the tags and fusion proteins can be designed to facilitate affinity purification. Since the fusion protein may not exhibit the native conformation of the target protein, various strategies have been developed to remove the tag during or after purification to avoid potential complications in structural and functional studies and to obtain native biological activities. Despite extensive research and rapid development along these lines, there are unsolved problems and imperfect applications. This focused review compares and contrasts various strategies that employ affinity tags to improve bacterial expression and to facilitate purification of recombinant proteins. The pros and cons of the approaches are discussed for more effective applications and new directions of future improvement.
Genetically encoded fluorescent tags

PubMed Central

Thorn, Kurt

2017-01-01

Genetically encoded fluorescent tags are protein sequences that can be fused to a protein of interest to render it fluorescent. These tags have revolutionized cell biology by allowing nearly any protein to be imaged by light microscopy at submicrometer spatial resolution and subsecond time resolution in a live cell or organism. They can also be used to measure protein abundance in thousands to millions of cells using flow cytometry. Here I provide an introduction to the different genetic tags available, including both intrinsically fluorescent proteins and proteins that derive their fluorescence from binding of either endogenous or exogenous fluorophores. I discuss their optical and biological properties and guidelines for choosing appropriate tags for an experiment. Tools for tagging nucleic acid sequences and reporter molecules that detect the presence of different biomolecules are also briefly discussed. PMID:28360214
Primer and platform effects on 16S rRNA tag sequencing

DOE Office of Scientific and Technical Information (OSTI.GOV)

Tremblay, Julien; Singh, Kanwar; Fern, Alison

Sequencing of 16S rRNA gene tags is a popular method for profiling and comparing microbial communities. The protocols and methods used, however, vary considerably with regard to amplification primers, sequencing primers, sequencing technologies; as well as quality filtering and clustering. How results are affected by these choices, and whether data produced with different protocols can be meaningfully compared, is often unknown. Here we compare results obtained using three different amplification primer sets (targeting V4, V6–V8, and V7–V8) and two sequencing technologies (454 pyrosequencing and Illumina MiSeq) using DNA from a mock community containing a known number of species as wellmore » as complex environmental samples whose PCR-independent profiles were estimated using shotgun sequencing. We find that paired-end MiSeq reads produce higher quality data and enabled the use of more aggressive quality control parameters over 454, resulting in a higher retention rate of high quality reads for downstream data analysis. While primer choice considerably influences quantitative abundance estimations, sequencing platform has relatively minor effects when matched primers are used. In conclusion, beta diversity metrics are surprisingly robust to both primer and sequencing platform biases.« less
Primer and platform effects on 16S rRNA tag sequencing

DOE PAGES

Tremblay, Julien; Singh, Kanwar; Fern, Alison; ...

2015-08-04

Sequencing of 16S rRNA gene tags is a popular method for profiling and comparing microbial communities. The protocols and methods used, however, vary considerably with regard to amplification primers, sequencing primers, sequencing technologies; as well as quality filtering and clustering. How results are affected by these choices, and whether data produced with different protocols can be meaningfully compared, is often unknown. Here we compare results obtained using three different amplification primer sets (targeting V4, V6–V8, and V7–V8) and two sequencing technologies (454 pyrosequencing and Illumina MiSeq) using DNA from a mock community containing a known number of species as wellmore » as complex environmental samples whose PCR-independent profiles were estimated using shotgun sequencing. We find that paired-end MiSeq reads produce higher quality data and enabled the use of more aggressive quality control parameters over 454, resulting in a higher retention rate of high quality reads for downstream data analysis. While primer choice considerably influences quantitative abundance estimations, sequencing platform has relatively minor effects when matched primers are used. In conclusion, beta diversity metrics are surprisingly robust to both primer and sequencing platform biases.« less
Analysis and functional annotation of expressed sequence tags from in vitro cell lines of elasmobranchs: spiny dogfish shark (Squalus acanthias) and little skate (Leucoraja erinacea)

PubMed Central

Parton, Angela; Bayne, Christopher J.; Barnes, David W.

2010-01-01

Elasmobranchs are the most commonly used experimental models among the jawed, cartilaginous fish (Chondrichthyes). Previously we developed cell lines from embryos of two elasmobranchs, Squalus acanthias the spiny dogfish shark (SAE line), and Leucoraja erinacea the little skate (LEE-1 line). From these lines cDNA libraries were derived and expressed sequence tags (ESTs) generated. From the SAE cell line 4303 unique transcripts were identified, with 1848 of these representing unknown sequences (showing no BLASTX identification). From the LEE-1 cell line, 3660 unique transcripts were identified, and unknown, unique sequences totaled 1333. Gene Ontology (GO) annotation showed that GO assignments for the two cell lines were in general similar. These results suggest that the procedures used to derive the cell lines led to isolation of cell types of the same general embryonic origin from both species. The LEE-1 transcripts included GO categories “envelope” and “oxidoreductase activity” but the SAE transcripts did not. GO analysis of SAE transcripts identified the category “anatomical structure formation” that was not present in LEE-1 cells. Increased organelle compartments may exist within LEE-1 cells compared to SAE cells, and the higher oxidoreductase activity in LEE-1 cells may indicate a role for these cells in responses associated with innate immunity or in steroidogenesis. These EST libraries from elasmobranch cell lines provide information for assembly of genomic sequences and are useful in revealing gene diversity, new genes and molecular markers, as well as in providing means for elucidation of full-length cDNAs and probes for gene array analyses. This is the first study of this type with members of the Chondrichthyes. PMID:20471924
Analysis and functional annotation of expressed sequence tags from in vitro cell lines of elasmobranchs: Spiny dogfish shark (Squalus acanthias) and little skate (Leucoraja erinacea).

PubMed

Parton, Angela; Bayne, Christopher J; Barnes, David W

2010-09-01

Elasmobranchs are the most commonly used experimental models among the jawed, cartilaginous fish (Chondrichthyes). Previously we developed cell lines from embryos of two elasmobranchs, Squalus acanthias the spiny dogfish shark (SAE line), and Leucoraja erinacea the little skate (LEE-1 line). From these lines cDNA libraries were derived and expressed sequence tags (ESTs) generated. From the SAE cell line 4303 unique transcripts were identified, with 1848 of these representing unknown sequences (showing no BLASTX identification). From the LEE-1 cell line, 3660 unique transcripts were identified, and unknown, unique sequences totaled 1333. Gene Ontology (GO) annotation showed that GO assignments for the two cell lines were in general similar. These results suggest that the procedures used to derive the cell lines led to isolation of cell types of the same general embryonic origin from both species. The LEE-1 transcripts included GO categories "envelope" and "oxidoreductase activity" but the SAE transcripts did not. GO analysis of SAE transcripts identified the category "anatomical structure formation" that was not present in LEE-1 cells. Increased organelle compartments may exist within LEE-1 cells compared to SAE cells, and the higher oxidoreductase activity in LEE-1 cells may indicate a role for these cells in responses associated with innate immunity or in steroidogenesis. These EST libraries from elasmobranch cell lines provide information for assembly of genomic sequences and are useful in revealing gene diversity, new genes and molecular markers, as well as in providing means for elucidation of full-length cDNAs and probes for gene array analyses. This is the first study of this type with members of the Chondrichthyes. Copyright 2010 Elsevier Inc. All rights reserved.
Preparing and Analyzing Expressed Sequence Tags (ESTs) Library for the Mammary Tissue of Local Turkish Kivircik Sheep

PubMed Central

Omeroglu Ulu, Zehra; Ulu, Salih; Un, Cemal; Ozdem Oztabak, Kemal; Altunatmaz, Kemal

2017-01-01

Kivircik sheep is an important local Turkish sheep according to its meat quality and milk productivity. The aim of this study was to analyze gene expression profiles of both prenatal and postnatal stages for the Kivircik sheep. Therefore, two different cDNA libraries, which were taken from the same Kivircik sheep mammary gland tissue at prenatal and postnatal stages, were constructed. Total 3072 colonies which were randomly selected from the two libraries were sequenced for developing a sheep ESTs collection. We used Phred/Phrap computer programs for analysis of the raw EST and readable EST sequences were assembled with the CAP3 software. Putative functions of all unique sequences and statistical analysis were determined by Geneious software. Total 422 ESTs have over 80% similarity to known sequences of other organisms in NCBI classified by Panther database for the Gene Ontology (GO) category. By comparing gene expression profiles, we observed some putative genes that may be relative to reproductive performance or play important roles in milk synthesis and secretion. A total of 2414 ESTs have been deposited to the NCBI GenBank database (GW996847–GW999260). EST data in this study have provided a new source of information to functional genome studies of sheep. PMID:28239610
Expression of fluorescently tagged connexins: a novel approach to rescue function of oligomeric DsRed-tagged proteins.

PubMed

Lauf, U; Lopez, P; Falk, M M

2001-06-01

A novel, brilliantly red fluorescent protein, DsRed has become available recently opening up a wide variety of experimental opportunities for double labeling and fluorescence resonance electron transfer experiments in combination with green fluorescent protein (GFP). Unlike in the case of GFP, proteins tagged with DsRed were often found to aggregate within the cell. Here we report a simple method that allows rescuing the function of an oligomeric protein tagged with DsRed. We demonstrate the feasibility of this approach on the subunit proteins of an oligomeric membrane channel, gap junction connexins. Additionally, DsRed fluorescence was easily detected 12-16 h post transfection, much earlier than previously reported, and could readily be differentiated from co-expressed GFP. Thus, this approach can eliminate the major drawbacks of this highly attractive autofluorescent protein.
Versatile Gene-Specific Sequence Tags for Arabidopsis Functional Genomics: Transcript Profiling and Reverse Genetics Applications

PubMed Central

Hilson, Pierre; Allemeersch, Joke; Altmann, Thomas; Aubourg, Sébastien; Avon, Alexandra; Beynon, Jim; Bhalerao, Rishikesh P.; Bitton, Frédérique; Caboche, Michel; Cannoot, Bernard; Chardakov, Vasil; Cognet-Holliger, Cécile; Colot, Vincent; Crowe, Mark; Darimont, Caroline; Durinck, Steffen; Eickhoff, Holger; de Longevialle, Andéol Falcon; Farmer, Edward E.; Grant, Murray; Kuiper, Martin T.R.; Lehrach, Hans; Léon, Céline; Leyva, Antonio; Lundeberg, Joakim; Lurin, Claire; Moreau, Yves; Nietfeld, Wilfried; Paz-Ares, Javier; Reymond, Philippe; Rouzé, Pierre; Sandberg, Goran; Segura, Maria Dolores; Serizet, Carine; Tabrett, Alexandra; Taconnat, Ludivine; Thareau, Vincent; Van Hummelen, Paul; Vercruysse, Steven; Vuylsteke, Marnik; Weingartner, Magdalena; Weisbeek, Peter J.; Wirta, Valtteri; Wittink, Floyd R.A.; Zabeau, Marc; Small, Ian

2004-01-01

Microarray transcript profiling and RNA interference are two new technologies crucial for large-scale gene function studies in multicellular eukaryotes. Both rely on sequence-specific hybridization between complementary nucleic acid strands, inciting us to create a collection of gene-specific sequence tags (GSTs) representing at least 21,500 Arabidopsis genes and which are compatible with both approaches. The GSTs were carefully selected to ensure that each of them shared no significant similarity with any other region in the Arabidopsis genome. They were synthesized by PCR amplification from genomic DNA. Spotted microarrays fabricated from the GSTs show good dynamic range, specificity, and sensitivity in transcript profiling experiments. The GSTs have also been transferred to bacterial plasmid vectors via recombinational cloning protocols. These cloned GSTs constitute the ideal starting point for a variety of functional approaches, including reverse genetics. We have subcloned GSTs on a large scale into vectors designed for gene silencing in plant cells. We show that in planta expression of GST hairpin RNA results in the expected phenotypes in silenced Arabidopsis lines. These versatile GST resources provide novel and powerful tools for functional genomics. PMID:15489341
Recombinant protein expression and purification: a comprehensive review of affinity tags and microbial applications.

PubMed

Young, Carissa L; Britton, Zachary T; Robinson, Anne S

2012-05-01

Protein fusion tags are indispensible tools used to improve recombinant protein expression yields, enable protein purification, and accelerate the characterization of protein structure and function. Solubility-enhancing tags, genetically engineered epitopes, and recombinant endoproteases have resulted in a versatile array of combinatorial elements that facilitate protein detection and purification in microbial hosts. In this comprehensive review, we evaluate the most frequently used solubility-enhancing and affinity tags. Furthermore, we provide summaries of well-characterized purification strategies that have been used to increase product yields and have widespread application in many areas of biotechnology including drug discovery, therapeutics, and pharmacology. This review serves as an excellent literature reference for those working on protein fusion tags. Copyright © 2012 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
New ligation independent cloning vectors for expression of recombinant proteins with a self-cleaving CPD/6xHis-tag.

PubMed

Biancucci, Marco; Dolores, Jazel S; Wong, Jennifer; Grimshaw, Sarah; Anderson, Wayne F; Satchell, Karla J F; Kwon, Keehwan

2017-01-05

Recombinant protein purification is a crucial step for biochemistry and structural biology fields. Rapid robust purification methods utilize various peptide or protein tags fused to the target protein for affinity purification using corresponding matrices and to enhance solubility. However, affinity/solubility-tags often need to be removed in order to conduct functional and structural studies, adding complexities to purification protocols. In this work, the Vibrio cholerae MARTX toxin Cysteine Protease Domain (CPD) was inserted in a ligation-independent cloning (LIC) vector to create a C-terminal 6xHis-tagged inducible autoprocessing enzyme tag, called "the CPD-tag". The pCPD and alternative pCPD/ccdB cloning vectors allow for easy insertion of DNA and expression of the target protein fused to the CPD-tag, which is removed at the end of the purification step by addition of the inexpensive small molecule inositol hexakisphosphate to induce CPD autoprocessing. This process is demonstrated using a small bacterial membrane localization domain and for high yield purification of the eukaryotic small GTPase KRas. Subsequently, pCPD was tested with 40 proteins or sub-domains selected from a high throughput crystallization pipeline. pCPD vectors are easily used LIC compatible vectors for expression of recombinant proteins with a C-terminal CPD/6xHis-tag. Although intended only as a strategy for rapid tag removal, this pilot study revealed the CPD-tag may also increase expression and solubility of some recombinant proteins.
The characterisation of novel secreted Ly-6 proteins from rat urine by the combined use of two-dimensional gel electrophoresis, microbore high performance liquid chromatography and expressed sequence tag data.

PubMed

Southan, Christopher; Cutler, Paul; Birrell, Helen; Connell, John; Fantom, Kenneth G M; Sims, Matthew; Shaikh, Narjis; Schneider, Klaus

2002-02-01

A proteomic study of rat urine was undertaken using two-dimensional gel electrophoresis, microbore high performance liquid chromatography, mass spectrometry and N-terminal sequencing. Five known urinary proteins were identified but two novel peptide fragments matched a large number of rat expressed sequence tags (ESTs) from a liver library. By combining protein chemical and nucleotide data, two 101-residue open reading frames with 90% amino acid identity were determined, rat urinary protein 1 (RUP-1) and RUP-2. The data established signal peptide removal and provided evidence for N-glycosylation. A third related sequence, rat spleen protein (RSP-1) was confirmed from EST searches. These three proteins have been submitted to SWISS-PROT as P81827, P81828 and Q9QXN2, respectively. A fourth novel homologue was found in porcine and bovine ESTs from embryo libraries. Alignment with known homologues showed conserved cysteine positions characteristic of a secreted subfamily of Ly-6 proteins. In two cases, antineoplastic urinary protein and caltrin, these homologues have unverified functional annotations. The RUP sequences showed high scoring matches to three unrelated rat mRNAs subsequently established to be chimeric. Two of these share extended sectional identity to RUP-1 but the third may represent another novel Ly-6 homologue. These chimeras have caused serious annotation errors in secondary databases.
Identification of Abundantly Expressed Novel and Conserved Genes from the Infective Larval Stage of Toxocara canis by an Expressed Sequence Tag Strategy

PubMed Central

Tetteh, Kevin K. A.; Loukas, Alex; Tripp, Cindy; Maizels, Rick M.

1999-01-01

Larvae of Toxocara canis, a nematode parasite of dogs, infect humans, causing visceral and ocular larva migrans. In noncanid hosts, larvae neither grow nor differentiate but endure in a state of arrested development. Reasoning that parasite protein production is orientated to immune evasion, we undertook a random sequencing project from a larval cDNA library to characterize the most highly expressed transcripts. In all, 266 clones were sequenced, most from both 3′ and 5′ ends, and similarity searches against GenBank protein and dbEST nucleotide databases were conducted. Cluster analyses showed that 128 distinct gene products had been found, all but 3 of which represented newly identified genes. Ninety-five genes were represented by a single clone, but seven transcripts were present at high frequencies, each composing >2% of all clones sequenced. These high-abundance transcripts include a mucin and a C-type lectin, which are both major excretory-secretory antigens released by parasites. Four highly expressed novel gene transcripts, termed ant (abundant novel transcript) genes, were found. Together, these four genes comprised 18% of all cDNA clones isolated, but no similar sequences occur in the Caenorhabditis elegans genome. While the coding regions of the four genes are dissimilar, their 3′ untranslated tracts have significant homology in nucleotide sequence. The discovery of these abundant, parasite-specific genes of newly identified lectins and mucins, as well as a range of conserved and novel proteins, provides defined candidates for future analysis of the molecular basis of immune evasion by T. canis. PMID:10456930
Analysis of expressed sequence tags generated from full-length enriched cDNA libraries of melon

PubMed Central

2011-01-01

Background Melon (Cucumis melo), an economically important vegetable crop, belongs to the Cucurbitaceae family which includes several other important crops such as watermelon, cucumber, and pumpkin. It has served as a model system for sex determination and vascular biology studies. However, genomic resources currently available for melon are limited. Result We constructed eleven full-length enriched and four standard cDNA libraries from fruits, flowers, leaves, roots, cotyledons, and calluses of four different melon genotypes, and generated 71,577 and 22,179 ESTs from full-length enriched and standard cDNA libraries, respectively. These ESTs, together with ~35,000 ESTs available in public domains, were assembled into 24,444 unigenes, which were extensively annotated by comparing their sequences to different protein and functional domain databases, assigning them Gene Ontology (GO) terms, and mapping them onto metabolic pathways. Comparative analysis of melon unigenes and other plant genomes revealed that 75% to 85% of melon unigenes had homologs in other dicot plants, while approximately 70% had homologs in monocot plants. The analysis also identified 6,972 gene families that were conserved across dicot and monocot plants, and 181, 1,192, and 220 gene families specific to fleshy fruit-bearing plants, the Cucurbitaceae family, and melon, respectively. Digital expression analysis identified a total of 175 tissue-specific genes, which provides a valuable gene sequence resource for future genomics and functional studies. Furthermore, we identified 4,068 simple sequence repeats (SSRs) and 3,073 single nucleotide polymorphisms (SNPs) in the melon EST collection. Finally, we obtained a total of 1,382 melon full-length transcripts through the analysis of full-length enriched cDNA clones that were sequenced from both ends. Analysis of these full-length transcripts indicated that sizes of melon 5' and 3' UTRs were similar to those of tomato, but longer than many other dicot
Expression and purification of recombinant proteins in Escherichia coli tagged with a small metal-binding protein from Nitrosomonas europaea.

PubMed

Vargas-Cortez, Teresa; Morones-Ramirez, Jose Ruben; Balderas-Renteria, Isaias; Zarate, Xristo

2016-02-01

Escherichia coli is still the preferred organism for large-scale production of recombinant proteins. The use of fusion proteins has helped considerably in enhancing the solubility of heterologous proteins and their purification with affinity chromatography. Here, the use of a small metal-binding protein (SmbP) from Nitrosomonas europaea is described as a new fusion protein for protein expression and purification in E. coli. Fluorescent proteins tagged at the N-terminal with SmbP showed high levels of solubility, compared with those of maltose-binding protein and glutathione S-transferase, and low formation of inclusion bodies. Using commercially available IMAC resins charged with Ni(II), highly pure recombinant proteins were obtained after just one chromatography step. Proteins may be purified from the periplasm of E. coli if SmbP contains the signal sequence at the N-terminal. After removal of the SmbP tag from the protein of interest, high-yields are obtained since SmbP is a protein of just 9.9 kDa. The results here obtained suggest that SmbP is a good alternative as a fusion protein/affinity tag for the production of soluble recombinant proteins in E. coli. Copyright © 2015 Elsevier Inc. All rights reserved.
Tryptophan tags and de novo designed complementary affinity ligands for the expression and purification of recombinant proteins.

PubMed

Pina, Ana Sofia; Carvalho, Sara; Dias, Ana Margarida G C; Guilherme, Márcia; Pereira, Alice S; Caraça, Luciana T; Coroadinha, Ana Sofia; Lowe, Christopher R; Roque, A Cecília A

2016-11-11

A common strategy for the production and purification of recombinant proteins is to fuse a tag to the protein terminal residues and employ a "tag-specific" ligand for fusion protein capture and purification. In this work, we explored the effect of two tryptophan-based tags, NWNWNW and WFWFWF, on the expression and purification of Green Fluorescence Protein (GFP) used as a model fusion protein. The titers obtained with the expression of these fusion proteins in soluble form were 0.11mgml -1 and 0.48mgml -1 for WFWFWF and NWNWNW, respectively. A combinatorial library comprising 64 ligands based on the Ugi reaction was prepared and screened for binding GFP-tagged and non-tagged proteins. Complementary ligands A2C2 and A3C1 were selected for the effective capture of NWNWNW and WFWFWF tagged proteins, respectively, in soluble forms. These affinity pairs displayed 10 6 M -1 affinity constants and Qmax values of 19.11±2.60ugg -1 and 79.39ugg -1 for the systems WFWFWF AND NWNWNW, respectively. GFP fused to the WFWFWF affinity tag was also produced as inclusion bodies, and a refolding-on column strategy was explored using the ligand A4C8, selected from the combinatorial library of ligands but in presence of denaturant agents. Copyright © 2016 Elsevier B.V. All rights reserved.

Parallel gene analysis with allele-specific padlock probes and tag microarrays

PubMed Central

Banér, Johan; Isaksson, Anders; Waldenström, Erik; Jarvius, Jonas; Landegren, Ulf; Nilsson, Mats

2003-01-01

Parallel, highly specific analysis methods are required to take advantage of the extensive information about DNA sequence variation and of expressed sequences. We present a scalable laboratory technique suitable to analyze numerous target sequences in multiplexed assays. Sets of padlock probes were applied to analyze single nucleotide variation directly in total genomic DNA or cDNA for parallel genotyping or gene expression analysis. All reacted probes were then co-amplified and identified by hybridization to a standard tag oligonucleotide array. The technique was illustrated by analyzing normal and pathogenic variation within the Wilson disease-related ATP7B gene, both at the level of DNA and RNA, using allele-specific padlock probes. PMID:12930977
Expressed sequences tags of the anther smut fungus, Microbotryum violaceum, identify mating and pathogenicity genes

PubMed Central

Yockteng, Roxana; Marthey, Sylvain; Chiapello, Hélène; Gendrault, Annie; Hood, Michael E; Rodolphe, François; Devier, Benjamin; Wincker, Patrick; Dossat, Carole; Giraud, Tatiana

2007-01-01

Background The basidiomycete fungus Microbotryum violaceum is responsible for the anther-smut disease in many plants of the Caryophyllaceae family and is a model in genetics and evolutionary biology. Infection is initiated by dikaryotic hyphae produced after the conjugation of two haploid sporidia of opposite mating type. This study describes M. violaceum ESTs corresponding to nuclear genes expressed during conjugation and early hyphal production. Results A normalized cDNA library generated 24,128 sequences, which were assembled into 7,765 unique genes; 25.2% of them displayed significant similarity to annotated proteins from other organisms, 74.3% a weak similarity to the same set of known proteins, and 0.5% were orphans. We identified putative pheromone receptors and genes that in other fungi are involved in the mating process. We also identified many sequences similar to genes known to be involved in pathogenicity in other fungi. The M. violaceum EST database, MICROBASE, is available on the Web and provides access to the sequences, assembled contigs, annotations and programs to compare similarities against MICROBASE. Conclusion This study provides a basis for cloning the mating type locus, for further investigation of pathogenicity genes in the anther smut fungi, and for comparative genomics. PMID:17692127
Frequency tagging to track the neural processing of contrast in fast, continuous sound sequences.

PubMed

Nozaradan, Sylvie; Mouraux, André; Cousineau, Marion

2017-07-01

The human auditory system presents a remarkable ability to detect rapid changes in fast, continuous acoustic sequences, as best illustrated in speech and music. However, the neural processing of rapid auditory contrast remains largely unclear, probably due to the lack of methods to objectively dissociate the response components specifically related to the contrast from the other components in response to the sequence of fast continuous sounds. To overcome this issue, we tested a novel use of the frequency-tagging approach allowing contrast-specific neural responses to be tracked based on their expected frequencies. The EEG was recorded while participants listened to 40-s sequences of sounds presented at 8Hz. A tone or interaural time contrast was embedded every fifth sound (AAAAB), such that a response observed in the EEG at exactly 8 Hz/5 (1.6 Hz) or harmonics should be the signature of contrast processing by neural populations. Contrast-related responses were successfully identified, even in the case of very fine contrasts. Moreover, analysis of the time course of the responses revealed a stable amplitude over repetitions of the AAAAB patterns in the sequence, except for the response to perceptually salient contrasts that showed a buildup and decay across repetitions of the sounds. Overall, this new combination of frequency-tagging with an oddball design provides a valuable complement to the classic, transient, evoked potentials approach, especially in the context of rapid auditory information. Specifically, we provide objective evidence on the neural processing of contrast embedded in fast, continuous sound sequences. NEW & NOTEWORTHY Recent theories suggest that the basis of neurodevelopmental auditory disorders such as dyslexia might be an impaired processing of fast auditory changes, highlighting how the encoding of rapid acoustic information is critical for auditory communication. Here, we present a novel electrophysiological approach to capture in humans
ScanRanker: Quality Assessment of Tandem Mass Spectra via Sequence Tagging

PubMed Central

Ma, Ze-Qiang; Chambers, Matthew C.; Ham, Amy-Joan L.; Cheek, Kristin L.; Whitwell, Corbin W.; Aerni, Hans-Rudolf; Schilling, Birgit; Miller, Aaron W.; Caprioli, Richard M.; Tabb, David L.

2011-01-01

In shotgun proteomics, protein identification by tandem mass spectrometry relies on bioinformatics tools. Despite recent improvements in identification algorithms, a significant number of high quality spectra remain unidentified for various reasons. Here we present ScanRanker, an open-source tool that evaluates the quality of tandem mass spectra via sequence tagging with reliable performance in data from different instruments. The superior performance of ScanRanker enables it not only to find unassigned high quality spectra that evade identification through database search, but also to select spectra for de novo sequencing and cross-linking analysis. In addition, we demonstrate that the distribution of ScanRanker scores predicts the richness of identifiable spectra among multiple LC-MS/MS runs in an experiment, and ScanRanker scores assist the process of peptide assignment validation to increase confident spectrum identifications. The source code and executable versions of ScanRanker are available from http://fenchurch.mc.vanderbilt.edu. PMID:21520941
Poly(A)-tag deep sequencing data processing to extract poly(A) sites.

PubMed

Wu, Xiaohui; Ji, Guoli; Li, Qingshun Quinn

2015-01-01

Polyadenylation [poly(A)] is an essential posttranscriptional processing step in the maturation of eukaryotic mRNA. The advent of next-generation sequencing (NGS) technology has offered feasible means to generate large-scale data and new opportunities for intensive study of polyadenylation, particularly deep sequencing of the transcriptome targeting the junction of 3'-UTR and the poly(A) tail of the transcript. To take advantage of this unprecedented amount of data, we present an automated workflow to identify polyadenylation sites by integrating NGS data cleaning, processing, mapping, normalizing, and clustering. In this pipeline, a series of Perl scripts are seamlessly integrated to iteratively map the single- or paired-end sequences to the reference genome. After mapping, the poly(A) tags (PATs) at the same genome coordinate are grouped into one cleavage site, and the internal priming artifacts removed. Then the ambiguous region is introduced to parse the genome annotation for cleavage site clustering. Finally, cleavage sites within a close range of 24 nucleotides and from different samples can be clustered into poly(A) clusters. This procedure could be used to identify thousands of reliable poly(A) clusters from millions of NGS sequences in different tissues or treatments.
The synchronous TAG production with the growth by the expression of chloroplast transit peptide-fused ScPDAT in Chlamydomonas reinhardtii.

PubMed

Zhu, Zhen; Yuan, Guangze; Fan, Xuran; Fan, Yan; Yang, Miao; Yin, Yalei; Liu, Jiao; Liu, Yang; Cao, Xupeng; Tian, Jing; Xue, Song

2018-01-01

The synchronous triacylglycerol (TAG) production with the growth is a key step to lower the cost of the microalgae-based biofuel production. Phospholipid: diacylglycerol acyltransferase (PDAT) has been identified recently and catalyzes the phospholipid contributing acyl group to diacylglycerol to synthesize TAG, and is considered as the important source of TAG in Chlamydomonas reinhardtii . Using a chimeric Hsp70A-RbcS2 promoter, exogenous PDAT form Saccharomyces cerevisiae fused with a chloroplast transit peptide was expressed in C. reinhardtii CC-137. Proved by western blot, the expression of ScPDAT showed a synchronous trend to the growth in the exponential phase. Compared to the wild type, the strain of Scpdat achieved 22% increase in the content of total fatty acids and 32% increase in TAG content. In addition, the fluctuation of C16 series fatty acid in monogalactosyldiacylglycerol, diacylglyceryltrimethylhomoserine and TAG indicated an enhancement in the TAG accumulation pathway. The TAG production was enhanced in the regular cultivation without the nutrient stress by strengthening the conversion of polar lipid to TAG in C. reinhardtii and the findings provide a candidate strategy for rational engineered strain to overcome the decline in the growth during the TAG accumulation triggered by nitrogen starvation.
Improved serial analysis of V1 ribosomal sequence tags (SARST-V1) provides a rapid, comprehensive, sequence-based characterization of bacterial diversity and community composition.

PubMed

Yu, Zhongtang; Yu, Marie; Morrison, Mark

2006-04-01

Serial analysis of ribosomal sequence tags (SARST) is a recently developed technology that can generate large 16S rRNA gene (rrs) sequence data sets from microbiomes, but there are numerous enzymatic and purification steps required to construct the ribosomal sequence tag (RST) clone libraries. We report here an improved SARST method, which still targets the V1 hypervariable region of rrs genes, but reduces the number of enzymes, oligonucleotides, reagents, and technical steps needed to produce the RST clone libraries. The new method, hereafter referred to as SARST-V1, was used to examine the eubacterial diversity present in community DNA recovered from the microbiome resident in the ovine rumen. The 190 sequenced clones contained 1055 RSTs and no less than 236 unique phylotypes (based on > or = 95% sequence identity) that were assigned to eight different eubacterial phyla. Rarefaction and monomolecular curve analyses predicted that the complete RST clone library contains 99% of the 353 unique phylotypes predicted to exist in this microbiome. When compared with ribosomal intergenic spacer analysis (RISA) of the same community DNA sample, as well as a compilation of nine previously published conventional rrs clone libraries prepared from the same type of samples, the RST clone library provided a more comprehensive characterization of the eubacterial diversity present in rumen microbiomes. As such, SARST-V1 should be a useful tool applicable to comprehensive examination of diversity and composition in microbiomes and offers an affordable, sequence-based method for diversity analysis.
Increasing ecological inference from high throughput sequencing of fungi in the environment through a tagging approach

Treesearch

D. Lee Taylor; Michael G. Booth; Jack W. McFarland; Ian C. Herriott; Niall J. Lennon; Chad Nusbaum; Thomas G. Marr

2008-01-01

High throughput sequencing methods are widely used in analyses of microbial diversity but are generally applied to small numbers of samples, which precludes charaterization of patterns of microbial diversity across space and time. We have designed a primer-tagging approach that allows pooling and subsequent sorting of numerous samples, which is directed to...
In silico identification and characterization of conserved miRNAs and their target genes in sweet potato (Ipomoea batatas L.) Expressed Sequence Tags (ESTs)

PubMed Central

Dehury, Budheswar; Panda, Debashis; Sahu, Jagajjit; Sahu, Mousumi; Sarma, Kishore; Barooah, Madhumita; Sen, Priyabrata; Modi, Mahendra Kumar

2013-01-01

The endogenous small non-coding micro RNAs (miRNAs), which are typically ~21–24 nt nucleotides, play a crucial role in regulating the intrinsic normal growth of cells and development of the plants as well as in maintaining the integrity of genomes. These small non-coding RNAs function as the universal specificity factors in post-transcriptional gene silencing. Discovering miRNAs, identifying their targets, and further inferring miRNA functions is a routine process to understand normal biological processes of miRNAs and their roles in the development of plants. Comparative genomics based approach using expressed sequence tags (EST) and genome survey sequences (GSS) offer a cost-effective platform for identification and characterization of miRNAs and their target genes in plants. Despite the fact that sweet potato (Ipomoea batatas L.) is an important staple food source for poor small farmers throughout the world, the role of miRNA in various developmental processes remains largely unknown. In this paper, we report the computational identification of miRNAs and their target genes in sweet potato from their ESTs. Using comparative genomics-based approach, 8 potential miRNA candidates belonging to miR168, miR2911, and miR156 families were identified from 23 406 ESTs in sweet potato. A total of 42 target genes were predicted and their probable functions were illustrated. Most of the newly identified miRNAs target transcription factors as well as genes involved in plant growth and development, signal transduction, metabolism, defense, and stress response. The identification of miRNAs and their targets is expected to accelerate the pace of miRNA discovery, leading to an improved understanding of the role of miRNA in development and physiology of sweet potato, as well as stress response. PMID:24067297
An efficient annotation and gene-expression derivation tool for Illumina Solexa datasets.

PubMed

Hosseini, Parsa; Tremblay, Arianne; Matthews, Benjamin F; Alkharouf, Nadim W

2010-07-02

The data produced by an Illumina flow cell with all eight lanes occupied, produces well over a terabyte worth of images with gigabytes of reads following sequence alignment. The ability to translate such reads into meaningful annotation is therefore of great concern and importance. Very easily, one can get flooded with such a great volume of textual, unannotated data irrespective of read quality or size. CASAVA, a optional analysis tool for Illumina sequencing experiments, enables the ability to understand INDEL detection, SNP information, and allele calling. To not only extract from such analysis, a measure of gene expression in the form of tag-counts, but furthermore to annotate such reads is therefore of significant value. We developed TASE (Tag counting and Analysis of Solexa Experiments), a rapid tag-counting and annotation software tool specifically designed for Illumina CASAVA sequencing datasets. Developed in Java and deployed using jTDS JDBC driver and a SQL Server backend, TASE provides an extremely fast means of calculating gene expression through tag-counts while annotating sequenced reads with the gene's presumed function, from any given CASAVA-build. Such a build is generated for both DNA and RNA sequencing. Analysis is broken into two distinct components: DNA sequence or read concatenation, followed by tag-counting and annotation. The end result produces output containing the homology-based functional annotation and respective gene expression measure signifying how many times sequenced reads were found within the genomic ranges of functional annotations. TASE is a powerful tool to facilitate the process of annotating a given Illumina Solexa sequencing dataset. Our results indicate that both homology-based annotation and tag-count analysis are achieved in very efficient times, providing researchers to delve deep in a given CASAVA-build and maximize information extraction from a sequencing dataset. TASE is specially designed to translate sequence data
An efficient annotation and gene-expression derivation tool for Illumina Solexa datasets

PubMed Central

2010-01-01

Background The data produced by an Illumina flow cell with all eight lanes occupied, produces well over a terabyte worth of images with gigabytes of reads following sequence alignment. The ability to translate such reads into meaningful annotation is therefore of great concern and importance. Very easily, one can get flooded with such a great volume of textual, unannotated data irrespective of read quality or size. CASAVA, a optional analysis tool for Illumina sequencing experiments, enables the ability to understand INDEL detection, SNP information, and allele calling. To not only extract from such analysis, a measure of gene expression in the form of tag-counts, but furthermore to annotate such reads is therefore of significant value. Findings We developed TASE (Tag counting and Analysis of Solexa Experiments), a rapid tag-counting and annotation software tool specifically designed for Illumina CASAVA sequencing datasets. Developed in Java and deployed using jTDS JDBC driver and a SQL Server backend, TASE provides an extremely fast means of calculating gene expression through tag-counts while annotating sequenced reads with the gene's presumed function, from any given CASAVA-build. Such a build is generated for both DNA and RNA sequencing. Analysis is broken into two distinct components: DNA sequence or read concatenation, followed by tag-counting and annotation. The end result produces output containing the homology-based functional annotation and respective gene expression measure signifying how many times sequenced reads were found within the genomic ranges of functional annotations. Conclusions TASE is a powerful tool to facilitate the process of annotating a given Illumina Solexa sequencing dataset. Our results indicate that both homology-based annotation and tag-count analysis are achieved in very efficient times, providing researchers to delve deep in a given CASAVA-build and maximize information extraction from a sequencing dataset. TASE is specially
Heparin-binding peptide as a novel affinity tag for purification of recombinant proteins.

PubMed

Morris, Jacqueline; Jayanthi, Srinivas; Langston, Rebekah; Daily, Anna; Kight, Alicia; McNabb, David S; Henry, Ralph; Kumar, Thallapuranam Krishnaswamy Suresh

2016-10-01

Purification of recombinant proteins constitutes a significant part of the downstream processing in biopharmaceutical industries. Major costs involved in the production of bio-therapeutics mainly depend on the number of purification steps used during the downstream process. Affinity chromatography is a widely used method for the purification of recombinant proteins expressed in different expression host platforms. Recombinant protein purification is achieved by fusing appropriate affinity tags to either N- or C- terminus of the target recombinant proteins. Currently available protein/peptide affinity tags have proved quite useful in the purification of recombinant proteins. However, these affinity tags suffer from specific limitations in their use under different conditions of purification. In this study, we have designed a novel 34-amino acid heparin-binding affinity tag (HB-tag) for the purification of recombinant proteins expressed in Escherichia coli (E. coli) cells. HB-tag fused recombinant proteins were overexpressed in E. coli in high yields. A one-step heparin-Sepharose-based affinity chromatography protocol was developed to purify HB-fused recombinant proteins to homogeneity using a simple sodium chloride step gradient elution. The HB-tag has also been shown to facilitate the purification of target recombinant proteins from their 8 M urea denatured state(s). The HB-tag has been demonstrated to be successfully released from the fusion protein by an appropriate protease treatment to obtain the recombinant target protein(s) in high yields. Results of the two-dimensional NMR spectroscopy experiments indicate that the purified recombinant target protein(s) exist in the native conformation. Polyclonal antibodies raised against the HB-peptide sequence, exhibited high binding specificity and sensitivity to the HB-fused recombinant proteins (∼10 ng) in different crude cell extracts obtained from diverse expression hosts. In our opinion, the HB-tag provides a
Transferable green fluorescence-tagged pEI2 in Edwardsiella ictaluri

USDA-ARS?s Scientific Manuscript database

The pEI2 plasmid of Edwardsiella ictaluri isolate, I49, was tagged using a Tn10-GFP-kan cassette to create the green fluorescence-expressing derivative I49-gfp. The Tn10-GFP-kan insertion site was mapped by plasmid sequencing to 663 bp upstream of orf2 and appeared to be at a neutral site in the pla...
OSIRIS-REx Touch-And-Go (TAG) Navigation Performance

NASA Technical Reports Server (NTRS)

Berry, Kevin; Antreasian, Peter; Moreau, Michael C.; May, Alex; Sutter, Brian

2015-01-01

The Origins Spectral Interpretation Resource identification Security Regolith Explorer (OSIRIS-REx) mission is a NASA New Frontiers mission launching in 2016 to rendezvous with the near-Earth asteroid (101955) Bennu in late 2018. Following an extensive campaign of proximity operations activities to characterize the properties of Bennu and select a suitable sample site, OSIRIES-REx will fly a Touch-And-Go (TAG) trajectory to the asteroid's surface to obtain a regolith sample. The paper summarizes the mission design of the TAG sequence, the propulsive required to achieve the trajectory, and the sequence of events leading up to the TAG event. The paper will summarize the Monte-Carlo simulation of the TAG sequence and present analysis results that demonstrate the ability to conduct the TAG within 25 meters of the selected sample site and +-2 cms of the targeted contact velocity. The paper will describe some of the challenges associated with conducting precision navigation operations and ultimately contacting a very small asteroid.
OSIRI-REx Touch and Go (TAG) Navigation Performance

NASA Technical Reports Server (NTRS)

Berry, Kevin; Antreasian, Peter; Moreau, Michael C.; May, Alex; Sutter, Brian

2015-01-01

The Origins Spectral Interpretation Resource Identification Security Regolith Explorer (OSIRIS-REx) mission is a NASA New Frontiers mission launching in 2016 to rendezvous with the near-Earth asteroid (101955) Bennu in late 2018. Following an extensive campaign of proximity operations activities to characterize the properties of Bennu and select a suitable sample site, OSIRIS-REx will fly a Touch-And-Go (TAG) trajectory to the asteroid's surface to obtain a regolith sample. The paper summarizes the mission design of the TAG sequence, the propulsive maneuvers required to achieve the trajectory, and the sequence of events leading up to the TAG event. The paper also summarizes the Monte-Carlo simulation of the TAG sequence and presents analysis results that demonstrate the ability to conduct the TAG within 25 meters of the selected sample site and 2 cm/s of the targeted contact velocity. The paper describes some of the challenges associated with conducting precision navigation operations and ultimately contacting a very small asteroid.
p53 elevation in human cells halt SV40 infection by inhibiting T-ag expression

PubMed Central

Drayman, Nir; Ben-nun-Shaul, Orly; Butin-Israeli, Veronika; Srivastava, Rohit; Rubinstein, Ariel M.; Mock, Caroline S.; Elyada, Ela; Ben-Neriah, Yinon; Lahav, Galit; Oppenheim, Ariella

2016-01-01

SV40 large T-antigen (T-ag) has been known for decades to inactivate the tumor suppressor p53 by sequestration and additional mechanisms. Our present study revealed that the struggle between p53 and T-ag begins very early in the infection cycle. We found that p53 is activated early after SV40 infection and defends the host against the infection. Using live cell imaging and single cell analyses we found that p53 dynamics are variable among individual cells, with only a subset of cells activating p53 immediately after SV40 infection. This cell-to-cell variabilty had clear consequences on the outcome of the infection. None of the cells with elevated p53 at the beginning of the infection proceeded to express T-ag, suggesting a p53-dependent decision between abortive and productive infection. In addition, we show that artificial elevation of p53 levels prior to the infection reduces infection efficiency, supporting a role for p53 in defending against SV40. We further found that the p53-mediated host defense mechanism against SV40 is not facilitated by apoptosis nor via interferon-stimulated genes. Instead p53 binds to the viral DNA at the T-ag promoter region, prevents its transcriptional activation by Sp1, and halts the progress of the infection. These findings shed new light on the long studied struggle between SV40 T-ag and p53, as developed during virus-host coevolution. Our studies indicate that the fate of SV40 infection is determined as soon as the viral DNA enters the nucleus, before the onset of viral gene expression. PMID:27462916
SEAN: SNP prediction and display program utilizing EST sequence clusters.

PubMed

Huntley, Derek; Baldo, Angela; Johri, Saurabh; Sergot, Marek

2006-02-15

SEAN is an application that predicts single nucleotide polymorphisms (SNPs) using multiple sequence alignments produced from expressed sequence tag (EST) clusters. The algorithm uses rules of sequence identity and SNP abundance to determine the quality of the prediction. A Java viewer is provided to display the EST alignments and predicted SNPs.
Analysis of common bean expressed sequence tags identifies sulfur metabolic pathways active in seed and sulfur-rich proteins highly expressed in the absence of phaseolin and major lectins

PubMed Central

2011-01-01

Background A deficiency in phaseolin and phytohemagglutinin is associated with a near doubling of sulfur amino acid content in genetically related lines of common bean (Phaseolus vulgaris), particularly cysteine, elevated by 70%, and methionine, elevated by 10%. This mostly takes place at the expense of an abundant non-protein amino acid, S-methyl-cysteine. The deficiency in phaseolin and phytohemagglutinin is mainly compensated by increased levels of the 11S globulin legumin and residual lectins. Legumin, albumin-2, defensin and albumin-1 were previously identified as contributing to the increased sulfur amino acid content in the mutant line, on the basis of similarity to proteins from other legumes. Results Profiling of free amino acid in developing seeds of the BAT93 reference genotype revealed a biphasic accumulation of gamma-glutamyl-S-methyl-cysteine, the main soluble form of S-methyl-cysteine, with a lag phase occurring during storage protein accumulation. A collection of 30,147 expressed sequence tags (ESTs) was generated from four developmental stages, corresponding to distinct phases of gamma-glutamyl-S-methyl-cysteine accumulation, and covering the transitions to reserve accumulation and dessication. Analysis of gene ontology categories indicated the occurrence of multiple sulfur metabolic pathways, including all enzymatic activities responsible for sulfate assimilation, de novo cysteine and methionine biosynthesis. Integration of genomic and proteomic data enabled the identification and isolation of cDNAs coding for legumin, albumin-2, defensin D1 and albumin-1A and -B induced in the absence of phaseolin and phytohemagglutinin. Their deduced amino acid sequences have a higher content of cysteine than methionine, providing an explanation for the preferential increase of cysteine in the mutant line. Conclusion The EST collection provides a foundation to further investigate sulfur metabolism and the differential accumulation of sulfur amino acids in seed
Needles in the EST Haystack: Large-Scale Identification and Analysis of Excretory-Secretory (ES) Proteins in Parasitic Nematodes Using Expressed Sequence Tags (ESTs)

PubMed Central

Nagaraj, Shivashankar H.; Gasser, Robin B.; Ranganathan, Shoba

2008-01-01

Background Parasitic nematodes of humans, other animals and plants continue to impose a significant public health and economic burden worldwide, due to the diseases they cause. Promising antiparasitic drug and vaccine candidates have been discovered from excreted or secreted (ES) proteins released from the parasite and exposed to the immune system of the host. Mining the entire expressed sequence tag (EST) data available from parasitic nematodes represents an approach to discover such ES targets. Methods and Findings In this study, we predicted, using EST2Secretome, a novel, high-throughput, computational workflow system, 4,710 ES proteins from 452,134 ESTs derived from 39 different species of nematodes, parasitic in animals (including humans) or plants. In total, 2,632, 786, and 1,292 ES proteins were predicted for animal-, human-, and plant-parasitic nematodes. Subsequently, we systematically analysed ES proteins using computational methods. Of these 4,710 proteins, 2,490 (52.8%) had orthologues in Caenorhabditis elegans, whereas 621 (13.8%) appeared to be novel, currently having no significant match to any molecule available in public databases. Of the C. elegans homologues, 267 had strong “loss-of-function” phenotypes by RNA interference (RNAi) in this nematode. We could functionally classify 1,948 (41.3%) sequences using the Gene Ontology (GO) terms, establish pathway associations for 573 (12.2%) sequences using Kyoto Encyclopaedia of Genes and Genomes (KEGG), and identify protein interaction partners for 1,774 (37.6%) molecules. We also mapped 758 (16.1%) proteins to protein domains including the nematode-specific protein family “transthyretin-like” and “chromadorea ALT,” considered as vaccine candidates against filariasis in humans. Conclusions We report the large-scale analysis of ES proteins inferred from EST data for a range of parasitic nematodes. This set of ES proteins provides an inventory of known and novel members of ES proteins as a
Application safety evaluation of the radio frequency identification tag under magnetic resonance imaging.

PubMed

Fei, Xiaolu; Li, Shanshan; Gao, Shan; Wei, Lan; Wang, Lihong

2014-09-04

Radio Frequency Identification(RFID) has been widely used in healthcare facilities, but it has been paid little attention whether RFID applications are safe enough under healthcare environment. The purpose of this study is to assess the effects of RFID tags on Magnetic Resonance (MR) imaging in a typical electromagnetic environment in hospitals, and to evaluate the safety of their applications. A Magphan phantom was used to simulate the imaging objects, while active RFID tags were placed at different distances (0, 4, 8, 10 cm) from the phantom border. The phantom was scanned by using three typical sequences including spin-echo (SE) sequence, gradient-echo (GRE) sequence and inversion-recovery (IR) sequence. The quality of the image was quantitatively evaluated by using signal-to-noise ratio (SNR), uniformity, high-contrast resolution, and geometric distortion. RFID tags were read by an RFID reader to calculate their usable rate. RFID tags can be read properly after being placed in high magnetic field for up to 30 minutes. SNR: There were no differences between the group with RFID tags and the group without RFID tags using SE and IR sequence, but it was lower when using GRE sequence.Uniformity: There was a significant difference between the group with RFID tags and the group without RFID tags using SE and GRE sequence. Geometric distortion and high-contrast resolution: There were no obvious differences found. Active RFID tags can affect MR imaging quality, especially using the GRE sequence. Increasing the distance from the RFID tags to the imaging objects can reduce that influence. When the distance was longer than 8 cm, MR imaging quality were almost unaffected. However, the Gradient Echo related sequence is not recommended when patients wear a RFID wristband.

Identification and Validation of Expressed Sequence Tags from Pigeonpea (Cajanus cajan L.) Root

PubMed Central

Kumar, Ravi Ranjan; Yadav, Shailesh; Joshi, Shourabh; Bhandare, Prithviraj P.; Patil, Vinod Kumar; Kulkarni, Pramod B.; Sonkawade, Swati; Naik, G. R.

2014-01-01

Pigeonpea (Cajanus cajan (L) Millsp.) is an important food legume crop of rain fed agriculture in the arid and semiarid tropics of the world. It has deep and extensive root system which serves a number of important physiological and metabolic functions in plant development and growth. In order to identify genes associated with pigeonpea root, ESTs were generated from the root tissues of pigeonpea (GRG-295 genotype) by normalized cDNA library. A total of 105 high quality ESTs were generated by sequencing of 250 random clones which resulted in 72 unigenes comprising 25 contigs and 47 singlets. The ESTs were assigned to 9 functional categories on the basis of their putative function. In order to validate the possible expression of transcripts, four genes, namely, S-adenosylmethionine synthetase, phosphoglycerate kinase, serine carboxypeptidase, and methionine aminopeptidase, were further analyzed by reverse transcriptase PCR. The possible role of the identified transcripts and their functions associated with root will also be a valuable resource for the functional genomics study in legume crop. PMID:24895494
High-resolution melt analysis to identify and map sequence-tagged site anchor points onto linkage maps: a white lupin (Lupinus albus) map as an exemplar.

PubMed

Croxford, Adam E; Rogers, Tom; Caligari, Peter D S; Wilkinson, Michael J

2008-01-01

* The provision of sequence-tagged site (STS) anchor points allows meaningful comparisons between mapping studies but can be a time-consuming process for nonmodel species or orphan crops. * Here, the first use of high-resolution melt analysis (HRM) to generate STS markers for use in linkage mapping is described. This strategy is rapid and low-cost, and circumvents the need for labelled primers or amplicon fractionation. * Using white lupin (Lupinus albus, x = 25) as a case study, HRM analysis was applied to identify 91 polymorphic markers from expressed sequence tag (EST)-derived and genomic libraries. Of these, 77 generated STS anchor points in the first fully resolved linkage map of the species. The map also included 230 amplified fragment length polymorphisms (AFLP) loci, spanned 1916 cM (84.2% coverage) and divided into the expected 25 linkage groups. * Quantitative trait loci (QTL) analyses performed on the population revealed genomic regions associated with several traits, including the agronomically important time to flowering (tf), alkaloid synthesis and stem height (Ph). Use of HRM-STS markers also allowed us to make direct comparisons between our map and that of the related crop, Lupinus angustifolius, based on the conversion of RFLP, microsatellite and single nucleotide polymorphism (SNP) markers into HRM markers.
Generation and analysis of expression sequence tags from haustoria of the wheat stripe rust fungus Puccinia striiformis f. sp. Tritici

PubMed Central

2009-01-01

Background Stripe rust, caused by Puccinia striiformis f. sp. tritici (Pst), is one of the most destructive diseases of wheat (Triticum aestivum L.) worldwide. In spite of its agricultural importance, the genomics and genetics of the pathogen are poorly characterized. Pst transcripts from urediniospores and germinated urediniospores have been examined previously, but little is known about genes expressed during host infection. Some genes involved in virulence in other rust fungi have been found to be specifically expressed in haustoria. Therefore, the objective of this study was to generate a cDNA library to characterize genes expressed in haustoria of Pst. Results A total of 5,126 EST sequences of high quality were generated from haustoria of Pst, from which 287 contigs and 847 singletons were derived. Approximately 10% and 26% of the 1,134 unique sequences were homologous to proteins with known functions and hypothetical proteins, respectively. The remaining 64% of the unique sequences had no significant similarities in GenBank. Fifteen genes were predicted to be proteins secreted from Pst haustoria. Analysis of ten genes, including six secreted protein genes, using quantitative RT-PCR revealed changes in transcript levels in different developmental and infection stages of the pathogen. Conclusions The haustorial cDNA library was useful in identifying genes of the stripe rust fungus expressed during the infection process. From the library, we identified 15 genes encoding putative secreted proteins and six genes induced during the infection process. These genes are candidates for further studies to determine their functions in wheat-Pst interactions. PMID:20028560
Myocardial tagging by Cardiovascular Magnetic Resonance: evolution of techniques--pulse sequences, analysis algorithms, and applications

PubMed Central

2011-01-01

Cardiovascular magnetic resonance (CMR) tagging has been established as an essential technique for measuring regional myocardial function. It allows quantification of local intramyocardial motion measures, e.g. strain and strain rate. The invention of CMR tagging came in the late eighties, where the technique allowed for the first time for visualizing transmural myocardial movement without having to implant physical markers. This new idea opened the door for a series of developments and improvements that continue up to the present time. Different tagging techniques are currently available that are more extensive, improved, and sophisticated than they were twenty years ago. Each of these techniques has different versions for improved resolution, signal-to-noise ratio (SNR), scan time, anatomical coverage, three-dimensional capability, and image quality. The tagging techniques covered in this article can be broadly divided into two main categories: 1) Basic techniques, which include magnetization saturation, spatial modulation of magnetization (SPAMM), delay alternating with nutations for tailored excitation (DANTE), and complementary SPAMM (CSPAMM); and 2) Advanced techniques, which include harmonic phase (HARP), displacement encoding with stimulated echoes (DENSE), and strain encoding (SENC). Although most of these techniques were developed by separate groups and evolved from different backgrounds, they are in fact closely related to each other, and they can be interpreted from more than one perspective. Some of these techniques even followed parallel paths of developments, as illustrated in the article. As each technique has its own advantages, some efforts have been made to combine different techniques together for improved image quality or composite information acquisition. In this review, different developments in pulse sequences and related image processing techniques are described along with the necessities that led to their invention, which makes this
A scalable strategy for high-throughput GFP tagging of endogenous human proteins.

PubMed

Leonetti, Manuel D; Sekine, Sayaka; Kamiyama, Daichi; Weissman, Jonathan S; Huang, Bo

2016-06-21

A central challenge of the postgenomic era is to comprehensively characterize the cellular role of the ∼20,000 proteins encoded in the human genome. To systematically study protein function in a native cellular background, libraries of human cell lines expressing proteins tagged with a functional sequence at their endogenous loci would be very valuable. Here, using electroporation of Cas9 nuclease/single-guide RNA ribonucleoproteins and taking advantage of a split-GFP system, we describe a scalable method for the robust, scarless, and specific tagging of endogenous human genes with GFP. Our approach requires no molecular cloning and allows a large number of cell lines to be processed in parallel. We demonstrate the scalability of our method by targeting 48 human genes and show that the resulting GFP fluorescence correlates with protein expression levels. We next present how our protocols can be easily adapted for the tagging of a given target with GFP repeats, critically enabling the study of low-abundance proteins. Finally, we show that our GFP tagging approach allows the biochemical isolation of native protein complexes for proteomic studies. Taken together, our results pave the way for the large-scale generation of endogenously tagged human cell lines for the proteome-wide analysis of protein localization and interaction networks in a native cellular context.
Methyl-CpG island-associated genome signature tags

DOEpatents

Dunn, John J

2014-05-20

Disclosed is a method for analyzing the organismic complexity of a sample through analysis of the nucleic acid in the sample. In the disclosed method, through a series of steps, including digestion with a type II restriction enzyme, ligation of capture adapters and linkers and digestion with a type IIS restriction enzyme, genome signature tags are produced. The sequences of a statistically significant number of the signature tags are determined and the sequences are used to identify and quantify the organisms in the sample. Various embodiments of the invention described herein include methods for using single point genome signature tags to analyze the related families present in a sample, methods for analyzing sequences associated with hyper- and hypo-methylated CpG islands, methods for visualizing organismic complexity change in a sampling location over time and methods for generating the genome signature tag profile of a sample of fragmented DNA.
Sequence analysis reveals genomic factors affecting EST-SSR primer performance and polymorphism

USDA-ARS?s Scientific Manuscript database

Search for simple sequence repeat (SSR) motifs and design of flanking primers in expressed sequence tag (EST) sequences can be easily done at a large scale using bioinformatics programs. However, failed amplification and/or detection, along with lack of polymorphism, is often seen among randomly sel...
DSAP: deep-sequencing small RNA analysis pipeline.

PubMed

Huang, Po-Jung; Liu, Yi-Chung; Lee, Chi-Ching; Lin, Wei-Chen; Gan, Richie Ruei-Chi; Lyu, Ping-Chiang; Tang, Petrus

2010-07-01

DSAP is an automated multiple-task web service designed to provide a total solution to analyzing deep-sequencing small RNA datasets generated by next-generation sequencing technology. DSAP uses a tab-delimited file as an input format, which holds the unique sequence reads (tags) and their corresponding number of copies generated by the Solexa sequencing platform. The input data will go through four analysis steps in DSAP: (i) cleanup: removal of adaptors and poly-A/T/C/G/N nucleotides; (ii) clustering: grouping of cleaned sequence tags into unique sequence clusters; (iii) non-coding RNA (ncRNA) matching: sequence homology mapping against a transcribed sequence library from the ncRNA database Rfam (http://rfam.sanger.ac.uk/); and (iv) known miRNA matching: detection of known miRNAs in miRBase (http://www.mirbase.org/) based on sequence homology. The expression levels corresponding to matched ncRNAs and miRNAs are summarized in multi-color clickable bar charts linked to external databases. DSAP is also capable of displaying miRNA expression levels from different jobs using a log(2)-scaled color matrix. Furthermore, a cross-species comparative function is also provided to show the distribution of identified miRNAs in different species as deposited in miRBase. DSAP is available at http://dsap.cgu.edu.tw.
Sequencing degraded DNA from non-destructively sampled museum specimens for RAD-tagging and low-coverage shotgun phylogenetics.

PubMed

Tin, Mandy Man-Ying; Economo, Evan Philip; Mikheyev, Alexander Sergeyevich

2014-01-01

Ancient and archival DNA samples are valuable resources for the study of diverse historical processes. In particular, museum specimens provide access to biotas distant in time and space, and can provide insights into ecological and evolutionary changes over time. However, archival specimens are difficult to handle; they are often fragile and irreplaceable, and typically contain only short segments of denatured DNA. Here we present a set of tools for processing such samples for state-of-the-art genetic analysis. First, we report a protocol for minimally destructive DNA extraction of insect museum specimens, which produced sequenceable DNA from all of the samples assayed. The 11 specimens analyzed had fragmented DNA, rarely exceeding 100 bp in length, and could not be amplified by conventional PCR targeting the mitochondrial cytochrome oxidase I gene. Our approach made these samples amenable to analysis with commonly used next-generation sequencing-based molecular analytic tools, including RAD-tagging and shotgun genome re-sequencing. First, we used museum ant specimens from three species, each with its own reference genome, for RAD-tag mapping. Were able to use the degraded DNA sequences, which were sequenced in full, to identify duplicate reads and filter them prior to base calling. Second, we re-sequenced six Hawaiian Drosophila species, with millions of years of divergence, but with only a single available reference genome. Despite a shallow coverage of 0.37 ± 0.42 per base, we could recover a sufficient number of overlapping SNPs to fully resolve the species tree, which was consistent with earlier karyotypic studies, and previous molecular studies, at least in the regions of the tree that these studies could resolve. Although developed for use with degraded DNA, all of these techniques are readily applicable to more recent tissue, and are suitable for liquid handling automation.
Tail proteins of phage T5: investigation of the effect of the His6-tag position, from expression to crystallisation.

PubMed

Noirclerc-Savoye, Marjolaine; Flayhan, Ali; Pereira, Cindy; Gallet, Benoit; Gans, Pierre; Ebel, Christine; Breyton, Cécile

2015-05-01

Upon binding to its bacterial host receptor, the tail tip of phage T5 perforates, by an unknown mechanism, the heavily armoured cell wall of the host. This allows the injection of phage DNA into the cytoplasm to hijack the cell machinery and enable the production of new virions. In the perspective of a structural study of the phage tail, we have systematically overproduced eight of the eleven T5 tail proteins, with or without a N- or a C-terminal His6-tag. The widely used Hi6-tag is very convenient to purify recombinant proteins using immobilised-metal affinity chromatography. The presence of a tag however is not always innocuous. We combined automated gene cloning and expression tests to rapidly identify the most promising constructs for proteins of phage T5 tail, and performed biochemical and biophysical characterisation and crystallisation screening on available proteins. Automated small-scale purification was adapted for two highly expressed proteins. We obtained structural information for three of the proteins. We showed that the presence of a His6-tag can have drastic effect on protein expression, solubility, oligomerisation propensity and crystal quality. Copyright © 2015 Elsevier Inc. All rights reserved.
Analysis of expressed sequence tags from Actinidia: applications of a cross species EST database for gene discovery in the areas of flavor, health, color and ripening

PubMed Central

Crowhurst, Ross N; Gleave, Andrew P; MacRae, Elspeth A; Ampomah-Dwamena, Charles; Atkinson, Ross G; Beuning, Lesley L; Bulley, Sean M; Chagne, David; Marsh, Ken B; Matich, Adam J; Montefiori, Mirco; Newcomb, Richard D; Schaffer, Robert J; Usadel, Björn; Allan, Andrew C; Boldingh, Helen L; Bowen, Judith H; Davy, Marcus W; Eckloff, Rheinhart; Ferguson, A Ross; Fraser, Lena G; Gera, Emma; Hellens, Roger P; Janssen, Bart J; Klages, Karin; Lo, Kim R; MacDiarmid, Robin M; Nain, Bhawana; McNeilage, Mark A; Rassam, Maysoon; Richardson, Annette C; Rikkerink, Erik HA; Ross, Gavin S; Schröder, Roswitha; Snowden, Kimberley C; Souleyre, Edwige JF; Templeton, Matt D; Walton, Eric F; Wang, Daisy; Wang, Mindy Y; Wang, Yanming Y; Wood, Marion; Wu, Rongmei; Yauk, Yar-Khing; Laing, William A

2008-01-01

Background Kiwifruit (Actinidia spp.) are a relatively new, but economically important crop grown in many different parts of the world. Commercial success is driven by the development of new cultivars with novel consumer traits including flavor, appearance, healthful components and convenience. To increase our understanding of the genetic diversity and gene-based control of these key traits in Actinidia, we have produced a collection of 132,577 expressed sequence tags (ESTs). Results The ESTs were derived mainly from four Actinidia species (A. chinensis, A. deliciosa, A. arguta and A. eriantha) and fell into 41,858 non redundant clusters (18,070 tentative consensus sequences and 23,788 EST singletons). Analysis of flavor and fragrance-related gene families (acyltransferases and carboxylesterases) and pathways (terpenoid biosynthesis) is presented in comparison with a chemical analysis of the compounds present in Actinidia including esters, acids, alcohols and terpenes. ESTs are identified for most genes in color pathways controlling chlorophyll degradation and carotenoid biosynthesis. In the health area, data are presented on the ESTs involved in ascorbic acid and quinic acid biosynthesis showing not only that genes for many of the steps in these pathways are represented in the database, but that genes encoding some critical steps are absent. In the convenience area, genes related to different stages of fruit softening are identified. Conclusion This large EST resource will allow researchers to undertake the tremendous challenge of understanding the molecular basis of genetic diversity in the Actinidia genus as well as provide an EST resource for comparative fruit genomics. The various bioinformatics analyses we have undertaken demonstrates the extent of coverage of ESTs for genes encoding different biochemical pathways in Actinidia. PMID:18655731
Microsatellite DNA in genomic survey sequences and UniGenes of loblolly pine

Treesearch

Craig S Echt; Surya Saha; Dennis L Deemer; C Dana Nelson

2011-01-01

Genomic DNA sequence databases are a potential and growing resource for simple sequence repeat (SSR) marker development in loblolly pine (Pinus taeda L.). Loblolly pine also has many expressed sequence tags (ESTs) available for microsatellite (SSR) marker development. We compared loblolly pine SSR densities in genome survey sequences (GSSs) to those in non-redundant...
Generation and Analysis of Expressed Sequence Tags (ESTs) from Halophyte Atriplex canescens to Explore Salt-Responsive Related Genes

PubMed Central

Li, Jingtao; Sun, Xinhua; Yu, Gang; Jia, Chengguo; Liu, Jinliang; Pan, Hongyu

2014-01-01

Little information is available on gene expression profiling of halophyte A. canescens. To elucidate the molecular mechanism for stress tolerance in A. canescens, a full-length complementary DNA library was generated from A. canescens exposed to 400 mM NaCl, and provided 343 high-quality ESTs. In an evaluation of 343 valid EST sequences in the cDNA library, 197 unigenes were assembled, among which 190 unigenes (83.1% ESTs) were identified according to their significant similarities with proteins of known functions. All the 343 EST sequences have been deposited in the dbEST GenBank under accession numbers JZ535802 to JZ536144. According to Arabidopsis MIPS functional category and GO classifications, we identified 193 unigenes of the 311 annotations EST, representing 72 non-redundant unigenes sharing similarities with genes related to the defense response. The sets of ESTs obtained provide a rich genetic resource and 17 up-regulated genes related to salt stress resistance were identified by qRT-PCR. Six of these genes may contribute crucially to earlier and later stage salt stress resistance. Additionally, among the 343 unigenes sequences, 22 simple sequence repeats (SSRs) were also identified contributing to the study of A. canescens resources. PMID:24960361
Strep-Tagged Protein Purification.

PubMed

Maertens, Barbara; Spriestersbach, Anne; Kubicek, Jan; Schäfer, Frank

2015-01-01

The Strep-tag system can be used to purify recombinant proteins from any expression system. Here, protocols for lysis and affinity purification of Strep-tagged proteins from E. coli, baculovirus-infected insect cells, and transfected mammalian cells are given. Depending on the amount of Strep-tagged protein in the lysate, a protocol for batch binding and subsequent washing and eluting by gravity flow can be used. Agarose-based matrices with the coupled Strep-Tactin ligand are the resins of choice, with a binding capacity of up to 9 mg ml(-1). For purification of lower amounts of Strep-tagged proteins, the use of Strep-Tactin magnetic beads is suitable. In addition, Strep-tagged protein purification can also be automated using prepacked columns for FPLC or other liquid-handling chromatography instrumentation, but automated purification is not discussed in this protocol. The protocols described here can be regarded as an update of the Strep-Tag Protein Handbook (Qiagen, 2009). © 2015 Elsevier Inc. All rights reserved.
Linear reduction methods for tag SNP selection.

PubMed

He, Jingwu; Zelikovsky, Alex

2004-01-01

It is widely hoped that constructing a complete human haplotype map will help to associate complex diseases with certain SNP's. Unfortunately, the number of SNP's is huge and it is very costly to sequence many individuals. Therefore, it is desirable to reduce the number of SNP's that should be sequenced to considerably small number of informative representatives, so called tag SNP's. In this paper, we propose a new linear algebra based method for selecting and using tag SNP's. Our method is purely combinatorial and can be combined with linkage disequilibrium (LD) and block based methods. We measure the quality of our tag SNP selection algorithm by comparing actual SNP's with SNP's linearly predicted from linearly chosen tag SNP's. We obtain an extremely good compression and prediction rates. For example, for long haplotypes (>25000 SNP's), knowing only 0.4% of all SNP's we predict the entire unknown haplotype with 2% accuracy while the prediction method is based on a 10% sample of the population.
Lipopolysaccharide-induced innate immune factors in the bottlenose dolphin (Tursiops truncatus) detected in expression sequence tag analysis.

PubMed

Ohishi, Kazue; Shishido, Reiko; Iwata, Yasunao; Saitoh, Masafumi; Takenaka, Ryota; Ohtsu, Dai; Okutsu, Kenji; Maruyama, Tadashi

2011-11-01

EST analysis based on the megaclone-megasorting method was performed using leukocytes from the bottlenose dolphin (Tursiops truncatus) with or without LPS stimulation. A total of 849 upregulated and 384 downregulated EST clones were sequenced, annotated, and functionally classified. Ferritin heavy peptide I was the most abundant upregulated transcript, suggesting that LPS stimulation induced high production of reactive oxygen species, which were sequestered in ferritin. Among the immune factors, the transcripts coding for an IL-1Ra, homologs to bovine serum amyloid A3, and canine intercellular adhesion molecule-1 were highly expressed. Markedly downregulated transcripts of immune factors were those for homologs of calcium-binding proteins belonging to the S100 family, S100A12, S100A8, and S100A6. Time-course experiments on the expression of some immune factors including IL-1Ra suggested that these factors interact and control cetacean innate immunity. © 2011 The Societies and Blackwell Publishing Asia Pty Ltd.
One-step affinity tag purification of full-length recombinant human AP-1 complexes from bacterial inclusion bodies using a polycistronic expression system

PubMed Central

Wang, Wei-Ming; Lee, A-Young; Chiang, Cheng-Ming

2008-01-01

The AP-1 transcription factor is a dimeric protein complex formed primarily between Jun (c-Jun, JunB, JunD) and Fos (c-Fos, FosB, Fra-1, Fra-2) family members. These distinct AP-1 complexes are expressed in many cell types and modulate target gene expression implicated in cell proliferation, differentiation, and stress responses. Although the importance of AP-1 has long been recognized, the biochemical characterization of AP-1 remains limited in part due to the difficulty in purifying full-length, reconstituted dimers with active DNA-binding and transcriptional activity. Using a combination of bacterial coexpression and epitope-tagging methods, we successfully purified all 12 heterodimers (3 Jun × 4 Fos) of full-length human AP-1 complexes as well as c-Jun/c-Jun, JunD/JunD, and c-Jun/JunD dimers from bacterial inclusion bodies using one-step nickel-NTA affinity tag purification following denaturation and renaturation of coexpressed AP-1 subunits. Coexpression of two constitutive components in a dimeric AP-1 complex helps stabilize the proteins when compared with individual protein expression in bacteria. Purified dimeric AP-1 complexes are functional in sequence-specific DNA binding, as illustrated by electrophoretic mobility shift assays and DNase I footprinting, and are also active in transcription with in vitro-reconstituted human papillomavirus (HPV) chromatin containing AP-1-binding sites in the native configuration of HPV nucleosomes. The availability of these recombinant full-length human AP-1 complexes has greatly facilitated mechanistic studies of AP-1-regulated gene transcription in many biological systems. PMID:18329890
Tab2, a novel recombinant polypeptide tag offering sensitive and specific protein detection and reliable affinity purification.

PubMed

Crusius, Kerstin; Finster, Silke; McClary, John; Xia, Wei; Larsen, Brent; Schneider, Douglas; Lu, Hong-Tao; Biancalana, Sara; Xuan, Jian-Ai; Newton, Alicia; Allen, Debbie; Bringmann, Peter; Cobb, Ronald R

2006-10-01

The detection and purification of proteins are often time-consuming and frequently involve complicated protocols. The addition of a peptide tag to recombinant proteins can make this process more efficient. Many of the commonly used tags, such as Flagtrade mark, Myc, HA and V5 are recognized by specific monoclonal antibodies and therefore, allow immunoaffinity-based purification. Enhancing the current scope of flexibility in using diverse peptide tags, we report here the development of a novel, short polypeptide tag (Tab2) for detection and purification of recombinant proteins. The Tab2 epitope corresponds to the NH2-terminal seven amino acid residues of human TGFalpha. A monoclonal anti-Tab2 antibody was raised and characterized. To investigate the potential of this peptide sequence as a novel tag for recombinant proteins, we expressed several different recombinant proteins containing this tag in E. coli, baculovirus, and mammalian cells. The data presented demonstrates the Tab2 tag-anti-Tab2 antibody combination is a reliable tool enabling specific Western blot detection, FACS analysis, and immunoprecipitation as well as non-denaturing protein affinity purification.
Identification of reproduction-related genes and SSR-markers through expressed sequence tags analysis of a monsoon breeding carp rohu, Labeo rohita (Hamilton).

PubMed

Sahu, Dinesh K; Panda, Soumya P; Panda, Sujata; Das, Paramananda; Meher, Prem K; Hazra, Rupenangshu K; Peatman, Eric; Liu, Zhanjiang J; Eknath, Ambekar E; Nandi, Samiran

2013-07-15

Labeo rohita (Ham.) also called rohu is the most important freshwater aquaculture species on the Indian sub continent. Monsoon dependent breeding restricts its seed production beyond season indicating a strong genetic control about which very limited information is available. Additionally, few genomic resources are publicly available for this species. Here we sought to identify reproduction-relevant genes from normalized cDNA libraries of the brain-pituitary-gonad-liver (BPGL-axis) tissues of adult L. rohita collected during post preparatory phase. 6161 random clones sequenced (Sanger-based) from these libraries produced 4642 (75.34%) high-quality sequences. They were assembled into 3631 (78.22%) unique sequences composed of 709 contigs and 2922 singletons. A total of 182 unique sequences were found to be associated with reproduction-related genes, mainly under the GO term categories of reproduction, neuro-peptide hormone activity, hormone and receptor binding, receptor activity, signal transduction, embryonic development, cell-cell signaling, cell death and anti-apoptosis process. Several important reproduction-related genes reported here for the first time in L. rohita are zona pellucida sperm-binding protein 3, aquaporin-12, spermine oxidase, sperm associated antigen 7, testis expressed 261, progesterone receptor membrane component, Neuropeptide Y and Pro-opiomelanocortin. Quantitative RT-PCR-based analyses of 8 known and 8 unknown transcripts during preparatory and post-spawning phase showed increased expression level of most of the transcripts during preparatory phase (except Neuropeptide Y) in comparison to post-spawning phase indicating possible roles in initiation of gonad maturation. Expression of unknown transcripts was also found in prolific breeder common carp and tilapia, but levels of expression were much higher in seasonal breeder rohu. 3631 unique sequences contained 236 (6.49%) putative microsatellites with the AG (28.16%) repeat as the most
Transposon tagging and the study of root development in Arabidopsis

NASA Technical Reports Server (NTRS)

Tsugeki, R.; Olson, M. L.; Fedoroff, N. V.

1998-01-01

The maize Ac-Ds transposable element family has been used as the basis of transposon mutagenesis systems that function in a variety of plants, including Arabidopsis. We have developed modified transposons and methods which simplify the detection, cloning and analysis of insertion mutations. We have identified and are analyzing two plant lines in which genes expressed either in the root cap cells or in the quiescent cells, cortex/endodermal initial cells and columella cells of the root cap have been tagged with a transposon carrying a reporter gene. A gene expressed in root cap cells tagged with an enhancer-trap Ds was isolated and its corresponding EST cDNA was identified. Nucleotide and deduced amino acid sequences of the gene show no significant similarity to other genes in the database. Genetic ablation experiments have been done by fusing a root cap-specific promoter to the diphtheria toxin A-chain gene and introducing the fusion construct into Arabidopsis plants. We find that in addition to eliminating gravitropism, root cap ablation inhibits elongation of roots by lowering root meristematic activities.

openSputnik--a database to ESTablish comparative plant genomics using unsaturated sequence collections.

PubMed

Rudd, Stephen

2005-01-01

The public expressed sequence tag collections are continually being enriched with high-quality sequences that represent an ever-expanding range of taxonomically diverse plant species. While these sequence collections provide biased insight into the populations of expressed genes available within individual species and their associated tissues, the information is conceivably of wider relevance in a comparative context. When we consider the available expressed sequence tag (EST) collections of summer 2004, most of the major plant taxonomic clades are at least superficially represented. Investigation of the five million available plant ESTs provides a wealth of information that has applications in modelling the routes of plant genome evolution and the identification of lineage-specific genes and gene families. Over four million ESTs from over 50 distinct plant species have been collated within an EST analysis pipeline called openSputnik. The ESTs were resolved down into approximately one million unigene sequences. These have been annotated using orthology-based annotation transfer from reference plant genomes and using a variety of contemporary bioinformatics methods to assign peptide, structural and functional attributes. The openSputnik database is available at http://sputnik.btk.fi.
An Expressed Sequence Tag (EST)-enriched genetic map of turbot (Scophthalmus maximus): a useful framework for comparative genomics across model and farmed teleosts

PubMed Central

2012-01-01

Background The turbot (Scophthalmus maximus) is a relevant species in European aquaculture. The small turbot genome provides a source for genomics strategies to use in order to understand the genetic basis of productive traits, particularly those related to sex, growth and pathogen resistance. Genetic maps represent essential genomic screening tools allowing to localize quantitative trait loci (QTL) and to identify candidate genes through comparative mapping. This information is the backbone to develop marker-assisted selection (MAS) programs in aquaculture. Expressed sequenced tag (EST) resources have largely increased in turbot, thus supplying numerous type I markers suitable for extending the previous linkage map, which was mostly based on anonymous loci. The aim of this study was to construct a higher-resolution turbot genetic map using EST-linked markers, which will turn out to be useful for comparative mapping studies. Results A consensus gene-enriched genetic map of the turbot was constructed using 463 SNP and microsatellite markers in nine reference families. This map contains 438 markers, 180 EST-linked, clustered at 24 linkage groups. Linkage and comparative genomics evidences suggested additional linkage group fusions toward the consolidation of turbot map according to karyotype information. The linkage map showed a total length of 1402.7 cM with low average intermarker distance (3.7 cM; ~2 Mb). A global 1.6:1 female-to-male recombination frequency (RF) ratio was observed, although largely variable among linkage groups and chromosome regions. Comparative sequence analysis revealed large macrosyntenic patterns against model teleost genomes, significant hits decreasing from stickleback (54%) to zebrafish (20%). Comparative mapping supported particular chromosome rearrangements within Acanthopterygii and aided to assign unallocated markers to specific turbot linkage groups. Conclusions The new gene-enriched high-resolution turbot map represents a
The use of sequence-based SSR mining for the development of a vast collection of microsatellites in Aquilegia Formosa

Treesearch

Brandon Schlautman; Vera Pfeiffer; Juan Zalapa; Johanne Brunet

2014-01-01

Numerous microsatellite markers were developed for Aquilegia formosafrom sequences deposited within the Expressed Sequence Tag (EST), Genomic Survey Sequence (GSS), and Nucleotide databases in NCBI. Microsatellites (SSRs) were identified and primers were designed for 9 SSR containing sequences in the Nucleotide database, 3803 sequences in the EST...
Expression of proteins in Escherichia coli as fusions with maltose-binding protein to rescue non-expressed targets in a high-throughput protein-expression and purification pipeline

PubMed Central

Hewitt, Stephen N.; Choi, Ryan; Kelley, Angela; Crowther, Gregory J.; Napuli, Alberto J.; Van Voorhis, Wesley C.

2011-01-01

Despite recent advances, the expression of heterologous proteins in Escherichia coli for crystallization remains a nontrivial challenge. The present study investigates the efficacy of maltose-binding protein (MBP) fusion as a general strategy for rescuing the expression of target proteins. From a group of sequence-verified clones with undetectable levels of protein expression in an E. coli T7 expression system, 95 clones representing 16 phylogenetically diverse organisms were selected for recloning into a chimeric expression vector with an N-terminal histidine-tagged MBP. PCR-amplified inserts were annealed into an identical ligation-independent cloning region in an MBP-fusion vector and were analyzed for expression and solubility by high-throughput nickel-affinity binding. This approach yielded detectable expression of 72% of the clones; soluble expression was visible in 62%. However, the solubility of most proteins was marginal to poor upon cleavage of the MBP tag. This study offers large-scale evidence that MBP can improve the soluble expression of previously non-expressing proteins from a variety of eukaryotic and prokaryotic organisms. While the behavior of the cleaved proteins was disappointing, further refinements in MBP tagging may permit the more widespread use of MBP-fusion proteins in crystallographic studies. PMID:21904041
Dramatic secretion of recombinant protein expressed in tobacco cells with a designer glycopeptide tag is highly impacted by medium composition.

PubMed

Zhang, Ningning; Dolan, Maureen; Wu, Di; Phillips, Gregory C; Xu, Jianfeng

2016-12-01

Cell growth medium composition has profound impacts on the O -glycosylation of a "designer" arabinogalactan protein-based module; full glycosylation is essential in directing efficient extracellular secretion of the tagged recombinant protein. Expression of recombinant proteins in plant cells as fusion with a de novo designed hydroxyproline (Hyp)-O-glycosylated peptide (HypGP) tag, termed HypGP engineering technology, resulted in dramatically increased secreted protein yields. This is due to the function of the HypGP tag as a molecular carrier in promoting efficient transport of conjoined proteins into culture media. To optimize the cell culture to achieve the best secreted protein yields, the medium effects on the cell growth and protein secretion were investigated using as a model system the tobacco BY-2 cell expressing enhanced green fluorescence protein (EGFP) fused with a (SP) 32 tag (32 tandem repeats of "Ser-Pro" motif). The (SP) 32 tag was found to undergo two-stage Hyp-O-glycosylation in plant cells with the dramatic secretion of the conjoined EGFP correlating with the triggering of the second-stage glycosylation. The BY-2 cell culture in SH medium generated a high secreted protein yield (125 mg/L) with a low cell biomass accumulation (~7.5 gDW/L). In contrast, very low secreted protein yields (~1.5 mg/L) with a high cell biomass accumulation (13.5 gDW/L) were obtained in MS medium. The macronutrients, specifically, the nitrogen supply greatly impacted the glycosylation of the (SP) 32 tag and subsequent protein secretion. Modified MS medium with reduced nitrogen levels boosted the secreted EGFP yields to 168 mg/L. This study demonstrates the profound impacts of medium composition on the secreted yields of a HypGP-tagged protein, and provides a basis for medium design to achieve the highest productivity of the HypGP engineering technology.
Molecular characterization of human ABHD2 as TAG lipase and ester hydrolase

PubMed Central

M., Naresh Kumar; V.B.S.C., Thunuguntla; G.K., Veeramachaneni; B., Chandra Sekhar; Guntupalli, Swapna; J.S., Bondili

2016-01-01

Alterations in lipid metabolism have been progressively documented as a characteristic property of cancer cells. Though, human ABHD2 gene was found to be highly expressed in breast and lung cancers, its biochemical functionality is yet uncharacterized. In the present study we report, human ABHD2 as triacylglycerol (TAG) lipase along with ester hydrolysing capacity. Sequence analysis of ABHD2 revealed the presence of conserved motifs G205XS207XG209 and H120XXXXD125. Phylogenetic analysis showed homology to known lipases, Drosophila melanogaster CG3488. To evaluate the biochemical role, recombinant ABHD2 was expressed in Saccharomyces cerevisiae using pYES2/CT vector and His-tag purified protein showed TAG lipase activity. Ester hydrolase activity was confirmed with pNP acetate, butyrate and palmitate substrates respectively. Further, the ABHD2 homology model was built and the modelled protein was analysed based on the RMSD and root mean square fluctuation (RMSF) of the 100 ns simulation trajectory. Docking the acetate, butyrate and palmitate ligands with the model confirmed covalent binding of ligands with the Ser207 of the GXSXG motif. The model was validated with a mutant ABHD2 developed with alanine in place of Ser207 and the docking studies revealed loss of interaction between selected ligands and the mutant protein active site. Based on the above results, human ABHD2 was identified as a novel TAG lipase and ester hydrolase. PMID:27247428
Molecular characterization of human ABHD2 as TAG lipase and ester hydrolase.

PubMed

M, Naresh Kumar; V B S C, Thunuguntla; G K, Veeramachaneni; B, Chandra Sekhar; Guntupalli, Swapna; J S, Bondili

2016-08-01

Alterations in lipid metabolism have been progressively documented as a characteristic property of cancer cells. Though, human ABHD2 gene was found to be highly expressed in breast and lung cancers, its biochemical functionality is yet uncharacterized. In the present study we report, human ABHD2 as triacylglycerol (TAG) lipase along with ester hydrolysing capacity. Sequence analysis of ABHD2 revealed the presence of conserved motifs G(205)XS(207)XG(209) and H(120)XXXXD(125) Phylogenetic analysis showed homology to known lipases, Drosophila melanogaster CG3488. To evaluate the biochemical role, recombinant ABHD2 was expressed in Saccharomyces cerevisiae using pYES2/CT vector and His-tag purified protein showed TAG lipase activity. Ester hydrolase activity was confirmed with pNP acetate, butyrate and palmitate substrates respectively. Further, the ABHD2 homology model was built and the modelled protein was analysed based on the RMSD and root mean square fluctuation (RMSF) of the 100 ns simulation trajectory. Docking the acetate, butyrate and palmitate ligands with the model confirmed covalent binding of ligands with the Ser(207) of the GXSXG motif. The model was validated with a mutant ABHD2 developed with alanine in place of Ser(207) and the docking studies revealed loss of interaction between selected ligands and the mutant protein active site. Based on the above results, human ABHD2 was identified as a novel TAG lipase and ester hydrolase. © 2016 The Author(s).
A combination of LongSAGE with Solexa sequencing is well suited to explore the depth and the complexity of transcriptome

PubMed Central

Hanriot, Lucie; Keime, Céline; Gay, Nadine; Faure, Claudine; Dossat, Carole; Wincker, Patrick; Scoté-Blachon, Céline; Peyron, Christelle; Gandrillon, Olivier

2008-01-01

Background "Open" transcriptome analysis methods allow to study gene expression without a priori knowledge of the transcript sequences. As of now, SAGE (Serial Analysis of Gene Expression), LongSAGE and MPSS (Massively Parallel Signature Sequencing) are the mostly used methods for "open" transcriptome analysis. Both LongSAGE and MPSS rely on the isolation of 21 pb tag sequences from each transcript. In contrast to LongSAGE, the high throughput sequencing method used in MPSS enables the rapid sequencing of very large libraries containing several millions of tags, allowing deep transcriptome analysis. However, a bias in the complexity of the transcriptome representation obtained by MPSS was recently uncovered. Results In order to make a deep analysis of mouse hypothalamus transcriptome avoiding the limitation introduced by MPSS, we combined LongSAGE with the Solexa sequencing technology and obtained a library of more than 11 millions of tags. We then compared it to a LongSAGE library of mouse hypothalamus sequenced with the Sanger method. Conclusion We found that Solexa sequencing technology combined with LongSAGE is perfectly suited for deep transcriptome analysis. In contrast to MPSS, it gives a complex representation of transcriptome as reliable as a LongSAGE library sequenced by the Sanger method. PMID:18796152
Isolation of centromeric-tandem repetitive DNA sequences by chromatin affinity purification using a HaloTag7-fused centromere-specific histone H3 in tobacco.

PubMed

Nagaki, Kiyotaka; Shibata, Fukashi; Kanatani, Asaka; Kashihara, Kazunari; Murata, Minoru

2012-04-01

The centromere is a multi-functional complex comprising centromeric DNA and a number of proteins. To isolate unidentified centromeric DNA sequences, centromere-specific histone H3 variants (CENH3) and chromatin immunoprecipitation (ChIP) have been utilized in some plant species. However, anti-CENH3 antibody for ChIP must be raised in each species because of its species specificity. Production of the antibodies is time-consuming and costly, and it is not easy to produce ChIP-grade antibodies. In this study, we applied a HaloTag7-based chromatin affinity purification system to isolate centromeric DNA sequences in tobacco. This system required no specific antibody, and made it possible to apply a highly stringent wash to remove contaminated DNA. As a result, we succeeded in isolating five tandem repetitive DNA sequences in addition to the centromeric retrotransposons that were previously identified by ChIP. Three of the tandem repeats were centromere-specific sequences located on different chromosomes. These results confirm the validity of the HaloTag7-based chromatin affinity purification system as an alternative method to ChIP for isolating unknown centromeric DNA sequences. The discovery of more than two chromosome-specific centromeric DNA sequences indicates the mosaic structure of tobacco centromeres. © Springer-Verlag 2011
Characterization and isolation of a T-DNA tagged banana promoter active during in vitro culture and low temperature stress.

PubMed

Santos, Efrén; Remy, Serge; Thiry, Els; Windelinckx, Saskia; Swennen, Rony; Sági, László

2009-06-24

Next-generation transgenic plants will require a more precise regulation of transgene expression, preferably under the control of native promoters. A genome-wide T-DNA tagging strategy was therefore performed for the identification and characterization of novel banana promoters. Embryogenic cell suspensions of a plantain-type banana were transformed with a promoterless, codon-optimized luciferase (luc+) gene and low temperature-responsive luciferase activation was monitored in real time. Around 16,000 transgenic cell colonies were screened for baseline luciferase activity at room temperature 2 months after transformation. After discarding positive colonies, cultures were re-screened in real-time at 26 degrees C followed by a gradual decrease to 8 degrees C. The baseline activation frequency was 0.98%, while the frequency of low temperature-responsive luciferase activity was 0.61% in the same population of cell cultures. Transgenic colonies with luciferase activity responsive to low temperature were regenerated to plantlets and luciferase expression patterns monitored during different regeneration stages. Twenty four banana DNA sequences flanking the right T-DNA borders in seven independent lines were cloned via PCR walking. RT-PCR analysis in one line containing five inserts allowed the identification of the sequence that had activated luciferase expression under low temperature stress in a developmentally regulated manner. This activating sequence was fused to the uidA reporter gene and back-transformed into a commercial dessert banana cultivar, in which its original expression pattern was confirmed. This promoter tagging and real-time screening platform proved valuable for the identification of novel promoters and genes in banana and for monitoring expression patterns throughout in vitro development and low temperature treatment. Combination of PCR walking techniques was efficient for the isolation of candidate promoters even in a multicopy T-DNA line
Overview of Fusion Tags for Recombinant Proteins.

PubMed

Kosobokova, E N; Skrypnik, K A; Kosorukov, V S

2016-03-01

Virtually all recombinant proteins are now prepared using fusion domains also known as "tags". The use of tags helps to solve some serious problems: to simplify procedures of protein isolation, to increase expression and solubility of the desired protein, to simplify protein refolding and increase its efficiency, and to prevent proteolysis. In this review, advantages and disadvantages of such fusion tags are analyzed and data on both well-known and new tags are generalized. The authors own data are also presented.
An Expressed Sequence Tag collection from the male antennae of the Noctuid moth Spodoptera littoralis: a resource for olfactory and pheromone detection research

PubMed Central

2011-01-01

Background Nocturnal insects such as moths are ideal models to study the molecular bases of olfaction that they use, among examples, for the detection of mating partners and host plants. Knowing how an odour generates a neuronal signal in insect antennae is crucial for understanding the physiological bases of olfaction, and also could lead to the identification of original targets for the development of olfactory-based control strategies against herbivorous moth pests. Here, we describe an Expressed Sequence Tag (EST) project to characterize the antennal transcriptome of the noctuid pest model, Spodoptera littoralis, and to identify candidate genes involved in odour/pheromone detection. Results By targeting cDNAs from male antennae, we biased gene discovery towards genes potentially involved in male olfaction, including pheromone reception. A total of 20760 ESTs were obtained from a normalized library and were assembled in 9033 unigenes. 6530 were annotated based on BLAST analyses and gene prediction software identified 6738 ORFs. The unigenes were compared to the Bombyx mori proteome and to ESTs derived from Lepidoptera transcriptome projects. We identified a large number of candidate genes involved in odour and pheromone detection and turnover, including 31 candidate chemosensory receptor genes, but also genes potentially involved in olfactory modulation. Conclusions Our project has generated a large collection of antennal transcripts from a Lepidoptera. The normalization process, allowing enrichment in low abundant genes, proved to be particularly relevant to identify chemosensory receptors in a species for which no genomic data are available. Our results also suggest that olfactory modulation can take place at the level of the antennae itself. These EST resources will be invaluable for exploring the mechanisms of olfaction and pheromone detection in S. littoralis, and for ultimately identifying original targets to fight against moth herbivorous pests. PMID
Linear reduction method for predictive and informative tag SNP selection.

PubMed

He, Jingwu; Westbrooks, Kelly; Zelikovsky, Alexander

2005-01-01

Constructing a complete human haplotype map is helpful when associating complex diseases with their related SNPs. Unfortunately, the number of SNPs is very large and it is costly to sequence many individuals. Therefore, it is desirable to reduce the number of SNPs that should be sequenced to a small number of informative representatives called tag SNPs. In this paper, we propose a new linear algebra-based method for selecting and using tag SNPs. We measure the quality of our tag SNP selection algorithm by comparing actual SNPs with SNPs predicted from selected linearly independent tag SNPs. Our experiments show that for sufficiently long haplotypes, knowing only 0.4% of all SNPs the proposed linear reduction method predicts an unknown haplotype with the error rate below 2% based on 10% of the population.
Serial analysis of gene expression in the silkworm, Bombyx mori.

PubMed

Huang, Jianhua; Miao, Xuexia; Jin, Weirong; Couble, Pierre; Mita, Kasuei; Zhang, Yong; Liu, Wenbin; Zhuang, Leijun; Shen, Yan; Keime, Celine; Gandrillon, Olivier; Brouilly, Patrick; Briolay, Jerome; Zhao, Guoping; Huang, Yongping

2005-08-01

The silkworm Bombyx mori is one of the most economically important insects and serves as a model for Lepidoptera insects. We used serial analysis of gene expression (SAGE) to derive profiles of expressed genes during the developmental life cycle of the silkworm and to create a reference for understanding silkworm metamorphosis. We generated four SAGE libraries, one from each of the four developmental stages of the silkworm. In total we obtained 257,964 SAGE tags, of which 39,485 were unique tags. Sorted by copy number, 14.1% of the unique tags were detected at a median to high level (five or more copies), 24.2% at lower levels (two to four copies), and 61.7% as single copies. Using a basic local alignment search tool on the EST database, 35% of the tags matched known silkworm expressed sequence tags. SAGE demonstrated that a number of the genes were up- or down-regulated during the four developmental phases of the egg, larva, pupa, and adult. Furthermore, we found that the generation of longer cDNA fragments from SAGE tags constituted the most efficient method of gene identification, which facilitated the analysis of a large number of unknown genes.
Differentially displayed expressed sequence tags in Melipona scutellaris (Hymenoptera, Apidae, Meliponini) development.

PubMed

Santana, Flávia A; Nunes, Francis M F; Vieira, Carlos U; Machado, Maria Alice M S; Kerr, Warwick E; Silva, Wilson A; Bonetti, Ana Maria

2006-03-01

We have compared gene expression, using the Differential Display Reverse Transcriptase-Polymerase Chain Reaction (DDRT-PCR) technique, by means of mRNA profile in Melipona scutellaris during ontogenetic postembryonic development, in adult worker, and in both Natural and Juvenile Hormone III-induced adult queen. Six, out of the nine ESTs described here, presented differentially expressed in the phases L1 or L2, or even in both of them, suggesting that key mechanisms to the development of Melipona scutellaris are regulated in these stages. The combination HT11G-AP05 revealed in L1 and L2 a product which matches to thioredoxin reductase protein domain in the Clostridium sporogenes, an important protein during cellular oxidoreduction processes. This study represents the first molecular evidence of differential gene expression profiles toward a description of the genetic developmental traits in the genus Melipona.
Transcriptome Analysis of Gene Expression during Chinese Water Chestnut Storage Organ Formation

PubMed Central

Chen, Sainan; Wang, Yan; Yu, Meizhen; Chen, Xuehao; Li, Liangjun; Yin, Jingjing

2016-01-01

The product organ (storage organ; corm) of the Chinese water chestnut has become a very popular food in Asian countries because of its unique nutritional value. Corm formation is a complex biological process, and extensive whole genome analysis of transcripts during corm development has not been carried out. In this study, four corm libraries at different developmental stages were constructed, and gene expression was identified using a high-throughput tag sequencing technique. Approximately 4.9 million tags were sequenced, and 4,371,386, 4,372,602, 4,782,494, and 5,276,540 clean tags, including 119,676, 110,701, 100,089, and 101,239 distinct tags, respectively, were obtained after removal of low-quality tags from each library. More than 39% of the distinct tags were unambiguous and could be mapped to reference genes, while 40% were unambiguous tag-mapped genes. After mapping their functions in existing databases, a total of 11,592, 10,949, 10,585, and 7,111 genes were annotated from the B1, B2, B3, and B4 libraries, respectively. Analysis of the differentially expressed genes (DEGs) in B1/B2, B2/B3, and B3/B4 libraries showed that most of the DEGs at the B1/B2 stages were involved in carbohydrate and hormone metabolism, while the majority of DEGs were involved in energy metabolism and carbohydrate metabolism at the B2/B3 and B3/B4 stages. All of the upregulated transcription factors and 9 important genes related to product organ formation in the above four stages were also identified. The expression changes of nine of the identified DEGs were validated using a quantitative PCR approach. This study provides a comprehensive understanding of gene expression during corm formation in the Chinese water chestnut. PMID:27716802
Notes on SAW Tag Interrogation Techniques

NASA Technical Reports Server (NTRS)

Barton, Richard J.

2010-01-01

We consider the problem of interrogating a single SAW RFID tag with a known ID and known range in the presence of multiple interfering tags under the following assumptions: (1) The RF propagation environment is well approximated as a simple delay channel with geometric power-decay constant alpha >/= 2. (2) The interfering tag IDs are unknown but well approximated as independent, identically distributed random samples from a probability distribution of tag ID waveforms with known second-order properties, and the tag of interest is drawn independently from the same distribution. (3) The ranges of the interfering tags are unknown but well approximated as independent, identically distributed realizations of a random variable rho with a known probability distribution f(sub rho) , and the tag ranges are independent of the tag ID waveforms. In particular, we model the tag waveforms as random impulse responses from a wide-sense-stationary, uncorrelated-scattering (WSSUS) fading channel with known bandwidth and scattering function. A brief discussion of the properties of such channels and the notation used to describe them in this document is given in the Appendix. Under these assumptions, we derive the expression for the output signal-to-noise ratio (SNR) for an arbitrary combination of transmitted interrogation signal and linear receiver filter. Based on this expression, we derive the optimal interrogator configuration (i.e., transmitted signal/receiver filter combination) in the two extreme noise/interference regimes, i.e., noise-limited and interference-limited, under the additional assumption that the coherence bandwidth of the tags is much smaller than the total tag bandwidth. Finally, we evaluate the performance of both optimal interrogators over a broad range of operating scenarios using both numerical simulation based on the assumed model and Monte Carlo simulation based on a small sample of measured tag waveforms. The performance evaluation results not only
Lactation-induced WAP-SV40 Tag transgene expression in C57BL/6J mice leads to mammary carcinoma.

PubMed

Hüsler, M R; Kotopoulis, K A; Sundberg, J P; Tennent, B J; Kunig, S V; Knowles, B B

1998-07-01

Two transgenic lineages were generated by directing the expression of SV40 T antigen to the mammary gland of inbred C57BL/6J mice using the whey acidic protein (WAP) promoter. In one lineage, WAPTag 1, multiparous female mice developed mammary adenocarcinoma with an average latency period of 13 months. The histopathological phenotype was heterogeneous, tumours occurred in a stochastic fashion, normal tissue was located next to neoplastic tissue, the mammary tumours usually developed and were remarkably similar to that observed in human cases. In addition, male and virgin females developed a poorly differentiated SV40 T antigen-positive soft tissue sarcoma, also at 13 months of age. In the other lineage, WAPTag 3, some parous females developed mammary tumours, but most mice succumbed to osteosarcomas arising from the os petrosum at 5.5 to 6 months of age and on necropsy, renal adenocarcinomas were also found. Appearance of these unexpected tumour types demonstrates the non-specific expression of SV40 Tag under the control of the WAP promoter. The expression of SV40 Tag in mammary glands at different stages of development was also examined, and only actively lactating glands were positive. This suggests that the abundant cyclic synthesis of SV40 Tag associated with pregnancy is required for mammary tumorigenesis in these lineages.
Simultaneously achieve soluble expression and biomimetic immobilization of Candida antarctica lipase B by introducing polyamine tags.

PubMed

Zhou, Xiaoxue; Han, Yu; Lv, Zheng; Tian, Xuemei; Li, Han; Xie, Panpan; Zheng, Liangyu

2017-05-10

Polyamine tags fused in Candida antarctica lipase B (CalB) can help achieve high soluble expression of CalB in E. coli and can directly mediate silicification, which leads to rapid formation of a CalB-silica particle complex through a one-step approach. After optimization experiments, the fused lipase CalB tagged with 6-histidine at the N terminal and 10-lysine at the C terminal (6His-CalB-10Lys) is effectively expressed with high solubility (0.1mg/mL) and specific activity (10.1U/mg), and easily cross-linked in silica particles with a high immobilization efficiency of 96.8% and activity recovery of 81.5%. The immobilized lipase 6His-CalB-10Lys exhibits excellent performance at broad temperature ranges, high thermal and storage stabilities, and superior reusability. Michaelis-Menten kinetics indicates that the affinity and enantioselectivity of the free and immobilized 6His-CalB-10Lys toward the substrate are better than that of commercial Novozym 435 in enantioselective resolution of (S)-N-(2-ethyl-6-methylphenyl) alanine ((S)-NEMPA). The strategies described in this paper are useful for the facile expression and construction of diverse enzyme systems with high efficiency and excellent recyclability. Copyright © 2017 Elsevier B.V. All rights reserved.
Sequence diversity and differential expression of major phenylpropanoid-flavonoid biosynthetic genes among three mango varieties.

PubMed

Hoang, Van L T; Innes, David J; Shaw, P Nicholas; Monteith, Gregory R; Gidley, Michael J; Dietzgen, Ralf G

2015-07-30

Mango fruits contain a broad spectrum of phenolic compounds which impart potential health benefits; their biosynthesis is catalysed by enzymes in the phenylpropanoid-flavonoid (PF) pathway. The aim of this study was to reveal the variability in genes involved in the PF pathway in three different mango varieties Mangifera indica L., a member of the family Anacardiaceae: Kensington Pride (KP), Irwin (IW) and Nam Doc Mai (NDM) and to determine associations with gene expression and mango flavonoid profiles. A close evolutionary relationship between mango genes and those from the woody species poplar of the Salicaceae family (Populus trichocarpa) and grape of the Vitaceae family (Vitis vinifera), was revealed through phylogenetic analysis of PF pathway genes. We discovered 145 SNPs in total within coding sequences with an average frequency of one SNP every 316 bp. Variety IW had the highest SNP frequency (one SNP every 258 bp) while KP and NDM had similar frequencies (one SNP every 369 bp and 360 bp, respectively). The position in the PF pathway appeared to influence the extent of genetic diversity of the encoded enzymes. The entry point enzymes phenylalanine lyase (PAL), cinnamate 4-mono-oxygenase (C4H) and chalcone synthase (CHS) had low levels of SNP diversity in their coding sequences, whereas anthocyanidin reductase (ANR) showed the highest SNP frequency followed by flavonoid 3'-hydroxylase (F3'H). Quantitative PCR revealed characteristic patterns of gene expression that differed between mango peel and flesh, and between varieties. The combination of mango expressed sequence tags and availability of well-established reference PF biosynthetic genes from other plant species allowed the identification of coding sequences of genes that may lead to the formation of important flavonoid compounds in mango fruits and facilitated characterisation of single nucleotide polymorphisms between varieties. We discovered an association between the extent of sequence variation and

Generation and Analysis of a Large-Scale Expressed Sequence Tag Database from a Full-Length Enriched cDNA Library of Developing Leaves of Gossypium hirsutum L

PubMed Central

Pang, Chaoyou; Fan, Shuli; Song, Meizhen; Yu, Shuxun

2013-01-01

Background Cotton (Gossypium hirsutum L.) is one of the world’s most economically-important crops. However, its entire genome has not been sequenced, and limited resources are available in GenBank for understanding the molecular mechanisms underlying leaf development and senescence. Methodology/Principal Findings In this study, 9,874 high-quality ESTs were generated from a normalized, full-length cDNA library derived from pooled RNA isolated from throughout leaf development during the plant blooming stage. After clustering and assembly of these ESTs, 5,191 unique sequences, representative 1,652 contigs and 3,539 singletons, were obtained. The average unique sequence length was 682 bp. Annotation of these unique sequences revealed that 84.4% showed significant homology to sequences in the NCBI non-redundant protein database, and 57.3% had significant hits to known proteins in the Swiss-Prot database. Comparative analysis indicated that our library added 2,400 ESTs and 991 unique sequences to those known for cotton. The unigenes were functionally characterized by gene ontology annotation. We identified 1,339 and 200 unigenes as potential leaf senescence-related genes and transcription factors, respectively. Moreover, nine genes related to leaf senescence and eleven MYB transcription factors were randomly selected for quantitative real-time PCR (qRT-PCR), which revealed that these genes were regulated differentially during senescence. The qRT-PCR for three GhYLSs revealed that these genes express express preferentially in senescent leaves. Conclusions/Significance These EST resources will provide valuable sequence information for gene expression profiling analyses and functional genomics studies to elucidate their roles, as well as for studying the mechanisms of leaf development and senescence in cotton and discovering candidate genes related to important agronomic traits of cotton. These data will also facilitate future whole-genome sequence assembly and annotation
Identification of cellular MMP substrates using quantitative proteomics: isotope-coded affinity tags (ICAT) and isobaric tags for relative and absolute quantification (iTRAQ).

PubMed

Butler, Georgina S; Dean, Richard A; Morrison, Charlotte J; Overall, Christopher M

2010-01-01

Identification of protease substrates is essential to understand the functional consequences of normal proteolytic processing and dysregulated proteolysis in disease. Quantitative proteomics and mass spectrometry can be used to identify protease substrates in the cellular context. Here we describe the use of two protein labeling techniques, Isotope-Coded Affinity Tags (ICAT and Isobaric Tags for Relative and Absolute Quantification (iTRAQ), which we have used successfully to identify novel matrix metalloproteinase (MMP) substrates in cell culture systems (1-4). ICAT and iTRAQ can label proteins and protease cleavage products of secreted proteins, protein domains shed from the cell membrane or pericellular matrix of protease-transfected cells that have accumulated in conditioned medium, or cell surface proteins in membrane preparations; isotopically distinct labels are used for control cells. Tryptic digestion and tandem mass spectrometry of the generated fragments enable sequencing of differentially labeled but otherwise identical pooled peptides. The isotopic tag, which is unique for each label, identifies the peptides originating from each sample, for instance, protease-transfected or control cells, and comparison of the peak areas enables relative quantification of the peptide in each sample. Thus proteins present in altered amounts between protease-expressing and null cells are implicated as protease substrates and can be further validated as such.
Expressed sequence tags (ESTs) from immune tissues of turbot (Scophthalmus maximus) challenged with pathogens

PubMed Central

Pardo, Belén G; Fernández, Carlos; Millán, Adrián; Bouza, Carmen; Vázquez-López, Araceli; Vera, Manuel; Alvarez-Dios, José A; Calaza, Manuel; Gómez-Tato, Antonio; Vázquez, María; Cabaleiro, Santiago; Magariños, Beatriz; Lemos, Manuel L; Leiro, José M; Martínez, Paulino

2008-01-01

Background The turbot (Scophthalmus maximus; Scophthalmidae; Pleuronectiformes) is a flatfish species of great relevance for marine aquaculture in Europe. In contrast to other cultured flatfish, very few genomic resources are available in this species. Aeromonas salmonicida and Philasterides dicentrarchi are two pathogens that affect turbot culture causing serious economic losses to the turbot industry. Little is known about the molecular mechanisms for disease resistance and host-pathogen interactions in this species. In this work, thousands of ESTs for functional genomic studies and potential markers linked to ESTs for mapping (microsatellites and single nucleotide polymorphisms (SNPs)) are provided. This information enabled us to obtain a preliminary view of regulated genes in response to these pathogens and it constitutes the basis for subsequent and more accurate microarray analysis. Results A total of 12584 cDNAs partially sequenced from three different cDNA libraries of turbot (Scophthalmus maximus) infected with Aeromonas salmonicida, Philasterides dicentrarchi and from healthy fish were analyzed. Three immune-relevant tissues (liver, spleen and head kidney) were sampled at several time points in the infection process for library construction. The sequences were processed into 9256 high-quality sequences, which constituted the source for the turbot EST database. Clustering and assembly of these sequences, revealed 3482 different putative transcripts, 1073 contigs and 2409 singletons. BLAST searches with public databases detected significant similarity (e-value ≤ 1e-5) in 1766 (50.7%) sequences and 816 of them (23.4%) could be functionally annotated. Two hundred three of these genes (24.9%), encoding for defence/immune-related proteins, were mostly identified for the first time in turbot. Some ESTs showed significant differences in the number of transcripts when comparing the three libraries, suggesting regulation in response to these pathogens. A total of
Candidate Genes Expressed in Tolerant Common Wheat With Resistant to English Grain Aphid.

PubMed

Luo, Kun; Zhang, Gaisheng; Wang, Chunping; Ouellet, Thérèse; Wu, Jingjing; Zhu, Qidi; Zhao, Huiyan

2014-10-01

The English grain aphid, Sitobion avenae (F.) (Hemiptera: Aphididae), is a common worldwide pest of wheat (Triticum aestivum L.). The use of improved resistant cultivars by the farmers is the most effective and environmentally friendly method to control this aphid in the field. The winter wheat genotypes 98-10-35 and Amigo are resistant to S. avenae. To identify genes responsible for resistance to S. avenae in these genotypes, differential-display reverse transcription-polymerase chain reaction was used to identify the corresponding differentially expressed sequences in current study. Two backcross progenies were obtained by crossing the two resistant genotypes with the susceptible genotype 1376. Six potential expected-differential bands were sequenced. Lengths of the expressed sequence tags ranged from 128 to 532 bp. Although these expressed sequences were likely associated with S. avenae resistance, there was one expressed sequence tag located on 7DL chromosome, and its potential function may associate with the ability to maintain photosynthesis in wheat. That serves as an active way for tolerant common wheat with resistant to S. avenae. Cloning the full length of these sequences would help us thoroughly understand the mechanism of wheat resistance to S. avenae and be valuable for breeding cultivars with S. avenae resistance. © 2014 Entomological Society of America.
Parallel tagged next-generation sequencing on pooled samples - a new approach for population genetics in ecology and conservation.

PubMed

Zavodna, Monika; Grueber, Catherine E; Gemmell, Neil J

2013-01-01

Next-generation sequencing (NGS) on pooled samples has already been broadly applied in human medical diagnostics and plant and animal breeding. However, thus far it has been only sparingly employed in ecology and conservation, where it may serve as a useful diagnostic tool for rapid assessment of species genetic diversity and structure at the population level. Here we undertake a comprehensive evaluation of the accuracy, practicality and limitations of parallel tagged amplicon NGS on pooled population samples for estimating species population diversity and structure. We obtained 16S and Cyt b data from 20 populations of Leiopelma hochstetteri, a frog species of conservation concern in New Zealand, using two approaches - parallel tagged NGS on pooled population samples and individual Sanger sequenced samples. Data from each approach were then used to estimate two standard population genetic parameters, nucleotide diversity (π) and population differentiation (FST), that enable population genetic inference in a species conservation context. We found a positive correlation between our two approaches for population genetic estimates, showing that the pooled population NGS approach is a reliable, rapid and appropriate method for population genetic inference in an ecological and conservation context. Our experimental design also allowed us to identify both the strengths and weaknesses of the pooled population NGS approach and outline some guidelines and suggestions that might be considered when planning future projects.
Characterization of transcriptome dynamics during watermelon fruit development: sequencing, assembly, annotation and gene expression profiles.

PubMed

Guo, Shaogui; Liu, Jingan; Zheng, Yi; Huang, Mingyun; Zhang, Haiying; Gong, Guoyi; He, Hongju; Ren, Yi; Zhong, Silin; Fei, Zhangjun; Xu, Yong

2011-09-21

Cultivated watermelon [Citrullus lanatus (Thunb.) Matsum. & Nakai var. lanatus] is an important agriculture crop world-wide. The fruit of watermelon undergoes distinct stages of development with dramatic changes in its size, color, sweetness, texture and aroma. In order to better understand the genetic and molecular basis of these changes and significantly expand the watermelon transcript catalog, we have selected four critical stages of watermelon fruit development and used Roche/454 next-generation sequencing technology to generate a large expressed sequence tag (EST) dataset and a comprehensive transcriptome profile for watermelon fruit flesh tissues. We performed half Roche/454 GS-FLX run for each of the four watermelon fruit developmental stages (immature white, white-pink flesh, red flesh and over-ripe) and obtained 577,023 high quality ESTs with an average length of 302.8 bp. De novo assembly of these ESTs together with 11,786 watermelon ESTs collected from GenBank produced 75,068 unigenes with a total length of approximately 31.8 Mb. Overall 54.9% of the unigenes showed significant similarities to known sequences in GenBank non-redundant (nr) protein database and around two-thirds of them matched proteins of cucumber, the most closely-related species with a sequenced genome. The unigenes were further assigned with gene ontology (GO) terms and mapped to biochemical pathways. More than 5,000 SSRs were identified from the EST collection. Furthermore we carried out digital gene expression analysis of these ESTs and identified 3,023 genes that were differentially expressed during watermelon fruit development and ripening, which provided novel insights into watermelon fruit biology and a comprehensive resource of candidate genes for future functional analysis. We then generated profiles of several interesting metabolites that are important to fruit quality including pigmentation and sweetness. Integrative analysis of metabolite and digital gene expression
Transcriptome-wide analysis of WRKY transcription factors in wheat and their leaf rust responsive expression profiling.

PubMed

Satapathy, Lopamudra; Singh, Dharmendra; Ranjan, Prashant; Kumar, Dhananjay; Kumar, Manish; Prabhu, Kumble Vinod; Mukhopadhyay, Kunal

2014-12-01

WRKY, a plant-specific transcription factor family, has important roles in pathogen defense, abiotic cues and phytohormone signaling, yet little is known about their roles and molecular mechanism of function in response to rust diseases in wheat. We identified 100 TaWRKY sequences using wheat Expressed Sequence Tag database of which 22 WRKY sequences were novel. Identified proteins were characterized based on their zinc finger motifs and phylogenetic analysis clustered them into six clades consisting of class IIc and class III WRKY proteins. Functional annotation revealed major functions in metabolic and cellular processes in control plants; whereas response to stimuli, signaling and defense in pathogen inoculated plants, their major molecular function being binding to DNA. Tag-based expression analysis of the identified genes revealed differential expression between mock and Puccinia triticina inoculated wheat near isogenic lines. Gene expression was also performed with six rust-related microarray experiments at Gene Expression Omnibus database. TaWRKY10, 15, 17 and 56 were common in both tag-based and microarray-based differential expression analysis and could be representing rust specific WRKY genes. The obtained results will bestow insight into the functional characterization of WRKY transcription factors responsive to leaf rust pathogenesis that can be used as candidate genes in molecular breeding programs to improve biotic stress tolerance in wheat.
Transcriptome sequencing and de novo analysis of the copepod Calanus sinicus using 454 GS FLX.

PubMed

Ning, Juan; Wang, Minxiao; Li, Chaolun; Sun, Song

2013-01-01

Despite their species abundance and primary economic importance, genomic information about copepods is still limited. In particular, genomic resources are lacking for the copepod Calanus sinicus, which is a dominant species in the coastal waters of East Asia. In this study, we performed de novo transcriptome sequencing to produce a large number of expressed sequence tags for the copepod C. sinicus. Copepodid larvae and adults were used as the basic material for transcriptome sequencing. Using 454 pyrosequencing, a total of 1,470,799 reads were obtained, which were assembled into 56,809 high quality expressed sequence tags. Based on their sequence similarity to known proteins, about 14,000 different genes were identified, including members of all major conserved signaling pathways. Transcripts that were putatively involved with growth, lipid metabolism, molting, and diapause were also identified among these genes. Differentially expressed genes related to several processes were found in C. sinicus copepodid larvae and adults. We detected 284,154 single nucleotide polymorphisms (SNPs) that provide a resource for gene function studies. Our data provide the most comprehensive transcriptome resource available for C. sinicus. This resource allowed us to identify genes associated with primary physiological processes and SNPs in coding regions, which facilitated the quantitative analysis of differential gene expression. These data should provide foundation for future genetic and genomic studies of this and related species.
Digital gene expression for non-model organisms

PubMed Central

Hong, Lewis Z.; Li, Jun; Schmidt-Küntzel, Anne; Warren, Wesley C.; Barsh, Gregory S.

2011-01-01

Next-generation sequencing technologies offer new approaches for global measurements of gene expression but are mostly limited to organisms for which a high-quality assembled reference genome sequence is available. We present a method for gene expression profiling called EDGE, or EcoP15I-tagged Digital Gene Expression, based on ultra-high-throughput sequencing of 27-bp cDNA fragments that uniquely tag the corresponding gene, thereby allowing direct quantification of transcript abundance. We show that EDGE is capable of assaying for expression in >99% of genes in the genome and achieves saturation after 6–8 million reads. EDGE exhibits very little technical noise, reveals a large (106) dynamic range of gene expression, and is particularly suited for quantification of transcript abundance in non-model organisms where a high-quality annotated genome is not available. In a direct comparison with RNA-seq, both methods provide similar assessments of relative transcript abundance, but EDGE does better at detecting gene expression differences for poorly expressed genes and does not exhibit transcript length bias. Applying EDGE to laboratory mice, we show that a loss-of-function mutation in the melanocortin 1 receptor (Mc1r), recognized as a Mendelian determinant of yellow hair color in many different mammals, also causes reduced expression of genes involved in the interferon response. To illustrate the application of EDGE to a non-model organism, we examine skin biopsy samples from a cheetah (Acinonyx jubatus) and identify genes likely to control differences in the color of spotted versus non-spotted regions. PMID:21844123
Analysis of expressed sequence tags from Uromyces appendiculatus hyphae and haustoria and their comparison to sequences from other rust fungi.

PubMed

Puthoff, D P; Neelam, A; Ehrenfried, M L; Scheffler, B E; Ballard, L; Song, Q; Campbell, K B; Cooper, B; Tucker, M L

2008-10-01

Hyphae, 2 to 8 days postinoculation (dpi), and haustoria, 5 dpi, were isolated from Uromyces appendiculatus infected bean leaves (Phaseolus vulgaris cv. Pinto 111) and a separate cDNA library prepared for each fungal preparation. Approximately 10,000 hyphae and 2,700 haustoria clones were sequenced from both the 5' and 3' ends. Assembly of all of the fungal sequences yielded 3,359 contigs and 927 singletons. The U. appendiculatus sequences were compared with sequence data for other rust fungi, Phakopsora pachyrhizi, Uromyces fabae, and Puccinia graminis. The U. appendiculatus haustoria library included a large number of genes with unknown cellular function; however, summation of sequences of known cellular function suggested that haustoria at 5 dpi had fewer transcripts linked to protein synthesis in favor of energy metabolism and nutrient uptake. In addition, open reading frames in the U. appendiculatus data set with an N-terminal signal peptide were identified and compared with other proteins putatively secreted from rust fungi. In this regard, a small family of putatively secreted RTP1-like proteins was identified in U. appendiculatus and P. graminis.
Single-step affinity and cost-effective purification of recombinant proteins using the Sepharose-binding lectin-tag from the mushroom Laetiporus sulphureus as fusion partner.

PubMed

Li, Xiao-Jing; Liu, Jin-Ling; Gao, Dong-Sheng; Wan, Wen-Yan; Yang, Xia; Li, Yong-Tao; Chang, Hong-Tao; Chen, Lu; Wang, Chuan-Qing; Zhao, Jun

2016-03-01

Previous research showed that a lectin from the mushroom Laetiporus sulphureus, designed LSL, bound to Sepharose and could be eluted by lactose. In this study, by taking advantage of the strong affinity of LSL-tag for Sepharose, we developed a single-step purification method for LSL-tagged fusion proteins. We utilized unmodified Sepharose-4B as a specific adsorbent and 0.2 M lactose solution as an elution buffer. Fusion proteins of LSL-tag and porcine circovirus capsid protein, designated LSL-Cap was recovered with purity of 90 ± 4%, and yield of 87 ± 3% from crude extract of recombinant Escherichia coli. To enable the remove of LSL-tag, tobacco etch virus (TEV) protease recognition sequence was placed downstream of LSL-tag in the expression vector, and LSL-tagged TEV protease, designated LSL-TEV, was also expressed in E. coli., and was recovered with purity of 82 ± 5%, and yield of 85 ± 2% from crude extract of recombinant E. coli. After digestion of LSL-tagged recombinant proteins with LSL-TEV, the LSL tag and LSL-TEV can be easily removed by passing the digested products through the Sepharose column. It is of worthy noting that the Sepharose can be reused after washing with PBS. The LSL affinity purification method enables rapid and inexpensive purification of LSL-tagged fusion proteins and scale-up production of native proteins. Copyright © 2015 Elsevier Inc. All rights reserved.
Genes are differentially expressed at transcriptional level of Neocaridina denticulata following short-term exposure to nonylphenol.

PubMed

Liu, Chang-Lun; Sung, Hung-Hung

2011-09-01

To assess the toxicity of nonylphenol towards aquatic crustaceans, Neocaridina denticulata were exposed short-term to sublethal concentration (0.001-0.5 mg/L). Following treatment, differentially expressed genes were identified using suppression subtractive hybridization on samples prepared from whole specimens. There were 20 differentially expressed sequence tags that corresponded to known genes and could be divided into six functional classes: defence, translation, metabolism, ribosomal gene expression, respiration, and genes involved in the stress response. Using semi-quantitative RT-PCR, we found that 14 of the differentially expressed sequence tags significantly responded to nonylphenol, including six at a nominal concentration of 0.01 mg/L; among them, 12 genes were down-regulated. These results suggest that under non-lethal concentrations of nonylphenol, the polluted aquatic environment may still present a potential risk to N. denticulata.
Profiling of wheat class III peroxidase genes derived from powdery mildew-attacked epidermis reveals distinct sequence-associated expression patterns.

PubMed

Liu, Guosheng; Sheng, Xiaoyan; Greenshields, David L; Ogieglo, Adam; Kaminskyj, Susan; Selvaraj, Gopalan; Wei, Yangdou

2005-07-01

A cDNA library was constructed from leaf epidermis of diploid wheat (Triticum monococcum) infected with the powdery mildew fungus (Blumeria graminis f. sp. tritici) and was screened for genes encoding peroxidases. From 2,500 expressed sequence tags (ESTs), 36 cDNAs representing 10 peroxidase genes (designated TmPRX1 to TmPRX10) were isolated and further characterized. Alignment of the deduced amino acid sequences and phylogenetic clustering with peroxidases from other plant species demonstrated that these peroxidases fall into four distinct groups. Differential expression and tissue-specific localization among the members were observed during the B. graminis f. sp. tritici attack using Northern blots and reverse-transcriptase polymerase chain reaction analyses. Consistent with its abundance in the EST collection, TmPRX1 expression showed the highest induction during pathogen attack and fluctuated in response to the fungal parasitic stages. TmPRX1 to TmPRX6 were expressed predominantly in mesophyll cells, whereas TmPRX7 to TmPRX10, which feature a putative C-terminal propeptide, were detectable mainly in epidermal cells. Using TmPRX8 as a representative, we demonstrated that its C-terminal propeptide was sufficient to target a green fluorescent protein fusion protein to the vacuoles in onion cells. Finally, differential expression profiles of the TmPRXs after abiotic stresses and signal molecule treatments were used to dissect the potential role of these peroxidases in multiple stress and defense pathways.
Identification and expression of the tig gene coding for trigger factor from psychrophilic bacteria with no information of genome sequence available.

PubMed

Lee, Kyunghee; Choi, Hyojung; Im, Hana

2009-08-01

Trigger factor (TF) plays a key role as a molecular chaperone with a peptidyl-prolyl cis-trans isomerase (PPIase) activity by which cells promote folding of newly synthesized proteins coming out of ribosomes. Since psychrophilic bacteria grow at a quite low temperature, between 4 and 15 degrees C, TF from such bacteria was investigated and compared with that of mesophilic bacteria E. coli in order to offer an explanation of cold-adaptation at a molecular level. Using a combination of gradient PCRs with homologous primers and LA PCR in vitro cloning technology, the tig gene was fully identified from Psychromonas arctica, whose genome sequence is not yet available. The resulting amino acid sequence of the TF was compared with other homologous TFs using sequence alignments to search for common domains. In addition, we have developed a protein expression system, by which TF proteins from P. arctica (PaTF) were produced by IPTG induction upon cloning the tig gene on expression vectors, such as pAED4. We have further examined the role of expressed psychrophilic PaTF on survival against cold treatment at 4 degrees C. Finally, we have attempted the in vitro biochemical characterization of TF proteins with His-tags expressed in a pET system, such as the PPIase activity of PaTF protein. Our results demonstrate that the expressed PaTF proteins helped cells survive against cold environments in vivo and the purified PaTF in vitro display the functional PPIase activity in a concentration dependent manner.
Behavioral tagging of extinction learning.

PubMed

de Carvalho Myskiw, Jociane; Benetti, Fernando; Izquierdo, Iván

2013-01-15

Extinction of contextual fear in rats is enhanced by exposure to a novel environment at 1-2 h before or 1 h after extinction training. This effect is antagonized by administration of protein synthesis inhibitors anisomycin and rapamycin into the hippocampus, but not into the amygdala, immediately after either novelty or extinction training, as well as by the gene expression blocker 5,6-dichloro-1-beta-D-ribofuranosylbenzimidazole administered after novelty training, but not after extinction training. Thus, this effect can be attributed to a mechanism similar to synaptic tagging, through which long-term potentiation can be enhanced by other long-term potentiations or by exposure to a novel environment in a protein synthesis-dependent fashion. Extinction learning produces a tag at the appropriate synapses, whereas novelty learning causes the synthesis of plasticity-related proteins that are captured by the tag, strengthening the synapses that generated this tag.
mESAdb: microRNA Expression and Sequence Analysis Database

PubMed Central

Kaya, Koray D.; Karakülah, Gökhan; Yakıcıer, Cengiz M.; Acar, Aybar C.; Konu, Özlen

2011-01-01

microRNA expression and sequence analysis database (http://konulab.fen.bilkent.edu.tr/mirna/) (mESAdb) is a regularly updated database for the multivariate analysis of sequences and expression of microRNAs from multiple taxa. mESAdb is modular and has a user interface implemented in PHP and JavaScript and coupled with statistical analysis and visualization packages written for the R language. The database primarily comprises mature microRNA sequences and their target data, along with selected human, mouse and zebrafish expression data sets. mESAdb analysis modules allow (i) mining of microRNA expression data sets for subsets of microRNAs selected manually or by motif; (ii) pair-wise multivariate analysis of expression data sets within and between taxa; and (iii) association of microRNA subsets with annotation databases, HUGE Navigator, KEGG and GO. The use of existing and customized R packages facilitates future addition of data sets and analysis tools. Furthermore, the ability to upload and analyze user-specified data sets makes mESAdb an interactive and expandable analysis tool for microRNA sequence and expression data. PMID:21177657
mESAdb: microRNA expression and sequence analysis database.

PubMed

Kaya, Koray D; Karakülah, Gökhan; Yakicier, Cengiz M; Acar, Aybar C; Konu, Ozlen

2011-01-01

microRNA expression and sequence analysis database (http://konulab.fen.bilkent.edu.tr/mirna/) (mESAdb) is a regularly updated database for the multivariate analysis of sequences and expression of microRNAs from multiple taxa. mESAdb is modular and has a user interface implemented in PHP and JavaScript and coupled with statistical analysis and visualization packages written for the R language. The database primarily comprises mature microRNA sequences and their target data, along with selected human, mouse and zebrafish expression data sets. mESAdb analysis modules allow (i) mining of microRNA expression data sets for subsets of microRNAs selected manually or by motif; (ii) pair-wise multivariate analysis of expression data sets within and between taxa; and (iii) association of microRNA subsets with annotation databases, HUGE Navigator, KEGG and GO. The use of existing and customized R packages facilitates future addition of data sets and analysis tools. Furthermore, the ability to upload and analyze user-specified data sets makes mESAdb an interactive and expandable analysis tool for microRNA sequence and expression data.
Genome-Wide Analysis of Differentially Expressed Genes Relevant to Rhizome Formation in Lotus Root (Nelumbo nucifera Gaertn)

PubMed Central

Yin, Jingjing; Li, Liangjun; Chen, Xuehao

2013-01-01

Lotus root is a popular wetland vegetable which produces edible rhizome. At the molecular level, the regulation of rhizome formation is very complex, which has not been sufficiently addressed in research. In this study, to identify differentially expressed genes (DEGs) in lotus root, four libraries (L1 library: stolon stage, L2 library: initial swelling stage, L3 library: middle swelling stage, L4: later swelling stage) were constructed from the rhizome development stages. High-throughput tag-sequencing technique was used which is based on Solexa Genome Analyzer Platform. Approximately 5.0 million tags were sequenced, and 4542104, 4474755, 4777919, and 4750348 clean tags including 151282, 137476, 215872, and 166005 distinct tags were obtained after removal of low quality tags from each library respectively. More than 43% distinct tags were unambiguous tags mapping to the reference genes, and 40% were unambiguous tag-mapped genes. From L1, L2, L3, and L4, total 20471, 18785, 23448, and 21778 genes were annotated, after mapping their functions in existing databases. Profiling of gene expression in L1/L2, L2/L3, and L3/L4 libraries were different among most of the selected 20 DEGs. Most of the DEGs in L1/L2 libraries were relevant to fiber development and stress response, while in L2/L3 and L3/L4 libraries, major of the DEGs were involved in metabolism of energy and storage. All up-regulated transcriptional factors in four libraries and 14 important rhizome formation-related genes in four libraries were also identified. In addition, the expression of 9 genes from identified DEGs was performed by qRT-PCR method. In a summary, this study provides a comprehensive understanding of gene expression during the rhizome formation in lotus root. PMID:23840598
PET-Tool: a software suite for comprehensive processing and managing of Paired-End diTag (PET) sequence data.

PubMed

Chiu, Kuo Ping; Wong, Chee-Hong; Chen, Qiongyu; Ariyaratne, Pramila; Ooi, Hong Sain; Wei, Chia-Lin; Sung, Wing-Kin Ken; Ruan, Yijun

2006-08-25

We recently developed the Paired End diTag (PET) strategy for efficient characterization of mammalian transcriptomes and genomes. The paired end nature of short PET sequences derived from long DNA fragments raised a new set of bioinformatics challenges, including how to extract PETs from raw sequence reads, and correctly yet efficiently map PETs to reference genome sequences. To accommodate and streamline data analysis of the large volume PET sequences generated from each PET experiment, an automated PET data process pipeline is desirable. We designed an integrated computation program package, PET-Tool, to automatically process PET sequences and map them to the genome sequences. The Tool was implemented as a web-based application composed of four modules: the Extractor module for PET extraction; the Examiner module for analytic evaluation of PET sequence quality; the Mapper module for locating PET sequences in the genome sequences; and the Project Manager module for data organization. The performance of PET-Tool was evaluated through the analyses of 2.7 million PET sequences. It was demonstrated that PET-Tool is accurate and efficient in extracting PET sequences and removing artifacts from large volume dataset. Using optimized mapping criteria, over 70% of quality PET sequences were mapped specifically to the genome sequences. With a 2.4 GHz LINUX machine, it takes approximately six hours to process one million PETs from extraction to mapping. The speed, accuracy, and comprehensiveness have proved that PET-Tool is an important and useful component in PET experiments, and can be extended to accommodate other related analyses of paired-end sequences. The Tool also provides user-friendly functions for data quality check and system for multi-layer data management.
The use of coded PCR primers enables high-throughput sequencing of multiple homolog amplification products by 454 parallel sequencing.

PubMed

Binladen, Jonas; Gilbert, M Thomas P; Bollback, Jonathan P; Panitz, Frank; Bendixen, Christian; Nielsen, Rasmus; Willerslev, Eske

2007-02-14

The invention of the Genome Sequence 20 DNA Sequencing System (454 parallel sequencing platform) has enabled the rapid and high-volume production of sequence data. Until now, however, individual emulsion PCR (emPCR) reactions and subsequent sequencing runs have been unable to combine template DNA from multiple individuals, as homologous sequences cannot be subsequently assigned to their original sources. We use conventional PCR with 5'-nucleotide tagged primers to generate homologous DNA amplification products from multiple specimens, followed by sequencing through the high-throughput Genome Sequence 20 DNA Sequencing System (GS20, Roche/454 Life Sciences). Each DNA sequence is subsequently traced back to its individual source through 5'tag-analysis. We demonstrate that this new approach enables the assignment of virtually all the generated DNA sequences to the correct source once sequencing anomalies are accounted for (miss-assignment rate<0.4%). Therefore, the method enables accurate sequencing and assignment of homologous DNA sequences from multiple sources in single high-throughput GS20 run. We observe a bias in the distribution of the differently tagged primers that is dependent on the 5' nucleotide of the tag. In particular, primers 5' labelled with a cytosine are heavily overrepresented among the final sequences, while those 5' labelled with a thymine are strongly underrepresented. A weaker bias also exists with regards to the distribution of the sequences as sorted by the second nucleotide of the dinucleotide tags. As the results are based on a single GS20 run, the general applicability of the approach requires confirmation. However, our experiments demonstrate that 5'primer tagging is a useful method in which the sequencing power of the GS20 can be applied to PCR-based assays of multiple homologous PCR products. The new approach will be of value to a broad range of research areas, such as those of comparative genomics, complete mitochondrial analyses

Global Transcriptome Analysis of the Tentacle of the Jellyfish Cyanea capillata Using Deep Sequencing and Expressed Sequence Tags: Insight into the Toxin- and Degenerative Disease-Related Transcripts

PubMed Central

Liu, Dan; Wang, Qianqian; Ruan, Zengliang; He, Qian; Zhang, Liming

2015-01-01

Background Jellyfish contain diverse toxins and other bioactive components. However, large-scale identification of novel toxins and bioactive components from jellyfish has been hampered by the low efficiency of traditional isolation and purification methods. Results We performed de novo transcriptome sequencing of the tentacle tissue of the jellyfish Cyanea capillata. A total of 51,304,108 reads were obtained and assembled into 50,536 unigenes. Of these, 21,357 unigenes had homologues in public databases, but the remaining unigenes had no significant matches due to the limited sequence information available and species-specific novel sequences. Functional annotation of the unigenes also revealed general gene expression profile characteristics in the tentacle of C. capillata. A primary goal of this study was to identify putative toxin transcripts. As expected, we screened many transcripts encoding proteins similar to several well-known toxin families including phospholipases, metalloproteases, serine proteases and serine protease inhibitors. In addition, some transcripts also resembled molecules with potential toxic activities, including cnidarian CfTX-like toxins with hemolytic activity, plancitoxin-1, venom toxin-like peptide-6, histamine-releasing factor, neprilysin, dipeptidyl peptidase 4, vascular endothelial growth factor A, angiotensin-converting enzyme-like and endothelin-converting enzyme 1-like proteins. Most of these molecules have not been previously reported in jellyfish. Interestingly, we also characterized a number of transcripts with similarities to proteins relevant to several degenerative diseases, including Huntington’s, Alzheimer’s and Parkinson’s diseases. This is the first description of degenerative disease-associated genes in jellyfish. Conclusion We obtained a well-categorized and annotated transcriptome of C. capillata tentacle that will be an important and valuable resource for further understanding of jellyfish at the molecular
Global Transcriptome Analysis of the Tentacle of the Jellyfish Cyanea capillata Using Deep Sequencing and Expressed Sequence Tags: Insight into the Toxin- and Degenerative Disease-Related Transcripts.

PubMed

Liu, Guoyan; Zhou, Yonghong; Liu, Dan; Wang, Qianqian; Ruan, Zengliang; He, Qian; Zhang, Liming

2015-01-01

Jellyfish contain diverse toxins and other bioactive components. However, large-scale identification of novel toxins and bioactive components from jellyfish has been hampered by the low efficiency of traditional isolation and purification methods. We performed de novo transcriptome sequencing of the tentacle tissue of the jellyfish Cyanea capillata. A total of 51,304,108 reads were obtained and assembled into 50,536 unigenes. Of these, 21,357 unigenes had homologues in public databases, but the remaining unigenes had no significant matches due to the limited sequence information available and species-specific novel sequences. Functional annotation of the unigenes also revealed general gene expression profile characteristics in the tentacle of C. capillata. A primary goal of this study was to identify putative toxin transcripts. As expected, we screened many transcripts encoding proteins similar to several well-known toxin families including phospholipases, metalloproteases, serine proteases and serine protease inhibitors. In addition, some transcripts also resembled molecules with potential toxic activities, including cnidarian CfTX-like toxins with hemolytic activity, plancitoxin-1, venom toxin-like peptide-6, histamine-releasing factor, neprilysin, dipeptidyl peptidase 4, vascular endothelial growth factor A, angiotensin-converting enzyme-like and endothelin-converting enzyme 1-like proteins. Most of these molecules have not been previously reported in jellyfish. Interestingly, we also characterized a number of transcripts with similarities to proteins relevant to several degenerative diseases, including Huntington's, Alzheimer's and Parkinson's diseases. This is the first description of degenerative disease-associated genes in jellyfish. We obtained a well-categorized and annotated transcriptome of C. capillata tentacle that will be an important and valuable resource for further understanding of jellyfish at the molecular level and information on the underlying
Gambling on a shortcut to genome sequencing

DOE Office of Scientific and Technical Information (OSTI.GOV)

Roberts, L.

1991-06-21

Almost from the start of the Human Genome Project, a debate has been raging over whether to sequence the entire human genome, all 3 billion bases, or just the genes - a mere 2% or 3% of the genome, and by far the most interesting part. In England, Sydney Brenner convinced the Medical Research Council (MRC) to start with the expressed genes, or complementary DNAs. But the US stance has been that the entire sequence is essential if we are to understand the blueprint of man. Craig Venter of the National Institute of Neurological Disorders and Stroke says that focusingmore » on the expressed genes may be even more useful than expected. His strategy involves randomly selecting clones from cDNA libraries which theoretically contain all the genes that are switched on at a particular time in a particular tissue. Then the researchers sequence just a short stretch of each clone, about 400 to 500 bases, to create can expressed sequence tag or EST. The sequences of these ESTs are then stored in a database. Using that information, other researchers can then recreate that EST by using polymerase chain reaction techniques.« less
Postprandial phase time influences the uptake of TAG from postprandial TAG-rich lipoproteins by THP-1 macrophages.

PubMed

Cabello-Moruno, Rosana; Sinausia, Laura; Botham, Kathleen M; Montero, Emilio; Avella, Michael; Perona, Javier S

2014-11-14

Postprandial TAG-rich lipoproteins (TRL) can be taken up by macrophages, leading to the formation of foam cells, probably via receptor-mediated pathways. The present study was conducted to investigate whether the postprandial time point at which TRL are collected modulates this process. A meal containing refined olive oil was given to nine healthy young men and TRL were isolated from their serum at 2, 4 and 6 h postprandially. The lipid class and apoB compositions of TRL were determined by HPLC and SDS-PAGE, respectively. The accumulation of lipids in macrophages was determined after the incubation of THP-1 macrophages with TRL. The gene expression of candidate receptors was measured by real-time PCR. The highest concentrations of TAG, apoB48 and apoB100 in TRL were observed at 2 h after the consumption of the test meal. However, excessive intracellular TAG accumulation in THP-1 macrophages was observed in response to incubation with TRL isolated at 4 h, when their particle size (estimated as the TAG:apoB ratio) was intermediate. The abundance of mRNA transcripts in macrophages in response to incubation with TRL was down-regulated for LDL receptor (LDLR), slightly up-regulated for VLDL receptor and remained unaltered for LDLR-related protein, but no effect of the postprandial time point was observed. In contrast, the mRNA expression of scavenger receptors SRB1, SRA2 and CD36 was higher when cells were incubated with TRL isolated at 4 h after the consumption of the test meal. In conclusion, TRL led to excessive intracellular TAG accumulation in THP-1 macrophages, which was greater when cells were incubated with intermediate-sized postprandial TRL isolated at 4 h and was associated with a significant increase in the mRNA expression of scavenger receptors.
Improvement of expression level of polysaccharide lyases with new tag GAPDH in E. coli.

PubMed

Chen, Zhenya; Li, Ye; Sun, Xinxiao; Yuan, Qipeng

2016-10-20

Escherichia coli (E. coli) is widely used to express a variety of heterologous proteins. Efforts have been made to enhance the expression level of the desired protein. However, problems still exist to regulate the level of protein expression and therefore, new strategies are needed to overcome those issues. Glyceraldehyde-3-phosphate dehydrogenase (GAPDH) which is properly expressed in E. coli might play a leading role and increase the expression levels of the target proteins. In this study, GAPDH was fused with a target enzyme, ChSase ABC I, an endoeliminase and polysaceharide lyase. Our results confirmed this hypothesis and indicated that GAPDH boosted the expression level of ChSase ABC I with an increase of 2.25 times, while the enzymatic activity with an increase of 2.99 times. The hypothesis were also supported by RT-PCR study and GAPDH was more effective in enhancing the expression level and enzymatic activity as compared to MBP, which is commonly used as fused tag and can improve the soluble expression of target protein. addition, the expression level and enzymatic activity of other polysaceharide lyases were also improved in the presence of GAPDH. The findings of this study prove that GAPDH has a strong effect on enhancing the expression level and enzymatic activity of the target proteins. Copyright © 2016 Elsevier B.V. All rights reserved.
Reference genome sequence of the model plant Setaria

DOE Office of Scientific and Technical Information (OSTI.GOV)

Bennetzen, Jeffrey L; Schmutz, Jeremy; Wang, Hao

We generated a high-quality reference genome sequence for foxtail millet (Setaria italica). The ~400-Mb assembly covers ~80% of the genome and >95% of the gene space. The assembly was anchored to a 992-locus genetic map and was annotated by comparison with >1.3 million expressed sequence tag reads. We produced more than 580 million RNA-Seq reads to facilitate expression analyses. We also sequenced Setaria viridis, the ancestral wild relative of S. italica, and identified regions of differential single-nucleotide polymorphism density, distribution of transposable elements, small RNA content, chromosomal rearrangement and segregation distortion. The genus Setaria includes natural and cultivated species thatmore » demonstrate a wide capacity for adaptation. The genetic basis of this adaptation was investigated by comparing five sequenced grass genomes. We also used the diploid Setaria genome to evaluate the ongoing genome assembly of a related polyploid, switchgrass (Panicum virgatum).« less
Reference genome sequence of the model plant Setaria.

PubMed

Bennetzen, Jeffrey L; Schmutz, Jeremy; Wang, Hao; Percifield, Ryan; Hawkins, Jennifer; Pontaroli, Ana C; Estep, Matt; Feng, Liang; Vaughn, Justin N; Grimwood, Jane; Jenkins, Jerry; Barry, Kerrie; Lindquist, Erika; Hellsten, Uffe; Deshpande, Shweta; Wang, Xuewen; Wu, Xiaomei; Mitros, Therese; Triplett, Jimmy; Yang, Xiaohan; Ye, Chu-Yu; Mauro-Herrera, Margarita; Wang, Lin; Li, Pinghua; Sharma, Manoj; Sharma, Rita; Ronald, Pamela C; Panaud, Olivier; Kellogg, Elizabeth A; Brutnell, Thomas P; Doust, Andrew N; Tuskan, Gerald A; Rokhsar, Daniel; Devos, Katrien M

2012-05-13

We generated a high-quality reference genome sequence for foxtail millet (Setaria italica). The ∼400-Mb assembly covers ∼80% of the genome and >95% of the gene space. The assembly was anchored to a 992-locus genetic map and was annotated by comparison with >1.3 million expressed sequence tag reads. We produced more than 580 million RNA-Seq reads to facilitate expression analyses. We also sequenced Setaria viridis, the ancestral wild relative of S. italica, and identified regions of differential single-nucleotide polymorphism density, distribution of transposable elements, small RNA content, chromosomal rearrangement and segregation distortion. The genus Setaria includes natural and cultivated species that demonstrate a wide capacity for adaptation. The genetic basis of this adaptation was investigated by comparing five sequenced grass genomes. We also used the diploid Setaria genome to evaluate the ongoing genome assembly of a related polyploid, switchgrass (Panicum virgatum).
TCC: an R package for comparing tag count data with robust normalization strategies

PubMed Central

2013-01-01

Background Differential expression analysis based on “next-generation” sequencing technologies is a fundamental means of studying RNA expression. We recently developed a multi-step normalization method (called TbT) for two-group RNA-seq data with replicates and demonstrated that the statistical methods available in four R packages (edgeR, DESeq, baySeq, and NBPSeq) together with TbT can produce a well-ranked gene list in which true differentially expressed genes (DEGs) are top-ranked and non-DEGs are bottom ranked. However, the advantages of the current TbT method come at the cost of a huge computation time. Moreover, the R packages did not have normalization methods based on such a multi-step strategy. Results TCC (an acronym for Tag Count Comparison) is an R package that provides a series of functions for differential expression analysis of tag count data. The package incorporates multi-step normalization methods, whose strategy is to remove potential DEGs before performing the data normalization. The normalization function based on this DEG elimination strategy (DEGES) includes (i) the original TbT method based on DEGES for two-group data with or without replicates, (ii) much faster methods for two-group data with or without replicates, and (iii) methods for multi-group comparison. TCC provides a simple unified interface to perform such analyses with combinations of functions provided by edgeR, DESeq, and baySeq. Additionally, a function for generating simulation data under various conditions and alternative DEGES procedures consisting of functions in the existing packages are provided. Bioinformatics scientists can use TCC to evaluate their methods, and biologists familiar with other R packages can easily learn what is done in TCC. Conclusion DEGES in TCC is essential for accurate normalization of tag count data, especially when up- and down-regulated DEGs in one of the samples are extremely biased in their number. TCC is useful for analyzing tag count data
Construction of a Full-Length Enriched cDNA Library and Preliminary Analysis of Expressed Sequence Tags from Bengal Tiger Panthera tigris tigris

PubMed Central

Liu, Changqing; Liu, Dan; Guo, Yu; Lu, Taofeng; Li, Xiangchen; Zhang, Minghai; Ma, Jianzhang; Ma, Yuehui; Guan, Weijun

2013-01-01

In this study, a full-length enriched cDNA library was successfully constructed from Bengal tiger, Panthera tigris tigris, the most well-known wild Animal. Total RNA was extracted from cultured Bengal tiger fibroblasts in vitro. The titers of primary and amplified libraries were 1.28 × 106 pfu/mL and 1.56 × 109 pfu/mL respectively. The percentage of recombinants from unamplified library was 90.2% and average length of exogenous inserts was 0.98 kb. A total of 212 individual ESTs with sizes ranging from 356 to 1108 bps were then analyzed. The BLASTX score revealed that 48.1% of the sequences were classified as a strong match, 45.3% as nominal and 6.6% as a weak match. Among the ESTs with known putative function, 26.4% ESTs were found to be related to all kinds of metabolisms, 19.3% ESTs to information storage and processing, 11.3% ESTs to posttranslational modification, protein turnover, chaperones, 11.3% ESTs to transport, 9.9% ESTs to signal transducer/cell communication, 9.0% ESTs to structure protein, 3.8% ESTs to cell cycle, and only 6.6% ESTs classified as novel genes. By EST sequencing, a full-length gene coding ferritin was identified and characterized. The recombinant plasmid pET32a-TAT-Ferritin was constructed, coded for the TAT-Ferritin fusion protein with two 6× His-tags in N and C-terminal. After BCA assay, the concentration of soluble Trx-TAT-Ferritin recombinant protein was 2.32 ± 0.12 mg/mL. These results demonstrated that the reliability and representativeness of the cDNA library attained to the requirements of a standard cDNA library. This library provided a useful platform for the functional genome and transcriptome research of Bengal tigers. PMID:23708105
Construction of a full-length enriched cDNA library and preliminary analysis of expressed sequence tags from Bengal Tiger Panthera tigris tigris.

PubMed

Liu, Changqing; Liu, Dan; Guo, Yu; Lu, Taofeng; Li, Xiangchen; Zhang, Minghai; Ma, Jianzhang; Ma, Yuehui; Guan, Weijun

2013-05-24

In this study, a full-length enriched cDNA library was successfully constructed from Bengal tiger, Panthera tigris tigris, the most well-known wild Animal. Total RNA was extracted from cultured Bengal tiger fibroblasts in vitro. The titers of primary and amplified libraries were 1.28 × 106 pfu/mL and 1.56 × 109 pfu/mL respectively. The percentage of recombinants from unamplified library was 90.2% and average length of exogenous inserts was 0.98 kb. A total of 212 individual ESTs with sizes ranging from 356 to 1108 bps were then analyzed. The BLASTX score revealed that 48.1% of the sequences were classified as a strong match, 45.3% as nominal and 6.6% as a weak match. Among the ESTs with known putative function, 26.4% ESTs were found to be related to all kinds of metabolisms, 19.3% ESTs to information storage and processing, 11.3% ESTs to posttranslational modification, protein turnover, chaperones, 11.3% ESTs to transport, 9.9% ESTs to signal transducer/cell communication, 9.0% ESTs to structure protein, 3.8% ESTs to cell cycle, and only 6.6% ESTs classified as novel genes. By EST sequencing, a full-length gene coding ferritin was identified and characterized. The recombinant plasmid pET32a-TAT-Ferritin was constructed, coded for the TAT-Ferritin fusion protein with two 6× His-tags in N and C-terminal. After BCA assay, the concentration of soluble Trx-TAT-Ferritin recombinant protein was 2.32 ± 0.12 mg/mL. These results demonstrated that the reliability and representativeness of the cDNA library attained to the requirements of a standard cDNA library. This library provided a useful platform for the functional genome and transcriptome research of Bengal tigers.
Characterization of transcriptome dynamics during watermelon fruit development: sequencing, assembly, annotation and gene expression profiles

PubMed Central

2011-01-01

Background Cultivated watermelon [Citrullus lanatus (Thunb.) Matsum. & Nakai var. lanatus] is an important agriculture crop world-wide. The fruit of watermelon undergoes distinct stages of development with dramatic changes in its size, color, sweetness, texture and aroma. In order to better understand the genetic and molecular basis of these changes and significantly expand the watermelon transcript catalog, we have selected four critical stages of watermelon fruit development and used Roche/454 next-generation sequencing technology to generate a large expressed sequence tag (EST) dataset and a comprehensive transcriptome profile for watermelon fruit flesh tissues. Results We performed half Roche/454 GS-FLX run for each of the four watermelon fruit developmental stages (immature white, white-pink flesh, red flesh and over-ripe) and obtained 577,023 high quality ESTs with an average length of 302.8 bp. De novo assembly of these ESTs together with 11,786 watermelon ESTs collected from GenBank produced 75,068 unigenes with a total length of approximately 31.8 Mb. Overall 54.9% of the unigenes showed significant similarities to known sequences in GenBank non-redundant (nr) protein database and around two-thirds of them matched proteins of cucumber, the most closely-related species with a sequenced genome. The unigenes were further assigned with gene ontology (GO) terms and mapped to biochemical pathways. More than 5,000 SSRs were identified from the EST collection. Furthermore we carried out digital gene expression analysis of these ESTs and identified 3,023 genes that were differentially expressed during watermelon fruit development and ripening, which provided novel insights into watermelon fruit biology and a comprehensive resource of candidate genes for future functional analysis. We then generated profiles of several interesting metabolites that are important to fruit quality including pigmentation and sweetness. Integrative analysis of metabolite and digital gene
Tagging methyl-CpG-binding domain proteins reveals different spatiotemporal expression and supports distinct functions.

PubMed

Wood, Kathleen H; Johnson, Brian S; Welsh, Sarah A; Lee, Jun Y; Cui, Yue; Krizman, Elizabeth; Brodkin, Edward S; Blendy, Julie A; Robinson, Michael B; Bartolomei, Marisa S; Zhou, Zhaolan

2016-04-01

DNA methylation is recognized by methyl-CpG-binding domain (MBD) proteins. Multiple MBDs are linked to neurodevelopmental disorders in humans and mice. However, the functions of MBD2 are poorly understood. We characterized Mbd2 knockout mice and determined spatiotemporal expression of MBDs and MBD2-NuRD (nucleosome remodeling deacetylase) interactions. We analyzed behavioral phenotypes, generated biotin-tagged MBD1 and MBD2 knockin mice, and performed biochemical studies of MBD2-NuRD. Most behavioral measures are minimally affected in Mbd2 knockout mice. In contrast to other MBDs, MBD2 shows distinct expression patterns. Unlike most MBDs, MBD2 is ubiquitously expressed in all tissues examined and appears dispensable for brain functions measured in this study. We provide novel genetic tools and reveal new directions to investigate MBD2 functions in vivo.
EST-PAC a web package for EST annotation and protein sequence prediction

PubMed Central

Strahm, Yvan; Powell, David; Lefèvre, Christophe

2006-01-01

With the decreasing cost of DNA sequencing technology and the vast diversity of biological resources, researchers increasingly face the basic challenge of annotating a larger number of expressed sequences tags (EST) from a variety of species. This typically consists of a series of repetitive tasks, which should be automated and easy to use. The results of these annotation tasks need to be stored and organized in a consistent way. All these operations should be self-installing, platform independent, easy to customize and amenable to using distributed bioinformatics resources available on the Internet. In order to address these issues, we present EST-PAC a web oriented multi-platform software package for expressed sequences tag (EST) annotation. EST-PAC provides a solution for the administration of EST and protein sequence annotations accessible through a web interface. Three aspects of EST annotation are automated: 1) searching local or remote biological databases for sequence similarities using Blast services, 2) predicting protein coding sequence from EST data and, 3) annotating predicted protein sequences with functional domain predictions. In practice, EST-PAC integrates the BLASTALL suite, EST-Scan2 and HMMER in a relational database system accessible through a simple web interface. EST-PAC also takes advantage of the relational database to allow consistent storage, powerful queries of results and, management of the annotation process. The system allows users to customize annotation strategies and provides an open-source data-management environment for research and education in bioinformatics. PMID:17147782
Construction and Cloning of Reporter-Tagged Replicon cDNA for an In Vitro Replication Study of Murine Norovirus-1 (MNV-1)

PubMed Central

Ahmad, Muhammad Khairi; Tabana, Yasser M; Ahmed, Mowaffaq Adam; Sandai, Doblin Anak; Mohamed, Rafeezul; Ismail, Ida Shazrina; Zulkiflie, Nurulisa; Yunus, Muhammad Amir

2017-01-01

Background A norovirus maintains its viability, infectivity and virulence by its ability to replicate. However, the biological mechanisms of the process remain to be explored. In this work, the NanoLuc™ Luciferase gene was used to develop a reporter-tagged replicon system to study norovirus replication. Methods The NanoLuc™ Luciferase reporter protein was engineered to be expressed as a fusion protein for MNV-1 minor capsid protein, VP2. The foot-and-mouth disease virus 2A (FMDV2A) sequence was inserted between the 3′end of the reporter gene and the VP2 start sequence to allow co-translational ‘cleavage’ of fusion proteins during intracellular transcript expression. Amplification of the fusion gene was performed using a series of standard and overlapping polymerase chain reactions. The resulting amplicon was then cloned into three readily available backbones of MNV-1 cDNA clones. Results Restriction enzyme analysis indicated that the NanoLucTM Luciferase gene was successfully inserted into the parental MNV-1 cDNA clone. The insertion was further confirmed by using DNA sequencing. Conclusion NanoLuc™ Luciferase-tagged MNV-1 cDNA clones were successfully engineered. Such clones can be exploited to develop robust experimental assays for in vitro assessments of viral RNA replication. PMID:29379384
Construction and Cloning of Reporter-Tagged Replicon cDNA for an In Vitro Replication Study of Murine Norovirus-1 (MNV-1).

PubMed

Ahmad, Muhammad Khairi; Tabana, Yasser M; Ahmed, Mowaffaq Adam; Sandai, Doblin Anak; Mohamed, Rafeezul; Ismail, Ida Shazrina; Zulkiflie, Nurulisa; Yunus, Muhammad Amir

2017-12-01

A norovirus maintains its viability, infectivity and virulence by its ability to replicate. However, the biological mechanisms of the process remain to be explored. In this work, the NanoLuc™ Luciferase gene was used to develop a reporter-tagged replicon system to study norovirus replication. The NanoLuc™ Luciferase reporter protein was engineered to be expressed as a fusion protein for MNV-1 minor capsid protein, VP2. The foot-and-mouth disease virus 2A (FMDV2A) sequence was inserted between the 3'end of the reporter gene and the VP2 start sequence to allow co-translational 'cleavage' of fusion proteins during intracellular transcript expression. Amplification of the fusion gene was performed using a series of standard and overlapping polymerase chain reactions. The resulting amplicon was then cloned into three readily available backbones of MNV-1 cDNA clones. Restriction enzyme analysis indicated that the NanoLucTM Luciferase gene was successfully inserted into the parental MNV-1 cDNA clone. The insertion was further confirmed by using DNA sequencing. NanoLuc™ Luciferase-tagged MNV-1 cDNA clones were successfully engineered. Such clones can be exploited to develop robust experimental assays for in vitro assessments of viral RNA replication.
The catalytic activity of a recombinant single chain variable fragment nucleic acid-hydrolysing antibody varies with fusion tag and expression host.

PubMed

Lee, Joungmin; Kim, Minjae; Seo, Youngsil; Lee, Yeonjin; Park, Hyunjoon; Byun, Sung June; Kwon, Myung-Hee

2017-11-01

The antigen-binding properties of single chain Fv antibodies (scFvs) can vary depending on the position and type of fusion tag used, as well as the host cells used for expression. The issue is even more complicated with a catalytic scFv antibody that binds and hydrolyses a specific antigen. Herein, we investigated the antigen-binding and -hydrolysing activities of the catalytic anti-nucleic acid antibody 3D8 scFv expressed in Escherichia coli or HEK293f cells with or without additional amino acid residues at the N- and C-termini. DNA-binding activity was retained in all recombinant forms. However, the DNA-hydrolysing activity varied drastically between forms. The DNA-hydrolysing activity of E. coli-derived 3D8 scFvs was not affected by the presence of a C-terminal human influenza haemagglutinin (HA) or His tag. By contrast, the activity of HEK293f-derived 3D8 scFvs was completely lost when additional residues were included at the N-terminus and/or when a His tag was incorporated at the C-terminus, whereas a HA tag at the C-terminus did not diminish activity. Thus, we demonstrate that the antigen-binding and catalytic activities of a catalytic antibody can be separately affected by the presence of additional residues at the N- and C-termini, and by the host cell type. Copyright © 2017 The Authors. Published by Elsevier Inc. All rights reserved.
Cyanine-based probe\\tag-peptide pair fluorescence protein imaging and fluorescence protein imaging methods

DOEpatents

Mayer-Cumblidge, M. Uljana; Cao, Haishi

2013-01-15

A molecular probe comprises two arsenic atoms and at least one cyanine based moiety. A method of producing a molecular probe includes providing a molecule having a first formula, treating the molecule with HgOAc, and subsequently transmetallizing with AsCl.sub.3. The As is liganded to ethanedithiol to produce a probe having a second formula. A method of labeling a peptide includes providing a peptide comprising a tag sequence and contacting the peptide with a biarsenical molecular probe. A complex is formed comprising the tag sequence and the molecular probe. A method of studying a peptide includes providing a mixture containing a peptide comprising a peptide tag sequence, adding a biarsenical probe to the mixture, and monitoring the fluorescence of the mixture.
Inferring the expression variability of human transposable element-derived exons by linear model analysis of deep RNA sequencing data.

PubMed

Zhang, Wensheng; Edwards, Andrea; Fan, Wei; Fang, Zhide; Deininger, Prescott; Zhang, Kun

2013-08-28

The exonization of transposable elements (TEs) has proven to be a significant mechanism for the creation of novel exons. Existing knowledge of the retention patterns of TE exons in mRNAs were mainly established by the analysis of Expressed Sequence Tag (EST) data and microarray data. This study seeks to validate and extend previous studies on the expression of TE exons by an integrative statistical analysis of high throughput RNA sequencing data. We collected 26 RNA-seq datasets spanning multiple tissues and cancer types. The exon-level digital expressions (indicating retention rates in mRNAs) were quantified by a double normalized measure, called the rescaled RPKM (Reads Per Kilobase of exon model per Million mapped reads). We analyzed the distribution profiles and the variability (across samples and between tissue/disease groups) of TE exon expressions, and compared them with those of other constitutive or cassette exons. We inferred the effects of four genomic factors, including the location, length, cognate TE family and TE nucleotide proportion (RTE, see Methods section) of a TE exon, on the exons' expression level and expression variability. We also investigated the biological implications of an assembly of highly-expressed TE exons. Our analysis confirmed prior studies from the following four aspects. First, with relatively high expression variability, most TE exons in mRNAs, especially those without exact counterparts in the UCSC RefSeq (Reference Sequence) gene tables, demonstrate low but still detectable expression levels in most tissue samples. Second, the TE exons in coding DNA sequences (CDSs) are less highly expressed than those in 3' (5') untranslated regions (UTRs). Third, the exons derived from chronologically ancient repeat elements, such as MIRs, tend to be highly expressed in comparison with those derived from younger TEs. Fourth, the previously observed negative relationship between the lengths of exons and the inclusion levels in transcripts
Single-Cell RNA-Sequencing: Assessment of Differential Expression Analysis Methods.

PubMed

Dal Molin, Alessandra; Baruzzo, Giacomo; Di Camillo, Barbara

2017-01-01

The sequencing of the transcriptomes of single-cells, or single-cell RNA-sequencing, has now become the dominant technology for the identification of novel cell types and for the study of stochastic gene expression. In recent years, various tools for analyzing single-cell RNA-sequencing data have been proposed, many of them with the purpose of performing differentially expression analysis. In this work, we compare four different tools for single-cell RNA-sequencing differential expression, together with two popular methods originally developed for the analysis of bulk RNA-sequencing data, but largely applied to single-cell data. We discuss results obtained on two real and one synthetic dataset, along with considerations about the perspectives of single-cell differential expression analysis. In particular, we explore the methods performance in four different scenarios, mimicking different unimodal or bimodal distributions of the data, as characteristic of single-cell transcriptomics. We observed marked differences between the selected methods in terms of precision and recall, the number of detected differentially expressed genes and the overall performance. Globally, the results obtained in our study suggest that is difficult to identify a best performing tool and that efforts are needed to improve the methodologies for single-cell RNA-sequencing data analysis and gain better accuracy of results.
Analysis of expressed sequence tags (ESTs) from cocoa (Theobroma cacao L) upon infection with Phytophthora megakarya.

PubMed

Naganeeswaran, Sudalaimuthu Asari; Subbian, Elain Apshara; Ramaswamy, Manimekalai

2012-01-01

Phytophthora megakarya, the causative agent of cacao black pod disease in West African countries causes an extensive loss of yield. In this study we have analyzed 4 libraries of ESTs derived from Phytophthora megakarya infected cocoa leaf and pod tissues. Totally 6379 redundant sequences were retrieved from ESTtik database and EST processing was performed using seqclean tool. Clustering and assembling using CAP3 generated 3333 non-redundant (907 contigs and 2426 singletons) sequences. The primary sequence analysis of 3333 non-redundant sequences showed that the GC percentage was 42.7 and the sequence length ranged from 101 - 2576 nucleotides. Further, functional analysis (Blast, Interproscan, Gene ontology and KEGG search) were executed and 1230 orthologous genes were annotated. Totally 272 enzymes corresponding to 114 metabolic pathways were identified. Functional annotation revealed that most of the sequences are related to molecular function, stress response and biological processes. The annotated enzymes are aldehyde dehydrogenase (E.C: 1.2.1.3), catalase (E.C: 1.11.1.6), acetyl-CoA C-acetyltransferase (E.C: 2.3.1.9), threonine ammonia-lyase (E.C: 4.3.1.19), acetolactate synthase (E.C: 2.2.1.6), O-methyltransferase (E.C: 2.1.1.68) which play an important role in amino acid biosynthesis and phenyl propanoid biosynthesis. All this information was stored in MySQL database management system to be used in future for reconstruction of biotic stress response pathway in cocoa.

DAMe: a toolkit for the initial processing of datasets with PCR replicates of double-tagged amplicons for DNA metabarcoding analyses.

PubMed

Zepeda-Mendoza, Marie Lisandra; Bohmann, Kristine; Carmona Baez, Aldo; Gilbert, M Thomas P

2016-05-03

DNA metabarcoding is an approach for identifying multiple taxa in an environmental sample using specific genetic loci and taxa-specific primers. When combined with high-throughput sequencing it enables the taxonomic characterization of large numbers of samples in a relatively time- and cost-efficient manner. One recent laboratory development is the addition of 5'-nucleotide tags to both primers producing double-tagged amplicons and the use of multiple PCR replicates to filter erroneous sequences. However, there is currently no available toolkit for the straightforward analysis of datasets produced in this way. We present DAMe, a toolkit for the processing of datasets generated by double-tagged amplicons from multiple PCR replicates derived from an unlimited number of samples. Specifically, DAMe can be used to (i) sort amplicons by tag combination, (ii) evaluate PCR replicates dissimilarity, and (iii) filter sequences derived from sequencing/PCR errors, chimeras, and contamination. This is attained by calculating the following parameters: (i) sequence content similarity between the PCR replicates from each sample, (ii) reproducibility of each unique sequence across the PCR replicates, and (iii) copy number of the unique sequences in each PCR replicate. We showcase the insights that can be obtained using DAMe prior to taxonomic assignment, by applying it to two real datasets that vary in their complexity regarding number of samples, sequencing libraries, PCR replicates, and used tag combinations. Finally, we use a third mock dataset to demonstrate the impact and importance of filtering the sequences with DAMe. DAMe allows the user-friendly manipulation of amplicons derived from multiple samples with PCR replicates built in a single or multiple sequencing libraries. It allows the user to: (i) collapse amplicons into unique sequences and sort them by tag combination while retaining the sample identifier and copy number information, (ii) identify sequences carrying
Finding similar nucleotide sequences using network BLAST searches.

PubMed

Ladunga, Istvan

2009-06-01

The Basic Local Alignment Search Tool (BLAST) is a keystone of bioinformatics due to its performance and user-friendliness. Beginner and intermediate users will learn how to design and submit blastn and Megablast searches on the Web pages at the National Center for Biotechnology Information. We map nucleic acid sequences to genomes, find identical or similar mRNA, expressed sequence tag, and noncoding RNA sequences, and run Megablast searches, which are much faster than blastn. Understanding results is assisted by taxonomy reports, genomic views, and multiple alignments. We interpret expected frequency thresholds, biological significance, and statistical significance. Weak hits provide no evidence, but hints for further analyses. We find genes that may code for homologous proteins by translated BLAST. We reduce false positives by filtering out low-complexity regions. Parsed BLAST results can be integrated into analysis pipelines. Links in the output connect to Entrez, PUBMED, structural, sequence, interaction, and expression databases. This facilitates integration with a wide spectrum of biological knowledge.
Somatodendritic surface expression of epitope-tagged and KChIP binding-deficient Kv4.2 channels in hippocampal neurons.

PubMed

Prechtel, Helena; Hartmann, Sven; Minge, Daniel; Bähring, Robert

2018-01-01

Kv4.2 channels mediate a subthreshold-activating somatodendritic A-type current (ISA) in hippocampal neurons. We examined the role of accessory Kv channel interacting protein (KChIP) binding in somatodendritic surface expression and activity-dependent decrease in the availability of Kv4.2 channels. For this purpose we transfected cultured hippocampal neurons with cDNA coding for Kv4.2 wild-type (wt) or KChIP binding-deficient Kv4.2 mutants. All channels were equipped with an externally accessible hemagglutinin (HA)-tag and an EGFP-tag, which was attached to the C-terminal end. Combined analyses of EGFP self-fluorescence, surface HA immunostaining and patch-clamp recordings demonstrated similar dendritic trafficking and functional surface expression for Kv4.2[wt]HA,EGFP and the KChIP binding-deficient Kv4.2[A14K]HA,EGFP. Coexpression of exogenous KChIP2 augmented the surface expression of Kv4.2[wt]HA,EGFP but not Kv4.2[A14K]HA,EGFP. Notably, activity-dependent decrease in availability was more pronounced in Kv4.2[wt]HA,EGFP + KChIP2 coexpressing than in Kv4.2[A14K]HA,EGFP + KChIP2 coexpressing neurons. Our results do not support the notion that accessory KChIP binding is a prerequisite for dendritic trafficking and functional surface expression of Kv4.2 channels, however, accessory KChIP binding may play a potential role in Kv4.2 modulation during intrinsic plasticity processes.
Gene expression profiling via LongSAGE in a non-model plant species: a case study in seeds of Brassica napus

PubMed Central

Obermeier, Christian; Hosseini, Bashir; Friedt, Wolfgang; Snowdon, Rod

2009-01-01

Background Serial analysis of gene expression (LongSAGE) was applied for gene expression profiling in seeds of oilseed rape (Brassica napus ssp. napus). The usefulness of this technique for detailed expression profiling in a non-model organism was demonstrated for the highly complex, neither fully sequenced nor annotated genome of B. napus by applying a tag-to-gene matching strategy based on Brassica ESTs and the annotated proteome of the closely related model crucifer A. thaliana. Results Transcripts from 3,094 genes were detected at two time-points of seed development, 23 days and 35 days after pollination (DAP). Differential expression showed a shift from gene expression involved in diverse developmental processes including cell proliferation and seed coat formation at 23 DAP to more focussed metabolic processes including storage protein accumulation and lipid deposition at 35 DAP. The most abundant transcripts at 23 DAP were coding for diverse protease inhibitor proteins and proteases, including cysteine proteases involved in seed coat formation and a number of lipid transfer proteins involved in embryo pattern formation. At 35 DAP, transcripts encoding napin, cruciferin and oleosin storage proteins were most abundant. Over both time-points, 18.6% of the detected genes were matched by Brassica ESTs identified by LongSAGE tags in antisense orientation. This suggests a strong involvement of antisense transcript expression in regulatory processes during B. napus seed development. Conclusion This study underlines the potential of transcript tagging approaches for gene expression profiling in Brassica crop species via EST matching to annotated A. thaliana genes. Limits of tag detection for low-abundance transcripts can today be overcome by ultra-high throughput sequencing approaches, so that tag-based gene expression profiling may soon become the method of choice for global expression profiling in non-model species. PMID:19575793
A Cleavable N-Terminal Signal Peptide Promotes Widespread Olfactory Receptor Surface Expression in HEK293T Cells

PubMed Central

Shepard, Blythe D.; Natarajan, Niranjana; Protzko, Ryan J.; Acres, Omar W.; Pluznick, Jennifer L.

2013-01-01

Olfactory receptors (ORs) are G protein-coupled receptors that detect odorants in the olfactory epithelium, and comprise the largest gene family in the genome. Identification of OR ligands typically requires OR surface expression in heterologous cells; however, ORs rarely traffic to the cell surface when exogenously expressed. Therefore, most ORs are orphan receptors with no known ligands. To date, studies have utilized non-cleavable rhodopsin (Rho) tags and/or chaperones (i.e. Receptor Transporting Protein, RTP1S, Ric8b and Gαolf) to improve surface expression. However, even with these tools, many ORs still fail to reach the cell surface. We used a test set of fifteen ORs to examine the effect of a cleavable leucine-rich signal peptide sequence (Lucy tag) on OR surface expression in HEK293T cells. We report here that the addition of the Lucy tag to the N-terminus increases the number of ORs reaching the cell surface to 7 of the 15 ORs (as compared to 3/15 without Rho or Lucy tags). Moreover, when ORs tagged with both Lucy and Rho were co-expressed with previously reported chaperones (RTP1S, Ric8b and Gαolf), we observed surface expression for all 15 receptors examined. In fact, two-thirds of Lucy-tagged ORs are able to reach the cell surface synergistically with chaperones even when the Rho tag is removed (10/15 ORs), allowing for the potential assessment of OR function with only an 8-amino acid Flag tag on the mature protein. As expected for a signal peptide, the Lucy tag was cleaved from the mature protein and did not alter OR-ligand binding and signaling. Our studies demonstrate that widespread surface expression of ORs can be achieved in HEK293T cells, providing promise for future large-scale deorphanization studies. PMID:23840901
Expression of CB2 cannabinoid receptor in Pichia pastoris.

PubMed

Feng, Wenke; Cai, Jian; Pierce, William M; Song, Zhao-Hui

2002-12-01

To facilitate purification and structural characterization, the CB2 cannabinoid receptor is expressed in methylotrophic yeast Pichia pastoris. The expression plasmids were constructed in which the CB2 gene is under the control of the highly inducible promoter of P. pastoris alcohol oxidase 1 gene. A c-myc epitope and a hexahistidine tag were introduced at the C-terminal of the CB2 to permit easy detection and purification. In membrane preparations of CB2 gene transformed yeast cells, Western blot analysis detected the expression of CB2 proteins. Radioligand binding assays demonstrated that the CB2 receptors expressed in P. pastoris have a pharmacological profile similar to that of the receptors expressed in mammalian systems. Furthermore, the epitope-tagged receptor was purified by metal chelating chromatography and the purified CB2 preparations were subjected to digestion by trypsin. MALDI/TOF mass spectrometry analysis of the peptides extracted from tryptic digestions detected 14 peptide fragments derived from the CB2 receptor. ESI mass spectrometry was used to sequence one of these peptide fragments, thus, further confirming the identity of the purified receptor. In conclusion, these data demonstrated for the first time that epitope-tagged, functional CB2 cannabinoid receptor can be expressed in P. pastoris for purification.
Cyanine-based probe\\tag-peptide pair for fluorescence protein imaging and fluorescence protein imaging methods

DOEpatents

Mayer-Cumblidge, M Uljana [Richland, WA; Cao, Haishi [Richland, WA

2010-08-17

A molecular probe comprises two arsenic atoms and at least one cyanine based moiety. A method of producing a molecular probe includes providing a molecule having a first formula, treating the molecule with HgOAc, and subsequently transmetallizing with AsCl.sub.3. The As is liganded to ethanedithiol to produce a probe having a second formula. A method of labeling a peptide includes providing a peptide comprising a tag sequence and contacting the peptide with a biarsenical molecular probe. A complex is formed comprising the tag sequence and the molecular probe. A method of studying a peptide includes providing a mixture containing a peptide comprising a peptide tag sequence, adding a biarsenical probe to the mixture, and monitoring the fluorescence of the mixture.
Differential expression of a novel gene during seed triacylglycerol accumulation in lupin species ( Lupinus angustifolius L. and L. mutabilis L.).

PubMed

Francki, Michael G; Whitaker, Peta; Smith, Penelope M; Atkins, Craig A

2002-11-01

Seed triacylglycerols (TAGs) are stored as energy reserves and extracted for various end-product uses. In lupins, seed oil content varies from 16% in Lupinus mutabilisto 8% in L. angustifolius. We have shown that TAGs rapidly accumulate during mid-stages of seed development in L. mutabilis compared to the lower seed oil species, L. angustifolius. In this study, we have targeted the key enzymes of the lipid biosynthetic pathway, acetyl-CoA carboxylase (ACCase) and diacylglycerol acyltransferase (DAGAT), to determine factors regulating TAG accumulation between two lupin species. A twofold increase in ACCase activity was observed in L. mutabilis relative to L. angustifolius and correlated with rapid TAG accumulation. No difference in DAGAT activity was detected. We have identified, cloned and partially characterised a novel gene differentially expressed during TAG accumulation between L. angustifolius and L. mutabilis. The gene has some identity to the glucose dehydrogenase family previously described in barley and bacteria and the significance of its expression levels during seed development in relation to TAG accumulation is discussed. DNA sequence analysis of the promoter in both L. angustifolius and L. mutabilis identified putative matrix attachment regions and recognition sequences for transcription binding sites similar to those found in the Adh1 gene from Arabidopsis. The identical promoter regions between species indicate that differential gene expression is controlled by alternative transcription factors, accessibility to binding sites or a combination of both.
Assembly of 500,000 inter-specific catfish expressed sequence tags and large scale gene-associated marker development for whole genome association studies

DOE Office of Scientific and Technical Information (OSTI.GOV)

Catfish Genome Consortium; Wang, Shaolin; Peatman, Eric

2010-03-23

Background-Through the Community Sequencing Program, a catfish EST sequencing project was carried out through a collaboration between the catfish research community and the Department of Energy's Joint Genome Institute. Prior to this project, only a limited EST resource from catfish was available for the purpose of SNP identification. Results-A total of 438,321 quality ESTs were generated from 8 channel catfish (Ictalurus punctatus) and 4 blue catfish (Ictalurus furcatus) libraries, bringing the number of catfish ESTs to nearly 500,000. Assembly of all catfish ESTs resulted in 45,306 contigs and 66,272 singletons. Over 35percent of the unique sequences had significant similarities tomore » known genes, allowing the identification of 14,776 unique genes in catfish. Over 300,000 putative SNPs have been identified, of which approximately 48,000 are high-quality SNPs identified from contigs with at least four sequences and the minor allele presence of at least two sequences in the contig. The EST resource should be valuable for identification of microsatellites, genome annotation, large-scale expression analysis, and comparative genome analysis. Conclusions-This project generated a large EST resource for catfish that captured the majority of the catfish transcriptome. The parallel analysis of ESTs from two closely related Ictalurid catfishes should also provide powerful means for the evaluation of ancient and recent gene duplications, and for the development of high-density microarrays in catfish. The inter- and intra-specific SNPs identified from all catfish EST dataset assembly will greatly benefit the catfish introgression breeding program and whole genome association studies.« less
Digital gene expression analysis of gene expression differences within Brassica diploids and allopolyploids.

PubMed

Jiang, Jinjin; Wang, Yue; Zhu, Bao; Fang, Tingting; Fang, Yujie; Wang, Youping

2015-01-27

Brassica includes many successfully cultivated crop species of polyploid origin, either by ancestral genome triplication or by hybridization between two diploid progenitors, displaying complex repetitive sequences and transposons. The U's triangle, which consists of three diploids and three amphidiploids, is optimal for the analysis of complicated genomes after polyploidization. Next-generation sequencing enables the transcriptome profiling of polyploids on a global scale. We examined the gene expression patterns of three diploids (Brassica rapa, B. nigra, and B. oleracea) and three amphidiploids (B. napus, B. juncea, and B. carinata) via digital gene expression analysis. In total, the libraries generated between 5.7 and 6.1 million raw reads, and the clean tags of each library were mapped to 18547-21995 genes of B. rapa genome. The unambiguous tag-mapped genes in the libraries were compared. Moreover, the majority of differentially expressed genes (DEGs) were explored among diploids as well as between diploids and amphidiploids. Gene ontological analysis was performed to functionally categorize these DEGs into different classes. The Kyoto Encyclopedia of Genes and Genomes analysis was performed to assign these DEGs into approximately 120 pathways, among which the metabolic pathway, biosynthesis of secondary metabolites, and peroxisomal pathway were enriched. The non-additive genes in Brassica amphidiploids were analyzed, and the results indicated that orthologous genes in polyploids are frequently expressed in a non-additive pattern. Methyltransferase genes showed differential expression pattern in Brassica species. Our results provided an understanding of the transcriptome complexity of natural Brassica species. The gene expression changes in diploids and allopolyploids may help elucidate the morphological and physiological differences among Brassica species.
A part toolbox to tune genetic expression in Bacillus subtilis

PubMed Central

Guiziou, Sarah; Sauveplane, Vincent; Chang, Hung-Ju; Clerté, Caroline; Declerck, Nathalie; Jules, Matthieu; Bonnet, Jerome

2016-01-01

Libraries of well-characterised components regulating gene expression levels are essential to many synthetic biology applications. While widely available for the Gram-negative model bacterium Escherichia coli, such libraries are lacking for the Gram-positive model Bacillus subtilis, a key organism for basic research and biotechnological applications. Here, we engineered a genetic toolbox comprising libraries of promoters, Ribosome Binding Sites (RBS), and protein degradation tags to precisely tune gene expression in B. subtilis. We first designed a modular Expression Operating Unit (EOU) facilitating parts assembly and modifications and providing a standard genetic context for gene circuits implementation. We then selected native, constitutive promoters of B. subtilis and efficient RBS sequences from which we engineered three promoters and three RBS sequence libraries exhibiting ∼14 000-fold dynamic range in gene expression levels. We also designed a collection of SsrA proteolysis tags of variable strength. Finally, by using fluorescence fluctuation methods coupled with two-photon microscopy, we quantified the absolute concentration of GFP in a subset of strains from the library. Our complete promoters and RBS sequences library comprising over 135 constructs enables tuning of GFP concentration over five orders of magnitude, from 0.05 to 700 μM. This toolbox of regulatory components will support many research and engineering applications in B. subtilis. PMID:27402159
Automated sample-preparation technologies in genome sequencing projects.

PubMed

Hilbert, H; Lauber, J; Lubenow, H; Düsterhöft, A

2000-01-01

A robotic workstation system (BioRobot 96OO, QIAGEN) and a 96-well UV spectrophotometer (Spectramax 250, Molecular Devices) were integrated in to the process of high-throughput automated sequencing of double-stranded plasmid DNA templates. An automated 96-well miniprep kit protocol (QIAprep Turbo, QIAGEN) provided high-quality plasmid DNA from shotgun clones. The DNA prepared by this procedure was used to generate more than two mega bases of final sequence data for two genomic projects (Arabidopsis thaliana and Schizosaccharomyces pombe), three thousand expressed sequence tags (ESTs) plus half a mega base of human full-length cDNA clones, and approximately 53,000 single reads for a whole genome shotgun project (Pseudomonas putida).
Simple purification method for a recombinantly expressed native His-tag-free aminopeptidase A from Lactobacillus delbrueckii.

PubMed

Stressler, Timo; Tanzer, Coralie; Ewert, Jacob; Claaßen, Wolfgang; Fischer, Lutz

2017-03-01

The aminopeptidase A (PepA; EC 3.4.11.7) is an intracellular exopeptidase present in lactic acid bacteria. The PepA cleaves glutamyl/aspartyl residues from the N-terminal end of peptides and can, therefore, be applied for the production of protein hydrolysates with an increased amount of these amino acids, which results in a savory taste (umami). The first PepA from a lactobacilli strain was recombinantly expressed in Escherichia coli in a recently published study and harbored a C-terminal His 6 -tag for easier purification. Due to the fact that a His-tag might influence the properties of an enzyme, a simple purification method for the non-His-tagged PepA was required. Surprisingly, the PepA precipitated at a very low ammonium sulfate concentration of 5%. Unusual for a precipitating step, the purity of PepA was over 95% and the obtained activity yield was 110%. The high purity allows biochemical characterization and kinetic investigation. As a result, the optimum pH (6.0-6.5) and temperature (60-65 °C) were comparable to the His 6 -tag harboring PepA; the K M value was at 0.79 mM slightly lower compared to 1.21 mM, respectively. Since PepA is a homo dodecamer, it has a high molecular mass of approximately 480 kDa. Therefore, a subsequent preparative size-exclusion chromatography (SEC) step seemed promising. The PepA after SEC was purified to homogeneity. In summary, the simple two-step purification method presented can be applied to purify high amounts of PepA that will allow the performance of experiments in the future to crystalize PepA for the first time. Copyright © 2016 Elsevier Inc. All rights reserved.
Generation and characterization of a recombinant Rift Valley fever virus expressing a V5 epitope-tagged RNA-dependent RNA polymerase.

PubMed

Brennan, Benjamin; Li, Ping; Elliott, Richard M

2011-12-01

The viral RNA-dependent RNA polymerase (RdRp; L protein) of Rift Valley fever virus (RVFV; family Bunyaviridae) is a 238 kDa protein that is crucial for the life cycle of the virus, as it catalyses both transcription of viral mRNAs and replication of the tripartite genome. Despite its importance, little is known about the intracellular distribution of the polymerase or its other roles during infection, primarily because of lack of specific antibodies that recognize L protein. To begin to address these questions we investigated whether the RVFV (MP12 strain) polymerase could tolerate insertion of the V5 epitope, as has been previously demonstrated for the Bunyamwera virus L protein. Insertion of the 14 aa epitope into the polymerase sequence at aa 1852 resulted in a polymerase that retained functionality in a minigenome assay, and we were able to rescue recombinant viruses that expressed the modified L protein by reverse genetics. The L protein could be detected in infected cells by Western blotting with anti-V5 antibodies. Examination of recombinant virus-infected cells by immunofluorescence revealed a punctate perinuclear or cytoplasmic distribution of the polymerase that co-localized with the nucleocapsid protein. The generation of RVFV expressing a tagged RdRp will allow detailed examination of the role of the viral polymerase in the virus life cycle.
Multiple tag labeling method for DNA sequencing

DOEpatents

Mathies, R.A.; Huang, X.C.; Quesada, M.A.

1995-07-25

A DNA sequencing method is described which uses single lane or channel electrophoresis. Sequencing fragments are separated in the lane and detected using a laser-excited, confocal fluorescence scanner. Each set of DNA sequencing fragments is separated in the same lane and then distinguished using a binary coding scheme employing only two different fluorescent labels. Also described is a method of using radioisotope labels. 5 figs.
Multiple tag labeling method for DNA sequencing

DOEpatents

Mathies, Richard A.; Huang, Xiaohua C.; Quesada, Mark A.

1995-01-01

A DNA sequencing method described which uses single lane or channel electrophoresis. Sequencing fragments are separated in said lane and detected using a laser-excited, confocal fluorescence scanner. Each set of DNA sequencing fragments is separated in the same lane and then distinguished using a binary coding scheme employing only two different fluorescent labels. Also described is a method of using radio-isotope labels.
Digital gene expression analysis with sample multiplexing and PCR duplicate detection: A straightforward protocol.

PubMed

Rozenberg, Andrey; Leese, Florian; Weiss, Linda C; Tollrian, Ralph

2016-01-01

Tag-Seq is a high-throughput approach used for discovering SNPs and characterizing gene expression. In comparison to RNA-Seq, Tag-Seq eases data processing and allows detection of rare mRNA species using only one tag per transcript molecule. However, reduced library complexity raises the issue of PCR duplicates, which distort gene expression levels. Here we present a novel Tag-Seq protocol that uses the least biased methods for RNA library preparation combined with a novel approach for joint PCR template and sample labeling. In our protocol, input RNA is fragmented by hydrolysis, and poly(A)-bearing RNAs are selected and directly ligated to mixed DNA-RNA P5 adapters. The P5 adapters contain i5 barcodes composed of sample-specific (moderately) degenerate base regions (mDBRs), which later allow detection of PCR duplicates. The P7 adapter is attached via reverse transcription with individual i7 barcodes added during the amplification step. The resulting libraries can be sequenced on an Illumina sequencer. After sample demultiplexing and PCR duplicate removal with a free software tool we designed, the data are ready for downstream analysis. Our protocol was tested on RNA samples from predator-induced and control Daphnia microcrustaceans.
Application of the High Resolution Melting analysis for genetic mapping of Sequence Tagged Site markers in narrow-leafed lupin (Lupinus angustifolius L.).

PubMed

Kamel, Katarzyna A; Kroc, Magdalena; Święcicki, Wojciech

2015-01-01

Sequence tagged site (STS) markers are valuable tools for genetic and physical mapping that can be successfully used in comparative analyses among related species. Current challenges for molecular markers genotyping in plants include the lack of fast, sensitive and inexpensive methods suitable for sequence variant detection. In contrast, high resolution melting (HRM) is a simple and high-throughput assay, which has been widely applied in sequence polymorphism identification as well as in the studies of genetic variability and genotyping. The present study is the first attempt to use the HRM analysis to genotype STS markers in narrow-leafed lupin (Lupinus angustifolius L.). The sensitivity and utility of this method was confirmed by the sequence polymorphism detection based on melting curve profiles in the parental genotypes and progeny of the narrow-leafed lupin mapping population. Application of different approaches, including amplicon size and a simulated heterozygote analysis, has allowed for successful genetic mapping of 16 new STS markers in the narrow-leafed lupin genome.
TagDigger: user-friendly extraction of read counts from GBS and RAD-seq data.

PubMed

Clark, Lindsay V; Sacks, Erik J

2016-01-01

In genotyping-by-sequencing (GBS) and restriction site-associated DNA sequencing (RAD-seq), read depth is important for assessing the quality of genotype calls and estimating allele dosage in polyploids. However, existing pipelines for GBS and RAD-seq do not provide read counts in formats that are both accurate and easy to access. Additionally, although existing pipelines allow previously-mined SNPs to be genotyped on new samples, they do not allow the user to manually specify a subset of loci to examine. Pipelines that do not use a reference genome assign arbitrary names to SNPs, making meta-analysis across projects difficult. We created the software TagDigger, which includes three programs for analyzing GBS and RAD-seq data. The first script, tagdigger_interactive.py, rapidly extracts read counts and genotypes from FASTQ files using user-supplied sets of barcodes and tags. Input and output is in CSV format so that it can be opened by spreadsheet software. Tag sequences can also be imported from the Stacks, TASSEL-GBSv2, TASSEL-UNEAK, or pyRAD pipelines, and a separate file can be imported listing the names of markers to retain. A second script, tag_manager.py, consolidates marker names and sequences across multiple projects. A third script, barcode_splitter.py, assists with preparing FASTQ data for deposit in a public archive by splitting FASTQ files by barcode and generating MD5 checksums for the resulting files. TagDigger is open-source and freely available software written in Python 3. It uses a scalable, rapid search algorithm that can process over 100 million FASTQ reads per hour. TagDigger will run on a laptop with any operating system, does not consume hard drive space with intermediate files, and does not require programming skill to use.
N-terminal processing of affinity-tagged recombinant proteins purified by IMAC procedures.

PubMed

Mooney, Jane T; Fredericks, Dale P; Christensen, Thorkild; Bruun Schiødt, Christine; Hearn, Milton T W

2015-07-01

The ability of a new class of metal binding tags to facilitate the purification of recombinant proteins, exemplified by the tagged glutathione S-transferase and human growth hormone, from Escherichia coli fermentation broths and lysates has been further investigated. These histidine-containing tags exhibit high affinity for borderline metal ions chelated to the immobilised ligand, 1,4,7-triazacyclononane (tacn). The use of this tag-tacn immobilised metal ion affinity chromatography (IMAC) system engenders high selectivity with regard to host cell protein removal and permits facile tag removal from the E. coli-expressed recombinant protein. In particular, these tags were specifically designed to enable their efficient removal by the dipeptidyl aminopeptidase 1 (DAP-1), thus capturing the advantages of high substrate specificity and rates of cleavage. MALDI-TOF MS analysis of the cleaved products from the DAP-1 digestion of the recombinant N-terminally tagged proteins confirmed the complete removal of the tag within 4-12 h under mild experimental conditions. Overall, this study demonstrates that the use of tags specifically designed to target tacn-based IMAC resins offers a comprehensive and flexible approach for the purification of E. coli-expressed recombinant proteins, where complete removal of the tag is an essential prerequisite for subsequent application of the purified native proteins in studies aimed at delineating the molecular and cellular basis of specific biological processes. Copyright © 2015 John Wiley & Sons, Ltd.

De Novo Transcriptome Sequencing Reveals Important Molecular Networks and Metabolic Pathways of the Plant, Chlorophytum borivilianum

PubMed Central

Kalra, Shikha; Puniya, Bhanwar Lal; Kulshreshtha, Deepika; Kumar, Sunil; Kaur, Jagdeep; Ramachandran, Srinivasan; Singh, Kashmir

2013-01-01

Chlorophytum borivilianum, an endangered medicinal plant species is highly recognized for its aphrodisiac properties provided by saponins present in the plant. The transcriptome information of this species is limited and only few hundred expressed sequence tags (ESTs) are available in the public databases. To gain molecular insight of this plant, high throughput transcriptome sequencing of leaf RNA was carried out using Illumina's HiSeq 2000 sequencing platform. A total of 22,161,444 single end reads were retrieved after quality filtering. Available (e.g., De-Bruijn/Eulerian graph) and in-house developed bioinformatics tools were used for assembly and annotation of transcriptome. A total of 101,141 assembled transcripts were obtained, with coverage size of 22.42 Mb and average length of 221 bp. Guanine-cytosine (GC) content was found to be 44%. Bioinformatics analysis, using non-redundant proteins, gene ontology (GO), enzyme commission (EC) and kyoto encyclopedia of genes and genomes (KEGG) databases, extracted all the known enzymes involved in saponin and flavonoid biosynthesis. Few genes of the alkaloid biosynthesis, along with anticancer and plant defense genes, were also discovered. Additionally, several cytochrome P450 (CYP450) and glycosyltransferase unique sequences were also found. We identified simple sequence repeat motifs in transcripts with an abundance of di-nucleotide simple sequence repeat (SSR; 43.1%) markers. Large scale expression profiling through Reads per Kilobase per Million mapped reads (RPKM) showed major genes involved in different metabolic pathways of the plant. Genes, expressed sequence tags (ESTs) and unique sequences from this study provide an important resource for the scientific community, interested in the molecular genetics and functional genomics of C. borivilianum. PMID:24376689
De Novo transcriptome sequencing reveals important molecular networks and metabolic pathways of the plant, Chlorophytum borivilianum.

PubMed

Kalra, Shikha; Puniya, Bhanwar Lal; Kulshreshtha, Deepika; Kumar, Sunil; Kaur, Jagdeep; Ramachandran, Srinivasan; Singh, Kashmir

2013-01-01

Chlorophytum borivilianum, an endangered medicinal plant species is highly recognized for its aphrodisiac properties provided by saponins present in the plant. The transcriptome information of this species is limited and only few hundred expressed sequence tags (ESTs) are available in the public databases. To gain molecular insight of this plant, high throughput transcriptome sequencing of leaf RNA was carried out using Illumina's HiSeq 2000 sequencing platform. A total of 22,161,444 single end reads were retrieved after quality filtering. Available (e.g., De-Bruijn/Eulerian graph) and in-house developed bioinformatics tools were used for assembly and annotation of transcriptome. A total of 101,141 assembled transcripts were obtained, with coverage size of 22.42 Mb and average length of 221 bp. Guanine-cytosine (GC) content was found to be 44%. Bioinformatics analysis, using non-redundant proteins, gene ontology (GO), enzyme commission (EC) and kyoto encyclopedia of genes and genomes (KEGG) databases, extracted all the known enzymes involved in saponin and flavonoid biosynthesis. Few genes of the alkaloid biosynthesis, along with anticancer and plant defense genes, were also discovered. Additionally, several cytochrome P450 (CYP450) and glycosyltransferase unique sequences were also found. We identified simple sequence repeat motifs in transcripts with an abundance of di-nucleotide simple sequence repeat (SSR; 43.1%) markers. Large scale expression profiling through Reads per Kilobase per Million mapped reads (RPKM) showed major genes involved in different metabolic pathways of the plant. Genes, expressed sequence tags (ESTs) and unique sequences from this study provide an important resource for the scientific community, interested in the molecular genetics and functional genomics of C. borivilianum.
Tandem SUMO fusion vectors for improving soluble protein expression and purification.

PubMed

Guerrero, Fernando; Ciragan, Annika; Iwaï, Hideo

2015-12-01

Availability of highly purified proteins in quantity is crucial for detailed biochemical and structural investigations. Fusion tags are versatile tools to facilitate efficient protein purification and to improve soluble overexpression of proteins. Various purification and fusion tags have been widely used for overexpression in Escherichia coli. However, these tags might interfere with biological functions and/or structural investigations of the protein of interest. Therefore, an additional purification step to remove fusion tags by proteolytic digestion might be required. Here, we describe a set of new vectors in which yeast SUMO (SMT3) was used as the highly specific recognition sequence of ubiquitin-like protease 1, together with other commonly used solubility enhancing proteins, such as glutathione S-transferase, maltose binding protein, thioredoxin and trigger factor for optimizing soluble expression of protein of interest. This tandem SUMO (T-SUMO) fusion system was tested for soluble expression of the C-terminal domain of TonB from different organisms and for the antiviral protein scytovirin. Copyright © 2015 Elsevier Inc. All rights reserved.
Optimal use of tandem biotin and V5 tags in ChIP assays

PubMed Central

Kolodziej, Katarzyna E; Pourfarzad, Farzin; de Boer, Ernie; Krpic, Sanja; Grosveld, Frank; Strouboulis, John

2009-01-01

Background Chromatin immunoprecipitation (ChIP) assays coupled to genome arrays (Chip-on-chip) or massive parallel sequencing (ChIP-seq) lead to the genome wide identification of binding sites of chromatin associated proteins. However, the highly variable quality of antibodies and the availability of epitopes in crosslinked chromatin can compromise genomic ChIP outcomes. Epitope tags have often been used as more reliable alternatives. In addition, we have employed protein in vivo biotinylation tagging as a very high affinity alternative to antibodies. In this paper we describe the optimization of biotinylation tagging for ChIP and its coupling to a known epitope tag in providing a reliable and efficient alternative to antibodies. Results Using the biotin tagged erythroid transcription factor GATA-1 as example, we describe several optimization steps for the application of the high affinity biotin streptavidin system in ChIP. We find that the omission of SDS during sonication, the use of fish skin gelatin as blocking agent and choice of streptavidin beads can lead to significantly improved ChIP enrichments and lower background compared to antibodies. We also show that the V5 epitope tag performs equally well under the conditions worked out for streptavidin ChIP and that it may suffer less from the effects of formaldehyde crosslinking. Conclusion The combined use of the very high affinity biotin tag with the less sensitive to crosslinking V5 tag provides for a flexible ChIP platform with potential implications in ChIP sequencing outcomes. PMID:19196479
Rapid large-scale purification of myofilament proteins using a cleavable His6-tag

PubMed Central

Zhang, Mengjie; Martin, Jody L.; Kumar, Mohit; de Tombe, Pieter P.

2015-01-01

With the advent of high-throughput DNA sequencing, the number of identified cardiomyopathy-causing mutations has increased tremendously. As the majority of these mutations affect myofilament proteins, there is a need to understand their functional consequence on contraction. Permeabilized myofilament preparations coupled with protein exchange protocols are a common method for examining into contractile mechanics. However, producing large quantities of myofilament proteins can be time consuming and requires different approaches for each protein of interest. In the present study, we describe a unified automated method to produce troponin C, troponin T, and troponin I as well as myosin light chain 2 fused to a His6-tag followed by a tobacco etch virus (TEV) protease site. TEV protease has the advantage of a relaxed P1′ cleavage site specificity, allowing for no residues left after proteolysis and preservation of the native sequence of the protein of interest. After expression in Esherichia coli, cells were lysed by sonication in imidazole-containing buffer. The His6-tagged protein was then purified using a HisTrap nickel metal affinity column, and the His6-tag was removed by His6-TEV protease digestion for 4 h at 30°C. The protease was then removed using a HisTrap column, and complex assembly was performed via column-assisted sequential desalting. This mostly automated method allows for the purification of protein in 1 day and can be adapted to most soluble proteins. It has the advantage of greatly increasing yield while reducing the time and cost of purification. Therefore, production and purification of mutant proteins can be accelerated and functional data collected in a faster, less expensive manner. PMID:26386113
Application of Strep-Tactin XT for affinity purification of Twin-Strep-tagged CB2, a G protein-coupled cannabinoid receptor.

PubMed

Yeliseev, Alexei; Zoubak, Lioudmila; Schmidt, Thomas G M

2017-03-01

Human cannabinoid receptor CB 2 belongs to the class A of G protein-coupled receptor (GPCR). CB 2 is predominantly expressed in membranes of cells of immune origin and is implicated in regulation of metabolic pathways of inflammation, neurodegenerative disorders and pain sensing. High resolution structural studies of CB 2 require milligram quantities of purified, structurally intact protein. While we previously reported on the methodology for expression of the recombinant CB 2 and its stabilization in a functional state, here we describe an efficient protocol for purification of this protein using the Twin-Strep-tag/Strep-Tactin XT system. To improve the affinity of interaction of the recombinant CB 2 with the resin, the double repeat of the Strep-tag (a sequence of eight amino acids WSHPQFEK), named the Twin-Strep-tag was attached either to the N- or C-terminus of CB 2 via a short linker, and the recombinant protein was expressed in cytoplasmic membranes of E. coli as a fusion with the N-terminal maltose binding protein (MBP). The CB 2 was isolated at high purity from dilute solutions containing high concentrations of detergents, glycerol and salts, by capturing onto the Strep-Tactin XT resin, and was eluted from the resin under mild conditions upon addition of biotin. Surface plasmon resonance studies performed on the purified protein demonstrate the high affinity of interaction between the Twin-Strep-tag fused to the CB 2 and Strep-Tactin XT with an estimated Kd in the low nanomolar range. The affinity of binding did not vary significantly in response to the position of the tag at either N- or C-termini of the fusion. The binding capacity of the resin was several-fold higher for the tag located at the N-terminus of the protein as opposed to the C-terminus- or middle of the fusion. The variation in the length of the linker between the double repeats of the Strep-tag from 6 to 12 amino acid residues did not significantly affect the binding. The novel purification
Construction of an Ostrea edulis database from genomic and expressed sequence tags (ESTs) obtained from Bonamia ostreae infected haemocytes: Development of an immune-enriched oligo-microarray.

PubMed

Pardo, Belén G; Álvarez-Dios, José Antonio; Cao, Asunción; Ramilo, Andrea; Gómez-Tato, Antonio; Planas, Josep V; Villalba, Antonio; Martínez, Paulino

2016-12-01

The flat oyster, Ostrea edulis, is one of the main farmed oysters, not only in Europe but also in the United States and Canada. Bonamiosis due to the parasite Bonamia ostreae has been associated with high mortality episodes in this species. This parasite is an intracellular protozoan that infects haemocytes, the main cells involved in oyster defence. Due to the economical and ecological importance of flat oyster, genomic data are badly needed for genetic improvement of the species, but they are still very scarce. The objective of this study is to develop a sequence database, OedulisDB, with new genomic and transcriptomic resources, providing new data and convenient tools to improve our knowledge of the oyster's immune mechanisms. Transcriptomic and genomic sequences were obtained using 454 pyrosequencing and compiled into an O. edulis database, OedulisDB, consisting of two sets of 10,318 and 7159 unique sequences that represent the oyster's genome (WG) and de novo haemocyte transcriptome (HT), respectively. The flat oyster transcriptome was obtained from two strains (naïve and tolerant) challenged with B. ostreae, and from their corresponding non-challenged controls. Approximately 78.5% of 5619 HT unique sequences were successfully annotated by Blast search using public databases. A total of 984 sequences were identified as being related to immune response and several key immune genes were identified for the first time in flat oyster. Additionally, transcriptome information was used to design and validate the first oligo-microarray in flat oyster enriched with immune sequences from haemocytes. Our transcriptomic and genomic sequencing and subsequent annotation have largely increased the scarce resources available for this economically important species and have enabled us to develop an OedulisDB database and accompanying tools for gene expression analysis. This study represents the first attempt to characterize in depth the O. edulis haemocyte transcriptome in
A universal TagModule collection for parallel genetic analysis of microorganisms

PubMed Central

Oh, Julia; Fung, Eula; Price, Morgan N.; Dehal, Paramvir S.; Davis, Ronald W.; Giaever, Guri; Nislow, Corey; Arkin, Adam P.; Deutschbauer, Adam

2010-01-01

Systems-level analyses of non-model microorganisms are limited by the existence of numerous uncharacterized genes and a corresponding over-reliance on automated computational annotations. One solution to this challenge is to disrupt gene function using DNA tag technology, which has been highly successful in parallelizing reverse genetics in Saccharomyces cerevisiae and has led to discoveries in gene function, genetic interactions and drug mechanism of action. To extend the yeast DNA tag methodology to a wide variety of microorganisms and applications, we have created a universal, sequence-verified TagModule collection. A hallmark of the 4280 TagModules is that they are cloned into a Gateway entry vector, thus facilitating rapid transfer to any compatible genetic system. Here, we describe the application of the TagModules to rapidly generate tagged mutants by transposon mutagenesis in the metal-reducing bacterium Shewanella oneidensis MR-1 and the pathogenic yeast Candida albicans. Our results demonstrate the optimal hybridization properties of the TagModule collection, the flexibility in applying the strategy to diverse microorganisms and the biological insights that can be gained from fitness profiling tagged mutant collections. The publicly available TagModule collection is a platform-independent resource for the functional genomics of a wide range of microbial systems in the post-genome era. PMID:20494978
A tag-based approach for high-throughput analysis of CCWGG methylation.

PubMed

Denisova, Oksana V; Chernov, Andrei V; Koledachkina, Tatyana Y; Matvienko, Nicholas I

2007-10-15

Non-CpG methylation occurring in the context of CNG sequences is found in plants at a large number of genomic loci. However, there is still little information available about non-CpG methylation in mammals. Efficient methods that would allow detection of scarcely localized methylated sites in small quantities of DNA are required to elucidate the biological role of non-CpG methylation in both plants and animals. In this study, we tested a new whole genome approach to identify sites of CCWGG methylation (W is A or T), a particular case of CNG methylation, in genomic DNA. This technique is based on digestion of DNAs with methylation-sensitive restriction endonucleases EcoRII-C and AjnI. Short DNAs flanking methylated CCWGG sites (tags) are selectively purified and assembled in tandem arrays of up to nine tags. This allows high-throughput sequencing of tags, identification of flanking regions, and their exact positions in the genome. In this study, we tested specificity and efficiency of the approach.
Sequences 5' to translation start regulate expression of petunia rbcS genes.

PubMed Central

Dean, C; Favreau, M; Bedbrook, J; Dunsmuir, P

1989-01-01

The promoter sequences that contribute to quantitative differences in expression of the petunia genes (rbcS) encoding the small subunit of ribulose bisphosphate carboxylase have been characterized. The promoter regions of the two most abundantly expressed petunia rbcS genes, SSU301 and SSU611, show sequence similarity not present in other rbcS genes. We investigated the significance of these and other sequences by adding specific regions from the SSU301 promoter (the most strongly expressed gene) to equivalent regions in the SSU911 promoter (the least strongly expressed gene) and assaying the expression of the fusions in transgenic tobacco plants. In this way, we characterized an SSU301 promoter region (either from -285 to -178 or -291 to -204) which, when added to SSU911, in either orientation, increased SSU911 expression 25-fold. This increase was equivalent to that caused by addition of the entire SSU301 5'-flanking region. Replacement of SSU911 promoter sequences between -198 and the start codon with sequences from the equivalent region of SSU301 did not increase SSU911 expression significantly. The -291 to -204 SSU301 promoter fragment contributes significantly to quantitative differences in expression between the petunia rbcS genes. PMID:2535543
Genome-wide identification, classification, and expression analysis of the arabinogalactan protein gene family in rice (Oryza sativa L.)

PubMed Central

Zhao, Jie

2010-01-01

Arabinogalactan proteins (AGPs) comprise a family of hydroxyproline-rich glycoproteins that are implicated in plant growth and development. In this study, 69 AGPs are identified from the rice genome, including 13 classical AGPs, 15 arabinogalactan (AG) peptides, three non-classical AGPs, three early nodulin-like AGPs (eNod-like AGPs), eight non-specific lipid transfer protein-like AGPs (nsLTP-like AGPs), and 27 fasciclin-like AGPs (FLAs). The results from expressed sequence tags, microarrays, and massively parallel signature sequencing tags are used to analyse the expression of AGP-encoding genes, which is confirmed by real-time PCR. The results reveal that several rice AGP-encoding genes are predominantly expressed in anthers and display differential expression patterns in response to abscisic acid, gibberellic acid, and abiotic stresses. Based on the results obtained from this analysis, an attempt has been made to link the protein structures and expression patterns of rice AGP-encoding genes to their functions. Taken together, the genome-wide identification and expression analysis of the rice AGP gene family might facilitate further functional studies of rice AGPs. PMID:20423940
A simple and effective strategy for solving the problem of inclusion bodies in recombinant protein technology: His-tag deletions enhance soluble expression.

PubMed

Zhu, Shaozhou; Gong, Cuiyu; Ren, Lu; Li, Xingzhou; Song, Dawei; Zheng, Guojun

2013-01-01

The formation of inclusion bodies (IBs) in recombinant protein biotechnology has become one of the most frequent undesirable occurrences in both research and industrial applications. So far, the pET System is the most powerful system developed for the production of recombinant proteins when Escherichia coli is used as the microbial cell factory. Also, using fusion tags to facilitate detection and purification of the target protein is a commonly used tactic. However, there is still a large fraction of proteins that cannot be produced in E. coli in a soluble (and hence functional) form. Intensive research efforts have tried to address this issue, and numerous parameters have been modulated to avoid the formation of inclusion bodies. However, hardly anyone has noticed that adding fusion tags to the recombinant protein to facilitate purification is a key factor that affects the formation of inclusion bodies. To test this idea, the industrial biocatalysts uridine phosphorylase from Aeropyrum pernix K1 and (+)-γ-lactamase and (-)-γ-lactamase from Bradyrhizobium japonicum USDA 6 were expressed in E. coli by using the pET System and then examined. We found that using a histidine tag as a fusion partner for protein expression did affect the formation of inclusion bodies in these examples, suggesting that removing the fusion tag can promote the solubility of heterologous proteins. The production of soluble and highly active uridine phosphorylase, (+)-γ-lactamase, and (-)-γ-lactamase in our results shows that the traditional process needs to be reconsidered. Accordingly, a simple and efficient structure-based strategy for the production of valuable soluble recombinant proteins in E. coli is proposed.
Bag3-Induced Autophagy Is Associated with Degradation of JCV Oncoprotein, T-Ag

PubMed Central

Sariyer, Ilker Kudret; Merabova, Nana; Patel, Prem Kumer; Knezevic, Tijana; Rosati, Alessandra; Turco, Maria C.; Khalili, Kamel

2012-01-01

JC virus, JCV, is a human neurotropic polyomavirus whose replication in glial cells causes the fatal demyelinating disease progressive multifocal leukoencephalopathy (PML). In addition, JCV possesses oncogenic activity and expression of its transforming protein, large T-antigen (T-Ag), in several experimental animals induces tumors of neural origin. Further, the presence of JCV DNA and T-Ag have been repeatedly observed in several human malignant tissues including primitive neuroectodermal tumors and glioblastomas. Earlier studies have demonstrated that Bag3, a member of the Bcl-2-associated athanogene (Bag) family of proteins, which is implicated in autophagy and apoptosis, is downregulated upon JCV infection of glial cells and that JCV T-Ag is responsible for suppressing the activity of the BAG3 promoter. Here, we investigated the possible impact of Bag3 on T-Ag expression in JCV-infected human primary glial cells as well as in cells derived from T-Ag-induced medulloblastoma in transgenic animals. Results from these studies revealed that overexpression of Bag3 drastically decreases the level of T-Ag expression by inducing the autophagic degradation of the viral protein. Interestingly, this event leads to the inhibition of JCV infection of glial cells, suggesting that the reduced levels of T-antigen seen upon the overexpression of Bag3 has a biological impact on the viral lytic cycle. Results from protein-protein interaction studies showed that T-Ag and Bag3 physically interact with each other through the zinc-finger of T-Ag and the proline rich domains of Bag3, and this interaction is important for the autophagic degradation of T-Ag. Our observations open a new avenue of research for better understanding of virus-host interaction by investigating the interplay between T-Ag and Bag3, and their impact on the development of JCV-associated diseases. PMID:22984599
Bag3-induced autophagy is associated with degradation of JCV oncoprotein, T-Ag.

PubMed

Sariyer, Ilker Kudret; Merabova, Nana; Patel, Prem Kumer; Knezevic, Tijana; Rosati, Alessandra; Turco, Maria C; Khalili, Kamel

2012-01-01

JC virus, JCV, is a human neurotropic polyomavirus whose replication in glial cells causes the fatal demyelinating disease progressive multifocal leukoencephalopathy (PML). In addition, JCV possesses oncogenic activity and expression of its transforming protein, large T-antigen (T-Ag), in several experimental animals induces tumors of neural origin. Further, the presence of JCV DNA and T-Ag have been repeatedly observed in several human malignant tissues including primitive neuroectodermal tumors and glioblastomas. Earlier studies have demonstrated that Bag3, a member of the Bcl-2-associated athanogene (Bag) family of proteins, which is implicated in autophagy and apoptosis, is downregulated upon JCV infection of glial cells and that JCV T-Ag is responsible for suppressing the activity of the BAG3 promoter. Here, we investigated the possible impact of Bag3 on T-Ag expression in JCV-infected human primary glial cells as well as in cells derived from T-Ag-induced medulloblastoma in transgenic animals. Results from these studies revealed that overexpression of Bag3 drastically decreases the level of T-Ag expression by inducing the autophagic degradation of the viral protein. Interestingly, this event leads to the inhibition of JCV infection of glial cells, suggesting that the reduced levels of T-antigen seen upon the overexpression of Bag3 has a biological impact on the viral lytic cycle. Results from protein-protein interaction studies showed that T-Ag and Bag3 physically interact with each other through the zinc-finger of T-Ag and the proline rich domains of Bag3, and this interaction is important for the autophagic degradation of T-Ag. Our observations open a new avenue of research for better understanding of virus-host interaction by investigating the interplay between T-Ag and Bag3, and their impact on the development of JCV-associated diseases.
Radio tag retention and tag-related mortality among adult sockeye salmon

USGS Publications Warehouse

Ramstad, Kristina M.; Woody, Carol Ann

2003-01-01

Tag retention and tag-related mortality are concerns for any tagging study but are rarely estimated. We assessed retention and mortality rates for esophageal radio tag implants in adult sockeye salmon Oncorhynchus nerka. Migrating sockeye salmon captured at the outlet of Lake Clark, Alaska, were implanted with one of four different radio tags (14.5 × 43 mm (diameter × length), 14.5 × 49 mm, 16 × 46 mm, and 19 × 51 mm). Fish were observed for 15 to 35 d after tagging to determine retention and mortality rates. The overall tag retention rate was high (0.98; 95% confidence interval (CI), 0.92-1.00; minimum, 33 d), with one loss of a 19-mm × 51- mm tag. Mortality of tagged sockeye salmon (0.02; 95% CI, 0-0.08) was similar to that of untagged controls (0.03 (0-0.15)). Sockeye salmon with body lengths (mid-eye to tail fork) of 585-649 mm retained tags as large as 19 × 51 mm and those with body lengths of 499-628 mm retained tags as small as 14.5 × 43 mm for a minimum of 33 d with no increase in mortality. The tags used in this study represent a suite of radio tags that vary in size, operational life, and cost but that are effective in tracking adult anadromous salmon with little tag loss or increase in fish mortality.
Sequence analysis of diacylglycerol acyltransferases

USDA-ARS?s Scientific Manuscript database

Diacylglycerol acyltransferases (DGATs) catalyze the final step of triacylglycerol (TAG) biosynthesis in eukaryotes. DGATs esterify sn-1,2-diacylglycerol with a long-chain fatty acyl-CoA. Plants and animals deficient in DGATs accumulate less TAG and over-expression of DGATs increases TAG. DGAT knock...
Molecular cloning, sequence characterization and recombinant expression of Nanog gene in goat fibroblast cells using lentiviral based expression system.

PubMed

Singhal, Dinesh K; Singhal, Raxita; Malik, Hruda N; Kumar, Surender; Kumar, Sudarshan; Mohanty, Ashok K; Kaushik, Jai K; Malakar, Dhruba

2014-01-01

Nanog is a homeodomain containing protein which plays important roles in regulation of signaling pathways for maintenance and induction of pluripotency in stem cells. Because of its unique expression in stem cells it is also regarded as pluripotency marker. In this study goat Nanog (gNanog) gene has been amplified, cloned and characterized at sequence level with successful over-expression in CHO-K1 cell line using a lentiviral based system. gNanog ORF is 903 bp long which codes for Nanog protein of size 300 amino acids (aas). Complete nucleotide sequence shows some evolutionary mutation in goat in comparision to other species. Protein sequence of goat is highly similar to other species. Overall, gNanog nucleotide sequence and predicted protein sequence showed high similarity and minimum divergence with cattle (96 % identity/4 % divergence) and buffalo (94/5 %) while low similarity and high divergence with pig (84/15 %), human (81/23 %) and mouse (69/40 %) indicating evolutionary closeness of gNanog to cattle and buffalo. gNanog lentiviral expression construct was prepared for over-expression of Nanog gene in adult goat fibroblast cells. Lentiviral expression construct of Nanog enabled continuous protein expression for induction and maintenance of pluripotency. Western blotting revealed the expression of Nanog gene at protein level which supported that the lentiviral expression system is highly promising for Nanog protein expression in differentiated goat cell.
Expression, purification, and functional analysis of the C-terminal domain of Herbaspirillum seropedicae NifA protein.

PubMed

Monteiro, Rose A; Souza, Emanuel M; Geoffrey Yates, M; Steffens, M Berenice R; Pedrosa, Fábio O; Chubatsu, Leda S

2003-02-01

The Herbaspirillum seropedicae NifA protein is responsible for nif gene expression. The C-terminal domain of the H. seropedicae NifA protein, fused to a His-Tag sequence (His-Tag-C-terminal), was over-expressed and purified by metal-affinity chromatography to yield a highly purified and active protein. Band-shift assays showed that the NifA His-Tag-C-terminal bound specifically to the H. seropedicae nifB promoter region in vitro. In vivo analysis showed that this protein inhibited the Central + C-terminal domains of NifA protein from activating the nifH promoter of K. pneumoniae in Escherichia coli, indicating that the protein must be bound to the NifA-binding site (UAS site) at the nifH promoter region to activate transcription. Copyright 2002 Elsevier Science (USA)
Extracting tag hierarchies.

PubMed

Tibély, Gergely; Pollner, Péter; Vicsek, Tamás; Palla, Gergely

2013-01-01

Tagging items with descriptive annotations or keywords is a very natural way to compress and highlight information about the properties of the given entity. Over the years several methods have been proposed for extracting a hierarchy between the tags for systems with a "flat", egalitarian organization of the tags, which is very common when the tags correspond to free words given by numerous independent people. Here we present a complete framework for automated tag hierarchy extraction based on tag occurrence statistics. Along with proposing new algorithms, we are also introducing different quality measures enabling the detailed comparison of competing approaches from different aspects. Furthermore, we set up a synthetic, computer generated benchmark providing a versatile tool for testing, with a couple of tunable parameters capable of generating a wide range of test beds. Beside the computer generated input we also use real data in our studies, including a biological example with a pre-defined hierarchy between the tags. The encouraging similarity between the pre-defined and reconstructed hierarchy, as well as the seemingly meaningful hierarchies obtained for other real systems indicate that tag hierarchy extraction is a very promising direction for further research with a great potential for practical applications. Tags have become very prevalent nowadays in various online platforms ranging from blogs through scientific publications to protein databases. Furthermore, tagging systems dedicated for voluntary tagging of photos, films, books, etc. with free words are also becoming popular. The emerging large collections of tags associated with different objects are often referred to as folksonomies, highlighting their collaborative origin and the "flat" organization of the tags opposed to traditional hierarchical categorization. Adding a tag hierarchy corresponding to a given folksonomy can very effectively help narrowing or broadening the scope of search. Moreover
Extracting Tag Hierarchies

PubMed Central

Tibély, Gergely; Pollner, Péter; Vicsek, Tamás; Palla, Gergely

2013-01-01

Tagging items with descriptive annotations or keywords is a very natural way to compress and highlight information about the properties of the given entity. Over the years several methods have been proposed for extracting a hierarchy between the tags for systems with a "flat", egalitarian organization of the tags, which is very common when the tags correspond to free words given by numerous independent people. Here we present a complete framework for automated tag hierarchy extraction based on tag occurrence statistics. Along with proposing new algorithms, we are also introducing different quality measures enabling the detailed comparison of competing approaches from different aspects. Furthermore, we set up a synthetic, computer generated benchmark providing a versatile tool for testing, with a couple of tunable parameters capable of generating a wide range of test beds. Beside the computer generated input we also use real data in our studies, including a biological example with a pre-defined hierarchy between the tags. The encouraging similarity between the pre-defined and reconstructed hierarchy, as well as the seemingly meaningful hierarchies obtained for other real systems indicate that tag hierarchy extraction is a very promising direction for further research with a great potential for practical applications. Tags have become very prevalent nowadays in various online platforms ranging from blogs through scientific publications to protein databases. Furthermore, tagging systems dedicated for voluntary tagging of photos, films, books, etc. with free words are also becoming popular. The emerging large collections of tags associated with different objects are often referred to as folksonomies, highlighting their collaborative origin and the “flat” organization of the tags opposed to traditional hierarchical categorization. Adding a tag hierarchy corresponding to a given folksonomy can very effectively help narrowing or broadening the scope of search

Activation tagging in indica rice identifies ribosomal proteins as potential targets for manipulation of water-use efficiency and abiotic stress tolerance in plants.

PubMed

Moin, Mazahar; Bakshi, Achala; Saha, Anusree; Udaya Kumar, M; Reddy, Attipalli R; Rao, K V; Siddiq, E A; Kirti, P B

2016-11-01

We have generated 3900 enhancer-based activation-tagged plants, in addition to 1030 stable Dissociator-enhancer plants in a widely cultivated indica rice variety, BPT-5204. Of them, 3000 were screened for water-use efficiency (WUE) by analysing photosynthetic quantum efficiency and yield-related attributes under water-limiting conditions that identified 200 activation-tagged mutants, which were analysed for flanking sequences at the site of enhancer integration in the genome. We have further selected five plants with low Δ 13 C, high quantum efficiency and increased plant yield compared with wild type for a detailed investigation. Expression studies of 18 genes in these mutants revealed that in four plants one of the three to four tagged genes became activated, while two genes were concurrently up-regulated in the fifth plant. Two genes coding for proteins involved in 60S ribosomal assembly, RPL6 and RPL23A, were among those that became activated by enhancers. Quantitative expression analysis of these two genes also corroborated the results on activating-tagging. The high up-regulation of RPL6 and RPL23A in various stress treatments and the presence of significant cis-regulatory elements in their promoter regions along with the high up-regulation of several of RPL genes in various stress treatments indicate that they are potential targets for manipulating WUE/abiotic stress tolerance. © 2016 John Wiley & Sons Ltd.
Isolation, sequencing and expression of RED, a novel human gene encoding an acidic-basic dipeptide repeat.

PubMed

Assier, E; Bouzinba-Segard, H; Stolzenberg, M C; Stephens, R; Bardos, J; Freemont, P; Charron, D; Trowsdale, J; Rich, T

1999-04-16

A novel human gene RED, and the murine homologue, MuRED, were cloned. These genes were named after the extensive stretch of alternating arginine (R) and glutamic acid (E) or aspartic acid (D) residues that they contain. We term this the 'RED' repeat. The genes of both species were expressed in a wide range of tissues and we have mapped the human gene to chromosome 5q22-24. MuRED and RED shared 98% sequence identity at the amino acid level. The open reading frame of both genes encodes a 557 amino acid protein. RED fused to a fluorescent tag was expressed in nuclei of transfected cells and localised to nuclear dots. Co-localisation studies showed that these nuclear dots did not contain either PML or Coilin, which are commonly found in the POD or coiled body nuclear compartments. Deletion of the amino terminal 265 amino acids resulted in a failure to sort efficiently to the nucleus, though nuclear dots were formed. Deletion of a further 50 amino acids from the amino terminus generates a protein that can sort to the nucleus but is unable to generate nuclear dots. Neither construct localised to the nucleolus. The characteristics of RED and its nuclear localisation implicate it as a regulatory protein, possibly involved in transcription.
Method to produce acetyldiacylglycerols (ac-TAGs) by expression of an acetyltransferase gene isolated from Euonymus alatus (burning bush)

DOEpatents

Durrett, Timothy; Ohlrogge, John; Pollard, Michael

2016-05-03

The present invention relates to novel diacylglycerol acyltransferase genes and proteins, and methods of their use. In particular, the invention describes genes encoding proteins having diacylglycerol acetyltransferase activity, specifically for transferring an acetyl group to a diacylglycerol substrate to form acetyl-Triacylglycerols (ac-TAGS), for example, a 3-acetyl-1,2-diacyl-sn-glycerol. The present invention encompasses both native and recombinant wild-type forms of the transferase, as well as mutants and variant forms. The present invention also relates to methods of using novel diacylglycerol acyltransferase genes and proteins, including their expression in transgenic organisms at commercially viable levels, for increasing production of 3-acetyl-1,2-diacyl-sn-glycerols in plant oils and altering the composition of oils produced by microorganisms, such as yeast, by increasing ac-TAG production. Additionally, oils produced by methods of the present inventions comprising genes and proteins are contemplated for use as biodiesel fuel, in polymer production and as naturally produced food oils with reduced calories.
In vivo phosphorylation of a peptide tag for protein purification.

PubMed

Goux, Marine; Fateh, Amina; Defontaine, Alain; Cinier, Mathieu; Tellier, Charles

2016-05-01

To design a new system for the in vivo phosphorylation of proteins in Escherichia coli using the co-expression of the α-subunit of casein kinase II (CKIIα) and a target protein, (Nanofitin) fused with a phosphorylatable tag. The level of the co-expressed CKIIα was controlled by the arabinose promoter and optimal phosphorylation was obtained with 2 % (w/v) arabinose as inductor. The effectiveness of the phosphorylation system was demonstrated by electrophoretic mobility shift assay (NUT-PAGE) and staining with a specific phosphoprotein-staining gel. The resulting phosphorylated tag was also used to purify the phosphoprotein by immobilized metal affinity chromatography, which relies on the specific interaction of phosphate moieties with Fe(III). The use of a single tag for both the purification and protein array anchoring provides a simple and straightforward system for protein analysis.
Shark Tagging Activities.

ERIC Educational Resources Information Center

Current: The Journal of Marine Education, 1998

1998-01-01

In this group activity, children learn about the purpose of tagging and how scientists tag a shark. Using a cut-out of a shark, students identify, measure, record data, read coordinates, and tag a shark. Includes introductory information about the purpose of tagging and the procedure, a data sheet showing original tagging data from Tampa Bay, and…
Characterization by Suppression Subtractive Hybridization of Transcripts That Are Differentially Expressed in Leaves of Anthracnose-Resistant Ramie Cultivar.

PubMed

Xuxia, Wang; Jie, Chen; Bo, Wang; Lijun, Liu; Hui, Jiang; Diluo, Tang; Dingxiang, Peng

2012-01-01

For the purpose of screening putative anthracnose resistance-related genes of ramie ( Boehmeria nivea L. Gaud), a cDNA library was constructed by suppression subtractive hybridization using anthracnose-resistant cultivar Huazhu no. 4. The cDNAs from Huazhu no. 4, which were infected with Colletotrichum gloeosporioides , were used as the tester and cDNAs from uninfected Huazhu no. 4 as the driver. Sequencing analysis and homology searching showed that these clones represented 132 single genes, which were assigned to functional categories, including 14 putative cellular functions, according to categories established for Arabidopsis . These 132 genes included 35 disease resistance and stress tolerance-related genes including putative heat-shock protein 90, metallothionein, PR-1.2 protein, catalase gene, WRKY family genes, and proteinase inhibitor-like protein. Partial disease-related genes were further analyzed by reverse transcription PCR and RNA gel blot. These expressed sequence tags are the first anthracnose resistance-related expressed sequence tags reported in ramie.
Cloning, sequencing, and expression of cDNA for human. beta. -glucuronidase

DOE Office of Scientific and Technical Information (OSTI.GOV)

Oshima, A.; Kyle, J.W.; Miller, R.D.

1987-02-01

The authors report here the cDNA sequence for human placental ..beta..-glucuronidase (..beta..-D-glucuronoside glucuronosohydrolase, EC 3.2.1.31) and demonstrate expression of the human enzyme in transfected COS cells. They also sequenced a partial cDNA clone from human fibroblasts that contained a 153-base-pair deletion within the coding sequence and found a second type of cDNA clone from placenta that contained the same deletion. Nuclease S1 mapping studies demonstrated two types of mRNAs in human placenta that corresponded to the two types of cDNA clones isolated. The NH/sub 2/-terminal amino acid sequence determined for human spleen ..beta..-glucuronidase agreed with that inferred from the DNAmore » sequence of the two placental clones, beginning at amino acid 23, suggesting a cleaved signal sequence of 22 amino acids. When transfected into COS cells, plasmids containing either placental clone expressed an immunoprecipitable protein that contained N-linked oligosaccharides as evidenced by sensitivity to endoglycosidase F. However, only transfection with the clone containing the 153-base-pair segment led to expression of human ..beta..-glucuronidase activity. These studies provide the sequence for the full-length cDNA for human ..beta..-glucuronidase, demonstrate the existence of two populations of mRNA for ..beta..-glucuronidase in human placenta, only one of which specifies a catalytically active enzyme, and illustrate the importance of expression studies in verifying that a cDNA is functionally full-length.« less
Quantum tagging for tags containing secret classical data

DOE Office of Scientific and Technical Information (OSTI.GOV)

Kent, Adrian

Various authors have considered schemes for quantum tagging, that is, authenticating the classical location of a classical tagging device by sending and receiving quantum signals from suitably located distant sites, in an environment controlled by an adversary whose quantum information processing and transmitting power is potentially unbounded. All of the schemes proposed elsewhere in the literature assume that the adversary is able to inspect the interior of the tagging device. All of these schemes have been shown to be breakable if the adversary has unbounded predistributed entanglement. We consider here the case in which the tagging device contains a finitemore » key string shared with distant sites but kept secret from the adversary, and show this allows the location of the tagging device to be authenticated securely and indefinitely. Our protocol relies on quantum key distribution between the tagging device and at least one distant site, and demonstrates a new practical application of quantum key distribution. It also illustrates that the attainable security in position-based cryptography can depend crucially on apparently subtle details in the security scenario considered.« less
Rapid large-scale purification of myofilament proteins using a cleavable His6-tag.

PubMed

Zhang, Mengjie; Martin, Jody L; Kumar, Mohit; Khairallah, Ramzi J; de Tombe, Pieter P

2015-11-01

With the advent of high-throughput DNA sequencing, the number of identified cardiomyopathy-causing mutations has increased tremendously. As the majority of these mutations affect myofilament proteins, there is a need to understand their functional consequence on contraction. Permeabilized myofilament preparations coupled with protein exchange protocols are a common method for examining into contractile mechanics. However, producing large quantities of myofilament proteins can be time consuming and requires different approaches for each protein of interest. In the present study, we describe a unified automated method to produce troponin C, troponin T, and troponin I as well as myosin light chain 2 fused to a His6-tag followed by a tobacco etch virus (TEV) protease site. TEV protease has the advantage of a relaxed P1' cleavage site specificity, allowing for no residues left after proteolysis and preservation of the native sequence of the protein of interest. After expression in Esherichia coli, cells were lysed by sonication in imidazole-containing buffer. The His6-tagged protein was then purified using a HisTrap nickel metal affinity column, and the His6-tag was removed by His6-TEV protease digestion for 4 h at 30°C. The protease was then removed using a HisTrap column, and complex assembly was performed via column-assisted sequential desalting. This mostly automated method allows for the purification of protein in 1 day and can be adapted to most soluble proteins. It has the advantage of greatly increasing yield while reducing the time and cost of purification. Therefore, production and purification of mutant proteins can be accelerated and functional data collected in a faster, less expensive manner. Copyright © 2015 the American Physiological Society.
Systematic Localization and Identification of SUMOylation Substrates in Knock-In Mice Expressing Affinity-Tagged SUMO1.

PubMed

Tirard, Marilyn; Brose, Nils

2016-01-01

Protein SUMOylation is a posttranslational protein modification that is emerging as a key regulatory process in neurobiology. To date, however, SUMOylation in vivo has only been studied cursorily. Knock-in mice expressing His6-HA-SUMO1 from the Sumo1 locus allow for the highly specific localization and identification of endogenous SUMO1 substrates under physiological and pathophysiological conditions. By making use of the HA-tag and using wild-type mice for highly stringent negative control samples, SUMO1 targets can be specifically localized in and purified from cultured mouse nerve cells and mouse tissues.
Changes in rat spinal cord gene expression after inflammatory hyperalgesia of the joint and manual therapy.

PubMed

Ruhlen, Rachel L; Singh, Vineet K; Pazdernik, Vanessa K; Towns, Lex C; Snider, Eric J; Sargentini, Neil J; Degenhardt, Brian F

2014-10-01

Mobilization of a joint affects local tissue directly but may also have other effects that are mediated through the central nervous system. To identify differential gene expression in the spinal cords of rats with or without inflammatory joint injury after manual therapy or no treatment. Rats were randomly assigned to 1 of 4 treatment groups: no injury and no touch (NI/NT), injury and no touch (I/NT), no injury and manual therapy (NI/MT), and injury and manual therapy (I/MT). We induced acute inflammatory joint injury in the rats by injecting carrageenan into an ankle. Rats in the no-injury groups did not receive carrageenan injection. One day after injury, rats received manual therapy to the knee of the injured limb. Rats in the no-touch groups were anesthetized without receiving manual therapy. Spinal cords were harvested 30 minutes after therapy or no touch, and spinal cord gene expression was analyzed by microarray for 3 comparisons: NI/NT vs I/NT, I/MT vs I/NT, and NI/NT vs NI/MT. Three rats were assigned to each group. Of 38,875 expressed sequence tags, 755 were differentially expressed in the NI/NT vs I/NT comparison. For the other comparisons, no expressed sequence tags were differentially expressed. Cluster analysis revealed that the differentially expressed sequence tags were over-represented in several categories, including ion homeostasis (enrichment score, 2.29), transmembrane (enrichment score, 1.55), and disulfide bond (enrichment score, 2.04). An inflammatory injury to the ankle of rats caused differential expression of genes in the spinal cord. Consistent with other studies, genes involved in ion transport were among those affected. However, manual therapy to the knees of injured limbs or to rats without injury did not alter gene expression in the spinal cord. Thus, evidence for central nervous system mediation of manual therapy was not observed. © 2014 The American Osteopathic Association.
A statistical method for assessing peptide identification confidence in accurate mass and time tag proteomics

PubMed Central

Stanley, Jeffrey R.; Adkins, Joshua N.; Slysz, Gordon W.; Monroe, Matthew E.; Purvine, Samuel O.; Karpievitch, Yuliya V.; Anderson, Gordon A.; Smith, Richard D.; Dabney, Alan R.

2011-01-01

Current algorithms for quantifying peptide identification confidence in the accurate mass and time (AMT) tag approach assume that the AMT tags themselves have been correctly identified. However, there is uncertainty in the identification of AMT tags, as this is based on matching LC-MS/MS fragmentation spectra to peptide sequences. In this paper, we incorporate confidence measures for the AMT tag identifications into the calculation of probabilities for correct matches to an AMT tag database, resulting in a more accurate overall measure of identification confidence for the AMT tag approach. The method is referred to as Statistical Tools for AMT tag Confidence (STAC). STAC additionally provides a Uniqueness Probability (UP) to help distinguish between multiple matches to an AMT tag and a method to calculate an overall false discovery rate (FDR). STAC is freely available for download as both a command line and a Windows graphical application. PMID:21692516
Comparative Performance of Acoustic-tagged and PIT-tagged Juvenile Salmonids

DOE Office of Scientific and Technical Information (OSTI.GOV)

Hockersmith, Eric E.; Brown, Richard S.; Liedtke, Theresa L.

2008-02-01

Numerous research tools and technologies are currently being used to evaluate fish passage and survival to determine the impacts of the Federal Columbia River Power System (FCRPS) on endangered and threatened juvenile salmonids, including PIT tags, balloon tags, hydroacoustic evaluations, radio telemetry, and acoustic telemetry. Each has advantages and disadvantages, but options are restricted in some situations because of limited capabilities of a specific technology, lack of detection capability downstream, or availability of adequate numbers of fish. However, there remains concern about the comparative effects of the tag or the tagging procedure on fish performance. The recently developed Juvenile Salmonidmore » Acoustic Telemetry System (JSATS) acoustic transmitter is the smallest active acoustic tag currently available. The goal of this study was to determine whether fish tagged with the JSATS acoustic-telemetry tag can provide unbiased estimates of passage behavior and survival within the performance life of the tag. We conducted both field and laboratory studies to assess tag effects. For the field evaluation we released a total of 996 acoustic-tagged fish in conjunction with 21,026 PIT-tagged fish into the tailrace of Lower Granite Dam on 6 and 13 May. Travel times between release and downstream dams were not significantly different for the majority of the reaches between acoustic-tagged and PIT-tagged fish. In addition to the field evaluation, a series of laboratory experiments were conducted to determine if growth and survival of juvenile Chinook salmon surgically implanted with acoustic transmitters is different than untagged or PIT tagged juvenile Chinook salmon. Only yearling fish with integrated and non-integrated transmitters experienced mortalities, and these were low (<4.5%). Mortality among sub-yearling control and PIT-tag treatments ranged up to 7.7% while integrated and non-integrated treatments had slightly higher rates (up to 8.3% and 7
Probing the potential of CnaB-type domains for the design of tag/catcher systems

PubMed Central

Pröschel, Marlene; Kraner, Max E.; Horn, Anselm H. C.; Schäfer, Lena; Sonnewald, Uwe

2017-01-01

Building proteins into larger, post-translational assemblies in a defined and stable way is still a challenging task. A promising approach relies on so-called tag/catcher systems that are fused to the proteins of interest and allow a durable linkage via covalent intermolecular bonds. Tags and catchers are generated by splitting protein domains that contain intramolecular isopeptide or ester bonds that form autocatalytically under physiological conditions. There are already numerous biotechnological and medical applications that demonstrate the usefulness of covalent linkages mediated by these systems. Additional covalent tag/catcher systems would allow creating more complex and ultra-stable protein architectures and networks. Two of the presently available tag/catcher systems were derived from closely related CnaB-domains of Streptococcus pyogenes and Streptococcus dysgalactiae proteins. However, it is unclear whether domain splitting is generally tolerated within the CnaB-family or only by a small subset of these domains. To address this point, we have selected a set of four CnaB domains of low sequence similarity and characterized the resulting tag/catcher systems by computational and experimental methods. Experimental testing for intermolecular isopeptide bond formation demonstrated two of the four systems to be functional. For these two systems length and sequence variations of the peptide tags were investigated revealing only a relatively small effect on the efficiency of the reaction. Our study suggests that splitting into tag and catcher moieties is tolerated by a significant portion of the naturally occurring CnaB-domains, thus providing a large reservoir for the design of novel tag/catcher systems. PMID:28654665
Separation efficiency of free-solution conjugated electrophoresis with drag-tags incorporating a synthetic amino acid.

PubMed

Seo, Kyung-Ho; Chu, Hun-Su; Yoo, Tae Hyeon; Lee, Sun-Gu; Won, Jong-In

2016-03-01

DNA sequencing or separation by conventional capillary electrophoresis with a polymer matrix has some inherent drawbacks, such as the expense of polymer matrix and limitations in sequencing read length. As DNA fragments have a linear charge-to-friction ratio in free solution, DNA fragments cannot be separated by size. However, size-based separation of DNA is possible in free-solution conjugate electrophoresis (FSCE) if a "drag-tag" is attached to DNA fragments because the tag breaks the linear charge-to-friction scaling. Although several previous studies have demonstrated the feasibility of DNA separation by free-solution conjugated electrophoresis, generation of a monodisperse drag-tag and identification of a strong, site-specific conjugation method between a DNA fragment and a drag-tag are challenges that still remain. In this study, we demonstrate an efficient FSCE method by conjugating a biologically synthesized elastin-like polypeptide (ELP) and green fluorescent protein (GFP) to DNA fragments. In addition, to produce strong and site-specific conjugation, a methionine residue in drag-tags is replaced with homopropargylglycine (Hpg), which can be conjugated specifically to a DNA fragment with an azide site. © 2016 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Ontologies and tag-statistics

NASA Astrophysics Data System (ADS)

Tibély, Gergely; Pollner, Péter; Vicsek, Tamás; Palla, Gergely

2012-05-01

Due to the increasing popularity of collaborative tagging systems, the research on tagged networks, hypergraphs, ontologies, folksonomies and other related concepts is becoming an important interdisciplinary area with great potential and relevance for practical applications. In most collaborative tagging systems the tagging by the users is completely ‘flat’, while in some cases they are allowed to define a shallow hierarchy for their own tags. However, usually no overall hierarchical organization of the tags is given, and one of the interesting challenges of this area is to provide an algorithm generating the ontology of the tags from the available data. In contrast, there are also other types of tagged networks available for research, where the tags are already organized into a directed acyclic graph (DAG), encapsulating the ‘is a sub-category of’ type of hierarchy between each other. In this paper, we study how this DAG affects the statistical distribution of tags on the nodes marked by the tags in various real networks. The motivation for this research was the fact that understanding the tagging based on a known hierarchy can help in revealing the hidden hierarchy of tags in collaborative tagging systems. We analyse the relation between the tag-frequency and the position of the tag in the DAG in two large sub-networks of the English Wikipedia and a protein-protein interaction network. We also study the tag co-occurrence statistics by introducing a two-dimensional (2D) tag-distance distribution preserving both the difference in the levels and the absolute distance in the DAG for the co-occurring pairs of tags. Our most interesting finding is that the local relevance of tags in the DAG (i.e. their rank or significance as characterized by, e.g., the length of the branches starting from them) is much more important than their global distance from the root. Furthermore, we also introduce a simple tagging model based on random walks on the DAG, capable of
Passive wireless tags for tongue controlled assistive technology interfaces.

PubMed

Rakibet, Osman O; Horne, Robert J; Kelly, Stephen W; Batchelor, John C

2016-03-01

Tongue control with low profile, passive mouth tags is demonstrated as a human-device interface by communicating values of tongue-tag separation over a wireless link. Confusion matrices are provided to demonstrate user accuracy in targeting by tongue position. Accuracy is found to increase dramatically after short training sequences with errors falling close to 1% in magnitude with zero missed targets. The rate at which users are able to learn accurate targeting with high accuracy indicates that this is an intuitive device to operate. The significance of the work is that innovative very unobtrusive, wireless tags can be used to provide intuitive human-computer interfaces based on low cost and disposable mouth mounted technology. With the development of an appropriate reading system, control of assistive devices such as computer mice or wheelchairs could be possible for tetraplegics and others who retain fine motor control capability of their tongues. The tags contain no battery and are intended to fit directly on the hard palate, detecting tongue position in the mouth with no need for tongue piercings.
Gene expression profiling of the plant pathogenic basidiomycetous fungus Rhizoctonia solani AG 4 reveals putative virulence factors

USDA-ARS?s Scientific Manuscript database

Rhizoctonia solani is a ubiquitous basidiomycetous soilborne fungal pathogen causing damping off of seedlings, aerial blights and postharvest diseases. To gain insight into the molecular mechanisms of pathogenesis a global approach based on analysis of expressed sequence tags (ESTs) was undertaken. ...
Large-Scale Concatenation cDNA Sequencing

PubMed Central

Yu, Wei; Andersson, Björn; Worley, Kim C.; Muzny, Donna M.; Ding, Yan; Liu, Wen; Ricafrente, Jennifer Y.; Wentland, Meredith A.; Lennon, Greg; Gibbs, Richard A.

1997-01-01

A total of 100 kb of DNA derived from 69 individual human brain cDNA clones of 0.7–2.0 kb were sequenced by concatenated cDNA sequencing (CCS), whereby multiple individual DNA fragments are sequenced simultaneously in a single shotgun library. The method yielded accurate sequences and a similar efficiency compared with other shotgun libraries constructed from single DNA fragments (>20 kb). Computer analyses were carried out on 65 cDNA clone sequences and their corresponding end sequences to examine both nucleic acid and amino acid sequence similarities in the databases. Thirty-seven clones revealed no DNA database matches, 12 clones generated exact matches (≥98% identity), and 16 clones generated nonexact matches (57%–97% identity) to either known human or other species genes. Of those 28 matched clones, 8 had corresponding end sequences that failed to identify similarities. In a protein similarity search, 27 clone sequences displayed significant matches, whereas only 20 of the end sequences had matches to known protein sequences. Our data indicate that full-length cDNA insert sequences provide significantly more nucleic acid and protein sequence similarity matches than expressed sequence tags (ESTs) for database searching. [All 65 cDNA clone sequences described in this paper have been submitted to the GenBank data library under accession nos. U79240–U79304.] PMID:9110174
Live single-cell laser tag.

PubMed

Binan, Loïc; Mazzaferri, Javier; Choquet, Karine; Lorenzo, Louis-Etienne; Wang, Yu Chang; Affar, El Bachir; De Koninck, Yves; Ragoussis, Jiannis; Kleinman, Claudia L; Costantino, Santiago

2016-05-20

The ability to conduct image-based, non-invasive cell tagging, independent of genetic engineering, is key to cell biology applications. Here we introduce cell labelling via photobleaching (CLaP), a method that enables instant, specific tagging of individual cells based on a wide array of criteria such as shape, behaviour or positional information. CLaP uses laser illumination to crosslink biotin onto the plasma membrane, coupled with streptavidin conjugates to label individual cells for genomic, cell-tracking, flow cytometry or ultra-microscopy applications. We show that the incorporated mark is stable, non-toxic, retained for several days, and transferred by cell division but not to adjacent cells in culture. To demonstrate the potential of CLaP for genomic applications, we combine CLaP with microfluidics-based single-cell capture followed by transcriptome-wide next-generation sequencing. Finally, we show that CLaP can also be exploited for inducing transient cell adhesion to substrates for microengineering cultures with spatially patterned cell types.

Computational identification of conserved microRNAs and their targets from expression sequence tags of blueberry (Vaccinium corybosum)

PubMed Central

Li, Xuyan; Hou, Yanming; Zhang, Li; Zhang, Wenhao; Quan, Chen; Cui, Yuhai; Bian, Shaomin

2014-01-01

MicroRNAs (miRNAs) are a class of endogenous, approximately 21nt in length, non-coding RNA, which mediate the expression of target genes primarily at post-transcriptional levels. miRNAs play critical roles in almost all plant cellular and metabolic processes. Although numerous miRNAs have been identified in the plant kingdom, the miRNAs in blueberry, which is an economically important small fruit crop, still remain totally unknown. In this study, we reported a computational identification of miRNAs and their targets in blueberry. By conducting an EST-based comparative genomics approach, 9 potential vco-miRNAs were discovered from 22,402 blueberry ESTs according to a series of filtering criteria, designated as vco-miR156–5p, vco-miR156–3p, vco-miR1436, vco-miR1522, vco-miR4495, vco-miR5120, vco-miR5658, vco-miR5783, and vco-miR5986. Based on sequence complementarity between miRNA and its target transcript, 34 target ESTs from blueberry and 70 targets from other species were identified for the vco-miRNAs. The targets were found to be involved in transcription, RNA splicing and binding, DNA duplication, signal transduction, transport and trafficking, stress response, as well as synthesis and metabolic process. These findings will greatly contribute to future research in regard to functions and regulatory mechanisms of blueberry miRNAs. PMID:25763692
Computational identification of conserved microRNAs and their targets from expression sequence tags of blueberry (Vaccinium corybosum).

PubMed

Li, Xuyan; Hou, Yanming; Zhang, Li; Zhang, Wenhao; Quan, Chen; Cui, Yuhai; Bian, Shaomin

2014-01-01

MicroRNAs (miRNAs) are a class of endogenous, approximately 21nt in length, non-coding RNA, which mediate the expression of target genes primarily at post-transcriptional levels. miRNAs play critical roles in almost all plant cellular and metabolic processes. Although numerous miRNAs have been identified in the plant kingdom, the miRNAs in blueberry, which is an economically important small fruit crop, still remain totally unknown. In this study, we reported a computational identification of miRNAs and their targets in blueberry. By conducting an EST-based comparative genomics approach, 9 potential vco-miRNAs were discovered from 22,402 blueberry ESTs according to a series of filtering criteria, designated as vco-miR156-5p, vco-miR156-3p, vco-miR1436, vco-miR1522, vco-miR4495, vco-miR5120, vco-miR5658, vco-miR5783, and vco-miR5986. Based on sequence complementarity between miRNA and its target transcript, 34 target ESTs from blueberry and 70 targets from other species were identified for the vco-miRNAs. The targets were found to be involved in transcription, RNA splicing and binding, DNA duplication, signal transduction, transport and trafficking, stress response, as well as synthesis and metabolic process. These findings will greatly contribute to future research in regard to functions and regulatory mechanisms of blueberry miRNAs.
Transcriptome analysis of Schistosoma mansoni larval development using serial analysis of gene expression (SAGE).

PubMed

Taft, A S; Vermeire, J J; Bernier, J; Birkeland, S R; Cipriano, M J; Papa, A R; McArthur, A G; Yoshino, T P

2009-04-01

Infection of the snail, Biomphalaria glabrata, by the free-swimming miracidial stage of the human blood fluke, Schistosoma mansoni, and its subsequent development to the parasitic sporocyst stage is critical to establishment of viable infections and continued human transmission. We performed a genome-wide expression analysis of the S. mansoni miracidia and developing sporocyst using Long Serial Analysis of Gene Expression (LongSAGE). Five cDNA libraries were constructed from miracidia and in vitro cultured 6- and 20-day-old sporocysts maintained in sporocyst medium (SM) or in SM conditioned by previous cultivation with cells of the B. glabrata embryonic (Bge) cell line. We generated 21 440 SAGE tags and mapped 13 381 to the S. mansoni gene predictions (v4.0e) either by estimating theoretical 3' UTR lengths or using existing 3' EST sequence data. Overall, 432 transcripts were found to be differentially expressed amongst all 5 libraries. In total, 172 tags were differentially expressed between miracidia and 6-day conditioned sporocysts and 152 were differentially expressed between miracidia and 6-day unconditioned sporocysts. In addition, 53 and 45 tags, respectively, were differentially expressed in 6-day and 20-day cultured sporocysts, due to the effects of exposure to Bge cell-conditioned medium.
Inferring gene expression from ribosomal promoter sequences, a crowdsourcing approach

PubMed Central

Meyer, Pablo; Siwo, Geoffrey; Zeevi, Danny; Sharon, Eilon; Norel, Raquel; Segal, Eran; Stolovitzky, Gustavo; Siwo, Geoffrey; Rider, Andrew K.; Tan, Asako; Pinapati, Richard S.; Emrich, Scott; Chawla, Nitesh; Ferdig, Michael T.; Tung, Yi-An; Chen, Yong-Syuan; Chen, Mei-Ju May; Chen, Chien-Yu; Knight, Jason M.; Sahraeian, Sayed Mohammad Ebrahim; Esfahani, Mohammad Shahrokh; Dreos, Rene; Bucher, Philipp; Maier, Ezekiel; Saeys, Yvan; Szczurek, Ewa; Myšičková, Alena; Vingron, Martin; Klein, Holger; Kiełbasa, Szymon M.; Knisley, Jeff; Bonnell, Jeff; Knisley, Debra; Kursa, Miron B.; Rudnicki, Witold R.; Bhattacharjee, Madhuchhanda; Sillanpää, Mikko J.; Yeung, James; Meysman, Pieter; Rodríguez, Aminael Sánchez; Engelen, Kristof; Marchal, Kathleen; Huang, Yezhou; Mordelet, Fantine; Hartemink, Alexander; Pinello, Luca; Yuan, Guo-Cheng

2013-01-01

The Gene Promoter Expression Prediction challenge consisted of predicting gene expression from promoter sequences in a previously unknown experimentally generated data set. The challenge was presented to the community in the framework of the sixth Dialogue for Reverse Engineering Assessments and Methods (DREAM6), a community effort to evaluate the status of systems biology modeling methodologies. Nucleotide-specific promoter activity was obtained by measuring fluorescence from promoter sequences fused upstream of a gene for yellow fluorescence protein and inserted in the same genomic site of yeast Saccharomyces cerevisiae. Twenty-one teams submitted results predicting the expression levels of 53 different promoters from yeast ribosomal protein genes. Analysis of participant predictions shows that accurate values for low-expressed and mutated promoters were difficult to obtain, although in the latter case, only when the mutation induced a large change in promoter activity compared to the wild-type sequence. As in previous DREAM challenges, we found that aggregation of participant predictions provided robust results, but did not fare better than the three best algorithms. Finally, this study not only provides a benchmark for the assessment of methods predicting activity of a specific set of promoters from their sequence, but it also shows that the top performing algorithm, which used machine-learning approaches, can be improved by the addition of biological features such as transcription factor binding sites. PMID:23950146
Expressed Sequence Reference Standards for Evaluating Stage-specific Gene Expression in Southern Green Lacewings, Chrysoperla rufilabris

USDA-ARS?s Scientific Manuscript database

Five developmental stages of Chrysoperla rufilabris were tested using nine primer pairs. Three sequences were highly expressed at all life stages and six were differentially expressed. These primer pairs may be used as standards to quantitate functional gene expression associated with physiological ...
Too much data, but little inter-changeability: a lesson learned from mining public data on tissue specificity of gene expression.

PubMed

Li, Shuyu; Li, Yiqun Helen; Wei, Tao; Su, Eric Wen; Duffin, Kevin; Liao, Birong

2006-10-25

The tissue expression pattern of a gene often provides an important clue to its potential role in a biological process. A vast amount of gene expression data have been and are being accumulated in public repository through different technology platforms. However, exploitations of these rich data sources remain limited in part due to issues of technology standardization. Our objective is to test the data comparability between SAGE and microarray technologies, through examining the expression pattern of genes under normal physiological states across variety of tissues. There are 42-54% of genes showing significant correlations in tissue expression patterns between SAGE and GeneChip, with 30-40% of genes whose expression patterns are positively correlated and 10-15% of genes whose expression patterns are negatively correlated at a statistically significant level (p = 0.05). Our analysis suggests that the discrepancy on the expression patterns derived from technology platforms is not likely from the heterogeneity of tissues used in these technologies, or other spurious correlations resulting from microarray probe design, abundance of genes, or gene function. The discrepancy can be partially explained by errors in the original assignment of SAGE tags to genes due to the evolution of sequence databases. In addition, sequence analysis has indicated that many SAGE tags and Affymetrix array probe sets are mapped to different splice variants or different sequence regions although they represent the same gene, which also contributes to the observed discrepancies between SAGE and array expression data. To our knowledge, this is the first report attempting to mine gene expression patterns across tissues using public data from different technology platforms. Unlike previous similar studies that only demonstrated the discrepancies between the two gene expression platforms, we carried out in-depth analysis to further investigate the cause for such discrepancies. Our study shows
Isoform-level gene expression patterns in single-cell RNA-sequencing data.

PubMed

Vu, Trung Nghia; Wills, Quin F; Kalari, Krishna R; Niu, Nifang; Wang, Liewei; Pawitan, Yudi; Rantalainen, Mattias

2018-02-27

RNA sequencing of single cells enables characterization of transcriptional heterogeneity in seemingly homogeneous cell populations. Single-cell sequencing has been applied in a wide range of researches fields. However, few studies have focus on characterization of isoform-level expression patterns at the single-cell level. In this study we propose and apply a novel method, ISOform-Patterns (ISOP), based on mixture modeling, to characterize the expression patterns of isoform pairs from the same gene in single-cell isoform-level expression data. We define six principal patterns of isoform expression relationships and describe a method for differential-pattern analysis. We demonstrate ISOP through analysis of single-cell RNA-sequencing data from a breast cancer cell line, with replication in three independent datasets. We assigned the pattern types to each of 16,562 isoform-pairs from 4,929 genes. Among those, 26% of the discovered patterns were significant (p<0.05), while remaining patterns are possibly effects of transcriptional bursting, drop-out and stochastic biological heterogeneity. Furthermore, 32% of genes discovered through differential-pattern analysis were not detected by differential-expression analysis. The effect of drop-out events, mean expression level, and properties of the expression distribution on the performances of ISOP were also investigated through simulated datasets. To conclude, ISOP provides a novel approach for characterization of isoformlevel preference, commitment and heterogeneity in single-cell RNA-sequencing data. The ISOP method has been implemented as a R package and is available at https://github.com/nghiavtr/ISOP under a GPL-3 license. mattias.rantalainen@ki.se. Supplementary data are available at Bioinformatics online.
Stable isotope, site-specific mass tagging for protein identification

DOEpatents

Chen, Xian

2006-10-24

Proteolytic peptide mass mapping as measured by mass spectrometry provides an important method for the identification of proteins, which are usually identified by matching the measured and calculated m/z values of the proteolytic peptides. A unique identification is, however, heavily dependent upon the mass accuracy and sequence coverage of the fragment ions generated by peptide ionization. The present invention describes a method for increasing the specificity, accuracy and efficiency of the assignments of particular proteolytic peptides and consequent protein identification, by the incorporation of selected amino acid residue(s) enriched with stable isotope(s) into the protein sequence without the need for ultrahigh instrumental accuracy. Selected amino acid(s) are labeled with .sup.13C/.sup.15N/.sup.2H and incorporated into proteins in a sequence-specific manner during cell culturing. Each of these labeled amino acids carries a defined mass change encoded in its monoisotopic distribution pattern. Through their characteristic patterns, the peptides with mass tag(s) can then be readily distinguished from other peptides in mass spectra. The present method of identifying unique proteins can also be extended to protein complexes and will significantly increase data search specificity, efficiency and accuracy for protein identifications.
Dynamic optical tags

NASA Astrophysics Data System (ADS)

Griggs, Steven P.; Mark, Martin B.; Feldman, Barry J.

2004-07-01

The goal of the DARPA Dynamic Optical Tags (DOTs) program is to develop a small, robust, persistent, 2-way tagging, tracking and locating device that also supports communications at data rates greater than 100 kbps and can be interrogated at significant range. These tags will allow for two-way data exchange and tagging operations in friendly and denied areas. The DOTs will be passive and non-RF. To accomplish this, the DOTs program will develop small, thin, retro-reflecting modulators. The tags will operate for long periods of time (greater than two months) in real-world environmental conditions (-40° to +70° C) and allow for a wide interrogation angle (+/-60°). The tags will be passive (in the sleep mode) for most of the time and only become active when interrogated by a laser with the correct code. Once correctly interrogated, the tags will begin to modulate and retro-reflect the incoming beam. The program will also develop two tag specific transceiver systems that are eye-safe, employ automated scanning algorithms, and are capable of short search and interrogate times.
Transcriptome dynamics through alternative polyadenylation in developmental and environmental responses in plants revealed by deep sequencing

PubMed Central

Shen, Yingjia; Venu, R.C.; Nobuta, Kan; Wu, Xiaohui; Notibala, Varun; Demirci, Caghan; Meyers, Blake C.; Wang, Guo-Liang; Ji, Guoli; Li, Qingshun Q.

2011-01-01

Polyadenylation sites mark the ends of mRNA transcripts. Alternative polyadenylation (APA) may alter sequence elements and/or the coding capacity of transcripts, a mechanism that has been demonstrated to regulate gene expression and transcriptome diversity. To study the role of APA in transcriptome dynamics, we analyzed a large-scale data set of RNA “tags” that signify poly(A) sites and expression levels of mRNA. These tags were derived from a wide range of tissues and developmental stages that were mutated or exposed to environmental treatments, and generated using digital gene expression (DGE)–based protocols of the massively parallel signature sequencing (MPSS-DGE) and the Illumina sequencing-by-synthesis (SBS-DGE) sequencing platforms. The data offer a global view of APA and how it contributes to transcriptome dynamics. Upon analysis of these data, we found that ∼60% of Arabidopsis genes have multiple poly(A) sites. Likewise, ∼47% and 82% of rice genes use APA, supported by MPSS-DGE and SBS-DGE tags, respectively. In both species, ∼49%–66% of APA events were mapped upstream of annotated stop codons. Interestingly, 10% of the transcriptomes are made up of APA transcripts that are differentially distributed among developmental stages and in tissues responding to environmental stresses, providing an additional level of transcriptome dynamics. Examples of pollen-specific APA switching and salicylic acid treatment-specific APA clearly demonstrated such dynamics. The significance of these APAs is more evident in the 3034 genes that have conserved APA events between rice and Arabidopsis. PMID:21813626
High-yield secretion of recombinant proteins expressed in tobacco cell culture with a designer glycopeptide tag: Process development.

PubMed

Zhang, Ningning; Gonzalez, Maria; Savary, Brett; Xu, Jianfeng

2016-03-01

Low-yield protein production remains the most significant economic hurdle with plant cell culture technology. Fusions of recombinant proteins with hydroxyproline-O-glycosylated designer glycopeptide tags have consistently boosted secreted protein yields. This prompted us to study the process development of this technology aiming to achieve productivity levels necessary for commercial viability. We used a tobacco BY-2 cell culture expressing EGFP as fusion with a glycopeptide tag comprised of 32 repeat of "Ser-Pro" dipeptide, or (SP)32 , to study cell growth and protein secretion, culture scale-up, and establishment of perfusion cultures for continuous production. The BY-2 cells accumulated low levels of cell biomass (~7.5 g DW/L) in Schenk & Hildebrandt medium, but secreted high yields of (SP)32 -tagged EGFP (125 mg/L). Protein productivity of the cell culture has been stable for 6.0 years. The BY-2 cells cultured in a 5-L bioreactor similarly produced high secreted protein yield at 131 mg/L. Successful operation of a cell perfusion culture for 30 days was achieved under the perfusion rate of 0.25 and 0.5 day(-1) , generating a protein volumetric productivity of 17.6 and 28.9 mg/day/L, respectively. This research demonstrates the great potential of the designer glycopeptide technology for use in commercial production of valuable proteins with plant cell cultures. © 2016 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
High Throughput Biological Analysis Using Multi-bit Magnetic Digital Planar Tags

NASA Astrophysics Data System (ADS)

Hong, B.; Jeong, J.-R.; Llandro, J.; Hayward, T. J.; Ionescu, A.; Trypiniotis, T.; Mitrelias, T.; Kopper, K. P.; Steinmuller, S. J.; Bland, J. A. C.

2008-06-01

We report a new magnetic labelling technology for high-throughput biomolecular identification and DNA sequencing. Planar multi-bit magnetic tags have been designed and fabricated, which comprise a magnetic barcode formed by an ensemble of micron-sized thin film Ni80Fe20 bars encapsulated in SU8. We show that by using a globally applied magnetic field and magneto-optical Kerr microscopy the magnetic elements in the multi-bit magnetic tags can be addressed individually and encoded/decoded remotely. The critical steps needed to show the feasibility of this technology are demonstrated, including fabrication, flow transport, remote writing and reading, and successful functionalization of the tags as verified by fluorescence detection. This approach is ideal for encoding information on tags in microfluidic flow or suspension, for such applications as labelling of chemical precursors during drug synthesis and combinatorial library-based high-throughput multiplexed bioassays.
Sampling and pyrosequencing methods for characterizing bacterial communities in the human gut using 16S sequence tags

PubMed Central

2010-01-01

Intense interest centers on the role of the human gut microbiome in health and disease, but optimal methods for analysis are still under development. Here we present a study of methods for surveying bacterial communities in human feces using 454/Roche pyrosequencing of 16S rRNA gene tags. We analyzed fecal samples from 10 individuals and compared methods for storage, DNA purification and sequence acquisition. To assess reproducibility, we compared samples one cm apart on a single stool specimen for each individual. To analyze storage methods, we compared 1) immediate freezing at -80°C, 2) storage on ice for 24 or 3) 48 hours. For DNA purification methods, we tested three commercial kits and bead beating in hot phenol. Variations due to the different methodologies were compared to variation among individuals using two approaches--one based on presence-absence information for bacterial taxa (unweighted UniFrac) and the other taking into account their relative abundance (weighted UniFrac). In the unweighted analysis relatively little variation was associated with the different analytical procedures, and variation between individuals predominated. In the weighted analysis considerable variation was associated with the purification methods. Particularly notable was improved recovery of Firmicutes sequences using the hot phenol method. We also carried out surveys of the effects of different 454 sequencing methods (FLX versus Titanium) and amplification of different 16S rRNA variable gene segments. Based on our findings we present recommendations for protocols to collect, process and sequence bacterial 16S rDNA from fecal samples--some major points are 1) if feasible, bead-beating in hot phenol or use of the PSP kit improves recovery; 2) storage methods can be adjusted based on experimental convenience; 3) unweighted (presence-absence) comparisons are less affected by lysis method. PMID:20673359
Sampling and pyrosequencing methods for characterizing bacterial communities in the human gut using 16S sequence tags.

PubMed

Wu, Gary D; Lewis, James D; Hoffmann, Christian; Chen, Ying-Yu; Knight, Rob; Bittinger, Kyle; Hwang, Jennifer; Chen, Jun; Berkowsky, Ronald; Nessel, Lisa; Li, Hongzhe; Bushman, Frederic D

2010-07-30

Intense interest centers on the role of the human gut microbiome in health and disease, but optimal methods for analysis are still under development. Here we present a study of methods for surveying bacterial communities in human feces using 454/Roche pyrosequencing of 16S rRNA gene tags. We analyzed fecal samples from 10 individuals and compared methods for storage, DNA purification and sequence acquisition. To assess reproducibility, we compared samples one cm apart on a single stool specimen for each individual. To analyze storage methods, we compared 1) immediate freezing at -80 degrees C, 2) storage on ice for 24 or 3) 48 hours. For DNA purification methods, we tested three commercial kits and bead beating in hot phenol. Variations due to the different methodologies were compared to variation among individuals using two approaches--one based on presence-absence information for bacterial taxa (unweighted UniFrac) and the other taking into account their relative abundance (weighted UniFrac). In the unweighted analysis relatively little variation was associated with the different analytical procedures, and variation between individuals predominated. In the weighted analysis considerable variation was associated with the purification methods. Particularly notable was improved recovery of Firmicutes sequences using the hot phenol method. We also carried out surveys of the effects of different 454 sequencing methods (FLX versus Titanium) and amplification of different 16S rRNA variable gene segments. Based on our findings we present recommendations for protocols to collect, process and sequence bacterial 16S rDNA from fecal samples--some major points are 1) if feasible, bead-beating in hot phenol or use of the PSP kit improves recovery; 2) storage methods can be adjusted based on experimental convenience; 3) unweighted (presence-absence) comparisons are less affected by lysis method.
Application of Stochastic Labeling with Random-Sequence Barcodes for Simultaneous Quantification and Sequencing of Environmental 16S rRNA Genes.

PubMed

Hoshino, Tatsuhiko; Inagaki, Fumio

2017-01-01

Next-generation sequencing (NGS) is a powerful tool for analyzing environmental DNA and provides the comprehensive molecular view of microbial communities. For obtaining the copy number of particular sequences in the NGS library, however, additional quantitative analysis as quantitative PCR (qPCR) or digital PCR (dPCR) is required. Furthermore, number of sequences in a sequence library does not always reflect the original copy number of a target gene because of biases caused by PCR amplification, making it difficult to convert the proportion of particular sequences in the NGS library to the copy number using the mass of input DNA. To address this issue, we applied stochastic labeling approach with random-tag sequences and developed a NGS-based quantification protocol, which enables simultaneous sequencing and quantification of the targeted DNA. This quantitative sequencing (qSeq) is initiated from single-primer extension (SPE) using a primer with random tag adjacent to the 5' end of target-specific sequence. During SPE, each DNA molecule is stochastically labeled with the random tag. Subsequently, first-round PCR is conducted, specifically targeting the SPE product, followed by second-round PCR to index for NGS. The number of random tags is only determined during the SPE step and is therefore not affected by the two rounds of PCR that may introduce amplification biases. In the case of 16S rRNA genes, after NGS sequencing and taxonomic classification, the absolute number of target phylotypes 16S rRNA gene can be estimated by Poisson statistics by counting random tags incorporated at the end of sequence. To test the feasibility of this approach, the 16S rRNA gene of Sulfolobus tokodaii was subjected to qSeq, which resulted in accurate quantification of 5.0 × 103 to 5.0 × 104 copies of the 16S rRNA gene. Furthermore, qSeq was applied to mock microbial communities and environmental samples, and the results were comparable to those obtained using digital PCR and
Quantum-dot-tagged microbeads for multiplexed optical coding of biomolecules.

PubMed

Han, M; Gao, X; Su, J Z; Nie, S

2001-07-01

Multicolor optical coding for biological assays has been achieved by embedding different-sized quantum dots (zinc sulfide-capped cadmium selenide nanocrystals) into polymeric microbeads at precisely controlled ratios. Their novel optical properties (e.g., size-tunable emission and simultaneous excitation) render these highly luminescent quantum dots (QDs) ideal fluorophores for wavelength-and-intensity multiplexing. The use of 10 intensity levels and 6 colors could theoretically code one million nucleic acid or protein sequences. Imaging and spectroscopic measurements indicate that the QD-tagged beads are highly uniform and reproducible, yielding bead identification accuracies as high as 99.99% under favorable conditions. DNA hybridization studies demonstrate that the coding and target signals can be simultaneously read at the single-bead level. This spectral coding technology is expected to open new opportunities in gene expression studies, high-throughput screening, and medical diagnostics.
DeepSAGE Based Differential Gene Expression Analysis under Cold and Freeze Stress in Seabuckthorn (Hippophae rhamnoides L.)

PubMed Central

Chaudhary, Saurabh; Sharma, Prakash C.

2015-01-01

Seabuckthorn (Hippophae rhamnoides L.), an important plant species of Indian Himalayas, is well known for its immense medicinal and nutritional value. The plant has the ability to sustain growth in harsh environments of extreme temperatures, drought and salinity. We employed DeepSAGE, a tag based approach, to identify differentially expressed genes under cold and freeze stress in seabuckthorn. In total 36.2 million raw tags including 13.9 million distinct tags were generated using Illumina sequencing platform for three leaf tissue libraries including control (CON), cold stress (CS) and freeze stress (FS). After discarding low quality tags, 35.5 million clean tags including 7 million distinct clean tags were obtained. In all, 11922 differentially expressed genes (DEGs) including 6539 up regulated and 5383 down regulated genes were identified in three comparative setups i.e. CON vs CS, CON vs FS and CS vs FS. Gene ontology and KEGG pathway analysis were performed to assign gene ontology term to DEGs and ascertain their biological functions. DEGs were mapped back to our existing seabuckthorn transcriptome assembly comprising of 88,297 putative unigenes leading to the identification of 428 cold and freeze stress responsive genes. Expression of randomly selected 22 DEGs was validated using qRT-PCR that further supported our DeepSAGE results. The present study provided a comprehensive view of global gene expression profile of seabuckthorn under cold and freeze stresses. The DeepSAGE data could also serve as a valuable resource for further functional genomics studies aiming selection of candidate genes for development of abiotic stress tolerant transgenic plants. PMID:25803684
DeepSAGE based differential gene expression analysis under cold and freeze stress in seabuckthorn (Hippophae rhamnoides L.).

PubMed

Chaudhary, Saurabh; Sharma, Prakash C

2015-01-01

Seabuckthorn (Hippophae rhamnoides L.), an important plant species of Indian Himalayas, is well known for its immense medicinal and nutritional value. The plant has the ability to sustain growth in harsh environments of extreme temperatures, drought and salinity. We employed DeepSAGE, a tag based approach, to identify differentially expressed genes under cold and freeze stress in seabuckthorn. In total 36.2 million raw tags including 13.9 million distinct tags were generated using Illumina sequencing platform for three leaf tissue libraries including control (CON), cold stress (CS) and freeze stress (FS). After discarding low quality tags, 35.5 million clean tags including 7 million distinct clean tags were obtained. In all, 11922 differentially expressed genes (DEGs) including 6539 up regulated and 5383 down regulated genes were identified in three comparative setups i.e. CON vs CS, CON vs FS and CS vs FS. Gene ontology and KEGG pathway analysis were performed to assign gene ontology term to DEGs and ascertain their biological functions. DEGs were mapped back to our existing seabuckthorn transcriptome assembly comprising of 88,297 putative unigenes leading to the identification of 428 cold and freeze stress responsive genes. Expression of randomly selected 22 DEGs was validated using qRT-PCR that further supported our DeepSAGE results. The present study provided a comprehensive view of global gene expression profile of seabuckthorn under cold and freeze stresses. The DeepSAGE data could also serve as a valuable resource for further functional genomics studies aiming selection of candidate genes for development of abiotic stress tolerant transgenic plants.
An efficient procedure for the expression and purification of HIV-1 protease from inclusion bodies.

PubMed

Nguyen, Hong-Loan Thi; Nguyen, Thuy Thi; Vu, Quy Thi; Le, Hang Thi; Pham, Yen; Trinh, Phuong Le; Bui, Thuan Phuong; Phan, Tuan-Nghia

2015-12-01

Several studies have focused on HIV-1 protease for developing drugs for treating AIDS. Recombinant HIV-1 protease is used to screen new drugs from synthetic compounds or natural substances. However, large-scale expression and purification of this enzyme is difficult mainly because of its low expression and solubility. In this study, we constructed 9 recombinant plasmids containing a sequence encoding HIV-1 protease along with different fusion tags and examined the expression of the enzyme from these plasmids. Of the 9 plasmids, pET32a(+) plasmid containing the HIV-1 protease-encoding sequence along with sequences encoding an autocleavage site GTVSFNF at the N-terminus and TEV plus 6× His tag at the C-terminus showed the highest expression of the enzyme and was selected for further analysis. The recombinant protein was isolated from inclusion bodies by using 2 tandem Q- and Ni-Sepharose columns. SDS-PAGE of the obtained HIV-1 protease produced a single band of approximately 13 kDa. The enzyme was recovered efficiently (4 mg protein/L of cell culture) and had high specific activity of 1190 nmol min(-1) mg(-1) at an optimal pH of 4.7 and optimal temperature of 37 °C. This procedure for expressing and purifying HIV-1 protease is now being scaled up to produce the enzyme on a large scale for its application. Copyright © 2015 Elsevier Inc. All rights reserved.
Specific In Vivo Labeling of Tyrosinated α-Tubulin and Measurement of Microtubule Dynamics Using a GFP Tagged, Cytoplasmically Expressed Recombinant Antibody

PubMed Central

Cassimeris, Lynne; Guglielmi, Laurence; Denis, Vincent; Larroque, Christian; Martineau, Pierre

2013-01-01

GFP-tagged proteins are used extensively as biosensors for protein localization and function, but the GFP moiety can interfere with protein properties. An alternative is to indirectly label proteins using intracellular recombinant antibodies (scFvs), but most antibody fragments are insoluble in the reducing environment of the cytosol. From a synthetic hyperstable human scFv library we isolated an anti-tubulin scFv, 2G4, which is soluble in mammalian cells when expressed as a GFP-fusion protein. Here we report the use of this GFP-tagged scFv to label microtubules in fixed and living cells. We found that 2G4-GFP localized uniformly along microtubules and did not disrupt binding of EB1, a protein that binds microtubule ends and serves as a platform for binding by a complex of proteins regulating MT polymerization. TOGp and CLIP-170 also bound microtubule ends in cells expressing 2G4-GFP. Microtubule dynamic instability, measured by tracking 2G4-GFP labeled microtubules, was nearly identical to that measured in cells expressing GFP-α-tubulin. Fluorescence recovery after photobleaching demonstrated that 2G4-GFP turns over rapidly on microtubules, similar to the turnover rates of fluorescently tagged microtubule-associated proteins. These data indicate that 2G4-GFP binds relatively weakly to microtubules, and this conclusion was confirmed in vitro. Purified 2G4 partially co-pelleted with microtubules, but a significant fraction remained in the soluble fraction, while a second anti-tubulin scFv, 2F12, was almost completely co-pelleted with microtubules. In cells, 2G4-GFP localized to most microtubules, but did not co-localize with those composed of detyrosinated α-tubulin, a post-translational modification associated with non-dynamic, more stable microtubules. Immunoblots probing bacterially expressed tubulins confirmed that 2G4 recognized α-tubulin and required tubulin’s C-terminal tyrosine residue for binding. Thus, a recombinant antibody with weak affinity for its

A laboratory evaluation of tagging-related mortality and tag loss in juvenile humpback chub

USGS Publications Warehouse

Ward, David L.; Persons, William R.; Young, Kirk; Stone, Dennis M.; Van Haverbeke, Randy; Knight, William R.

2015-01-01

We quantified tag retention, survival, and growth in juvenile, captive-reared Humpback Chub Gila cypha marked with three different tag types: (1) Biomark 12.5-mm, 134.2-kHz, full duplex PIT tags injected into the body cavity with a 12-gauge needle; (2) Biomark 8.4-mm, 134.2-kHz, full duplex PIT tags injected with a 16-gauge needle; and (3) Northwest Marine Technology visible implant elastomer (VIE) tags injected under the skin with a 29-gauge needle. Estimates of tag loss, tagging-induced mortality, and growth were evaluated for 60 d with each tag type for four different size-groups of fish: 40–49 mm, 50–59 mm, 60–69 mm, and 70–79 mm TL. Total length was a significant predictor of the probability of PIT tag retention and mortality for both 8-mm and 12-mm PIT tags, and the smallest fish had the highest rates of tag loss (12.5–30.0%) and mortality (7.5–20.0%). Humpback Chub of sizes 40–49 mm TL and tagged with VIE tags had no mortality but did have a 17.5% tag loss. Growth rates of all tagged fish were similar to controls. Our data indicate Humpback Chub can be effectively tagged using either 8-mm or 12-mm PIT tags with little tag loss or mortality at sizes as low as 65 mm TL.
A review of recommendations for sequencing receptive and expressive language instruction.

PubMed

Petursdottir, Anna Ingeborg; Carr, James E

2011-01-01

We review recommendations for sequencing instruction in receptive and expressive language objectives in early and intensive behavioral intervention (EIBI) programs. Several books recommend completing receptive protocols before introducing corresponding expressive protocols. However, this recommendation has little empirical support, and some evidence exists that the reverse sequence may be more efficient. Alternative recommendations include teaching receptive and expressive skills simultaneously (M. L. Sundberg & Partington, 1998) and building learning histories that lead to acquisition of receptive and expressive skills without direct instruction (Greer & Ross, 2008). Empirical support for these recommendations also is limited. Future research should assess the relative efficiency of receptive-before-expressive, expressive-before-receptive, and simultaneous training with children who have diagnoses of autism spectrum disorders. In addition, further evaluation is needed of the potential benefits of multiple-exemplar training and other variables that may influence the efficiency of receptive and expressive instruction.
A REVIEW OF RECOMMENDATIONS FOR SEQUENCING RECEPTIVE AND EXPRESSIVE LANGUAGE INSTRUCTION

PubMed Central

Petursdottir, Anna Ingeborg; Carr, James E

2011-01-01

We review recommendations for sequencing instruction in receptive and expressive language objectives in early and intensive behavioral intervention (EIBI) programs. Several books recommend completing receptive protocols before introducing corresponding expressive protocols. However, this recommendation has little empirical support, and some evidence exists that the reverse sequence may be more efficient. Alternative recommendations include teaching receptive and expressive skills simultaneously (M. L. Sundberg & Partington, 1998) and building learning histories that lead to acquisition of receptive and expressive skills without direct instruction (Greer & Ross, 2008). Empirical support for these recommendations also is limited. Future research should assess the relative efficiency of receptive-before-expressive, expressive-before-receptive, and simultaneous training with children who have diagnoses of autism spectrum disorders. In addition, further evaluation is needed of the potential benefits of multiple-exemplar training and other variables that may influence the efficiency of receptive and expressive instruction. PMID:22219535
Expression, purification, and DNA-binding activity of the Herbaspirillum seropedicae RecX protein.

PubMed

Galvão, Carolina W; Pedrosa, Fábio O; Souza, Emanuel M; Yates, M Geoffrey; Chubatsu, Leda S; Steffens, Maria Berenice R

2004-06-01

The Herbaspirillum seropedicae RecX protein participates in the SOS response: a process in which the RecA protein plays a central role. The RecX protein of the H. seropedicae, fused to a His-tag sequence (RecX His-tagged), was over-expressed in Escherichia coli and purified by metal-affinity chromatography to yield a highly purified and active protein. DNA band-shift assays showed that the RecX His-tagged protein bound to both circular and linear double-stranded DNA and also to circular single-stranded DNA. The apparent affinity of RecX for DNA decreased in the presence of Mg(2+) ions. The ability of RecX to bind DNA may be relevant to its function in the SOS response.
Highly Selective End-Tagged Antimicrobial Peptides Derived from PRELP

PubMed Central

Malmsten, Martin; Kasetty, Gopinath; Pasupuleti, Mukesh; Alenfall, Jan; Schmidtchen, Artur

2011-01-01

Background Antimicrobial peptides (AMPs) are receiving increasing attention due to resistance development against conventional antibiotics. Pseudomonas aeruginosa and Staphylococcus aureus are two major pathogens involved in an array of infections such as ocular infections, cystic fibrosis, wound and post-surgery infections, and sepsis. The goal of the study was to design novel AMPs against these pathogens. Methodology and Principal Findings Antibacterial activity was determined by radial diffusion, viable count, and minimal inhibitory concentration assays, while toxicity was evaluated by hemolysis and effects on human epithelial cells. Liposome and fluorescence studies provided mechanistic information. Protease sensitivity was evaluated after subjection to human leukocyte elastase, staphylococcal aureolysin and V8 proteinase, as well as P. aeruginosa elastase. Highly active peptides were evaluated in ex vivo skin infection models. C-terminal end-tagging by W and F amino acid residues increased antimicrobial potency of the peptide sequences GRRPRPRPRP and RRPRPRPRP, derived from proline arginine-rich and leucine-rich repeat protein (PRELP). The optimized peptides were antimicrobial against a range of Gram-positive S. aureus and Gram-negative P. aeruginosa clinical isolates, also in the presence of human plasma and blood. Simultaneously, they showed low toxicity against mammalian cells. Particularly W-tagged peptides displayed stability against P. aeruginosa elastase, and S. aureus V8 proteinase and aureolysin, and the peptide RRPRPRPRPWWWW-NH2 was effective against various “superbugs” including vancomycin-resistant enterococci, multi-drug resistant P. aeruginosa, and methicillin-resistant S. aureus, as well as demonstrated efficiency in an ex vivo skin wound model of S. aureus and P. aeruginosa infection. Conclusions/Significance Hydrophobic C-terminal end-tagging of the cationic sequence RRPRPRPRP generates highly selective AMPs with potent activity against
A database of expressed genes from the new world screwworm, Cochliomyia hominivorax (Coquerel) (Diptera: Calliphoridae)

USDA-ARS?s Scientific Manuscript database

We used an expressed sequence tag and 454 pyrosequencing approach to initiate a study of the genome of the New World Screwworm, Cochliomyia hominivorax (Coquerel). Two normalized cDNA libraries were constructed from RNA isolated from embryos and 2nd instar larvae from the Panama 95 strain. Approxima...
Gene expression distribution deconvolution in single-cell RNA sequencing.

PubMed

Wang, Jingshu; Huang, Mo; Torre, Eduardo; Dueck, Hannah; Shaffer, Sydney; Murray, John; Raj, Arjun; Li, Mingyao; Zhang, Nancy R

2018-06-26

Single-cell RNA sequencing (scRNA-seq) enables the quantification of each gene's expression distribution across cells, thus allowing the assessment of the dispersion, nonzero fraction, and other aspects of its distribution beyond the mean. These statistical characterizations of the gene expression distribution are critical for understanding expression variation and for selecting marker genes for population heterogeneity. However, scRNA-seq data are noisy, with each cell typically sequenced at low coverage, thus making it difficult to infer properties of the gene expression distribution from raw counts. Based on a reexamination of nine public datasets, we propose a simple technical noise model for scRNA-seq data with unique molecular identifiers (UMI). We develop deconvolution of single-cell expression distribution (DESCEND), a method that deconvolves the true cross-cell gene expression distribution from observed scRNA-seq counts, leading to improved estimates of properties of the distribution such as dispersion and nonzero fraction. DESCEND can adjust for cell-level covariates such as cell size, cell cycle, and batch effects. DESCEND's noise model and estimation accuracy are further evaluated through comparisons to RNA FISH data, through data splitting and simulations and through its effectiveness in removing known batch effects. We demonstrate how DESCEND can clarify and improve downstream analyses such as finding differentially expressed genes, identifying cell types, and selecting differentiation markers. Copyright © 2018 the Author(s). Published by PNAS.
LlamaTags: A Versatile Tool to Image Transcription Factor Dynamics in Live Embryos.

PubMed

Bothma, Jacques P; Norstad, Matthew R; Alamos, Simon; Garcia, Hernan G

2018-06-14

Embryonic cell fates are defined by transcription factors that are rapidly deployed, yet attempts to visualize these factors in vivo often fail because of slow fluorescent protein maturation. Here, we pioneer a protein tag, LlamaTag, which circumvents this maturation limit by binding mature fluorescent proteins, making it possible to visualize transcription factor concentration dynamics in live embryos. Implementing this approach in the fruit fly Drosophila melanogaster, we discovered stochastic bursts in the concentration of transcription factors that are correlated with bursts in transcription. We further used LlamaTags to show that the concentration of protein in a given nucleus heavily depends on transcription of that gene in neighboring nuclei; we speculate that this inter-nuclear signaling is an important mechanism for coordinating gene expression to delineate straight and sharp boundaries of gene expression. Thus, LlamaTags now make it possible to visualize the flow of information along the central dogma in live embryos. Copyright © 2018 Elsevier Inc. All rights reserved.
Assessment of PIT tag retention and post-tagging survival in metamorphosing juvenile Sea Lamprey

USGS Publications Warehouse

Simard, Lee G.; Sotola, V. Alex; Marsden, J. Ellen; Miehls, Scott M.

2017-01-01

Background: Passive integrated transponder (PIT) tags have been used to document and monitor the movement or behavior of numerous species of fishes. Data on short-term and long-term survival and tag retention are needed before initiating studies using PIT tags on a new species or life stage. We evaluated the survival and tag retention of 153 metamorphosing juvenile Sea Lamprey Petromyzon marinus tagged with 12 mm PIT tags on three occasions using a simple surgical procedure. Results: Tag retention was 100% and 98.6% at 24 h and 28-105 d post-tagging. Of the lamprey that retained their tags, 87.3% had incisions sufficiently healed to prevent further loss. Survival was 100% and 92.7% at 24 h and 41-118 d post-tagging with no significant difference in survival between tagged and untagged control lamprey. Of the 11 lamprey that died, four had symptoms that indicated their death was directly related to tagging. Survival was positively correlated with Sea Lamprey length. Conclusions: Given the overall high level of survival and tag retention in this study, future studies can utilize 12 mm PIT tags to monitor metamorphosing juvenile Sea Lamprey movement and migration patterns.
Decision Tree Algorithm-Generated Single-Nucleotide Polymorphism Barcodes of rbcL Genes for 38 Brassicaceae Species Tagging.

PubMed

Yang, Cheng-Hong; Wu, Kuo-Chuan; Chuang, Li-Yeh; Chang, Hsueh-Wei

2018-01-01

DNA barcode sequences are accumulating in large data sets. A barcode is generally a sequence larger than 1000 base pairs and generates a computational burden. Although the DNA barcode was originally envisioned as straightforward species tags, the identification usage of barcode sequences is rarely emphasized currently. Single-nucleotide polymorphism (SNP) association studies provide us an idea that the SNPs may be the ideal target of feature selection to discriminate between different species. We hypothesize that SNP-based barcodes may be more effective than the full length of DNA barcode sequences for species discrimination. To address this issue, we tested a r ibulose diphosphate carboxylase ( rbcL ) S NP b arcoding (RSB) strategy using a decision tree algorithm. After alignment and trimming, 31 SNPs were discovered in the rbcL sequences from 38 Brassicaceae plant species. In the decision tree construction, these SNPs were computed to set up the decision rule to assign the sequences into 2 groups level by level. After algorithm processing, 37 nodes and 31 loci were required for discriminating 38 species. Finally, the sequence tags consisting of 31 rbcL SNP barcodes were identified for discriminating 38 Brassicaceae species based on the decision tree-selected SNP pattern using RSB method. Taken together, this study provides the rational that the SNP aspect of DNA barcode for rbcL gene is a useful and effective sequence for tagging 38 Brassicaceae species.
Exploiting EST databases for the development and characterisation of 3425 gene-tagged CISP markers in biofuel crop sugarcane and their transferability in cereals and orphan tropical grasses.

PubMed

Chandra, Amaresh; Jain, Radha; Solomon, Sushil; Shrivastava, Shiksha; Roy, Ajoy K

2013-02-04

Sugarcane is an important cash crop, providing 70% of the global raw sugar as well as raw material for biofuel production. Genetic analysis is hindered in sugarcane because of its large and complex polyploid genome and lack of sufficiently informative gene-tagged markers. Modern genomics has produced large amount of ESTs, which can be exploited to develop molecular markers based on comparative analysis with EST datasets of related crops and whole rice genome sequence, and accentuate their cross-technical functionality in orphan crops like tropical grasses. Utilising 246,180 Saccharum officinarum EST sequences vis-à-vis its comparative analysis with ESTs of sorghum and barley and the whole rice genome sequence, we have developed 3425 novel gene-tagged markers - namely, conserved-intron scanning primers (CISP) - using the web program GeMprospector. Rice orthologue annotation results indicated homology of 1096 sequences with expressed proteins, 491 with hypothetical proteins. The remaining 1838 were miscellaneous in nature. A total of 367 primer-pairs were tested in diverse panel of samples. The data indicate amplification of 41% polymorphic bands leading to 0.52 PIC and 3.50 MI with a set of sugarcane varieties and Saccharum species. In addition, a moderate technical functionality of a set of such markers with orphan tropical grasses (22%) and fodder cum cereal oat (33%) is observed. Developed gene-tagged CISP markers exhibited considerable technical functionality with varieties of sugarcane and unexplored species of tropical grasses. These markers would thus be particularly useful in identifying the economical traits in sugarcane and developing conservation strategies for orphan tropical grasses.
Molecular phenotype of zebrafish ovarian follicle by serial analysis of gene expression and proteomic profiling, and comparison with the transcriptomes of other animals

PubMed Central

Knoll-Gellida, Anja; André, Michèle; Gattegno, Tamar; Forgue, Jean; Admon, Arie; Babin, Patrick J

2006-01-01

Background The ability of an oocyte to develop into a viable embryo depends on the accumulation of specific maternal information and molecules, such as RNAs and proteins. A serial analysis of gene expression (SAGE) was carried out in parallel with proteomic analysis on fully-grown ovarian follicles from zebrafish (Danio rerio). The data obtained were compared with ovary/follicle/egg molecular phenotypes of other animals, published or available in public sequence databases. Results Sequencing of 27,486 SAGE tags identified 11,399 different ones, including 3,329 tags with an occurrence superior to one. Fifty-eight genes were expressed at over 0.15% of the total population and represented 17.34% of the mRNA population identified. The three most expressed transcripts were a rhamnose-binding lectin, beta-actin 2, and a transcribed locus similar to the H2B histone family. Comparison with the large-scale expressed sequence tags sequencing approach revealed highly expressed transcripts that were not previously known to be expressed at high levels in fish ovaries, like the short-sized polarized metallothionein 2 transcript. A higher sensitivity for the detection of transcripts with a characterized maternal genetic contribution was also demonstrated compared to large-scale sequencing of cDNA libraries. Ferritin heavy polypeptide 1, heat shock protein 90-beta, lactate dehydrogenase B4, beta-actin isoforms, tubulin beta 2, ATP synthase subunit 9, together with 40 S ribosomal protein S27a, were common highly-expressed transcripts of vertebrate ovary/unfertilized egg. Comparison of transcriptome and proteome data revealed that transcript levels provide little predictive value with respect to the extent of protein abundance. All the proteins identified by proteomic analysis of fully-grown zebrafish follicles had at least one transcript counterpart, with two exceptions: eosinophil chemotactic cytokine and nothepsin. Conclusion This study provides a complete sequence data set of
Control of total GFP expression by alterations to the 3′ region nucleotide sequence

PubMed Central

2013-01-01

Background Previously, we distinguished the Escherichia coli type II cytoplasmic membrane translocation pathways of Tat, Yid, and Sec for unfolded and folded soluble target proteins. The translocation of folded protein to the periplasm for soluble expression via the Tat pathway was controlled by an N-terminal hydrophilic leader sequence. In this study, we investigated the effect of the hydrophilic C-terminal end and its nucleotide sequence on total and soluble protein expression. Results The native hydrophilic C-terminal end of GFP was obtained by deleting the C-terminal peptide LeuGlu-6×His, derived from pET22b(+). The corresponding clones induced total and soluble GFP expression that was either slightly increased or dramatically reduced, apparently through reconstruction of the nucleotide sequence around the stop codon in the 3′ region. In the expression-induced clones, the hydrophilic C-terminus showed increased Tat pathway specificity for soluble expression. However, in the expression-reduced clone, after analyzing the role of the 5′ poly(A) coding sequence with a substituted synonymous codon, we proved that the longer 5′ poly(A) coding sequence interacted with the reconstructed 3′ region nucleotide sequence to create a new mRNA tertiary structure between the 5′ and 3′ regions, which resulted in reduced total GFP expression. Further, to recover the reduced expression by changing the 3′ nucleotide sequence, after replacing selected C-terminal 5′ codons and the stop codon in the ORF with synonymous codons, total GFP expression in most of the clones was recovered to the undeleted control level. The insertion of trinucleotides after the stop codon in the 3′-UTR recovered or reduced total GFP expression. RT-PCR revealed that the level of total protein expression was controlled by changes in translational or transcriptional regulation, which were induced or reduced by the substitution or insertion of 3′ region nucleotides. Conclusions We found
Generation, Analysis and Functional Annotation of Expressed Sequence Tags from the Sheepshead Minnow (Cyprinodon variegatus)

DTIC Science & Technology

2010-01-01

any other provision of law, no person shall be subject to a penalty for failing to comply with a collection of information if it does not display a...CA) were cloned using the pGEM-T Easy Vector System (Promega, Madison, WI) and Electromax DH10B T1 Phage Resistant Cells (Invitrogen, Carlsbad, CA...reactions were performed using 75ng of plasmid DNA template and a M13 (-40) forward primer according to the manufac- turer’s protocol for DNA sequencing of
An SSVEP-actuated brain computer interface using phase-tagged flickering sequences: a cursor system.

PubMed

Lee, Po-Lei; Sie, Jyun-Jie; Liu, Yu-Ju; Wu, Chi-Hsun; Lee, Ming-Huan; Shu, Chih-Hung; Li, Po-Hung; Sun, Chia-Wei; Shyu, Kuo-Kai

2010-07-01

This study presents a new steady-state visual evoked potential (SSVEP)-based brain computer interface (BCI). SSVEPs, induced by phase-tagged flashes in eight light emitting diodes (LEDs), were used to control four cursor movements (up, right, down, and left) and four button functions (on, off, right-, and left-clicks) on a screen menu. EEG signals were measured by one EEG electrode placed at Oz position, referring to the international EEG 10-20 system. Since SSVEPs are time-locked and phase-locked to the onsets of SSVEP flashes, EEG signals were bandpass-filtered and segmented into epochs, and then averaged across a number of epochs to sharpen the recorded SSVEPs. Phase lags between the measured SSVEPs and a reference SSVEP were measured, and targets were recognized based on these phase lags. The current design used eight LEDs to flicker at 31.25 Hz with 45 degrees phase margin between any two adjacent SSVEP flickers. The SSVEP responses were filtered within 29.25-33.25 Hz and then averaged over 60 epochs. Owing to the utilization of high-frequency flickers, the induced SSVEPs were away from low-frequency noises, 60 Hz electricity noise, and eye movement artifacts. As a consequence, we achieved a simple architecture that did not require eye movement monitoring or other artifact detection and removal. The high-frequency design also achieved a flicker fusion effect for better visualization. Seven subjects were recruited in this study to sequentially input a command sequence, consisting of a sequence of eight cursor functions, repeated three times. The accuracy and information transfer rate (mean +/- SD) over the seven subjects were 93.14 +/- 5.73% and 28.29 +/- 12.19 bits/min, respectively. The proposed system can provide a reliable channel for severely disabled patients to communicate with external environments.
Biotin-tagged proteins: Reagents for efficient ELISA-based serodiagnosis and phage display-based affinity selection

PubMed Central

Verma, Vaishali; Kaur, Charanpreet; Grover, Payal; Gupta, Amita

2018-01-01

The high-affinity interaction between biotin and streptavidin has opened avenues for using recombinant proteins with site-specific biotinylation to achieve efficient and directional immobilization. The site-specific biotinylation of proteins carrying a 15 amino acid long Biotin Acceptor Peptide tag (BAP; also known as AviTag) is effected on a specific lysine either by co-expressing the E. coli BirA enzyme in vivo or by using purified recombinant E. coli BirA enzyme in the presence of ATP and biotin in vitro. In this paper, we have designed a T7 promoter-lac operator-based expression vector for rapid and efficient cloning, and high-level cytosolic expression of proteins carrying a C-terminal BAP tag in E. coli with TEV protease cleavable N-terminal deca-histidine tag, useful for initial purification. Furthermore, a robust three-step purification pipeline integrated with well-optimized protocols for TEV protease-based H10 tag removal, and recombinant BirA enzyme-based site-specific in vitro biotinylation is described to obtain highly pure biotinylated proteins. Most importantly, the paper demonstrates superior sensitivities in indirect ELISA with directional and efficient immobilization of biotin-tagged proteins on streptavidin-coated surfaces in comparison to passive immobilization. The use of biotin-tagged proteins through specific immobilization also allows more efficient selection of binders from a phage-displayed naïve antibody library. In addition, for both these applications, specific immobilization requires much less amount of protein as compared to passive immobilization and can be easily multiplexed. The simplified strategy described here for the production of highly pure biotin-tagged proteins will find use in numerous applications, including those, which may require immobilization of multiple proteins simultaneously on a solid surface. PMID:29360877
Biotin-tagged proteins: Reagents for efficient ELISA-based serodiagnosis and phage display-based affinity selection.

PubMed

Verma, Vaishali; Kaur, Charanpreet; Grover, Payal; Gupta, Amita; Chaudhary, Vijay K

2018-01-01

The high-affinity interaction between biotin and streptavidin has opened avenues for using recombinant proteins with site-specific biotinylation to achieve efficient and directional immobilization. The site-specific biotinylation of proteins carrying a 15 amino acid long Biotin Acceptor Peptide tag (BAP; also known as AviTag) is effected on a specific lysine either by co-expressing the E. coli BirA enzyme in vivo or by using purified recombinant E. coli BirA enzyme in the presence of ATP and biotin in vitro. In this paper, we have designed a T7 promoter-lac operator-based expression vector for rapid and efficient cloning, and high-level cytosolic expression of proteins carrying a C-terminal BAP tag in E. coli with TEV protease cleavable N-terminal deca-histidine tag, useful for initial purification. Furthermore, a robust three-step purification pipeline integrated with well-optimized protocols for TEV protease-based H10 tag removal, and recombinant BirA enzyme-based site-specific in vitro biotinylation is described to obtain highly pure biotinylated proteins. Most importantly, the paper demonstrates superior sensitivities in indirect ELISA with directional and efficient immobilization of biotin-tagged proteins on streptavidin-coated surfaces in comparison to passive immobilization. The use of biotin-tagged proteins through specific immobilization also allows more efficient selection of binders from a phage-displayed naïve antibody library. In addition, for both these applications, specific immobilization requires much less amount of protein as compared to passive immobilization and can be easily multiplexed. The simplified strategy described here for the production of highly pure biotin-tagged proteins will find use in numerous applications, including those, which may require immobilization of multiple proteins simultaneously on a solid surface.
ASFinder: a tool for genome-wide identification of alternatively splicing transcripts from EST-derived sequences.

PubMed

Min, Xiang Jia

2013-01-01

Expressed Sequence Tags (ESTs) are a rich resource for identifying Alternatively Splicing (AS) genes. The ASFinder webserver is designed to identify AS isoforms from EST-derived sequences. Two approaches are implemented in ASFinder. If no genomic sequences are provided, the server performs a local BLASTN to identify AS isoforms from ESTs having both ends aligned but an internal segment unaligned. Otherwise, ASFinder uses SIM4 to map ESTs to the genome, then the overlapping ESTs that are mapped to the same genomic locus and have internal variable exon/intron boundaries are identified as AS isoforms. The tool is available at http://proteomics.ysu.edu/tools/ASFinder.html.
A dual affinity-tag strategy for the expression and purification of human linker histone H1.4 in Escherichia coli.

PubMed

Ryan, Daniel P; Tremethick, David J

2016-04-01

Linker histones are an abundant and critical component of the eukaryotic chromatin landscape. They play key roles in regulating the higher order structure of chromatin and many genetic processes. Higher eukaryotes possess a number of different linker histone subtypes and new data are consistently emerging that indicate these subtypes are functionally distinct. We were interested in studying one of the most abundant human linker histone subtypes, H1.4. We have produced recombinant full-length H1.4 in Escherichia coli. An N-terminal Glutathione-S-Transferase tag was used to promote soluble expression and was combined with a C-terminal hexahistidine tag to facilitate a simple non-denaturing two-step affinity chromatography procedure that results in highly pure full-length H1.4. The purified H1.4 was shown to be functional via in vitro chromatin assembly experiments and remains active after extended storage at -80 °C. Copyright © 2015 Elsevier Inc. All rights reserved.
PIT Tagging Anurans

USGS Publications Warehouse

McCreary, Brome

2008-01-01

The following video demonstrates a procedure to insert a passive integrated transponder (PIT) tag under the skin of an anuran (frog or toad) for research and monitoring purposes. Typically, a 12.5 mm tag (0.5 in.) is used to uniquely identify individual anurans as smal as 40 mm (1.6 in.) in length from snout to vent. Smaller tags are also available and allow smaller anurans to be tagged. The procedure does not differ for other sizes of tages or other sizes of anurans. Anyone using this procedure should ensure that the tag is small enough to fit easily behind the sacral hump of the anuran, as shown in this video.

Construction and characterization of 3A-epitope-tagged foot-and-mouth disease virus.

PubMed

Ma, Xueqing; Li, Pinghua; Sun, Pu; Bai, Xingwen; Bao, Huifang; Lu, Zengjun; Fu, Yuanfang; Cao, Yimei; Li, Dong; Chen, Yingli; Qiao, Zilin; Liu, Zaixin

2015-04-01

Nonstructural protein 3A of foot-and-mouth disease virus (FMDV) is a partially conserved protein of 153 amino acids (aa) in most FMDVs examined to date. Specific deletion in the FMDV 3A protein has been associated with the inability of FMDV to grow in primary bovine cells and cause disease in cattle. However, the aa residues playing key roles in these processes are poorly understood. In this study, we constructed epitope-tagged FMDVs containing an 8 aa FLAG epitope, a 9 aa haemagglutinin (HA) epitope, and a 10 aa c-Myc epitope to substitute residues 94-101, 93-101, and 93-102 of 3A protein, respectively, using a recently developed O/SEA/Mya-98 FMDV infectious cDNA clone. Immunofluorescence assay (IFA), Western blot and sequence analysis showed that the epitope-tagged viruses stably maintained and expressed the foreign epitopes even after 10 serial passages in BHK-21 cells. The epitope-tagged viruses displayed growth properties and plaque phenotypes similar to those of the parental virus in BHK-21 cells. However, the epitope-tagged viruses exhibited lower growth rates and smaller plaque size phenotypes than those of the parental virus in primary fetal bovine kidney (FBK) cells, but similar growth properties and plaque phenotypes to those of the recombinant viruses harboring 93-102 deletion in 3A. These results demonstrate that the decreased ability of FMDV to replicate in primary bovine cells was not associated with the length of 3A, and the genetic determinant thought to play key role in decreased ability to replicate in primary bovine cells could be reduced from 93-102 residues to 8 aa residues at positions 94-101 in 3A protein. Copyright © 2015 Elsevier B.V. All rights reserved.
Development of RAP Tag, a Novel Tagging System for Protein Detection and Purification.

PubMed

Fujii, Yuki; Kaneko, Mika K; Ogasawara, Satoshi; Yamada, Shinji; Yanaka, Miyuki; Nakamura, Takuro; Saidoh, Noriko; Yoshida, Kanae; Honma, Ryusuke; Kato, Yukinari

2017-04-01

Affinity tag systems, possessing high affinity and specificity, are useful for protein detection and purification. The most suitable tag for a particular purpose should be selected from many available affinity tag systems. In this study, we developed a novel affinity tag called the "RAP tag" system, which comprises a mouse antirat podoplanin monoclonal antibody (clone PMab-2) and the RAP tag (DMVNPGLEDRIE). This system is useful not only for protein detection in Western blotting, flow cytometry, and sandwich enzyme-linked immunosorbent assay, but also for protein purification.
ezTag: tagging biomedical concepts via interactive learning.

PubMed

Kwon, Dongseop; Kim, Sun; Wei, Chih-Hsuan; Leaman, Robert; Lu, Zhiyong

2018-05-18

Recently, advanced text-mining techniques have been shown to speed up manual data curation by providing human annotators with automated pre-annotations generated by rules or machine learning models. Due to the limited training data available, however, current annotation systems primarily focus only on common concept types such as genes or diseases. To support annotating a wide variety of biological concepts with or without pre-existing training data, we developed ezTag, a web-based annotation tool that allows curators to perform annotation and provide training data with humans in the loop. ezTag supports both abstracts in PubMed and full-text articles in PubMed Central. It also provides lexicon-based concept tagging as well as the state-of-the-art pre-trained taggers such as TaggerOne, GNormPlus and tmVar. ezTag is freely available at http://eztag.bioqrator.org.
The generation of knock-in mice expressing fluorescently tagged galanin receptors 1 and 2

PubMed Central

Kerr, Niall; Holmes, Fiona E.; Hobson, Sally-Ann; Vanderplank, Penny; Leard, Alan; Balthasar, Nina; Wynick, David

2015-01-01

The neuropeptide galanin has diverse roles in the central and peripheral nervous systems, by activating the G protein-coupled receptors Gal1, Gal2 and the less studied Gal3 (GalR1–3 gene products). There is a wealth of data on expression of Gal1–3 at the mRNA level, but not at the protein level due to the lack of specificity of currently available antibodies. Here we report the generation of knock-in mice expressing Gal1 or Gal2 receptor fluorescently tagged at the C-terminus with, respectively, mCherry or hrGFP (humanized Renilla green fluorescent protein). In dorsal root ganglia (DRG) neurons expressing the highest levels of Gal1-mCherry, localization to the somatic cell membrane was detected by live-cell fluorescence and immunohistochemistry, and that fluorescence decreased upon addition of galanin. In spinal cord, abundant Gal1-mCherry immunoreactive processes were detected in the superficial layers of the dorsal horn, and highly expressing intrinsic neurons of the lamina III/IV border showed both somatic cell membrane localization and outward transport of receptor from the cell body, detected as puncta within cell processes. In brain, high levels of Gal1-mCherry immunofluorescence were detected within thalamus, hypothalamus and amygdala, with a high density of nerve endings in the external zone of the median eminence, and regions with lesser immunoreactivity included the dorsal raphe nucleus. Gal2-hrGFP mRNA was detected in DRG, but live-cell fluorescence was at the limits of detection, drawing attention to both the much lower mRNA expression than to Gal1 in mice and the previously unrecognized potential for translational control by upstream open reading frames (uORFs). PMID:26292267
Cutaneous skin tag

MedlinePlus

Skin tag; Acrochordon; Fibroepithelial polyp ... have diabetes. They are thought to occur from skin rubbing against skin. ... The tag sticks out of the skin and may have a short, narrow stalk connecting it to the surface of the skin. Some skin tags are as long as ...
Production of recombinant proteins in Escherichia coli tagged with the fusion protein CusF3H.

PubMed

Vargas-Cortez, Teresa; Morones-Ramirez, Jose Ruben; Balderas-Renteria, Isaias; Zarate, Xristo

2017-04-01

Recombinant protein expression in the bacterium Escherichia coli still is the number one choice for large-scale protein production. Nevertheless, many complications can arise using this microorganism, such as low yields, the formation of inclusion bodies, and the requirement for difficult purification steps. Most of these problems can be solved with the use of fusion proteins. Here, the use of the metal-binding protein CusF3H+ is described as a new fusion protein for recombinant protein expression and purification in E. coli. We have previously shown that CusF produces large amounts of soluble protein, with low levels of formation of inclusion bodies, and that proteins can be purified using IMAC resins charged with Cu(II) ions. CusF3H+ is an enhanced variant of CusF, formed by the addition of three histidine residues at the N-terminus. These residues then can bind Ni(II) ions allowing improved purity after affinity chromatography. Expression and purification of Green Fluorescent Protein tagged with CusF3H+ showed that the mutation did not alter the capacity of the fusion protein to increase protein expression, and purity improved considerably after affinity chromatography with immobilized nickel ions; high yields are obtained after tag-removal since CusF3H+ is a small protein of just 10 kDa. Furthermore, the results of experiments involving expression of tagged proteins having medium to large molecular weights indicate that the presence of the CusF3H+ tag improves protein solubility, as compared to a His-tag. We therefore endorse CusF3H+ as a useful alternative fusion protein/affinity tag for production of recombinant proteins in E. coli. Copyright © 2017 Elsevier Inc. All rights reserved.
A novel gene, RSD-3/HSD-3.1, encodes a meiotic-related protein expressed in rat and human testis.

PubMed

Zhang, Xiaodong; Liu, Huixian; Zhang, Yan; Qiao, Yuan; Miao, Shiying; Wang, Linfang; Zhang, Jianchao; Zong, Shudong; Koide, S S

2003-06-01

The expression of stage-specific genes during spermatogenesis was determined by isolating two segments of rat seminiferous tubule at different stages of the germinal epithelium cycle delineated by transillumination-delineated microdissection, combined with differential display polymerase chain reaction to identify the differential transcripts formed. A total of 22 cDNAs were identified and accepted by GenBank as new expressed sequence tags. One of the expressed sequence tags was radiolabeled and used as a probe to screen a rat testis cDNA library. A novel full-length cDNA composed of 2228 bp, designated as RSD-3 (rat sperm DNA no.3, GenBank accession no. AF094609) was isolated and characterized. The reading frame encodes a polypeptide consisting of 526 amino acid residues, containing a number of DNA binding motifs and phosphorylation sites for PKC, CK-II, and p34cdc2. Northern blot of mRNA prepared from various tissues of adult rats showed that RSD-3 is expressed only in the testis. The initial expression of the RSD-3 gene was detected in the testis on the 30th postnatal day and attained adult level on the 60th postnatal day. Immunolocalization of RSD-3 in germ cells of rat testis showed that its expression is restricted to primary spermatocytes, undergoing meiosis division I. A human testis homologue of RSD-3 cDNA, designated as HSD-3.1 (GenBank accession no. AF144487) was isolated by screening the Human Testis Rapid-Screen arrayed cDNA library panels by RT-PCR. The exon-intron boundaries of HSD-3.1 gene were determined by aligning the cDNA sequence with the corresponding genome sequence. The cDNA consisted of 12 exons that span approximately 52.8 kb of the genome sequence and was mapped to chromosome 14q31.3.
Use of the Nanofitin Alternative Scaffold as a GFP-Ready Fusion Tag

PubMed Central

Huet, Simon; Gorre, Harmony; Perrocheau, Anaëlle; Picot, Justine; Cinier, Mathieu

2015-01-01

With the continuous diversification of recombinant DNA technologies, the possibilities for new tailor-made protein engineering have extended on an on-going basis. Among these strategies, the use of the green fluorescent protein (GFP) as a fusion domain has been widely adopted for cellular imaging and protein localization. Following the lead of the direct head-to-tail fusion of GFP, we proposed to provide additional features to recombinant proteins by genetic fusion of artificially derived binders. Thus, we reported a GFP-ready fusion tag consisting of a small and robust fusion-friendly anti-GFP Nanofitin binding domain as a proof-of-concept. While limiting steric effects on the carrier, the GFP-ready tag allows the capture of GFP or its blue (BFP), cyan (CFP) and yellow (YFP) alternatives. Here, we described the generation of the GFP-ready tag from the selection of a Nanofitin variant binding to the GFP and its spectral variants with a nanomolar affinity, while displaying a remarkable folding stability, as demonstrated by its full resistance upon thermal sterilization process or the full chemical synthesis of Nanofitins. To illustrate the potential of the Nanofitin-based tag as a fusion partner, we compared the expression level in Escherichia coli and activity profile of recombinant human tumor necrosis factor alpha (TNFα) constructs, fused to a SUMO or GFP-ready tag. Very similar expression levels were found with the two fusion technologies. Both domains of the GFP-ready tagged TNFα were proved fully active in ELISA and interferometry binding assays, allowing the simultaneous capture by an anti-TNFα antibody and binding to the GFP, and its spectral mutants. The GFP-ready tag was also shown inert in a L929 cell based assay, demonstrating the potent TNFα mediated apoptosis induction by the GFP-ready tagged TNFα. Eventually, we proposed the GFP-ready tag as a versatile capture and labeling system in addition to expected applications of anti-GFP Nanofitins (as
Use of the Nanofitin Alternative Scaffold as a GFP-Ready Fusion Tag.

PubMed

Huet, Simon; Gorre, Harmony; Perrocheau, Anaëlle; Picot, Justine; Cinier, Mathieu

2015-01-01

With the continuous diversification of recombinant DNA technologies, the possibilities for new tailor-made protein engineering have extended on an on-going basis. Among these strategies, the use of the green fluorescent protein (GFP) as a fusion domain has been widely adopted for cellular imaging and protein localization. Following the lead of the direct head-to-tail fusion of GFP, we proposed to provide additional features to recombinant proteins by genetic fusion of artificially derived binders. Thus, we reported a GFP-ready fusion tag consisting of a small and robust fusion-friendly anti-GFP Nanofitin binding domain as a proof-of-concept. While limiting steric effects on the carrier, the GFP-ready tag allows the capture of GFP or its blue (BFP), cyan (CFP) and yellow (YFP) alternatives. Here, we described the generation of the GFP-ready tag from the selection of a Nanofitin variant binding to the GFP and its spectral variants with a nanomolar affinity, while displaying a remarkable folding stability, as demonstrated by its full resistance upon thermal sterilization process or the full chemical synthesis of Nanofitins. To illustrate the potential of the Nanofitin-based tag as a fusion partner, we compared the expression level in Escherichia coli and activity profile of recombinant human tumor necrosis factor alpha (TNFα) constructs, fused to a SUMO or GFP-ready tag. Very similar expression levels were found with the two fusion technologies. Both domains of the GFP-ready tagged TNFα were proved fully active in ELISA and interferometry binding assays, allowing the simultaneous capture by an anti-TNFα antibody and binding to the GFP, and its spectral mutants. The GFP-ready tag was also shown inert in a L929 cell based assay, demonstrating the potent TNFα mediated apoptosis induction by the GFP-ready tagged TNFα. Eventually, we proposed the GFP-ready tag as a versatile capture and labeling system in addition to expected applications of anti-GFP Nanofitins (as
Lamprey Tagging

DOE Office of Scientific and Technical Information (OSTI.GOV)

Colotelo, Alison; Deters, Kate

2017-05-26

Pacific Northwest National Laboratory has developed a super-small acoustic tracking tag designed just for juvenile lamprey. In this video, PNNL researcher Alison Colotelo describes how she and her colleague Kate Deters inject young lamprey with the PNNL tag.
Tag-to-Tag Interference Suppression Technique Based on Time Division for RFID.

PubMed

Khadka, Grishma; Hwang, Suk-Seung

2017-01-01

Radio-frequency identification (RFID) is a tracking technology that enables immediate automatic object identification and rapid data sharing for a wide variety of modern applications using radio waves for data transmission from a tag to a reader. RFID is already well established in technical areas, and many companies have developed corresponding standards and measurement techniques. In the construction industry, effective monitoring of materials and equipment is an important task, and RFID helps to improve monitoring and controlling capabilities, in addition to enabling automation for construction projects. However, on construction sites, there are many tagged objects and multiple RFID tags that may interfere with each other's communications. This reduces the reliability and efficiency of the RFID system. In this paper, we propose an anti-collision algorithm for communication between multiple tags and a reader. In order to suppress interference signals from multiple neighboring tags, the proposed algorithm employs the time-division (TD) technique, where tags in the interrogation zone are assigned a specific time slot so that at every instance in time, a reader communicates with tags using the specific time slot. We present representative computer simulation examples to illustrate the performance of the proposed anti-collision technique for multiple RFID tags.
Photon-tagged and B-meson-tagged b-jet production at the LHC

DOE PAGES

Huang, Jinrui; Kang, Zhong -Bo; Vitev, Ivan; ...

2015-09-18

Tagged jet measurements in high energy hadronic and nuclear reactions provide constraints on the energy and parton flavor origin of the parton shower that recoils against the tagging particle. Such additional insight can be especially beneficial in illuminating the mechanisms of heavy flavor production in proton–proton collisions at the LHC and their modification in the heavy ion environment, which are not fully understood. With this motivation, we present theoretical results for isolated-photon-tagged and B-meson-tagged b-jet production at √s NN = 5.1 TeV for comparison to the upcoming lead–lead data. We find that photon-tagged b-jets exhibit smaller momentum imbalance shift inmore » nuclear matter, and correspondingly smaller energy loss, than photon-tagged light flavor jets. Our results show that B-meson tagging is most effective in ensuring that the dominant fraction of recoiling jets originate from prompt b-quarks. Furthermore, in this channel the large suppression of the cross section is not accompanied by a significant momentum imbalance shift.« less
A maize map standard with sequenced core markers, grass genome reference points and 932 expressed sequence tagged sites (ESTs) in a 1736-locus map.

PubMed Central

Davis, G L; McMullen, M D; Baysdorfer, C; Musket, T; Grant, D; Staebell, M; Xu, G; Polacco, M; Koster, L; Melia-Hancock, S; Houchins, K; Chao, S; Coe, E H

1999-01-01

We have constructed a 1736-locus maize genome map containing1156 loci probed by cDNAs, 545 probed by random genomic clones, 16 by simple sequence repeats (SSRs), 14 by isozymes, and 5 by anonymous clones. Sequence information is available for 56% of the loci with 66% of the sequenced loci assigned functions. A total of 596 new ESTs were mapped from a B73 library of 5-wk-old shoots. The map contains 237 loci probed by barley, oat, wheat, rice, or tripsacum clones, which serve as grass genome reference points in comparisons between maize and other grass maps. Ninety core markers selected for low copy number, high polymorphism, and even spacing along the chromosome delineate the 100 bins on the map. The average bin size is 17 cM. Use of bin assignments enables comparison among different maize mapping populations and experiments including those involving cytogenetic stocks, mutants, or quantitative trait loci. Integration of nonmaize markers in the map extends the resources available for gene discovery beyond the boundaries of maize mapping information into the expanse of map, sequence, and phenotype information from other grass species. This map provides a foundation for numerous basic and applied investigations including studies of gene organization, gene and genome evolution, targeted cloning, and dissection of complex traits. PMID:10388831
A Review of Recommendations for Sequencing Receptive and Expressive Language Instruction

ERIC Educational Resources Information Center

Petursdottir, Anna Ingeborg; Carr, James E.

2011-01-01

We review recommendations for sequencing instruction in receptive and expressive language objectives in early and intensive behavioral intervention (EIBI) programs. Several books recommend completing receptive protocols before introducing corresponding expressive protocols. However, this recommendation has little empirical support, and some…
MytiBase: a knowledgebase of mussel (M. galloprovincialis) transcribed sequences

PubMed Central

Venier, Paola; De Pittà, Cristiano; Bernante, Filippo; Varotto, Laura; De Nardi, Barbara; Bovo, Giuseppe; Roch, Philippe; Novoa, Beatriz; Figueras, Antonio; Pallavicini, Alberto; Lanfranchi, Gerolamo

2009-01-01

Background Although Bivalves are among the most studied marine organisms due to their ecological role, economic importance and use in pollution biomonitoring, very little information is available on the genome sequences of mussels. This study reports the functional analysis of a large-scale Expressed Sequence Tag (EST) sequencing from different tissues of Mytilus galloprovincialis (the Mediterranean mussel) challenged with toxic pollutants, temperature and potentially pathogenic bacteria. Results We have constructed and sequenced seventeen cDNA libraries from different Mediterranean mussel tissues: gills, digestive gland, foot, anterior and posterior adductor muscle, mantle and haemocytes. A total of 24,939 clones were sequenced from these libraries generating 18,788 high-quality ESTs which were assembled into 2,446 overlapping clusters and 4,666 singletons resulting in a total of 7,112 non-redundant sequences. In particular, a high-quality normalized cDNA library (Nor01) was constructed as determined by the high rate of gene discovery (65.6%). Bioinformatic screening of the non-redundant M. galloprovincialis sequences identified 159 microsatellite-containing ESTs. Clusters, consensuses, related similarities and gene ontology searches have been organized in a dedicated, searchable database . Conclusion We defined the first species-specific catalogue of M. galloprovincialis ESTs including 7,112 unique transcribed sequences. Putative microsatellite markers were identified. This annotated catalogue represents a valuable platform for expression studies, marker validation and genetic linkage analysis for investigations in the biology of Mediterranean mussels. PMID:19203376
Identification, Classification and Differential Expression of Oleosin Genes in Tung Tree (Vernicia fordii)

PubMed Central

Cao, Heping; Zhang, Lin; Tan, Xiaofeng; Long, Hongxu; Shockey, Jay M.

2014-01-01

Triacylglycerols (TAG) are the major molecules of energy storage in eukaryotes. TAG are packed in subcellular structures called oil bodies or lipid droplets. Oleosins (OLE) are the major proteins in plant oil bodies. Multiple isoforms of OLE are present in plants such as tung tree (Vernicia fordii), whose seeds are rich in novel TAG with a wide range of industrial applications. The objectives of this study were to identify OLE genes, classify OLE proteins and analyze OLE gene expression in tung trees. We identified five tung tree OLE genes coding for small hydrophobic proteins. Genome-wide phylogenetic analysis and multiple sequence alignment demonstrated that the five tung OLE genes represented the five OLE subfamilies and all contained the “proline knot” motif (PX5SPX3P) shared among 65 OLE from 19 tree species, including the sequenced genomes of Prunus persica (peach), Populus trichocarpa (poplar), Ricinus communis (castor bean), Theobroma cacao (cacao) and Vitis vinifera (grapevine). Tung OLE1, OLE2 and OLE3 belong to the S type and OLE4 and OLE5 belong to the SM type of Arabidopsis OLE. TaqMan and SYBR Green qPCR methods were used to study the differential expression of OLE genes in tung tree tissues. Expression results demonstrated that 1) All five OLE genes were expressed in developing tung seeds, leaves and flowers; 2) OLE mRNA levels were much higher in seeds than leaves or flowers; 3) OLE1, OLE2 and OLE3 genes were expressed in tung seeds at much higher levels than OLE4 and OLE5 genes; 4) OLE mRNA levels rapidly increased during seed development; and 5) OLE gene expression was well-coordinated with tung oil accumulation in the seeds. These results suggest that tung OLE genes 1–3 probably play major roles in tung oil accumulation and/or oil body development. Therefore, they might be preferred targets for tung oil engineering in transgenic plants. PMID:24516650
Identification, classification and differential expression of oleosin genes in tung tree (Vernicia fordii).

PubMed

Cao, Heping; Zhang, Lin; Tan, Xiaofeng; Long, Hongxu; Shockey, Jay M

2014-01-01

Triacylglycerols (TAG) are the major molecules of energy storage in eukaryotes. TAG are packed in subcellular structures called oil bodies or lipid droplets. Oleosins (OLE) are the major proteins in plant oil bodies. Multiple isoforms of OLE are present in plants such as tung tree (Vernicia fordii), whose seeds are rich in novel TAG with a wide range of industrial applications. The objectives of this study were to identify OLE genes, classify OLE proteins and analyze OLE gene expression in tung trees. We identified five tung tree OLE genes coding for small hydrophobic proteins. Genome-wide phylogenetic analysis and multiple sequence alignment demonstrated that the five tung OLE genes represented the five OLE subfamilies and all contained the "proline knot" motif (PX5SPX3P) shared among 65 OLE from 19 tree species, including the sequenced genomes of Prunus persica (peach), Populus trichocarpa (poplar), Ricinus communis (castor bean), Theobroma cacao (cacao) and Vitis vinifera (grapevine). Tung OLE1, OLE2 and OLE3 belong to the S type and OLE4 and OLE5 belong to the SM type of Arabidopsis OLE. TaqMan and SYBR Green qPCR methods were used to study the differential expression of OLE genes in tung tree tissues. Expression results demonstrated that 1) All five OLE genes were expressed in developing tung seeds, leaves and flowers; 2) OLE mRNA levels were much higher in seeds than leaves or flowers; 3) OLE1, OLE2 and OLE3 genes were expressed in tung seeds at much higher levels than OLE4 and OLE5 genes; 4) OLE mRNA levels rapidly increased during seed development; and 5) OLE gene expression was well-coordinated with tung oil accumulation in the seeds. These results suggest that tung OLE genes 1-3 probably play major roles in tung oil accumulation and/or oil body development. Therefore, they might be preferred targets for tung oil engineering in transgenic plants.
A tandem affinity purification tag of TGA2 for isolation of interacting proteins in Arabidopsis thaliana

PubMed Central

Stotz, Henrik U; Findling, Simone; Nukarinen, Ella; Weckwerth, Wolfram; Mueller, Martin J; Berger, Susanne

2014-01-01

Tandem affinity purification (TAP) tagging provides a powerful tool for isolating interacting proteins in vivo. TAP-tag purification offers particular advantages for the identification of stimulus-induced protein interactions. Type II bZIP transcription factors (TGA2, TGA5 and TGA6) play key roles in pathways that control salicylic acid, ethylene, xenobiotic and reactive oxylipin signaling. Although proteins interacting with these transcription factors have been identified through genetic and yeast 2-hybrid screening, others are still elusive. We have therefore generated a C-terminal TAP-tag of TGA2 to isolate additional proteins that interact with this transcription factor. Three lines most highly expressing TAP-tagged TGA2 were functional in that they partially complemented reactive oxylipin-responsive gene expression in a tga2 tga5 tga6 triple mutant. TAP-tagged TGA2 in the most strongly overexpressing line was proteolytically less stable than in the other 2 lines. Only this overexpressing line could be used in a 2-step purification process, resulting in isolation of co-purifying bands of larger molecular weight than TGA2. TAP-tagged TGA2 was used to pull down NPR1, a protein known to interact with this transcription factor. Mass spectrometry was used to identify peptides that co-purified with TAP-tagged TGA2. Having generated this TGA2 TAP-tag line will therefore be an asset to researchers interested in stimulus-induced signal transduction processes. PMID:25482810
A genome-wide analysis of the lysophosphatidate acyltransferase (LPAAT) gene family in cotton: organization, expression, sequence variation, and association with seed oil content and fiber quality.

PubMed

Wang, Nuohan; Ma, Jianjiang; Pei, Wenfeng; Wu, Man; Li, Haijing; Li, Xingli; Yu, Shuxun; Zhang, Jinfa; Yu, Jiwen

2017-03-01

Lysophosphatidic acid acyltransferase (LPAAT) encoded by a multigene family is a rate-limiting enzyme in the Kennedy pathway in higher plants. Cotton is the most important natural fiber crop and one of the most important oilseed crops. However, little is known on genes coding for LPAATs involved in oil biosynthesis with regard to its genome organization, diversity, expression, natural genetic variation, and association with fiber development and oil content in cotton. In this study, a comprehensive genome-wide analysis in four Gossypium species with genome sequences, i.e., tetraploid G. hirsutum- AD 1 and G. barbadense- AD 2 and its possible ancestral diploids G. raimondii- D 5 and G. arboreum- A 2 , identified 13, 10, 8, and 9 LPAAT genes, respectively, that were divided into four subfamilies. RNA-seq analyses of the LPAAT genes in the widely grown G. hirsutum suggest their differential expression at the transcriptional level in developing cottonseeds and fibers. Although 10 LPAAT genes were co-localised with quantitative trait loci (QTL) for cottonseed oil or protein content within a 25-cM region, only one single strand conformation polymorphic (SSCP) marker developed from a synonymous single nucleotide polymorphism (SNP) of the At-Gh13LPAAT5 gene was significantly correlated with cottonseed oil and protein contents in one of the three field tests. Moreover, transformed yeasts using the At-Gh13LPAAT5 gene with the two sequences for the SNP led to similar results, i.e., a 25-31% increase in palmitic acid and oleic acid, and a 16-29% increase in total triacylglycerol (TAG). The results in this study demonstrated that the natural variation in the LPAAT genes to improving cottonseed oil content and fiber quality is limited; therefore, traditional cross breeding should not expect much progress in improving cottonseed oil content or fiber quality through a marker-assisted selection for the LPAAT genes. However, enhancing the expression of one of the LPAAT genes such
Haplotag: Software for Haplotype-Based Genotyping-by-Sequencing Analysis

PubMed Central

Tinker, Nicholas A.; Bekele, Wubishet A.; Hattori, Jiro

2016-01-01

Genotyping-by-sequencing (GBS), and related methods, are based on high-throughput short-read sequencing of genomic complexity reductions followed by discovery of single nucleotide polymorphisms (SNPs) within sequence tags. This provides a powerful and economical approach to whole-genome genotyping, facilitating applications in genomics, diversity analysis, and molecular breeding. However, due to the complexity of analyzing large data sets, applications of GBS may require substantial time, expertise, and computational resources. Haplotag, the novel GBS software described here, is freely available, and operates with minimal user-investment on widely available computer platforms. Haplotag is unique in fulfilling the following set of criteria: (1) operates without a reference genome; (2) can be used in a polyploid species; (3) provides a discovery mode, and a production mode; (4) discovers polymorphisms based on a model of tag-level haplotypes within sequenced tags; (5) reports SNPs as well as haplotype-based genotypes; and (6) provides an intuitive visual “passport” for each inferred locus. Haplotag is optimized for use in a self-pollinating plant species. PMID:26818073

Intronic sequences are required for AINTEGUMENTA-LIKE6 expression in Arabidopsis flowers.

PubMed

Krizek, Beth A

2015-10-12

The AINTEGUMENTA-LIKE6/PLETHORA3 (AIL6/PLT3) gene of Arabidopsis thaliana is a key regulator of growth and patterning in both shoots and roots. AIL6 encodes an AINTEGUMENTA-LIKE/PLETHORA (AIL/PLT) transcription factor that is expressed in the root stem cell niche, the peripheral region of the shoot apical meristem and young lateral organ primordia. In flowers, AIL6 acts redundantly with AINTEGUMENTA (ANT) to regulate floral organ positioning, growth, identity and patterning. Experiments were undertaken to define the genomic regions required for AIL6 function and expression in flowers. Transgenic plants expressing a copy of the coding region of AIL6 in the context of 7.7 kb of 5' sequence and 919 bp of 3' sequence (AIL6:cAIL6-3') fail to fully complement AIL6 function when assayed in the ant-4 ail6-2 double mutant background. In contrast, a genomic copy of AIL6 with the same amount of 5' and 3' sequence (AIL6:gAIL6-3') can fully complement ant-4 ail6-2. In addition, a genomic copy of AIL6 with 590 bp of 5' sequence and 919 bp of 3' sequence (AIL6m:gAIL6-3') complements ant-4 ail6-2 and contains all regulatory elements needed to confer normal AIL6 expression in inflorescences. Efforts to map cis-regulatory elements reveal that the third intron of AIL6 contains enhancer elements that confer expression in young flowers but in a broader pattern than that of AIL6 mRNA in wild-type flowers. Some AIL6:gAIL6-3' and AIL6m:gAIL6-3' lines confer an over-rescue phenotype in the ant-4 ail6-2 background that is correlated with higher levels of AIL6 mRNA accumulation. The results presented here indicate that AIL6 intronic sequences serve as transcriptional enhancer elements. In addition, the results show that increased expression of AIL6 can partially compensate for loss of ANT function in flowers.
Understanding why users tag: A survey of tagging motivation literature and results from an empirical study.

PubMed

Strohmaier, Markus; Körner, Christian; Kern, Roman

2012-12-01

While recent progress has been achieved in understanding the structure and dynamics of social tagging systems, we know little about the underlying user motivations for tagging, and how they influence resulting folksonomies and tags. This paper addresses three issues related to this question. (1) What distinctions of user motivations are identified by previous research, and in what ways are the motivations of users amenable to quantitative analysis? (2) To what extent does tagging motivation vary across different social tagging systems? (3) How does variability in user motivation influence resulting tags and folksonomies? In this paper, we present measures to detect whether a tagger is primarily motivated by categorizing or describing resources, and apply these measures to datasets from seven different tagging systems. Our results show that (a) users' motivation for tagging varies not only across, but also within tagging systems, and that (b) tag agreement among users who are motivated by categorizing resources is significantly lower than among users who are motivated by describing resources . Our findings are relevant for (1) the development of tag-based user interfaces, (2) the analysis of tag semantics and (3) the design of search algorithms for social tagging systems.
Understanding why users tag: A survey of tagging motivation literature and results from an empirical study

PubMed Central

Strohmaier, Markus; Körner, Christian; Kern, Roman

2012-01-01

While recent progress has been achieved in understanding the structure and dynamics of social tagging systems, we know little about the underlying user motivations for tagging, and how they influence resulting folksonomies and tags. This paper addresses three issues related to this question. (1) What distinctions of user motivations are identified by previous research, and in what ways are the motivations of users amenable to quantitative analysis? (2) To what extent does tagging motivation vary across different social tagging systems? (3) How does variability in user motivation influence resulting tags and folksonomies? In this paper, we present measures to detect whether a tagger is primarily motivated by categorizing or describing resources, and apply these measures to datasets from seven different tagging systems. Our results show that (a) users’ motivation for tagging varies not only across, but also within tagging systems, and that (b) tag agreement among users who are motivated by categorizing resources is significantly lower than among users who are motivated by describing resources. Our findings are relevant for (1) the development of tag-based user interfaces, (2) the analysis of tag semantics and (3) the design of search algorithms for social tagging systems. PMID:23471473
Comparison of next generation sequencing technologies for transcriptome characterization

PubMed Central

2009-01-01

Background We have developed a simulation approach to help determine the optimal mixture of sequencing methods for most complete and cost effective transcriptome sequencing. We compared simulation results for traditional capillary sequencing with "Next Generation" (NG) ultra high-throughput technologies. The simulation model was parameterized using mappings of 130,000 cDNA sequence reads to the Arabidopsis genome (NCBI Accession SRA008180.19). We also generated 454-GS20 sequences and de novo assemblies for the basal eudicot California poppy (Eschscholzia californica) and the magnoliid avocado (Persea americana) using a variety of methods for cDNA synthesis. Results The Arabidopsis reads tagged more than 15,000 genes, including new splice variants and extended UTR regions. Of the total 134,791 reads (13.8 MB), 119,518 (88.7%) mapped exactly to known exons, while 1,117 (0.8%) mapped to introns, 11,524 (8.6%) spanned annotated intron/exon boundaries, and 3,066 (2.3%) extended beyond the end of annotated UTRs. Sequence-based inference of relative gene expression levels correlated significantly with microarray data. As expected, NG sequencing of normalized libraries tagged more genes than non-normalized libraries, although non-normalized libraries yielded more full-length cDNA sequences. The Arabidopsis data were used to simulate additional rounds of NG and traditional EST sequencing, and various combinations of each. Our simulations suggest a combination of FLX and Solexa sequencing for optimal transcriptome coverage at modest cost. We have also developed ESTcalc http://fgp.huck.psu.edu/NG_Sims/ngsim.pl, an online webtool, which allows users to explore the results of this study by specifying individualized costs and sequencing characteristics. Conclusion NG sequencing technologies are a highly flexible set of platforms that can be scaled to suit different project goals. In terms of sequence coverage alone, the NG sequencing is a dramatic advance over capillary
Tag retention, growth, and survival of red swamp crayfish marked with a visible implant tag

USGS Publications Warehouse

Isely, J.J.; Stockett, P.E.

2001-01-01

Eighty juvenile (means: 42.4 mm total length, 1.6 g) red swamp crayfish Procambarus clarkii were implanted with sequentially numbered visible implant tags and held in the laboratory. Tags were injected transversely into the musculature just beneath the exoskeleton of the third abdominal segment from the cephalothorax; tags were visible upon inspection. An additional 20 crayfish were left untagged and served as controls. After 150 d, tag retention was 80% and all tags were readable. No tagged crayfish died during the study, and no differences in total length or weight were detected between tagged and control crayfish. All individuals molted at least three times during the 150-d study, and some individuals molted up to six times, suggesting that most tags would be permanently retained. The readability in the field without specialized equipment makes the visible implant tag ideal for studies of crayfish ecology, management, and culture.
Expression profiling during arabidopsis/downy mildew interaction reveals a highly-expressed effector that attenuates responses to salicylic acid.

PubMed

Asai, Shuta; Rallapalli, Ghanasyam; Piquerez, Sophie J M; Caillaud, Marie-Cécile; Furzer, Oliver J; Ishaque, Naveed; Wirthmueller, Lennart; Fabro, Georgina; Shirasu, Ken; Jones, Jonathan D G

2014-10-01

Plants have evolved strong innate immunity mechanisms, but successful pathogens evade or suppress plant immunity via effectors delivered into the plant cell. Hyaloperonospora arabidopsidis (Hpa) causes downy mildew on Arabidopsis thaliana, and a genome sequence is available for isolate Emoy2. Here, we exploit the availability of genome sequences for Hpa and Arabidopsis to measure gene-expression changes in both Hpa and Arabidopsis simultaneously during infection. Using a high-throughput cDNA tag sequencing method, we reveal expression patterns of Hpa predicted effectors and Arabidopsis genes in compatible and incompatible interactions, and promoter elements associated with Hpa genes expressed during infection. By resequencing Hpa isolate Waco9, we found it evades Arabidopsis resistance gene RPP1 through deletion of the cognate recognized effector ATR1. Arabidopsis salicylic acid (SA)-responsive genes including PR1 were activated not only at early time points in the incompatible interaction but also at late time points in the compatible interaction. By histochemical analysis, we found that Hpa suppresses SA-inducible PR1 expression, specifically in the haustoriated cells into which host-translocated effectors are delivered, but not in non-haustoriated adjacent cells. Finally, we found a highly-expressed Hpa effector candidate that suppresses responsiveness to SA. As this approach can be easily applied to host-pathogen interactions for which both host and pathogen genome sequences are available, this work opens the door towards transcriptome studies in infection biology that should help unravel pathogen infection strategies and the mechanisms by which host defense responses are overcome.
Multi-Threaded DNA Tag/Anti-Tag Library Generator for Multi-Core Platforms

DTIC Science & Technology

2009-05-01

base pair) Watson ‐ Crick strand pairs that bind perfectly within pairs, but poorly across pairs. A variety of DNA strand hybridization metrics...AFRL-RI-RS-TR-2009-131 Final Technical Report May 2009 MULTI-THREADED DNA TAG/ANTI-TAG LIBRARY GENERATOR FOR MULTI-CORE PLATFORMS...TYPE Final 3. DATES COVERED (From - To) Jun 08 – Feb 09 4. TITLE AND SUBTITLE MULTI-THREADED DNA TAG/ANTI-TAG LIBRARY GENERATOR FOR MULTI-CORE
Systematic gene tagging using CRISPR/Cas9 in human stem cells to illuminate cell organization

PubMed Central

Roberts, Brock; Haupt, Amanda; Tucker, Andrew; Grancharova, Tanya; Arakaki, Joy; Fuqua, Margaret A.; Nelson, Angelique; Hookway, Caroline; Ludmann, Susan A.; Mueller, Irina A.; Yang, Ruian; Horwitz, Rick; Rafelski, Susanne M.; Gunawardane, Ruwanthi N.

2017-01-01

We present a CRISPR/Cas9 genome-editing strategy to systematically tag endogenous proteins with fluorescent tags in human induced pluripotent stem cells (hiPSC). To date, we have generated multiple hiPSC lines with monoallelic green fluorescent protein tags labeling 10 proteins representing major cellular structures. The tagged proteins include alpha tubulin, beta actin, desmoplakin, fibrillarin, nuclear lamin B1, nonmuscle myosin heavy chain IIB, paxillin, Sec61 beta, tight junction protein ZO1, and Tom20. Our genome-editing methodology using Cas9/crRNA ribonuclear protein and donor plasmid coelectroporation, followed by fluorescence-based enrichment of edited cells, typically resulted in <0.1–4% homology-directed repair (HDR). Twenty-five percent of clones generated from each edited population were precisely edited. Furthermore, 92% (36/39) of expanded clonal lines displayed robust morphology, genomic stability, expression and localization of the tagged protein to the appropriate subcellular structure, pluripotency-marker expression, and multilineage differentiation. It is our conclusion that, if cell lines are confirmed to harbor an appropriate gene edit, pluripotency, differentiation potential, and genomic stability are typically maintained during the clonal line–generation process. The data described here reveal general trends that emerged from this systematic gene-tagging approach. Final clonal lines corresponding to each of the 10 cellular structures are now available to the research community. PMID:28814507
The recombinant expression and activity detection of MAF-1 fusion protein.

PubMed

Fu, Ping; Wu, Jianwei; Gao, Song; Guo, Guo; Zhang, Yong; Liu, Jian

2015-10-01

This study establishes the recombinant expression system of MAF-1 (Musca domestica antifungal peptide-1) and demonstrates the antifungal activity of the expression product and shows the relationship between biological activity and structure. The gene segments on mature peptide part of MAF-1 were cloned, based on the primers designed according to the cDNA sequence of MAF-1. We constructed the recombinant prokaryotic expression plasmid using prokaryotic expression vector (pET-28a(+)) and converted it to the competent cell of BL21(DE3) to gain recombinant MAF-1 fusion protein with His tag sequence through purifying affinity chromatographic column of Ni-NTA. To conduct the Western Blotting test, recombinant MAF-1 fusion protein was used to produce the polyclonal antibody of rat. The antifungal activity of the expression product was detected using Candida albicans (ATCC10231) as the indicator. The MAF-1 recombinant fusion protein was purified to exhibit obvious antifungal activity, which lays the foundation for the further study of MAF-1 biological activity, the relationship between structure and function, as well as control of gene expression.
TaGS5-3A, a grain size gene selected during wheat improvement for larger kernel and yield.

PubMed

Ma, Lin; Li, Tian; Hao, Chenyang; Wang, Yuquan; Chen, Xinhong; Zhang, Xueyong

2016-05-01

Grain size is a dominant component of grain weight in cereals. Earlier studies have shown that OsGS5 plays a major role in regulating both grain size and weight in rice via promotion of cell division. In this study, we isolated TaGS5 homoeologues in wheat and mapped them on chromosomes 3A, 3B and 3D. Temporal and spatial expression analysis showed that TaGS5 homoeologues were preferentially expressed in young spikes and developing grains. Two alleles of TaGS5-3A, TaGS5-3A-T and TaGS5-3A-G were identified in wheat accessions, and a functional marker was developed to discriminate them. Association analysis revealed that TaGS5-3A-T was significantly correlated with larger grain size and higher thousand kernel weight. Biochemical assays showed that TaGS5-3A-T possesses a higher enzymatic activity than TaGS5-3A-G. Transgenic rice lines overexpressing TaGS5-3A-T also exhibited larger grain size and higher thousand kernel weight than TaGS5-3A-G lines, and the transcript levels of cell cycle-related genes in TaGS5-3A-T lines were higher than those in TaGS5-3A-G lines. Furthermore, systematic evolution analysis in diploid, tetraploid and hexaploid wheat showed that TaGS5-3A underwent strong artificial selection during wheat polyploidization events and the frequency changes of two alleles demonstrated that TaGS5-3A-T was favoured in global modern wheat cultivars. These results suggest that TaGS5-3A is a positive regulator of grain size and its favoured allele TaGS5-3A-T exhibits a larger potential application in wheat high-yield breeding. © 2015 Society for Experimental Biology, Association of Applied Biologists and John Wiley & Sons Ltd.
Compressing DNA sequence databases with coil.

PubMed

White, W Timothy J; Hendy, Michael D

2008-05-20

Publicly available DNA sequence databases such as GenBank are large, and are growing at an exponential rate. The sheer volume of data being dealt with presents serious storage and data communications problems. Currently, sequence data is usually kept in large "flat files," which are then compressed using standard Lempel-Ziv (gzip) compression - an approach which rarely achieves good compression ratios. While much research has been done on compressing individual DNA sequences, surprisingly little has focused on the compression of entire databases of such sequences. In this study we introduce the sequence database compression software coil. We have designed and implemented a portable software package, coil, for compressing and decompressing DNA sequence databases based on the idea of edit-tree coding. coil is geared towards achieving high compression ratios at the expense of execution time and memory usage during compression - the compression time represents a "one-off investment" whose cost is quickly amortised if the resulting compressed file is transmitted many times. Decompression requires little memory and is extremely fast. We demonstrate a 5% improvement in compression ratio over state-of-the-art general-purpose compression tools for a large GenBank database file containing Expressed Sequence Tag (EST) data. Finally, coil can efficiently encode incremental additions to a sequence database. coil presents a compelling alternative to conventional compression of flat files for the storage and distribution of DNA sequence databases having a narrow distribution of sequence lengths, such as EST data. Increasing compression levels for databases having a wide distribution of sequence lengths is a direction for future work.
Antenna for passive RFID tags

NASA Astrophysics Data System (ADS)

Schiopu, Paul; Manea, Adrian; Cristea, Ionica; Grosu, Neculai; Vladescu, Marian; Craciun, Anca-Ileana; Craciun, Alexandru

2015-02-01

Minuscule devices, called RFID tags are attached to objects and persons and emit information which positioned readers may capture wirelessly. Many methods of identification have been used, but that of most common is to use a unique serial number for identification of person or object. RFID tags can be characterized as either active or passive [1,2]. Traditional passive tags are typically in "sleep" state until awakened by the reader's emitted field. In passive tags, the reader's field acts to charge the capacitor that powers the badge and this can be a combination of antenna and barcodes obtained with SAW( Surface Acoustic Wave) devices [1,2,3] . The antenna in an RFID tag is a conductive element that permits the tag to exchange data with the reader. The paper contribution are targeted to antenna for passive RFID tags. The electromagnetic field generated by the reader is somehow oriented by the reader antenna and power is induced in the tag only if the orientation of the tag antenna is appropriate. A tag placed orthogonal to the reader yield field will not be read. This is the reason that guided manufacturers to build circular polarized antenna capable of propagating a field that is alternatively polarized on all planes passing on the diffusion axis. Passive RFID tags are operated at the UHF frequencies of 868MHz (Europe) and 915MHz (USA) and at the microwave frequencies of 2,45 GHz and 5,8 GHz . Because the tags are small dimensions, in paper, we present the possibility to use circular polarization microstrip antenna with fractal edge [2].
Forskolin-induced apical membrane insertion of virally expressed, epitope-tagged CFTR in polarized MDCK cells.

PubMed

Howard, M; Jiang, X; Stolz, D B; Hill, W G; Johnson, J A; Watkins, S C; Frizzell, R A; Bruton, C M; Robbins, P D; Weisz, O A

2000-08-01

Channel gating of the cystic fibrosis transmembrane conductance regulator (CFTR) is activated in response to cAMP stimulation. In addition, CFTR activation may also involve rapid insertion of a subapical pool of CFTR into the plasma membrane (PM). However, this issue has been controversial, in part because of the difficulty in distinguishing cell surface vs. intracellular CFTR. Recently, a fully functional, epitope-tagged form of CFTR (M2-901/CFTR) that can be detected immunologically in nonpermeabilized cells was characterized (Howard M, Duvall MD, Devor DC, Dong J-Y, Henze K, and Frizzell RA. Am J Physiol Cell Physiol 269: C1565-C1576, 1995; and Schultz BD, Takahashi A, Liu C, Frizzell RA, and Howard M. Am J Physiol Cell Physiol 273: C2080-C2089, 1997). We have developed replication-defective recombinant adenoviruses that express M2-901/CFTR and used them to probe cell surface CFTR in forskolin (FSK)-stimulated polarized Madin-Darby canine kidney (MDCK) cells. Virally expressed M2-901/CFTR was functional and was readily detected on the apical surface of FSK-stimulated polarized MDCK cells. Interestingly, at low multiplicity of infection, we observed FSK-stimulated insertion of M2901/CFTR into the apical PM, whereas at higher M2-901/CFTR expression levels, no increase in surface expression was detected using indirect immunofluorescence. Immunoelectron microscopy of unstimulated and FSK-stimulated cells confirmed the M2-901/CFTR redistribution to the PM upon FSK stimulation and demonstrates that the apically inserted M2-901/CFTR originates from a population of subapical vesicles. Our observations may reconcile previous conflicting reports regarding the effect of cAMP stimulation on CFTR trafficking.
Database-independent Protein Sequencing (DiPS) Enables Full-length de Novo Protein and Antibody Sequence Determination.

PubMed

Savidor, Alon; Barzilay, Rotem; Elinger, Dalia; Yarden, Yosef; Lindzen, Moshit; Gabashvili, Alexandra; Adiv Tal, Ophir; Levin, Yishai

2017-06-01

Traditional "bottom-up" proteomic approaches use proteolytic digestion, LC-MS/MS, and database searching to elucidate peptide identities and their parent proteins. Protein sequences absent from the database cannot be identified, and even if present in the database, complete sequence coverage is rarely achieved even for the most abundant proteins in the sample. Thus, sequencing of unknown proteins such as antibodies or constituents of metaproteomes remains a challenging problem. To date, there is no available method for full-length protein sequencing, independent of a reference database, in high throughput. Here, we present Database-independent Protein Sequencing, a method for unambiguous, rapid, database-independent, full-length protein sequencing. The method is a novel combination of non-enzymatic, semi-random cleavage of the protein, LC-MS/MS analysis, peptide de novo sequencing, extraction of peptide tags, and their assembly into a consensus sequence using an algorithm named "Peptide Tag Assembler." As proof-of-concept, the method was applied to samples of three known proteins representing three size classes and to a previously un-sequenced, clinically relevant monoclonal antibody. Excluding leucine/isoleucine and glutamic acid/deamidated glutamine ambiguities, end-to-end full-length de novo sequencing was achieved with 99-100% accuracy for all benchmarking proteins and the antibody light chain. Accuracy of the sequenced antibody heavy chain, including the entire variable region, was also 100%, but there was a 23-residue gap in the constant region sequence. © 2017 by The American Society for Biochemistry and Molecular Biology, Inc.
Expanding the versatility of phage display I: efficient display of peptide-tags on protein VII of the filamentous phage.

PubMed

Løset, Geir Åge; Bogen, Bjarne; Sandlie, Inger

2011-02-24

Phage display is a platform for selection of specific binding molecules and this is a clear-cut motivation for increasing its performance. Polypeptides are normally displayed as fusions to the major coat protein VIII (pVIII), or the minor coat protein III (pIII). Display on other coat proteins such as pVII allows for display of heterologous peptide sequences on the virions in addition to those displayed on pIII and pVIII. In addition, pVII display is an alternative to pIII or pVIII display. Here we demonstrate how standard pIII or pVIII display phagemids are complemented with a helper phage which supports production of virions that are tagged with octa FLAG, HIS(6) or AviTag on pVII. The periplasmic signal sequence required for pIII and pVIII display, and which has been added to pVII in earlier studies, is omitted altogether. Tagging on pVII is an important and very useful add-on feature to standard pIII and pVII display. Any phagemid bearing a protein of interest on either pIII or pVIII can be tagged with any of the tags depending simply on choice of helper phage. We show in this paper how such tags may be utilized for immobilization and separation as well as purification and detection of monoclonal and polyclonal phage populations.
Improving RNA-Seq expression estimation by modeling isoform- and exon-specific read sequencing rate.

PubMed

Liu, Xuejun; Shi, Xinxin; Chen, Chunlin; Zhang, Li

2015-10-16

The high-throughput sequencing technology, RNA-Seq, has been widely used to quantify gene and isoform expression in the study of transcriptome in recent years. Accurate expression measurement from the millions or billions of short generated reads is obstructed by difficulties. One is ambiguous mapping of reads to reference transcriptome caused by alternative splicing. This increases the uncertainty in estimating isoform expression. The other is non-uniformity of read distribution along the reference transcriptome due to positional, sequencing, mappability and other undiscovered sources of biases. This violates the uniform assumption of read distribution for many expression calculation approaches, such as the direct RPKM calculation and Poisson-based models. Many methods have been proposed to address these difficulties. Some approaches employ latent variable models to discover the underlying pattern of read sequencing. However, most of these methods make bias correction based on surrounding sequence contents and share the bias models by all genes. They therefore cannot estimate gene- and isoform-specific biases as revealed by recent studies. We propose a latent variable model, NLDMseq, to estimate gene and isoform expression. Our method adopts latent variables to model the unknown isoforms, from which reads originate, and the underlying percentage of multiple spliced variants. The isoform- and exon-specific read sequencing biases are modeled to account for the non-uniformity of read distribution, and are identified by utilizing the replicate information of multiple lanes of a single library run. We employ simulation and real data to verify the performance of our method in terms of accuracy in the calculation of gene and isoform expression. Results show that NLDMseq obtains competitive gene and isoform expression compared to popular alternatives. Finally, the proposed method is applied to the detection of differential expression (DE) to show its usefulness in the
GSyellow, a Multifaceted Tag for Functional Protein Analysis in Monocot and Dicot Plants.

PubMed

Besbrugge, Nienke; Van Leene, Jelle; Eeckhout, Dominique; Cannoot, Bernard; Kulkarni, Shubhada R; De Winne, Nancy; Persiau, Geert; Van De Slijke, Eveline; Bontinck, Michiel; Aesaert, Stijn; Impens, Francis; Gevaert, Kris; Van Damme, Daniel; Van Lijsebettens, Mieke; Inzé, Dirk; Vandepoele, Klaas; Nelissen, Hilde; De Jaeger, Geert

2018-06-01

The ability to tag proteins has boosted the emergence of generic molecular methods for protein functional analysis. Fluorescent protein tags are used to visualize protein localization, and affinity tags enable the mapping of molecular interactions by, for example, tandem affinity purification or chromatin immunoprecipitation. To apply these widely used molecular techniques on a single transgenic plant line, we developed a multifunctional tandem affinity purification tag, named GS yellow , which combines the streptavidin-binding peptide tag with citrine yellow fluorescent protein. We demonstrated the versatility of the GS yellow tag in the dicot Arabidopsis ( Arabidopsis thaliana ) using a set of benchmark proteins. For proof of concept in monocots, we assessed the localization and dynamic interaction profile of the leaf growth regulator ANGUSTIFOLIA3 (AN3), fused to the GS yellow tag, along the growth zone of the maize ( Zea mays ) leaf. To further explore the function of ZmAN3, we mapped its DNA-binding landscape in the growth zone of the maize leaf through chromatin immunoprecipitation sequencing. Comparison with AN3 target genes mapped in the developing maize tassel or in Arabidopsis cell cultures revealed strong conservation of AN3 target genes between different maize tissues and across monocots and dicots, respectively. In conclusion, the GS yellow tag offers a powerful molecular tool for distinct types of protein functional analyses in dicots and monocots. As this approach involves transforming a single construct, it is likely to accelerate both basic and translational plant research. © 2018 American Society of Plant Biologists. All rights reserved.
Expression and Stability of Foreign Epitopes Introduced into 3A Nonstructural Protein of Foot-and-Mouth Disease Virus

PubMed Central

Li, Pinghua; Bai, Xingwen; Cao, Yimei; Han, Chenghao; Lu, Zengjun; Sun, Pu; Yin, Hong; Liu, Zaixin

2012-01-01

Foot-and-mouth disease virus (FMDV) is an aphthovirus that belongs to the Picornaviridae family and causes one of the most important animal diseases worldwide. The capacity of other picornaviruses to express foreign antigens has been extensively reported, however, little is known about FMDV. To explore the potential of FMDV as a viral vector, an 11-amino-acid (aa) HSV epitope and an 8 aa FLAG epitope were introduced into the C-terminal different regions of 3A protein of FMDV full-length infectious cDNA clone. Recombinant viruses expressing the HSV or FLAG epitope were successfully rescued after transfection of both modified constructs. Immunofluorescence assay, Western blot and sequence analysis showed that the recombinant viruses stably maintained the foreign epitopes even after 11 serial passages in BHK-21 cells. The 3A-tagged viruses shared similar plaque phenotypes and replication kinetics to those of the parental virus. In addition, mice experimentally infected with the epitope-tagged viruses could induce tag-specific antibodies. Our results demonstrate that FMDV can be used effectively as a viral vector for the delivery of foreign tags. PMID:22848509
Gene expression profiling of adult female tissues in feeding Rhipicephalus microplus cattle ticks.

PubMed

Stutzer, Christian; van Zyl, Willem A; Olivier, Nicholas A; Richards, Sabine; Maritz-Olivier, Christine

2013-06-01

The southern cattle tick, Rhipicephalus microplus, is an economically important pest, especially for resource-poor countries, both as a highly adaptive invasive species and prominent vector of disease. The increasing prevalence of resistance to chemical acaricides and variable efficacy of current tick vaccine candidates highlight the need for more effective control methods. In the absence of a fully annotated genome, the wealth of available expressed sequence tag sequence data for this species presents a unique opportunity to study the genes that are expressed in tissues involved in blood meal acquisition, digestion and reproduction during feeding. Utilising a custom oligonucleotide microarray designed from available singletons (BmiGI Version 2.1) and expressed sequence tag sequences of R. microplus, the expression profiles in feeding adult female midgut, salivary glands and ovarian tissues were compared. From 13,456 assembled transcripts, 588 genes expressed in all three tissues were identified from fed adult females 20 days post infestation. The greatest complement of genes relate to translation and protein turnover. Additionally, a number of unique transcripts were identified for each tissue that relate well to their respective physiological/biological function/role(s). These transcripts include secreted anti-hemostatics and defense proteins from the salivary glands for acquisition of a blood meal, proteases as well as enzymes and transporters for digestion and nutrient acquisition from ingested blood in the midgut, and finally proteins and associated factors involved in DNA replication and cell-cycle control for oogenesis in the ovaries. Comparative analyses of adult female tissues during feeding enabled the identification of a catalogue of transcripts that may be essential for successful feeding and reproduction in the cattle tick, R. microplus. Future studies will increase our understanding of basic tick biology, allowing the identification of shared proteins
RAMTaB: Robust Alignment of Multi-Tag Bioimages

PubMed Central

Raza, Shan-e-Ahmed; Humayun, Ahmad; Abouna, Sylvie; Nattkemper, Tim W.; Epstein, David B. A.; Khan, Michael; Rajpoot, Nasir M.

2012-01-01

Background In recent years, new microscopic imaging techniques have evolved to allow us to visualize several different proteins (or other biomolecules) in a visual field. Analysis of protein co-localization becomes viable because molecules can interact only when they are located close to each other. We present a novel approach to align images in a multi-tag fluorescence image stack. The proposed approach is applicable to multi-tag bioimaging systems which (a) acquire fluorescence images by sequential staining and (b) simultaneously capture a phase contrast image corresponding to each of the fluorescence images. To the best of our knowledge, there is no existing method in the literature, which addresses simultaneous registration of multi-tag bioimages and selection of the reference image in order to maximize the overall overlap between the images. Methodology/Principal Findings We employ a block-based method for registration, which yields a confidence measure to indicate the accuracy of our registration results. We derive a shift metric in order to select the Reference Image with Maximal Overlap (RIMO), in turn minimizing the total amount of non-overlapping signal for a given number of tags. Experimental results show that the Robust Alignment of Multi-Tag Bioimages (RAMTaB) framework is robust to variations in contrast and illumination, yields sub-pixel accuracy, and successfully selects the reference image resulting in maximum overlap. The registration results are also shown to significantly improve any follow-up protein co-localization studies. Conclusions For the discovery of protein complexes and of functional protein networks within a cell, alignment of the tag images in a multi-tag fluorescence image stack is a key pre-processing step. The proposed framework is shown to produce accurate alignment results on both real and synthetic data. Our future work will use the aligned multi-channel fluorescence image data for normal and diseased tissue specimens to

Method for designing gas tag compositions

DOEpatents

Gross, Kenny C.

1995-01-01

For use in the manufacture of gas tags such as employed in a nuclear reactor gas tagging failure detection system, a method for designing gas tagging compositions utilizes an analytical approach wherein the final composition of a first canister of tag gas as measured by a mass spectrometer is designated as node #1. Lattice locations of tag nodes in multi-dimensional space are then used in calculating the compositions of a node #2 and each subsequent node so as to maximize the distance of each node from any combination of tag components which might be indistinguishable from another tag composition in a reactor fuel assembly. Alternatively, the measured compositions of tag gas numbers 1 and 2 may be used to fix the locations of nodes 1 and 2, with the locations of nodes 3-N then calculated for optimum tag gas composition. A single sphere defining the lattice locations of the tag nodes may be used to define approximately 20 tag nodes, while concentric spheres can extend the number of tag nodes to several hundred.
Method for designing gas tag compositions

DOEpatents

Gross, K.C.

1995-04-11

For use in the manufacture of gas tags such as employed in a nuclear reactor gas tagging failure detection system, a method for designing gas tagging compositions utilizes an analytical approach wherein the final composition of a first canister of tag gas as measured by a mass spectrometer is designated as node No. 1. Lattice locations of tag nodes in multi-dimensional space are then used in calculating the compositions of a node No. 2 and each subsequent node so as to maximize the distance of each node from any combination of tag components which might be indistinguishable from another tag composition in a reactor fuel assembly. Alternatively, the measured compositions of tag gas numbers 1 and 2 may be used to fix the locations of nodes 1 and 2, with the locations of nodes 3-N then calculated for optimum tag gas composition. A single sphere defining the lattice locations of the tag nodes may be used to define approximately 20 tag nodes, while concentric spheres can extend the number of tag nodes to several hundred. 5 figures.
Complementary DNA sequencing and identification of mRNAs from the venomous gland of Agkistrodon piscivorus leucostoma.

PubMed

Jia, Ying; Cantu, Bruno A; Sánchez, Elda E; Pérez, John C

2008-06-15

To advance our knowledge on the snake venom composition and transcripts expressed in venom gland at the molecular level, we constructed a cDNA library from the venom gland of Agkistrodon piscivorus leucostoma for the generation of expressed sequence tags (ESTs) database. From the randomly sequenced 2112 independent clones, we have obtained ESTs for 1309 (62%) cDNAs, which showed significant deduced amino acid sequence similarity (scores >80) to previously characterized proteins in National Center for Biotechnology Information (NCBI) database. Ribosomal proteins make up 47 clones (2%) and the remaining 756 (36%) cDNAs represent either unknown identity or show BLASTX sequence identity scores of <80 with known GenBank accessions. The most highly expressed gene encoding phospholipase A(2) (PLA(2)) accounting for 35% of A. p. leucostoma venom gland cDNAs was identified and further confirmed by crude venom applied to sodium dodecyl sulfate/polyacrylamide gel electrophoresis (SDS-PAGE) electrophoresis and protein sequencing. A total of 180 representative genes were obtained from the sequence assemblies and deposited to EST database. Clones showing sequence identity to disintegrins, thrombin-like enzymes, hemorrhagic toxins, fibrinogen clotting inhibitors and plasminogen activators were also identified in our EST database. These data can be used to develop a research program that will help us identify genes encoding proteins that are of medical importance or proteins involved in the mechanisms of the toxin venom.
WebTag: Web browsing into sensor tags over NFC.

PubMed

Echevarria, Juan Jose; Ruiz-de-Garibay, Jonathan; Legarda, Jon; Alvarez, Maite; Ayerbe, Ana; Vazquez, Juan Ignacio

2012-01-01

Information and Communication Technologies (ICTs) continue to overcome many of the challenges related to wireless sensor monitoring, such as for example the design of smarter embedded processors, the improvement of the network architectures, the development of efficient communication protocols or the maximization of the life cycle autonomy. This work tries to improve the communication link of the data transmission in wireless sensor monitoring. The upstream communication link is usually based on standard IP technologies, but the downstream side is always masked with the proprietary protocols used for the wireless link (like ZigBee, Bluetooth, RFID, etc.). This work presents a novel solution (WebTag) for a direct IP based access to a sensor tag over the Near Field Communication (NFC) technology for secure applications. WebTag allows a direct web access to the sensor tag by means of a standard web browser, it reads the sensor data, configures the sampling rate and implements IP based security policies. It is, definitely, a new step towards the evolution of the Internet of Things paradigm.
WebTag: Web Browsing into Sensor Tags over NFC

PubMed Central

Echevarria, Juan Jose; Ruiz-de-Garibay, Jonathan; Legarda, Jon; Álvarez, Maite; Ayerbe, Ana; Vazquez, Juan Ignacio

2012-01-01

Information and Communication Technologies (ICTs) continue to overcome many of the challenges related to wireless sensor monitoring, such as for example the design of smarter embedded processors, the improvement of the network architectures, the development of efficient communication protocols or the maximization of the life cycle autonomy. This work tries to improve the communication link of the data transmission in wireless sensor monitoring. The upstream communication link is usually based on standard IP technologies, but the downstream side is always masked with the proprietary protocols used for the wireless link (like ZigBee, Bluetooth, RFID, etc.). This work presents a novel solution (WebTag) for a direct IP based access to a sensor tag over the Near Field Communication (NFC) technology for secure applications. WebTag allows a direct web access to the sensor tag by means of a standard web browser, it reads the sensor data, configures the sampling rate and implements IP based security policies. It is, definitely, a new step towards the evolution of the Internet of Things paradigm. PMID:23012511
Efficient experimental design and analysis strategies for the detection of differential expression using RNA-Sequencing

PubMed Central

2012-01-01

Background RNA sequencing (RNA-Seq) has emerged as a powerful approach for the detection of differential gene expression with both high-throughput and high resolution capabilities possible depending upon the experimental design chosen. Multiplex experimental designs are now readily available, these can be utilised to increase the numbers of samples or replicates profiled at the cost of decreased sequencing depth generated per sample. These strategies impact on the power of the approach to accurately identify differential expression. This study presents a detailed analysis of the power to detect differential expression in a range of scenarios including simulated null and differential expression distributions with varying numbers of biological or technical replicates, sequencing depths and analysis methods. Results Differential and non-differential expression datasets were simulated using a combination of negative binomial and exponential distributions derived from real RNA-Seq data. These datasets were used to evaluate the performance of three commonly used differential expression analysis algorithms and to quantify the changes in power with respect to true and false positive rates when simulating variations in sequencing depth, biological replication and multiplex experimental design choices. Conclusions This work quantitatively explores comparisons between contemporary analysis tools and experimental design choices for the detection of differential expression using RNA-Seq. We found that the DESeq algorithm performs more conservatively than edgeR and NBPSeq. With regard to testing of various experimental designs, this work strongly suggests that greater power is gained through the use of biological replicates relative to library (technical) replicates and sequencing depth. Strikingly, sequencing depth could be reduced as low as 15% without substantial impacts on false positive or true positive rates. PMID:22985019
Efficient experimental design and analysis strategies for the detection of differential expression using RNA-Sequencing.

PubMed

Robles, José A; Qureshi, Sumaira E; Stephen, Stuart J; Wilson, Susan R; Burden, Conrad J; Taylor, Jennifer M

2012-09-17

RNA sequencing (RNA-Seq) has emerged as a powerful approach for the detection of differential gene expression with both high-throughput and high resolution capabilities possible depending upon the experimental design chosen. Multiplex experimental designs are now readily available, these can be utilised to increase the numbers of samples or replicates profiled at the cost of decreased sequencing depth generated per sample. These strategies impact on the power of the approach to accurately identify differential expression. This study presents a detailed analysis of the power to detect differential expression in a range of scenarios including simulated null and differential expression distributions with varying numbers of biological or technical replicates, sequencing depths and analysis methods. Differential and non-differential expression datasets were simulated using a combination of negative binomial and exponential distributions derived from real RNA-Seq data. These datasets were used to evaluate the performance of three commonly used differential expression analysis algorithms and to quantify the changes in power with respect to true and false positive rates when simulating variations in sequencing depth, biological replication and multiplex experimental design choices. This work quantitatively explores comparisons between contemporary analysis tools and experimental design choices for the detection of differential expression using RNA-Seq. We found that the DESeq algorithm performs more conservatively than edgeR and NBPSeq. With regard to testing of various experimental designs, this work strongly suggests that greater power is gained through the use of biological replicates relative to library (technical) replicates and sequencing depth. Strikingly, sequencing depth could be reduced as low as 15% without substantial impacts on false positive or true positive rates.
An Overview of Enzymatic Reagents for the Removal of Affinity Tags

PubMed Central

Waugh, David S.

2011-01-01

Although they are often exploited to facilitate the expression and purification of recombinant proteins, every affinity tag, whether large or small, has the potential to interfere with the structure and function of its fusion partner. For this reason, reliable methods for removing affinity tags are needed. Only enzymes have the requisite specificity to be generally useful reagents for this purpose. In this review, the advantages and disadvantages of some commonly used endo- and exoproteases are discussed in light of the latest information. PMID:21871965
Association of ESR1 gene tagging SNPs with breast cancer risk

PubMed Central

Dunning, Alison M.; Healey, Catherine S.; Baynes, Caroline; Maia, Ana-Teresa; Scollen, Serena; Vega, Ana; Rodríguez, Raquel; Barbosa-Morais, Nuno L.; Ponder, Bruce A.J.; Low, Yen-Ling; Bingham, Sheila; Haiman, Christopher A.; Le Marchand, Loic; Broeks, Annegien; Schmidt, Marjanka K.; Hopper, John; Southey, Melissa; Beckmann, Matthias W.; Fasching, Peter A.; Peto, Julian; Johnson, Nichola; Bojesen, Stig E.; Nordestgaard, Børge; Milne, Roger L.; Benitez, Javier; Hamann, Ute; Ko, Yon; Schmutzler, Rita K.; Burwinkel, Barbara; Schürmann, Peter; Dörk, Thilo; Heikkinen, Tuomas; Nevanlinna, Heli; Lindblom, Annika; Margolin, Sara; Mannermaa, Arto; Kosma, Veli-Matti; Chen, Xiaoqing; Spurdle, Amanda; Change-Claude, Jenny; Flesch-Janys, Dieter; Couch, Fergus J.; Olson, Janet E.; Severi, Gianluca; Baglietto, Laura; Børresen-Dale, Anne-Lise; Kristensen, Vessela; Hunter, David J.; Hankinson, Susan E.; Devilee, Peter; Vreeswijk, Maaike; Lissowska, Jolanta; Brinton, Louise; Liu, Jianjun; Hall, Per; Kang, Daehee; Yoo, Keun-Young; Shen, Chen-Yang; Yu, Jyh-Cherng; Anton-Culver, Hoda; Ziogoas, Argyrios; Sigurdson, Alice; Struewing, Jeff; Easton, Douglas F.; Garcia-Closas, Montserrat; Humphreys, Manjeet K.; Morrison, Jonathan; Pharoah, Paul D.P.; Pooley, Karen A.; Chenevix-Trench, Georgia

2009-01-01

We have conducted a three-stage, comprehensive single nucleotide polymorphism (SNP)-tagging association study of ESR1 gene variants (SNPs) in more than 55 000 breast cancer cases and controls from studies within the Breast Cancer Association Consortium (BCAC). No large risks or highly significant associations were revealed. SNP rs3020314, tagging a region of ESR1 intron 4, is associated with an increase in breast cancer susceptibility with a dominant mode of action in European populations. Carriers of the c-allele have an odds ratio (OR) of 1.05 [95% Confidence Intervals (CI) 1.02–1.09] relative to t-allele homozygotes, P = 0.004. There is significant heterogeneity between studies, P = 0.002. The increased risk appears largely confined to oestrogen receptor-positive tumour risk. The region tagged by SNP rs3020314 contains sequence that is more highly conserved across mammalian species than the rest of intron 4, and it may subtly alter the ratio of two mRNA splice forms. PMID:19126777
Evaluation of rice tetraticopeptide domain-containing thioredoxin as a novel solubility-enhancing fusion tag in Escherichia coli.

PubMed

Xiao, Wenjun; Jiang, Li; Wang, Weiyu; Wang, Ruyue; Fan, Jun

2018-02-01

Fusion of solubility-enhancing tag is frequently used for improving soluble production of target protein in Escherichia coli. The Arabidopsis tetraticopeptide domain-containing thioredoxin (TDX) has been documented to exhibit functions of disulfide reductase, foldase chaperone, and holdase chaperone. Here, we identified that fusion of rice TDX with the smaller size increased soluble expression levels of three fluorescent proteins with different fluorophores in the E. coli strain BL21(DE3) or the Rosetta (DE3) strain with coexpression of six rare tRNAs, but decreased conformational quality of certain fluorescent proteins, as comparison with the His6-tagged ones. Among five maize proteins, the rice TDX fusion carrier displayed higher solubility-enhancing activity than the yeast SUMO3 tag toward three proteins in both E. coli strains. Five fusion constructs were cleaved with the co-expressed TEV protease variant, but the released target proteins were partly insolubly aggregated in vivo. Attachment of the His6-tag to the TDX tagged proteins had little impact on protein solubility. After Ni-NTA purification, five His6-TDX tagged proteins displayed different apparent purities. Taken together, our work presents that rice TDX tag is a novel solubility enhancer. Copyright © 2017 The Society for Biotechnology, Japan. Published by Elsevier B.V. All rights reserved.
Tag loss and short-term mortality associated with passive integrated transponder tagging of juvenile Lost River suckers

USGS Publications Warehouse

Burdick, Summer M.

2011-01-01

Passive integrated transponder (PIT) tags are commonly used to mark small catostomids, but tag loss and the effect of tagging on mortality have not been assessed for juveniles of the endangered Lost River sucker Deltistes luxatus. I evaluated tag loss and short-term (34-d) mortality associated with the PIT tagging of juvenile Lost River suckers in the laboratory by using a completely randomized design and three treatment groups (PIT tagged, positive control, and control). An empty needle was inserted into each positive control fish, whereas control fish were handled but not tagged. Only one fish expelled its PIT tag. Mortality rate averaged 9.8 ± 3.4% (mean ± SD) for tagged fish; mortality was 0% for control and positive control fish. All tagging mortalities occurred in fish with standard lengths of 71 mm or less, and most of the mortalities occurred within 48 h of tagging. My results indicate that 12.45- × 2.02-mm PIT tags provide a viable method of marking juvenile Lost River suckers that are 72 mm or larger.
Sarcocystis neurona merozoites express a family of immunogenic surface antigens that are orthologues of the Toxoplasma gondii surface antigens (SAGs) and SAG-related sequences.

PubMed

Howe, Daniel K; Gaji, Rajshekhar Y; Mroz-Barrett, Meaghan; Gubbels, Marc-Jan; Striepen, Boris; Stamper, Shelby

2005-02-01

Sarcocystis neurona is a member of the Apicomplexa that causes myelitis and encephalitis in horses but normally cycles between the opossum and small mammals. Analysis of an S. neurona expressed sequence tag (EST) database revealed four paralogous proteins that exhibit clear homology to the family of surface antigens (SAGs) and SAG-related sequences of Toxoplasma gondii. The primary peptide sequences of the S. neurona proteins are consistent with the two-domain structure that has been described for the T. gondii SAGs, and each was predicted to have an amino-terminal signal peptide and a carboxyl-terminal glycolipid anchor addition site, suggesting surface localization. All four proteins were confirmed to be membrane associated and displayed on the surface of S. neurona merozoites. Due to their surface localization and homology to T. gondii surface antigens, these S. neurona proteins were designated SnSAG1, SnSAG2, SnSAG3, and SnSAG4. Consistent with their homology, the SnSAGs elicited a robust immune response in infected and immunized animals, and their conserved structure further suggests that the SnSAGs similarly serve as adhesins for attachment to host cells. Whether the S. neurona SAG family is as extensive as the T. gondii SAG family remains unresolved, but it is probable that additional SnSAGs will be revealed as more S. neurona ESTs are generated. The existence of an SnSAG family in S. neurona indicates that expression of multiple related surface antigens is not unique to the ubiquitous organism T. gondii. Instead, the SAG gene family is a common trait that presumably has an essential, conserved function(s).
Expression and purification of diacylglycerol acyltransferases

USDA-ARS?s Scientific Manuscript database

Diacylglycerol acyltransferases (DGATs) are integral membrane proteins that catalyze the last step of triacylglycerol (TAG) biosynthesis in eukaryotic organisms. Plants and animals deficient in DGATs accumulate less TAG and over-expression of DGATs increases TAG. DGAT knockout mice are resistant to ...
Characterization and Amplification of Gene-Based Simple Sequence Repeat (SSR) Markers in Date Palm.

PubMed

Zhao, Yongli; Keremane, Manjunath; Prakash, Channapatna S; He, Guohao

2017-01-01

The paucity of molecular markers limits the application of genetic and genomic research in date palm (Phoenix dactylifera L.). Availability of expressed sequence tag (EST) sequences in date palm may provide a good resource for developing gene-based markers. This study characterizes a substantial fraction of transcriptome sequences containing simple sequence repeats (SSRs) from the EST sequences in date palm. The EST sequences studied are mainly homologous to those of Elaeis guineensis and Musa acuminata. A total of 911 gene-based SSR markers, characterized with functional annotations, have provided a useful basis not only for discovering candidate genes and understanding genetic basis of traits of interest but also for developing genetic and genomic tools for molecular research in date palm, such as diversity study, quantitative trait locus (QTL) mapping, and molecular breeding. The procedures of DNA extraction, polymerase chain reaction (PCR) amplification of these gene-based SSR markers, and gel electrophoresis of PCR products are described in this chapter.
OSIRIS-REx Touch-And-Go (TAG) Mission Design and Analysis

NASA Technical Reports Server (NTRS)

Berry, Kevin; Sutter, Brian; May, Alex; Williams, Ken; Barbee, Brent W.; Beckman, Mark; Williams, Bobby

2013-01-01

The Origins Spectral Interpretation Resource Identification Security Regolith Explorer (OSIRIS-REx) mission is a NASA New Frontiers mission launching in 2016 to rendezvous with the near-Earth asteroid (101955) 1999 RQ36 in late 2018. After several months in formation with and orbit about the asteroid, OSIRIS-REx will fly a Touch-And-Go (TAG) trajectory to the asteroid s surface to obtain a regolith sample. This paper describes the mission design of the TAG sequence and the propulsive maneuvers required to achieve the trajectory. This paper also shows preliminary results of orbit covariance analysis and Monte-Carlo analysis that demonstrate the ability to arrive at a targeted location on the surface of RQ36 within a 25 meter radius with 98.3% confidence.
Survival and tag loss in Moapa White River springfish implanted with passive integrated transponder tags

USGS Publications Warehouse

Dixon, Christopher J.; Mesa, Matthew G.

2011-01-01

We monitored survival and tag loss among Moapa White River springfish Crenichthys baileyi moapae that were surgically implanted with passive integrated transponder (PIT; 9 × 2 mm) tags. The fish used in the study ranged from 40 to 67 mm in total length and from 1.0 to 6.5 g in mass; the PIT tag: body weight ratios were 1.0–6.1%. Fish were held for 41 d in live cages within a small, warm desert stream. Survival did not differ between untagged control fish (94.5%) and tagged fish (95.6%). Survival did not appear to be influenced by fish size or PIT tag: body weight ratio, but the small number of fish that died precluded a detailed analysis. Tag retention was 100% among the 86 fish that survived over the 41 d. Our results suggest that surgically implanting 9-mm PIT tags into Moapa White River springfish as small as 40 mm is an effective method for marking them because it has minimal impacts on survival and tag retention is high. More work is needed on the effects of PIT tagging on growth and other performance metrics of springfish and other small desert fishes.
Cloning and expression of antibacterial goat lactoferricin from Escherichia coli AD494(DE3)pLysS expression system.

PubMed

Chen, Gen-Hung; Yin, Li-Jung; Chiang, I-Hua; Jiang, Shann-Tzong

2008-12-01

Goat lactoferricin (GLfcin), an antibacterial peptide, is released from the N terminus of goat lactoferrin by pepsin digestion. Two GLfcin-related cDNAs, GLfcin L and GLfcin S, encoding Ala20-Ser60 and Ser36-Ser60 of goat lactoferrin, respectively, were cloned into the pET-23a(+) expression vector upstream from (His)6-Tag gene and transformed into Escherichia coli AD494(DE3)pLysS expression host. After being induced by isopropyl-beta-D-thiogalactopyranoside (IPTG), two (His)6-Tag fused recombinant lactoferricins, GLfcin L-His*Tag and GLfcin S-His*Tag, were expressed in soluble form within the E. coli cytoplasm. The GLfcin L-His*Tag and GLfcin S-His*Tag were purified using HisTrap affinity chromatography. According to an antibacterial activity assay using the agar diffusion method, GLfcin L-His*Tag had antibacterial activity against E. coli BCRC 11549, Staphylococcus aureus BCRC 25923, and Propionibacterium acnes BCRC 10723, while GLfcin S-His*Tag was able to inhibit the growth of E. coli BCRC 11549 and P. acnes BCRC 10723. These two recombinant lactoferricins behaved as thermostable peptides, which could retain their activity for up to 30 min of exposure at 100 degrees C.
Optimization of human granulocyte macrophage-colony stimulating factor (hGM-CSF) expression using asparaginase and xylanase gene's signal sequences in Escherichia coli.

PubMed

Khasa, Yogender Pal; Khushoo, Amardeep; Tapryal, Suman; Mukherjee, K J

2011-09-01

The toxicity of the recombinant protein towards the expression host remains a significant deterrent for bioprocess development. In this study, the expression of human granulocyte macrophage-colony stimulating factor (hGM-CSF), which is known to be toxic to its host, was enhanced many folds using a combination of genetic and bioprocess strategies in Escherichia coli. The N terminus attachment of endoxylanase and asparaginase signal sequences from Bacillus subtilis and E. coli, respectively, in combination with and without His-tag, considerably improved expression levels. Induction and media optimization studies in shake flask cultures resulted in a maximal hGM-CSF concentration of 365 mg/L in the form of inclusion bodies (IBs) with a specific product yield (Y (P/X)) of 120 mg/g dry cell weight in case of the asparaginase signal. Culturing the cells in nutrient rich Terrific broth maintained the specific product yields (Y (P/X)) while a 6.6-fold higher volumetric concentration of both product and biomass was obtained. The purification and refolding steps were optimized resulting in a 95% pure protein with a fairly high refolding yield of 45%. The biological activity of the refolded protein was confirmed by a cell proliferation assay on hGM-CSF dependent human erythroleukemia TF-1 cells. This study demonstrated that this indeed is a viable route for the efficient production of hGM-CSF.
Cloning and characterization of a basic Cysteine-like protease (Cathespsin L1) expressed in the gut of larval Diaprepes abbreviatus L. (Coleoptera: Curculionidae)

USDA-ARS?s Scientific Manuscript database

Diaprepes abbreviatus is an important pest that causes extensive damage to citrus in the USA. Analysis of an expressed sequence tag (EST) library from the digestive tract of larvae and adult D. abbreviatus identified cathepsins as major putative digestive enzymes. One class, sharing amino acid seque...
Multi-targeted priming for genome-wide gene expression assays.

PubMed

Adomas, Aleksandra B; Lopez-Giraldez, Francesc; Clark, Travis A; Wang, Zheng; Townsend, Jeffrey P

2010-08-17

Complementary approaches to assaying global gene expression are needed to assess gene expression in regions that are poorly assayed by current methodologies. A key component of nearly all gene expression assays is the reverse transcription of transcribed sequences that has traditionally been performed by priming the poly-A tails on many of the transcribed genes in eukaryotes with oligo-dT, or by priming RNA indiscriminately with random hexamers. We designed an algorithm to find common sequence motifs that were present within most protein-coding genes of Saccharomyces cerevisiae and of Neurospora crassa, but that were not present within their ribosomal RNA or transfer RNA genes. We then experimentally tested whether degenerately priming these motifs with multi-targeted primers improved the accuracy and completeness of transcriptomic assays. We discovered two multi-targeted primers that would prime a preponderance of genes in the genomes of Saccharomyces cerevisiae and Neurospora crassa while avoiding priming ribosomal RNA or transfer RNA. Examining the response of Saccharomyces cerevisiae to nitrogen deficiency and profiling Neurospora crassa early sexual development, we demonstrated that using multi-targeted primers in reverse transcription led to superior performance of microarray profiling and next-generation RNA tag sequencing. Priming with multi-targeted primers in addition to oligo-dT resulted in higher sensitivity, a larger number of well-measured genes and greater power to detect differences in gene expression. Our results provide the most complete and detailed expression profiles of the yeast nitrogen starvation response and N. crassa early sexual development to date. Furthermore, our multi-targeting priming methodology for genome-wide gene expression assays provides selective targeting of multiple sequences and counter-selection against undesirable sequences, facilitating a more complete and precise assay of the transcribed sequences within the genome.

[Expression of Dengue virus type 2 nonstructural protein 3 and isolation of host proteins interacting with it].

PubMed

Weng, Daihui; Lei, Yingfeng; Dong, Yangchao; Han, Peijun; Ye, Chuantao; Yang, Jing; Wang, Yuan; Yin, Wen

2015-12-01

To construct the plasmid expressing the fusion protein of Dengue virus type 2 (DENV2) nonstructural protein 3 (NS3) with affinity tag, and isolate the cellular proteins interacting with NS3 protein using tandem affinity purification (TAP) assay. Primers for amplifying NS3 gene were designed according to the sequence of DENV2 genome and chemically synthesized. The NS3 fragments, after amplified by PCR with DENV2 cDNA as template, were digested and cloned into the mammalian eukaryotic expression vector pCI-SF with the tandem affinity tag (FLAG-StrepII). The recombinant pCI-NS3-SF was transiently transformed by Lipofectamine(TM) 2000 into HEK293T cells, and the expression of the fusion protein was confirmed by Western blotting. Cellular proteins that interacted with NS3 were isolated and purified by TAP assay. The eukaryotic expression vector expressing NS3 protein was successfully constructed. The host proteins interacting with NS3 protein were isolated by TAP system. TAP is an efficient method to isolate the cellular proteins interacting with DENV2 NS3.
High-throughput purification of recombinant proteins using self-cleaving intein tags.

PubMed

Coolbaugh, M J; Shakalli Tang, M J; Wood, D W

2017-01-01

High throughput methods for recombinant protein production using E. coli typically involve the use of affinity tags for simple purification of the protein of interest. One drawback of these techniques is the occasional need for tag removal before study, which can be hard to predict. In this work, we demonstrate two high throughput purification methods for untagged protein targets based on simple and cost-effective self-cleaving intein tags. Two model proteins, E. coli beta-galactosidase (βGal) and superfolder green fluorescent protein (sfGFP), were purified using self-cleaving versions of the conventional chitin-binding domain (CBD) affinity tag and the nonchromatographic elastin-like-polypeptide (ELP) precipitation tag in a 96-well filter plate format. Initial tests with shake flask cultures confirmed that the intein purification scheme could be scaled down, with >90% pure product generated in a single step using both methods. The scheme was then validated in a high throughput expression platform using 24-well plate cultures followed by purification in 96-well plates. For both tags and with both target proteins, the purified product was consistently obtained in a single-step, with low well-to-well and plate-to-plate variability. This simple method thus allows the reproducible production of highly pure untagged recombinant proteins in a convenient microtiter plate format. Copyright © 2016 Elsevier Inc. All rights reserved.
Single step purification of recombinant proteins using the metal ion-inducible autocleavage (MIIA) domain as linker for tag removal.

PubMed

Ibe, Susan; Schirrmeister, Jana; Zehner, Susanne

2015-08-20

For fast and easy purification, proteins are typically fused with an affinity tag, which often needs to be removed after purification. Here, we present a method for the removal of the affinity tag from the target protein in a single step protocol. The protein VIC_001052 of the coral pathogen Vibrio coralliilyticus ATCC BAA-450 contains a metal ion-inducible autocatalytic cleavage (MIIA) domain. Its coding sequence was inserted into an expression vector for the production of recombinant fusion proteins. Following, the target proteins MalE and mCherry were produced as MIIA-Strep fusion proteins in Escherichia coli. The target proteins could be separated from the MIIA-Strep part simply by the addition of calcium or manganese(II) ions within minutes. The cleavage is not affected in the pH range from 5.0 to 9.0 or at low temperatures (6°C). Autocleavage was also observed with immobilized protein on an affinity column. The protein yield was similar to that achieved with a conventional purification protocol. Copyright © 2015 Elsevier B.V. All rights reserved.
Social Tagging of Mission Data

NASA Technical Reports Server (NTRS)

Norris, Jeffrey S.; Wallick, Michael N.; Joswig, Joseph C.; Powell, Mark W.; Torres, Recaredo J.; Mittman, David S.; Abramyan, Lucy; Crockett, Thomas M.; Shams, Khawaja S.; Fox, Jason M.;

2010-01-01

Mars missions will generate a large amount of data in various forms, such as daily plans, images, and scientific information. Often, there is a semantic linkage between images that cannot be captured automatically. Software is needed that will provide a method for creating arbitrary tags for this mission data so that items with a similar tag can be related to each other. The tags should be visible and searchable for all users. A new routine was written to offer a new and more flexible search option over previous applications. This software allows users of the MSLICE program to apply any number of arbitrary tags to a piece of mission data through a MSLICE search interface. The application of tags creates relationships between data that did not previously exist. These tags can be easily removed and changed, and contain enough flexibility to be specifically configured for any mission. This gives users the ability to quickly recall or draw attention to particular pieces of mission data, for example: Give a semantic and meaningful description to mission data; for example, tag all images with a rock in them with the tag "rock." Rapidly recall specific and useful pieces of data; for example, tag a plan as"driving template." Call specific data to a user s attention; for example, tag a plan as "for:User." This software is part of the MSLICE release, which was written in Java. It will run on any current Windows, Macintosh, or Linux system.

Magnetic Resonance Arterial Spin Tagging for Non-Invasive Pharmacokinetic Analysis of Breast Cancer

DTIC Science & Technology

2000-10-01

sequence software that we had developed for this project. In addition, we revised the pulse sequences to utilize the high performance gradients (40 mT/ m ...peak, 150 mT/ m /ms rise) of the system. We believe these revised sequences will provide better arterial spin tagged data for perfusion measurement. All...U.... ...... ... -- v p I _1 i-:F~ ----- ! - .Ag Jig. H aI .. M e fI6lo 3 ~ ~ 2 0’,~- A.11. I 1 1 9 - HP ~ ~ IM I 15 L 1 1 8 = NIAt I C J1 5
Profiling cellular protein complexes by proximity ligation with dual tag microarray readout.

PubMed

Hammond, Maria; Nong, Rachel Yuan; Ericsson, Olle; Pardali, Katerina; Landegren, Ulf

2012-01-01

Patterns of protein interactions provide important insights in basic biology, and their analysis plays an increasing role in drug development and diagnostics of disease. We have established a scalable technique to compare two biological samples for the levels of all pairwise interactions among a set of targeted protein molecules. The technique is a combination of the proximity ligation assay with readout via dual tag microarrays. In the proximity ligation assay protein identities are encoded as DNA sequences by attaching DNA oligonucleotides to antibodies directed against the proteins of interest. Upon binding by pairs of antibodies to proteins present in the same molecular complexes, ligation reactions give rise to reporter DNA molecules that contain the combined sequence information from the two DNA strands. The ligation reactions also serve to incorporate a sample barcode in the reporter molecules to allow for direct comparison between pairs of samples. The samples are evaluated using a dual tag microarray where information is decoded, revealing which pairs of tags that have become joined. As a proof-of-concept we demonstrate that this approach can be used to detect a set of five proteins and their pairwise interactions both in cellular lysates and in fixed tissue culture cells. This paper provides a general strategy to analyze the extent of any pairwise interactions in large sets of molecules by decoding reporter DNA strands that identify the interacting molecules.
In silico mining and characterization of simple sequence repeats from gilthead sea bream (Sparus aurata) expressed sequence tags (EST-SSRs); PCR amplification, polymorphism evaluation and multiplexing and cross-species assays.

PubMed

Vogiatzi, Emmanouella; Lagnel, Jacques; Pakaki, Victoria; Louro, Bruno; Canario, Adelino V M; Reinhardt, Richard; Kotoulas, Georgios; Magoulas, Antonios; Tsigenopoulos, Costas S

2011-06-01

We screened for simple sequence repeats (SSRs) found in ESTs derived from an EST-database development project ('Marine Genomics Europe' Network of Excellence). Different motifs of di-, tri-, tetra-, penta- and hexanucleotide SSRs were evaluated for variation in length and position in the expressed sequences, relative abundance and distribution in gilthead sea bream (Sparus aurata). We found 899 ESTs that harbor 997 SSRs (4.94%). On average, one SSR was found per 2.95 kb of EST sequence and the dinucleotide SSRs are the most abundant accounting for 47.6% of the total number. EST-SSRs were used as template for primer design. 664 primer pairs could be successfully identified and a subset of 206 pairs of primers was synthesized, PCR-tested and visualized on ethidium bromide stained agarose gels. The main objective was to further assess the potential of EST-SSRs as informative markers and investigate their cross-species amplification in sixteen teleost fish species: seven sparid species and nine other species from different families. Approximately 78% of the primer pairs gave PCR products of expected size in gilthead sea bream, and as expected, the rate of successful amplification of sea bream EST-SSRs was higher in sparids, lower in other perciforms and even lower in species of the Clupeiform and Gadiform orders. We finally determined the polymorphism and the heterozygosity of 63 markers in a wild gilthead sea bream population; fifty-eight loci were found to be polymorphic with the expected heterozygosity and the number of alleles ranging from 0.089 to 0.946 and from 2 to 27, respectively. These tools and markers are expected to enhance the available genetic linkage map in gilthead sea bream, to assist comparative mapping and genome analyses for this species and further with other model fish species and finally to help advance genetic analysis for cultivated and wild populations and accelerate breeding programs. Copyright © 2011 Elsevier B.V. All rights reserved.
Analysis of bacterial and archaeal diversity in coastal microbial mats using massive parallel 16S rRNA gene tag sequencing.

PubMed

Bolhuis, Henk; Stal, Lucas J

2011-11-01

Coastal microbial mats are small-scale and largely closed ecosystems in which a plethora of different functional groups of microorganisms are responsible for the biogeochemical cycling of the elements. Coastal microbial mats play an important role in coastal protection and morphodynamics through stabilization of the sediments and by initiating the development of salt-marshes. Little is known about the bacterial and especially archaeal diversity and how it contributes to the ecological functioning of coastal microbial mats. Here, we analyzed three different types of coastal microbial mats that are located along a tidal gradient and can be characterized as marine (ST2), brackish (ST3) and freshwater (ST3) systems. The mats were sampled during three different seasons and subjected to massive parallel tag sequencing of the V6 region of the 16S rRNA genes of Bacteria and Archaea. Sequence analysis revealed that the mats are among the most diverse marine ecosystems studied so far and consist of several novel taxonomic levels ranging from classes to species. The diversity between the different mat types was far more pronounced than the changes between the different seasons at one location. The archaeal community for these mats have not been studied before and revealed a strong reaction on a short period of draught during summer resulting in a massive increase in halobacterial sequences, whereas the bacterial community was barely affected. We concluded that the community composition and the microbial diversity were intrinsic of the mat type and depend on the location along the tidal gradient indicating a relation with salinity.
Gene expression analysis of flax seed development

PubMed Central

2011-01-01

Background Flax, Linum usitatissimum L., is an important crop whose seed oil and stem fiber have multiple industrial applications. Flax seeds are also well-known for their nutritional attributes, viz., omega-3 fatty acids in the oil and lignans and mucilage from the seed coat. In spite of the importance of this crop, there are few molecular resources that can be utilized toward improving seed traits. Here, we describe flax embryo and seed development and generation of comprehensive genomic resources for the flax seed. Results We describe a large-scale generation and analysis of expressed sequences in various tissues. Collectively, the 13 libraries we have used provide a broad representation of genes active in developing embryos (globular, heart, torpedo, cotyledon and mature stages) seed coats (globular and torpedo stages) and endosperm (pooled globular to torpedo stages) and genes expressed in flowers, etiolated seedlings, leaves, and stem tissue. A total of 261,272 expressed sequence tags (EST) (GenBank accessions LIBEST_026995 to LIBEST_027011) were generated. These EST libraries included transcription factor genes that are typically expressed at low levels, indicating that the depth is adequate for in silico expression analysis. Assembly of the ESTs resulted in 30,640 unigenes and 82% of these could be identified on the basis of homology to known and hypothetical genes from other plants. When compared with fully sequenced plant genomes, the flax unigenes resembled poplar and castor bean more than grape, sorghum, rice or Arabidopsis. Nearly one-fifth of these (5,152) had no homologs in sequences reported for any organism, suggesting that this category represents genes that are likely unique to flax. Digital analyses revealed gene expression dynamics for the biosynthesis of a number of important seed constituents during seed development. Conclusions We have developed a foundational database of expressed sequences and collection of plasmid clones that comprise
Genomic integrity of the Y chromosome sequence-tagged-sites in infertile and Down syndrome Jordanian males.

PubMed

Yasin, S R; Tahtamouni, L H; Najeeb, N S; Issa, N M; Al-Mazaydeh, Z A; Alfaouri, A A

2014-09-01

The long arm of the Y chromosome contains nonoverlapping regions termed azoospermia factor (AZF) with great influence on male fertility. Microdeletions at these regions minimise the males' ability to father offsprings. In this preliminary study, we attempted to screen the presence or absence of twenty Y chromosome's sequence-tagged sites (STS) associated with fertility in infertile and Down syndrome (DS) males. Genomic DNA from 35 fertile, 74 infertile and 22 karyotyped DS males was extracted and amplified in multiplex polymerase chain reaction (PCR) containing 20 primer pairs that amplify Y-specific STS that cover functional regions associated with AZF and spermatogenesis-related genes. Our results indicated the integrity of the Y chromosome at the 20 fertility markers for both the fertile and Down syndrome males. However, the results of the infertile males showed the presence of microdeletions at these Y-specific STS. Three samples showed Y chromosome microdeletion when blood and seminal fluid genomic DNA were assayed, while two samples showed microdeletion only when seminal fluid genomic DNA was assayed. The current study demonstrated that the molecular genetic aspect of infertility should be given proper attention when dealing with infertility cases. Furthermore, our results indicate the importance of genetic counselling in managing infertility cases. © 2013 Blackwell Verlag GmbH.
Computational exploration of microRNAs from expressed sequence tags of Humulus lupulus, target predictions and expression analysis.

PubMed

Mishra, Ajay Kumar; Duraisamy, Ganesh Selvaraj; Týcová, Anna; Matoušek, Jaroslav

2015-12-01

Among computationally predicted and experimentally validated plant miRNAs, several are conserved across species boundaries in the plant kingdom. In this study, a combined experimental-in silico computational based approach was adopted for the identification and characterization of miRNAs in Humulus lupulus (hop), which is widely cultivated for use by the brewing industry and apart from, used as a medicinal herb. A total of 22 miRNAs belonging to 17 miRNA families were identified in hop following comparative computational approach and EST-based homology search according to a series of filtering criteria. Selected miRNAs were validated by end-point PCR and quantitative reverse transcription-polymerase chain reaction (qRT-PCR), confirmed the existence of conserved miRNAs in hop. Based on the characteristic that miRNAs exhibit perfect or nearly perfect complementarity with their targeted mRNA sequences, a total of 47 potential miRNA targets were identified in hop. Strikingly, the majority of predicted targets were belong to transcriptional factors which could regulate hop growth and development, including leaf, root and even cone development. Moreover, the identified miRNAs may also be involved in other cellular and metabolic processes, such as stress response, signal transduction, and other physiological processes. The cis-regulatory elements relevant to biotic and abiotic stress, plant hormone response, flavonoid biosynthesis were identified in the promoter regions of those miRNA genes. Overall, findings from this study will accelerate the way for further researches of miRNAs, their functions in hop and shows a path for the prediction and analysis of miRNAs to those species whose genomes are not available. Copyright © 2015 Elsevier Ltd. All rights reserved.
Snap-, CLIP- and Halo-Tag Labelling of Budding Yeast Cells

PubMed Central

Stagge, Franziska; Mitronova, Gyuzel Y.; Belov, Vladimir N.; Wurm, Christian A.; Jakobs, Stefan

2013-01-01

Fluorescence microscopy of the localization and the spatial and temporal dynamics of specifically labelled proteins is an indispensable tool in cell biology. Besides fluorescent proteins as tags, tag-mediated labelling utilizing self-labelling proteins as the SNAP-, CLIP-, or the Halo-tag are widely used, flexible labelling systems relying on exogenously supplied fluorophores. Unfortunately, labelling of live budding yeast cells proved to be challenging with these approaches because of the limited accessibility of the cell interior to the dyes. In this study we developed a fast and reliable electroporation-based labelling protocol for living budding yeast cells expressing SNAP-, CLIP-, or Halo-tagged fusion proteins. For the Halo-tag, we demonstrate that it is crucial to use the 6′-carboxy isomers and not the 5′-carboxy isomers of important dyes to ensure cell viability. We report on a simple rule for the analysis of 1H NMR spectra to discriminate between 6′- and 5′-carboxy isomers of fluorescein and rhodamine derivatives. We demonstrate the usability of the labelling protocol by imaging yeast cells with STED super-resolution microscopy and dual colour live cell microscopy. The large number of available fluorophores for these self-labelling proteins and the simplicity of the protocol described here expands the available toolbox for the model organism Saccharomyces cerevisiae. PMID:24205303
Review on SAW RFID tags.

PubMed

Plessky, Victor P; Reindl, Leonhard M

2010-03-01

SAW tags were invented more than 30 years ago, but only today are the conditions united for mass application of this technology. The devices in the 2.4-GHz ISM band can be routinely produced with optical lithography, high-resolution radar systems can be built up using highly sophisticated, but low-cost RF-chips, and the Internet is available for global access to the tag databases. The "Internet of Things," or I-o-T, will demand trillions of cheap tags and sensors. The SAW tags can overcome semiconductor-based analogs in many aspects: they can be read at a distance of a few meters with readers radiating power levels 2 to 3 orders lower, they are cheap, and they can operate in robust environments. Passive SAW tags are easily combined with sensors. Even the "anti-collision" problem (i.e., the simultaneous reading of many nearby tags) has adequate solutions for many practical applications. In this paper, we discuss the state-of-the-art in the development of SAW tags. The design approaches will be reviewed and optimal tag designs, as well as encoding methods, will be demonstrated. We discuss ways to reduce the size and cost of these devices. A few practical examples of tags using a time-position coding with 10(6) different codes will be demonstrated. Phase-coded devices can additionally increase the number of codes at the expense of a reduction of reading distance. We also discuss new and exciting perspectives of using ultra wide band (UWB) technology for SAW-tag systems. The wide frequency band available for this standard provides a great opportunity for SAW tags to be radically reduced in size to about 1 x 1 mm(2) while keeping a practically infinite number of possible different codes. Finally, the reader technology will be discussed, as well as detailed comparison made between SAW tags and IC-based semiconductor device.
Tag retention, growth, and survival of red swamp crayfish Procambarus clarkii marked with coded wire tags

USGS Publications Warehouse

Isely, J.J.; Eversole, A.G.

1998-01-01

Juvenile red swamp crayfish (or crawfish), Procambarus clarkii (20-41 mm in total length) were collected from a crayfish culture pond by dipnetting and tagged with sequentially numbered, standard length, binary-coded wire tags. Four replicates of 50 crayfish were impaled perpendicular to the long axis of the abdomen with a fixed needle. Tags were injected transversely into the ventral surface of the first or second abdominal segment and were imbedded in the musculature just beneath the abdominal sternum. Tags were visible upon inspection. Additionally, two replicates of 50 crayfish were not tagged and were used as controls. Growth, survival, and tag retention were evaluated after 7 d in individual containers, after 100 d in aquaria, and after 200 d in field cages. Tag retention during each sample period was 100%, and average mortality of tagged crayfish within 7 d of tagging was 1%. Mortality during the remainder of the study was high (75-91%) but was similar between treatment and control samples. Most of the deaths were probably due to cannibalism. Average total length increased threefold during the course of the study, and crayfish reached maturity. Because crayfish were mature by the end of the study, we concluded that the coded wire tag was retained through the life history of the crayfish.
Differential effects of simple repeating DNA sequences on gene expression from the SV40 early promoter.

PubMed

Amirhaeri, S; Wohlrab, F; Wells, R D

1995-02-17

The influence of simple repeat sequences, cloned into different positions relative to the SV40 early promoter/enhancer, on the transient expression of the chloramphenicol acetyltransferase (CAT) gene was investigated. Insertion of (G)29.(C)29 in either orientation into the 5'-untranslated region of the CAT gene reduced expression in CV-1 cells 50-100 fold when compared with controls with random sequence inserts. Analysis of CAT-specific mRNA levels demonstrated that the effect was due to a reduction of CAT mRNA production rather than to posttranscriptional events. In contrast, insertion of the same insert in either orientation upstream of the promoter-enhancer or downstream of the gene stimulated gene expression 2-3-fold. These effects could be reversed by cotransfection of a competitor plasmid carrying (G)25.(C)25 sequences. The results suggest that a G.C-binding transcription factor modulates gene expression in this system and that promoter strength can be regulated by providing protein-binding sites in trans. Although constructs containing longer tracts of alternating (C-G), (T-G), or (A-T) sequences inhibited CAT expression when inserted in the 5'-untranslated region of the CAT gene, the amount of CAT mRNA was unaffected. Hence, these inhibitions must be due to posttranscriptional events, presumably at the level of translation. These effects of microsatellite sequences on gene expression are discussed with respect to recent data on related simple repeat sequences which cause several human genetic diseases.
Compressing DNA sequence databases with coil

PubMed Central

White, W Timothy J; Hendy, Michael D

2008-01-01

Background Publicly available DNA sequence databases such as GenBank are large, and are growing at an exponential rate. The sheer volume of data being dealt with presents serious storage and data communications problems. Currently, sequence data is usually kept in large "flat files," which are then compressed using standard Lempel-Ziv (gzip) compression – an approach which rarely achieves good compression ratios. While much research has been done on compressing individual DNA sequences, surprisingly little has focused on the compression of entire databases of such sequences. In this study we introduce the sequence database compression software coil. Results We have designed and implemented a portable software package, coil, for compressing and decompressing DNA sequence databases based on the idea of edit-tree coding. coil is geared towards achieving high compression ratios at the expense of execution time and memory usage during compression – the compression time represents a "one-off investment" whose cost is quickly amortised if the resulting compressed file is transmitted many times. Decompression requires little memory and is extremely fast. We demonstrate a 5% improvement in compression ratio over state-of-the-art general-purpose compression tools for a large GenBank database file containing Expressed Sequence Tag (EST) data. Finally, coil can efficiently encode incremental additions to a sequence database. Conclusion coil presents a compelling alternative to conventional compression of flat files for the storage and distribution of DNA sequence databases having a narrow distribution of sequence lengths, such as EST data. Increasing compression levels for databases having a wide distribution of sequence lengths is a direction for future work. PMID:18489794
DNA sequence chromatogram browsing using JAVA and CORBA.

PubMed

Parsons, J D; Buehler, E; Hillier, L

1999-03-01

DNA sequence chromatograms (traces) are the primary data source for all large-scale genomic and expressed sequence tags (ESTs) sequencing projects. Access to the sequencing trace assists many later analyses, for example contig assembly and polymorphism detection, but obtaining and using traces is problematic. Traces are not collected and published centrally, they are much larger than the base calls derived from them, and viewing them requires the interactivity of a local graphical client with local data. To provide efficient global access to DNA traces, we developed a client/server system based on flexible Java components integrated into other applications including an applet for use in a WWW browser and a stand-alone trace viewer. Client/server interaction is facilitated by CORBA middleware which provides a well-defined interface, a naming service, and location independence. [The software is packaged as a Jar file available from the following URL: http://www.ebi.ac.uk/jparsons. Links to working examples of the trace viewers can be found at http://corba.ebi.ac.uk/EST. All the Washington University mouse EST traces are available for browsing at the same URL.
Draft Sequences of the Radish (Raphanus sativus L.) Genome

PubMed Central

Kitashiba, Hiroyasu; Li, Feng; Hirakawa, Hideki; Kawanabe, Takahiro; Zou, Zhongwei; Hasegawa, Yoichi; Tonosaki, Kaoru; Shirasawa, Sachiko; Fukushima, Aki; Yokoi, Shuji; Takahata, Yoshihito; Kakizaki, Tomohiro; Ishida, Masahiko; Okamoto, Shunsuke; Sakamoto, Koji; Shirasawa, Kenta; Tabata, Satoshi; Nishio, Takeshi

2014-01-01

Radish (Raphanus sativus L., n = 9) is one of the major vegetables in Asia. Since the genomes of Brassica and related species including radish underwent genome rearrangement, it is quite difficult to perform functional analysis based on the reported genomic sequence of Brassica rapa. Therefore, we performed genome sequencing of radish. Short reads of genomic sequences of 191.1 Gb were obtained by next-generation sequencing (NGS) for a radish inbred line, and 76,592 scaffolds of ≥300 bp were constructed along with the bacterial artificial chromosome-end sequences. Finally, the whole draft genomic sequence of 402 Mb spanning 75.9% of the estimated genomic size and containing 61,572 predicted genes was obtained. Subsequently, 221 single nucleotide polymorphism markers and 768 PCR-RFLP markers were used together with the 746 markers produced in our previous study for the construction of a linkage map. The map was combined further with another radish linkage map constructed mainly with expressed sequence tag-simple sequence repeat markers into a high-density integrated map of 1,166 cM with 2,553 DNA markers. A total of 1,345 scaffolds were assigned to the linkage map, spanning 116.0 Mb. Bulked PCR products amplified by 2,880 primer pairs were sequenced by NGS, and SNPs in eight inbred lines were identified. PMID:24848699
How did nature engineer the highest surface lipid accumulation among plants? Exceptional expression of acyl-lipid-associated genes for the assembly of extracellular triacylglycerol by Bayberry ( Myrica pensylvanica ) fruits

DOE PAGES

Simpson, Jeffrey P.; Thrower, Nicholas; Ohlrogge, John B.

2016-02-09

Bayberry (Myrica pensylvanica) fruits are covered with a remarkably thick layer of crystalline wax consisting of triacylglycerol (TAG) and diacylglycerol (DAG) esterified exclusively with saturated fatty acids. As the only plant known to accumulate soluble glycerolipids as a major component of surface waxes, Bayberry represents a novel system to investigate neutral lipid biosynthesis and lipid secretion by vegetative plant cells. The assembly of Bayberry wax is distinct from conventional TAG and other surface waxes, and instead proceeds through a pathway related to cutin synthesis (Simpson and Ohlrogge, 2016). In this study, microscopic examination revealed that the fruit tissue that producesmore » and secretes wax (Bayberry knobs) is fully developed before wax accumulates and that wax is secreted to the surface without cell disruption. Comparison of transcript expression to genetically related tissues (Bayberry leaves, M. rubra fruits), cutin-rich tomato and cherry fruit epidermis, and to oil-rich mesocarp and seeds, revealed exceptionally high expression of 13 transcripts for acyl-lipid metabolism together with down-regulation of fatty acid oxidases and desaturases. The predicted protein sequences of the most highly expressed lipid-related enzyme-encoding transcripts in Bayberry knobs are 100% identical to the sequences from Bayberry leaves,which do not produce surface DAG or TAG. Together, these results indicate that TAG biosynthesis and secretion in Bayberry is achieved by both up and down-regulation of a small subset of genes related to the biosynthesis of cutin and saturated fatty acids, and also implies that modifications in gene expression, rather than evolution of new gene functions, was the major mechanism by which Bayberry evolved its specialized lipid metabolism.« less
Differential Gene Expression of Longan Under Simulated Acid Rain Stress.

PubMed

Zheng, Shan; Pan, Tengfei; Ma, Cuilan; Qiu, Dongliang

2017-05-01

Differential gene expression profile was studied in Dimocarpus longan Lour. in response to treatments of simulated acid rain with pH 2.5, 3.5, and a control (pH 5.6) using differential display reverse transcription polymerase chain reaction (DDRT-PCR). Results showed that mRNA differential display conditions were optimized to find an expressed sequence tag (EST) related with acid rain stress. The potential encoding products had 80% similarity with a transcription initiation factor IIF of Gossypium raimondii and 81% similarity with a protein product of Theobroma cacao. This fragment is the transcription factor activated by second messenger substances in longan leaves after signal perception of acid rain.

Gene identification and analysis of transcripts differentially regulated in fracture healing by EST sequencing in the domestic sheep.

PubMed

Hecht, Jochen; Kuhl, Heiner; Haas, Stefan A; Bauer, Sebastian; Poustka, Albert J; Lienau, Jasmin; Schell, Hanna; Stiege, Asita C; Seitz, Volkhard; Reinhardt, Richard; Duda, Georg N; Mundlos, Stefan; Robinson, Peter N

2006-07-05

The sheep is an important model animal for testing novel fracture treatments and other medical applications. Despite these medical uses and the well known economic and cultural importance of the sheep, relatively little research has been performed into sheep genetics, and DNA sequences are available for only a small number of sheep genes. In this work we have sequenced over 47 thousand expressed sequence tags (ESTs) from libraries developed from healing bone in a sheep model of fracture healing. These ESTs were clustered with the previously available 10 thousand sheep ESTs to a total of 19087 contigs with an average length of 603 nucleotides. We used the newly identified sequences to develop RT-PCR assays for 78 sheep genes and measured differential expression during the course of fracture healing between days 7 and 42 postfracture. All genes showed significant shifts at one or more time points. 23 of the genes were differentially expressed between postfracture days 7 and 10, which could reflect an important role for these genes for the initiation of osteogenesis. The sequences we have identified in this work are a valuable resource for future studies on musculoskeletal healing and regeneration using sheep and represent an important head-start for genomic sequencing projects for Ovis aries, with partial or complete sequences being made available for over 5,800 previously unsequenced sheep genes.
Sonication-based isolation and enrichment of Chlorella protothecoides chloroplasts for illumina genome sequencing

DOE Office of Scientific and Technical Information (OSTI.GOV)

Angelova, Angelina; Park, Sang-Hycuk; Kyndt, John

2013-09-01

With the increasing world demand for biofuel, a number of oleaginous algal species are being considered as renewable sources of oil. Chlorella protothecoides Krüger synthesizes triacylglycerols (TAGs) as storage compounds that can be converted into renewable fuel utilizing an anabolic pathway that is poorly understood. The paucity of algal chloroplast genome sequences has been an important constraint to chloroplast transformation and for studying gene expression in TAGs pathways. In this study, the intact chloroplasts were released from algal cells using sonication followed by sucrose gradient centrifugation, resulting in a 2.36-fold enrichment of chloroplasts from C. protothecoides, based on qPCR analysis.more » The C. protothecoides chloroplast genome (cpDNA) was determined using the Illumina HiSeq 2000 sequencing platform and found to be 84,576 Kb in size (8.57 Kb) in size, with a GC content of 30.8 %. This is the first report of an optimized protocol that uses a sonication step, followed by sucrose gradient centrifugation, to release and enrich intact chloroplasts from a microalga (C. prototheocoides) of sufficient quality to permit chloroplast genome sequencing with high coverage, while minimizing nuclear genome contamination. The approach is expected to guide chloroplast isolation from other oleaginous algal species for a variety of uses that benefit from enrichment of chloroplasts, ranging from biochemical analysis to genomics studies.« less
Directional Radio-Frequency Identification Tag Reader

NASA Technical Reports Server (NTRS)

Medelius, Pedro J.; Taylor, John D.; Henderson, John J.

2004-01-01

A directional radio-frequency identification (RFID) tag reader has been designed to facilitate finding a specific object among many objects in a crowded room. The device could be an adjunct to an electronic inventory system that tracks RFID-tagged objects as they move through reader-equipped doorways. Whereas commercial RFID-tag readers do not measure directions to tagged objects, the device is equipped with a phased-array antenna and a received signal-strength indicator (RSSI) circuit for measuring direction. At the beginning of operation, it is set to address only the RFID tag of interest. It then continuously transmits a signal to interrogate that tag while varying the radiation pattern of the antenna. It identifies the direction to the tag as the radiation pattern direction of peak strength of the signal returned by the tag. An approximate distance to the tag is calculated from the peak signal strength. The direction and distance can be displayed on a screen. A prototype containing a Yagi antenna was found to be capable of detecting a 915.5-MHz tag at a distance of approximately equal to 15 ft (approximately equal to 4.6 m).
Annotation and sequence diversity of transposable elements in common bean (Phaseolus vulgaris).

PubMed

Gao, Dongying; Abernathy, Brian; Rohksar, Daniel; Schmutz, Jeremy; Jackson, Scott A

2014-01-01

Common bean (Phaseolus vulgaris) is an important legume crop grown and consumed worldwide. With the availability of the common bean genome sequence, the next challenge is to annotate the genome and characterize functional DNA elements. Transposable elements (TEs) are the most abundant component of plant genomes and can dramatically affect genome evolution and genetic variation. Thus, it is pivotal to identify TEs in the common bean genome. In this study, we performed a genome-wide transposon annotation in common bean using a combination of homology and sequence structure-based methods. We developed a 2.12-Mb transposon database which includes 791 representative transposon sequences and is available upon request or from www.phytozome.org. Of note, nearly all transposons in the database are previously unrecognized TEs. More than 5,000 transposon-related expressed sequence tags (ESTs) were detected which indicates that some transposons may be transcriptionally active. Two Ty1-copia retrotransposon families were found to encode the envelope-like protein which has rarely been identified in plant genomes. Also, we identified an extra open reading frame (ORF) termed ORF2 from 15 Ty3-gypsy families that was located between the ORF encoding the retrotransposase and the 3'LTR. The ORF2 was in opposite transcriptional orientation to retrotransposase. Sequence homology searches and phylogenetic analysis suggested that the ORF2 may have an ancient origin, but its function is not clear. These transposon data provide a useful resource for understanding the genome organization and evolution and may be used to identify active TEs for developing transposon-tagging system in common bean and other related genomes.
ESTuber db: an online database for Tuber borchii EST sequences.

PubMed

Lazzari, Barbara; Caprera, Andrea; Cosentino, Cristian; Stella, Alessandra; Milanesi, Luciano; Viotti, Angelo

2007-03-08

The ESTuber database (http://www.itb.cnr.it/estuber) includes 3,271 Tuber borchii expressed sequence tags (EST). The dataset consists of 2,389 sequences from an in-house prepared cDNA library from truffle vegetative hyphae, and 882 sequences downloaded from GenBank and representing four libraries from white truffle mycelia and ascocarps at different developmental stages. An automated pipeline was prepared to process EST sequences using public software integrated by in-house developed Perl scripts. Data were collected in a MySQL database, which can be queried via a php-based web interface. Sequences included in the ESTuber db were clustered and annotated against three databases: the GenBank nr database, the UniProtKB database and a third in-house prepared database of fungi genomic sequences. An algorithm was implemented to infer statistical classification among Gene Ontology categories from the ontology occurrences deduced from the annotation procedure against the UniProtKB database. Ontologies were also deduced from the annotation of more than 130,000 EST sequences from five filamentous fungi, for intra-species comparison purposes. Further analyses were performed on the ESTuber db dataset, including tandem repeats search and comparison of the putative protein dataset inferred from the EST sequences to the PROSITE database for protein patterns identification. All the analyses were performed both on the complete sequence dataset and on the contig consensus sequences generated by the EST assembly procedure. The resulting web site is a resource of data and links related to truffle expressed genes. The Sequence Report and Contig Report pages are the web interface core structures which, together with the Text search utility and the Blast utility, allow easy access to the data stored in the database.
Evaluation of Intercontinental Transport of Ozone Using Full-tagged, Tagged-N and Sensitivity Methods

NASA Astrophysics Data System (ADS)

Guo, Y.; Liu, J.; Mauzerall, D. L.; Emmons, L. K.; Horowitz, L. W.; Fan, S.; Li, X.; Tao, S.

2014-12-01

Long-range transport of ozone is of great concern, yet the source-receptor relationships derived previously depend strongly on the source attribution techniques used. Here we describe a new tagged ozone mechanism (full-tagged), the design of which seeks to take into account the combined effects of emissions of ozone precursors, CO, NOx and VOCs, from a particular source, while keeping the current state of chemical equilibrium unchanged. We label emissions from the target source (A) and background (B). When two species from A and B sources react with each other, half of the resulting products are labeled A, and half B. Thus the impact of a given source on downwind regions is recorded through tagged chemistry. We then incorporate this mechanism into the Model for Ozone and Related chemical Tracers (MOZART-4) to examine the impact of anthropogenic emissions within North America, Europe, East Asia and South Asia on ground-level ozone downwind of source regions during 1999-2000. We compare our results with two previously used methods -- the sensitivity and tagged-N approaches. The ozone attributed to a given source by the full-tagged method is more widely distributed spatially, but has weaker seasonal variability than that estimated by the other methods. On a seasonal basis, for most source/receptor pairs, the full-tagged method estimates the largest amount of tagged ozone, followed by the sensitivity and tagged-N methods. In terms of trans-Pacific influence of ozone pollution, the full-tagged method estimates the strongest impact of East Asian (EA) emissions on the western U.S. (WUS) in MAM and JJA (~3 ppbv), which is substantially different in magnitude and seasonality from tagged-N and sensitivity studies. This difference results from the full-tagged method accounting for the maintenance of peroxy radicals (e.g., CH3O2, CH3CO3, and HO2), in addition to NOy, as effective reservoirs of EA source impact across the Pacific, allowing for a significant contribution to
miR-148a and miR-17-5p synergistically regulate milk TAG synthesis via PPARGC1A and PPARA in goat mammary epithelial cells.

PubMed

Chen, Zhi; Luo, Jun; Sun, Shuang; Cao, Duoyao; Shi, Huaiping; Loor, Juan J

2017-03-04

MicroRNA (miRNA) are a class of '18-25' nt RNA molecules which regulate gene expression and play an important role in several biologic processes including fatty acid metabolism. Here we used S-Poly (T) and high-throughput sequencing to evaluate the expression of miRNA and mRNA during early-lactation and in the non-lactating ("dry") period in goat mammary gland tissue. Results indicated that miR-148a, miR-17-5p, PPARGC1A and PPARA are highly expressed in the goat mammary gland in early-lactation and non-lactating periods. Utilizing a Luciferase reporter assay and Western Blot, PPARA, an important regulator of fatty acid oxidation, and PGC1a (PPARGC1A), a major regulator of fat metabolism, were demonstrated to be targets of miR-148a and miR-17-5p in goat mammary epithelial cells (GMECs). It was also revealed that miR-148a expression can regulate PPARA, and miR-17-5p represses PPARGC1A in GMECs. Furthermore, the overexpression of miR-148a and miR-17-5p promoted triacylglycerol (TAG) synthesis while the knockdown of miR-148a and miR-17-5p impaired TAG synthesis in GMEC. These findings underscore the importance of miR-148a and miR-17-5p as key components in the regulation of TAG synthesis. In addition, miR-148a cooperates with miR-17-5p to regulate fatty acid metabolism by repressing PPARGC1A and PPARA in GMECs. Further studies on the functional role of miRNAs in lipid metabolism of ruminant mammary cells seem warranted.
Expression and purification of short hydrophobic elastin-like polypeptides with maltose-binding protein as a solubility tag.

PubMed

Bataille, Laure; Dieryck, Wilfrid; Hocquellet, Agnès; Cabanne, Charlotte; Bathany, Katell; Lecommandoux, Sébastien; Garbay, Bertrand; Garanger, Elisabeth

2015-06-01

Elastin-like polypeptides (ELPs) are biodegradable polymers with interesting physico-chemical properties for biomedical and biotechnological applications. The recombinant expression of hydrophobic elastin-like polypeptides is often difficult because they possess low transition temperatures, and therefore form aggregates at sub-ambient temperatures. To circumvent this difficulty, we expressed in Escherichia coli three hydrophobic ELPs (VPGIG)n with variable lengths (n=20, 40, and 60) in fusion with the maltose-binding protein (MBP). Fusion proteins were soluble and yields of purified MBP-ELP ranged between 66 and 127mg/L culture. After digestion of the fusion proteins by enterokinase, the ELP moiety was purified by using inverse transition cycling. The purified fraction containing ELP40 was slightly contaminated by traces of undigested fusion protein. Purification of ELP60 was impaired because of co-purification of the MBP tag during inverse transition cycling. ELP20 was successfully purified to homogeneity, as assessed by gel electrophoresis and mass spectrometry analyses. The transition temperature of ELP20 was measured at 15.4°C in low salt buffer. In conclusion, this method can be used to produce hydrophobic ELP of low molecular mass. Copyright © 2015 Elsevier Inc. All rights reserved.
Leveraging algal omics to reveal potential targets for augmenting TAG accumulation

DOE Office of Scientific and Technical Information (OSTI.GOV)

Arora, Neha; Pienkos, Philip T.; Pruthi, Vikas

Ongoing global efforts to commercialize microalgal biofuels have expedited the use of multi-omics techniques to gain insights into lipid biosynthetic pathways. Functional genomics analyses have recently been employed to complement existing sequence-level omics studies, shedding light on the dynamics of lipid synthesis and its interplay with other cellular metabolic pathways, thus revealing possible targets for metabolic engineering. Here, we review the current status of algal omics studies to reveal potential targets to augment TAG accumulation in various microalgae. Here, this review specifically aims to examine and catalog systems level data related to stress-induced TAG accumulation in oleaginous microalgae and informmore » future metabolic engineering strategies to develop strains with enhanced bioproductivity, which could pave a path for sustainable green energy.« less
Leveraging Algal Omics to Reveal Potential Targets for Augmenting TAG Accumulation

DOE Office of Scientific and Technical Information (OSTI.GOV)

Guarnieri, Michael T; Pienkos, Philip T; Arora, Neha

2018-04-18

Ongoing global efforts to commercialize microalgal biofuels have expedited the use of multi-omics techniques to gain insights into lipid biosynthetic pathways. Functional genomics analyses have recently been employed to complement existing sequence-level omics studies, shedding light on the dynamics of lipid synthesis and its interplay with other cellular metabolic pathways, thus revealing possible targets for metabolic engineering. Here, we review the current status of algal omics studies to reveal potential targets to augment TAG accumulation in various microalgae. This review specifically aims to examine and catalog systems level data related to stress-induced TAG accumulation in oleaginous microalgae and inform futuremore » metabolic engineering strategies to develop strains with enhanced bioproductivity, which could pave a path for sustainable green energy.« less
Leveraging algal omics to reveal potential targets for augmenting TAG accumulation

DOE PAGES

Arora, Neha; Pienkos, Philip T.; Pruthi, Vikas; ...

2018-04-18

Ongoing global efforts to commercialize microalgal biofuels have expedited the use of multi-omics techniques to gain insights into lipid biosynthetic pathways. Functional genomics analyses have recently been employed to complement existing sequence-level omics studies, shedding light on the dynamics of lipid synthesis and its interplay with other cellular metabolic pathways, thus revealing possible targets for metabolic engineering. Here, we review the current status of algal omics studies to reveal potential targets to augment TAG accumulation in various microalgae. Here, this review specifically aims to examine and catalog systems level data related to stress-induced TAG accumulation in oleaginous microalgae and informmore » future metabolic engineering strategies to develop strains with enhanced bioproductivity, which could pave a path for sustainable green energy.« less
Expression and Purification of Recombinant Proteins in Escherichia coli with a His6 or Dual His6-MBP Tag.

PubMed

Raran-Kurussi, Sreejith; Waugh, David S

2017-01-01

Rapid advances in bioengineering and biotechnology over the past three decades have greatly facilitated the production of recombinant proteins in Escherichia coli. Affinity-based methods that employ protein or peptide based tags for protein purification have been instrumental in this progress. Yet insolubility of recombinant proteins in E. coli remains a persistent problem. One way around this problem is to fuse an aggregation-prone protein to a highly soluble partner. E. coli maltose-binding protein (MBP) is widely acknowledged as a highly effective solubilizing agent. In this chapter, we describe how to construct either a His 6 - or a dual His 6 -MBP tagged fusion protein by Gateway ® recombinational cloning and how to evaluate their yield and solubility. We also describe a simple and rapid procedure to test the solubility of proteins after removing their N-terminal fusion tags by tobacco etch virus (TEV) protease digestion. The choice of whether to use a His 6 tag or a His 6 -MBP tag can be made on the basis of this solubility test.
Mocr: A novel fusion tag for enhancing solubility that is compatible with structural biology applications

PubMed Central

DelProposto, James; Majmudar, Chinmay Y.; Smith, Janet L.; Brown, William Clay

2010-01-01

A persistent problem in heterologous protein production is insolubility of the target protein when expressed to high level in the host cell. A widely employed strategy for overcoming this problem is the use of fusion tags. The best fusion tags promote solubility, may function as purification handles and either do not interfere with downstream applications or may be removed from the passenger protein preparation. A novel fusion tag is identified that meets these criteria. This fusion tag is a monomeric mutant of the Ocr protein (0.3 gene product) of bacteriophage T7. This fusion tag displays solubilizing activity with a variety of different passenger proteins. We show that it may be used as a purification handle similar to other fusion tags. Its small size and compact structure are compatible with its use in downstream applications of the passenger protein or it may be removed and purified away from the passenger protein. The use of monomeric Ocr (Mocr) as a complement to other fusion tags such as maltose-binding protein will provide greater flexibility in protein production and processing for a wide variety of protein applications. PMID:18824232
Mocr: a novel fusion tag for enhancing solubility that is compatible with structural biology applications.

PubMed

DelProposto, James; Majmudar, Chinmay Y; Smith, Janet L; Brown, William Clay

2009-01-01

A persistent problem in heterologous protein production is insolubility of the target protein when expressed to high level in the host cell. A widely employed strategy for overcoming this problem is the use of fusion tags. The best fusion tags promote solubility, may function as purification handles and either do not interfere with downstream applications or may be removed from the passenger protein preparation. A novel fusion tag is identified that meets these criteria. This fusion tag is a monomeric mutant of the Ocr protein (0.3 gene product) of bacteriophage T7. This fusion tag displays solubilizing activity with a variety of different passenger proteins. We show that it may be used as a purification handle similar to other fusion tags. Its small size and compact structure are compatible with its use in downstream applications of the passenger protein or it may be removed and purified away from the passenger protein. The use of monomeric Ocr (Mocr) as a complement to other fusion tags such as maltose-binding protein will provide greater flexibility in protein production and processing for a wide variety of protein applications.
Quantitative evaluation of his-tag purification and immunoprecipitation of tristetraprolin and its mutant proteins from transfected human cells

USDA-ARS?s Scientific Manuscript database

Histidine (His)-tag is widely used for affinity purification of recombinant proteins, but the yield and purity of expressed proteins are quite different. Little information is available about quantitative evaluation of this procedure. The objective of the current study was to evaluate the His-tag pr...
Investigating the genetics of Bti resistance using mRNA tag sequencing: application on laboratory strains and natural populations of the dengue vector Aedes aegypti

PubMed Central

Paris, Margot; Marcombe, Sebastien; Coissac, Eric; Corbel, Vincent; David, Jean-Philippe; Després, Laurence

2013-01-01

Mosquito control is often the main method used to reduce mosquito-transmitted diseases. In order to investigate the genetic basis of resistance to the bio-insecticide Bacillus thuringiensis subsp. israelensis (Bti), we used information on polymorphism obtained from cDNA tag sequences from pooled larvae of laboratory Bti-resistant and susceptible Aedes aegypti mosquito strains to identify and analyse 1520 single nucleotide polymorphisms (SNPs). Of the 372 SNPs tested, 99.2% were validated using DNA Illumina GoldenGate® array, with a strong correlation between the allelic frequencies inferred from the pooled and individual data (r = 0.85). A total of 11 genomic regions and five candidate genes were detected using a genome scan approach. One of these candidate genes showed significant departures from neutrality in the resistant strain at sequence level. Six natural populations from Martinique Island were sequenced for the 372 tested SNPs with a high transferability (87%), and association mapping analyses detected 14 loci associated with Bti resistance, including one located in a putative receptor for Cry11 toxins. Three of these loci were also significantly differentiated between the laboratory strains, suggesting that most of the genes associated with resistance might differ between the two environments. It also suggests that common selected regions might harbour key genes for Bti resistance. PMID:24187584
Over-Expression, Purification and Crystallization of Human Dihydrolipoamide Dehydrogenase

NASA Technical Reports Server (NTRS)

Hong, Y. S.; Ciszak, Ewa; Patel, Mulchand

2000-01-01

Dehydrolipoamide dehydrogenase (E3; dihydrolipoan-tide:NAD+ oxidoreductase, EC 1.8.1.4) is a common catalytic component found in pyruvate dehydrogenase complex, alpha-ketoglutarate dehydrogenase complex, and branched-chain cc-keto acid dehydrogenase complex. E3 is also a component (referred to as L protein) of the glycine cleavage system in bacterial metabolism (2). Active E3 forms a homodimer with four distinctive subdomain structures (FAD binding, NAD+ binding, central and interface domains) with non-covalently but tightly bound FAD in the holoenzyme. Deduced amino acids from cloned full-length human E3 gene showed a total of 509 amino acids with a leader sequence (N-terminal 35 amino acids) that is excised (mature form) during transportation of expressed E3 into mitochondria membrane. So far, three-dimensional structure of human E3 has not been reported. Our effort to achieve the elucidation of the X-ray crystal structure of human E3 will be presented. Recombinant pPROEX-1 expression vector (from GIBCO BRL Life Technologies) having the human E3 gene without leader sequence was constructed by Polymerase Chain Reaction (PCR) and subsequent ligation, and cloned in E.coli XL1-Blue by transformation. Since pPROEX-1 vector has an internal His-tag (six histidine peptide) located at the upstream region of a multicloning site, one-step affinity purification of E3 using nickelnitriloacetic acid (Ni-NTA) agarose resin, which has a strong affinity to His-tag, was feasible. Also a seven-amino-acid spacer peptide and a recombinant tobacco etch virus protease recognition site (seven amino acids peptide) found between His-tag and first amino acid of expressed E3 facilitated the cleavage of His-tag from E3 after the affinity purification. By IPTG induction, ca. 15 mg of human E3 (mature form) was obtained from 1L LB culture with overnight incubation at 25C. Over 98% of purity of E3 from one-step Ni-NTA agarose affinity purification was confirmed by SDS-PAGE analysis. For
Expressed sequence tags from poplar wood tissues--a comparative analysis from multiple libraries.

PubMed

Déjardin, A; Leplé, J-C; Lesage-Descauses, M-C; Costa, G; Pilate, G

2004-01-01

Xylogenesis involves successive developmental processes--cambial division, cell expansion and differentiation, cell death--each occurring along a gradient from the cambium to the pith of the stem. Taking advantage of the high level of organisation of wood tissues, we isolated cambial zone (CZ), differentiating xylem (DX) and mature xylem (MX) from both tension wood (TW) and opposite wood (OW) of bent poplars. Four different cDNA libraries were then constructed and used to generate 10,062 EST, reflecting the genes expressed in the different wood tissues. For the most abundant clusters, the EST distributions were compared between libraries in order to identify genes specific or over-represented at some specific developmental stages. They clearly showed a developmental shift between CZ and DX, whereas there is a continuity of development between DX and MX. CZ was mainly characterized by clusters of genes involved in cell cycle, protein synthesis and fate. Interestingly, two clusters with no assigned function were found specific to the cambial zone. In DX and MX, clusters were mostly involved in methylation of lignin precursors and microtubule cytoskeleton. In addition, in DX, EST from TW and OW were compared: five clusters of arabinogalactan proteins, one for sucrose synthase and one for fructokinase were specific or over-represented in TW. Moreover, a putative transcription factor and a cluster of unknown function were also identified in DX-TW. The informative comparison of multiple libraries prepared from wood tissues led to the identification of genes--some with still unknown functions--putatively involved in xylogenesis and tension wood formation.
dictyExpress: a web-based platform for sequence data management and analytics in Dictyostelium and beyond.

PubMed

Stajdohar, Miha; Rosengarten, Rafael D; Kokosar, Janez; Jeran, Luka; Blenkus, Domen; Shaulsky, Gad; Zupan, Blaz

2017-06-02

Dictyostelium discoideum, a soil-dwelling social amoeba, is a model for the study of numerous biological processes. Research in the field has benefited mightily from the adoption of next-generation sequencing for genomics and transcriptomics. Dictyostelium biologists now face the widespread challenges of analyzing and exploring high dimensional data sets to generate hypotheses and discovering novel insights. We present dictyExpress (2.0), a web application designed for exploratory analysis of gene expression data, as well as data from related experiments such as Chromatin Immunoprecipitation sequencing (ChIP-Seq). The application features visualization modules that include time course expression profiles, clustering, gene ontology enrichment analysis, differential expression analysis and comparison of experiments. All visualizations are interactive and interconnected, such that the selection of genes in one module propagates instantly to visualizations in other modules. dictyExpress currently stores the data from over 800 Dictyostelium experiments and is embedded within a general-purpose software framework for management of next-generation sequencing data. dictyExpress allows users to explore their data in a broader context by reciprocal linking with dictyBase-a repository of Dictyostelium genomic data. In addition, we introduce a companion application called GenBoard, an intuitive graphic user interface for data management and bioinformatics analysis. dictyExpress and GenBoard enable broad adoption of next generation sequencing based inquiries by the Dictyostelium research community. Labs without the means to undertake deep sequencing projects can mine the data available to the public. The entire information flow, from raw sequence data to hypothesis testing, can be accomplished in an efficient workspace. The software framework is generalizable and represents a useful approach for any research community. To encourage more wide usage, the backend is open
Molecular cloning, overexpression, purification, and sequence analysis of the giant panda (Ailuropoda melanoleuca) ferritin light polypeptide.

PubMed

Fu, L; Hou, Y L; Ding, X; Du, Y J; Zhu, H Q; Zhang, N; Hou, W R

2016-08-30

The complementary DNA (cDNA) of the giant panda (Ailuropoda melanoleuca) ferritin light polypeptide (FTL) gene was successfully cloned using reverse transcription-polymerase chain reaction technology. We constructed a recombinant expression vector containing FTL cDNA and overexpressed it in Escherichia coli using pET28a plasmids. The expressed protein was then purified by nickel chelate affinity chromatography. The cloned cDNA fragment was 580 bp long and contained an open reading frame of 525 bp. The deduced protein sequence was composed of 175 amino acids and had an estimated molecular weight of 19.90 kDa, with an isoelectric point of 5.53. Topology prediction revealed one N-glycosylation site, two casein kinase II phosphorylation sites, one N-myristoylation site, two protein kinase C phosphorylation sites, and one cell attachment sequence. Alignment indicated that the nucleotide and deduced amino acid sequences are highly conserved across several mammals, including Homo sapiens, Cavia porcellus, Equus caballus, and Felis catus, among others. The FTL gene was readily expressed in E. coli, which gave rise to the accumulation of a polypeptide of the expected size (25.50 kDa, including an N-terminal polyhistidine tag).

Genetic validation of whole-transcriptome sequencing for mapping expression affected by cis-regulatory variation.

PubMed

Babak, Tomas; Garrett-Engele, Philip; Armour, Christopher D; Raymond, Christopher K; Keller, Mark P; Chen, Ronghua; Rohl, Carol A; Johnson, Jason M; Attie, Alan D; Fraser, Hunter B; Schadt, Eric E

2010-08-13

Identifying associations between genotypes and gene expression levels using microarrays has enabled systematic interrogation of regulatory variation underlying complex phenotypes. This approach has vast potential for functional characterization of disease states, but its prohibitive cost, given hundreds to thousands of individual samples from populations have to be genotyped and expression profiled, has limited its widespread application. Here we demonstrate that genomic regions with allele-specific expression (ASE) detected by sequencing cDNA are highly enriched for cis-acting expression quantitative trait loci (cis-eQTL) identified by profiling of 500 animals in parallel, with up to 90% agreement on the allele that is preferentially expressed. We also observed widespread noncoding and antisense ASE and identified several allele-specific alternative splicing variants. Monitoring ASE by sequencing cDNA from as little as one sample is a practical alternative to expression genetics for mapping cis-acting variation that regulates RNA transcription and processing.
Expression of recombinant CD59 with an N-terminal peptide epitope facilitates analysis of residues contributing to its complement-inhibitory function.

PubMed

Zhou, Q; Zhao, J; Hüsler, T; Sims, P J

1996-10-01

CD59 is a plasma membrane-anchored glycoprotein that serves to protect human cells from lysis by the C5b-9 complex of complement. The immunodominant epitopes of CD59 are known to be sensitive to disruption of native tertiary structure, complicating immunological measurement of expressed mutant constructs for structure function analysis. In order to quantify cell-surface expression of wild-type and mutant forms of this complement inhibitor, independent of CD59 antigen, an 11-residue peptide (TAG) recognized by monoclonal antibody (mAb) 9E10 was inserted before the N-terminal codon (L1) of mature CD59, in a pcDNA3 expression plasmid. SV-T2 cells were transfected with this plasmid, yielding cell lines expressing 0 to > 10(5) CD59/cell. The TAG-CD59 fusion protein was confirmed to be GPI-anchored, N-glycosylated and showed identical complement-inhibitory function to wild-type CD59, lacking the TAG peptide sequence. Using this construct, the contribution of each of four surface-localized aromatic residues (4Y, 47F, 61Y, and 62Y) to CD59's complement-inhibitory function was examined. These assays revealed normal surface expression with complete loss of complement-inhibitory function in the 4Y --> S, 47F --> G and 61Y --> S mutants. By contrast, 62Y --> S mutants retained approximately 40% of function of wild-type CD59. These studies confirmed the utility of the TAG-CD59 construct for quantifying CD59 surface expression and activity, and implicate surface aromatic residues 4Y, 47F, 61Y and 62Y as essential to maintenance of CD59's normal complement-regulatory function.
A resource of large-scale molecular markers for monitoring Agropyron cristatum chromatin introgression in wheat background based on transcriptome sequences.

PubMed

Zhang, Jinpeng; Liu, Weihua; Lu, Yuqing; Liu, Qunxing; Yang, Xinming; Li, Xiuquan; Li, Lihui

2017-09-20

Agropyron cristatum is a wild grass of the tribe Triticeae and serves as a gene donor for wheat improvement. However, very few markers can be used to monitor A. cristatum chromatin introgressions in wheat. Here, we reported a resource of large-scale molecular markers for tracking alien introgressions in wheat based on transcriptome sequences. By aligning A. cristatum unigenes with the Chinese Spring reference genome sequences, we designed 9602 A. cristatum expressed sequence tag-sequence-tagged site (EST-STS) markers for PCR amplification and experimental screening. As a result, 6063 polymorphic EST-STS markers were specific for the A. cristatum P genome in the single-receipt wheat background. A total of 4956 randomly selected polymorphic EST-STS markers were further tested in eight wheat variety backgrounds, and 3070 markers displaying stable and polymorphic amplification were validated. These markers covered more than 98% of the A. cristatum genome, and the marker distribution density was approximately 1.28 cM. An application case of all EST-STS markers was validated on the A. cristatum 6 P chromosome. These markers were successfully applied in the tracking of alien A. cristatum chromatin. Altogether, this study provided a universal method of large-scale molecular marker development to monitor wild relative chromatin in wheat.
A mutant sumo facilitates quick plasmid construction for expressing proteins with native N-termini after fusion tag removal

USDA-ARS?s Scientific Manuscript database

Sumo is one of the fusion tags commonly used to enhance the solubility and yield of recombinant proteins. One advantage of using sumo is that the removal of the sumo tag is highly specific because its recognition by the ULP sumo protease is determined by its structural characteristics, instead of th...
Buddy Tag CONOPS and Requirements.

DOE Office of Scientific and Technical Information (OSTI.GOV)

Brotz, Jay Kristoffer; Deland, Sharon M.

2015-12-01

This document defines the concept of operations (CONOPS) and the requirements for the Buddy Tag, which is conceived and designed in collaboration between Sandia National Laboratories and Princeton University under the Department of State Key VerificationAssets Fund. The CONOPS describe how the tags are used to support verification of treaty limitations and is only defined to the extent necessary to support a tag design. The requirements define the necessary functions and desired non-functional features of the Buddy Tag at a high level
Microarray expression profiling identifies genes with altered expression in HDL-deficient mice

DOE Office of Scientific and Technical Information (OSTI.GOV)

Callow, Matthew J.; Dudoit, Sandrine; Gong, Elaine L.

2000-05-05

Based on the assumption that severe alterations in the expression of genes known to be involved in HDL metabolism may affect the expression of other genes we screened an array of over 5000 mouse expressed sequence tags (ESTs) for altered gene expression in the livers of two lines of mice with dramatic decreases in HDL plasma concentrations. Labeled cDNA from livers of apolipoprotein AI (apo AI) knockout mice, Scavenger Receptor BI (SR-BI) transgenic mice and control mice were co-hybridized to microarrays. Two-sample t-statistics were used to identify genes with altered expression levels in the knockout or transgenic mice compared withmore » the control mice. In the SR-BI group we found 9 array elements representing at least 5 genes to be significantly altered on the basis of an adjusted p value of less than 0.05. In the apo AI knockout group 8 array elements representing 4 genes were altered compared with the control group (p < 0.05). Several of the genes identified in the SR-BI transgenic suggest altered sterol metabolism and oxidative processes. These studies illustrate the use of multiple-testing methods for the identification of genes with altered expression in replicated microarray experiments of apo AI knockout and SR-BI transgenic mice.« less
Advanced colorectal adenoma related gene expression signature may predict prognostic for colorectal cancer patients with adenoma-carcinoma sequence.

PubMed

Li, Bing; Shi, Xiao-Yu; Liao, Dai-Xiang; Cao, Bang-Rong; Luo, Cheng-Hua; Cheng, Shu-Jun

2015-01-01

There are still no absolute parameters predicting progression of adenoma into cancer. The present study aimed to characterize functional differences on the multistep carcinogenetic process from the adenoma-carcinoma sequence. All samples were collected and mRNA expression profiling was performed by using Agilent Microarray high-throughput gene-chip technology. Then, the characteristics of mRNA expression profiles of adenoma-carcinoma sequence were described with bioinformatics software, and we analyzed the relationship between gene expression profiles of adenoma-adenocarcinoma sequence and clinical prognosis of colorectal cancer. The mRNA expressions of adenoma-carcinoma sequence were significantly different between high-grade intraepithelial neoplasia group and adenocarcinoma group. The biological process of gene ontology function enrichment analysis on differentially expressed genes between high-grade intraepithelial neoplasia group and adenocarcinoma group showed that genes enriched in the extracellular structure organization, skeletal system development, biological adhesion and itself regulated growth regulation, with the P value after FDR correction of less than 0.05. In addition, IPR-related protein mainly focused on the insulin-like growth factor binding proteins. The variable trends of gene expression profiles for adenoma-carcinoma sequence were mainly concentrated in high-grade intraepithelial neoplasia and adenocarcinoma. The differentially expressed genes are significantly correlated between high-grade intraepithelial neoplasia group and adenocarcinoma group. Bioinformatics analysis is an effective way to study the gene expression profiles in the adenoma-carcinoma sequence, and may provide an effective tool to involve colorectal cancer research strategy into colorectal adenoma or advanced adenoma.
Recurrence time statistics: versatile tools for genomic DNA sequence analysis.

PubMed

Cao, Yinhe; Tung, Wen-Wen; Gao, J B

2004-01-01

With the completion of the human and a few model organisms' genomes, and the genomes of many other organisms waiting to be sequenced, it has become increasingly important to develop faster computational tools which are capable of easily identifying the structures and extracting features from DNA sequences. One of the more important structures in a DNA sequence is repeat-related. Often they have to be masked before protein coding regions along a DNA sequence are to be identified or redundant expressed sequence tags (ESTs) are to be sequenced. Here we report a novel recurrence time based method for sequence analysis. The method can conveniently study all kinds of periodicity and exhaustively find all repeat-related features from a genomic DNA sequence. An efficient codon index is also derived from the recurrence time statistics, which has the salient features of being largely species-independent and working well on very short sequences. Efficient codon indices are key elements of successful gene finding algorithms, and are particularly useful for determining whether a suspected EST belongs to a coding or non-coding region. We illustrate the power of the method by studying the genomes of E. coli, the yeast S. cervisivae, the nematode worm C. elegans, and the human, Homo sapiens. Computationally, our method is very efficient. It allows us to carry out analysis of genomes on the whole genomic scale by a PC.
Generation and analysis of a barcode-tagged insertion mutant library in the fission yeast Schizosaccharomyces pombe

PubMed Central

2012-01-01

Background Barcodes are unique DNA sequence tags that can be used to specifically label individual mutants. The barcode-tagged open reading frame (ORF) haploid deletion mutant collections in the budding yeast Saccharomyces cerevisiae and the fission yeast Schizosaccharomyces pombe allow for high-throughput mutant phenotyping because the relative growth of mutants in a population can be determined by monitoring the proportions of their associated barcodes. While these mutant collections have greatly facilitated genome-wide studies, mutations in essential genes are not present, and the roles of these genes are not as easily studied. To further support genome-scale research in S. pombe, we generated a barcode-tagged fission yeast insertion mutant library that has the potential of generating viable mutations in both essential and non-essential genes and can be easily analyzed using standard molecular biological techniques. Results An insertion vector containing a selectable ura4+ marker and a random barcode was used to generate a collection of 10,000 fission yeast insertion mutants stored individually in 384-well plates and as six pools of mixed mutants. Individual barcodes are flanked by Sfi I recognition sites and can be oligomerized in a unique orientation to facilitate barcode sequencing. Independent genetic screens on a subset of mutants suggest that this library contains a diverse collection of single insertion mutations. We present several approaches to determine insertion sites. Conclusions This collection of S. pombe barcode-tagged insertion mutants is well-suited for genome-wide studies. Because insertion mutations may eliminate, reduce or alter the function of essential and non-essential genes, this library will contain strains with a wide range of phenotypes that can be assayed by their associated barcodes. The design of the barcodes in this library allows for barcode sequencing using next generation or standard benchtop cloning approaches. PMID:22554201
Generation and analysis of a barcode-tagged insertion mutant library in the fission yeast Schizosaccharomyces pombe.

PubMed

Chen, Bo-Ruei; Hale, Devin C; Ciolek, Peter J; Runge, Kurt W

2012-05-03

Barcodes are unique DNA sequence tags that can be used to specifically label individual mutants. The barcode-tagged open reading frame (ORF) haploid deletion mutant collections in the budding yeast Saccharomyces cerevisiae and the fission yeast Schizosaccharomyces pombe allow for high-throughput mutant phenotyping because the relative growth of mutants in a population can be determined by monitoring the proportions of their associated barcodes. While these mutant collections have greatly facilitated genome-wide studies, mutations in essential genes are not present, and the roles of these genes are not as easily studied. To further support genome-scale research in S. pombe, we generated a barcode-tagged fission yeast insertion mutant library that has the potential of generating viable mutations in both essential and non-essential genes and can be easily analyzed using standard molecular biological techniques. An insertion vector containing a selectable ura4+ marker and a random barcode was used to generate a collection of 10,000 fission yeast insertion mutants stored individually in 384-well plates and as six pools of mixed mutants. Individual barcodes are flanked by Sfi I recognition sites and can be oligomerized in a unique orientation to facilitate barcode sequencing. Independent genetic screens on a subset of mutants suggest that this library contains a diverse collection of single insertion mutations. We present several approaches to determine insertion sites. This collection of S. pombe barcode-tagged insertion mutants is well-suited for genome-wide studies. Because insertion mutations may eliminate, reduce or alter the function of essential and non-essential genes, this library will contain strains with a wide range of phenotypes that can be assayed by their associated barcodes. The design of the barcodes in this library allows for barcode sequencing using next generation or standard benchtop cloning approaches.
SparkClouds: visualizing trends in tag clouds.

PubMed

Lee, Bongshin; Riche, Nathalie Henry; Karlson, Amy K; Carpendale, Sheelash

2010-01-01

Tag clouds have proliferated over the web over the last decade. They provide a visual summary of a collection of texts by visually depicting the tag frequency by font size. In use, tag clouds can evolve as the associated data source changes over time. Interesting discussions around tag clouds often include a series of tag clouds and consider how they evolve over time. However, since tag clouds do not explicitly represent trends or support comparisons, the cognitive demands placed on the person for perceiving trends in multiple tag clouds are high. In this paper, we introduce SparkClouds, which integrate sparklines into a tag cloud to convey trends between multiple tag clouds. We present results from a controlled study that compares SparkClouds with two traditional trend visualizations—multiple line graphs and stacked bar charts—as well as Parallel Tag Clouds. Results show that SparkClouds ability to show trends compares favourably to the alternative visualizations.
An orange fluorescent protein tagging system for real-time pollen tracking.

PubMed

Rice, J Hollis; Millwood, Reginald J; Mundell, Richard E; Chambers, Orlando D; Abercrombie, Laura L; Davies, H Maelor; Stewart, C Neal

2013-09-27

Monitoring gene flow could be important for future transgenic crops, such as those producing plant-made-pharmaceuticals (PMPs) in open field production. A Nicotiana hybrid (Nicotiana. tabacum × Nicotiana glauca) shows limited male fertility and could be used as a bioconfined PMP platform. Effective assessment of gene flow from these plants is augmented with methods that utilize fluorescent proteins for transgenic pollen identification. We report the generation of a pollen tagging system utilizing an orange fluorescent protein to monitor pollen flow and as a visual assessment of transgene zygosity of the parent plant. This system was created to generate a tagged Nicotiana hybrid that could be used for the incidence of gene flow. Nicotiana tabacum 'TN 90' and Nicotiana glauca were successfully transformed via Agrobacterium tumefaciens to express the orange fluorescent protein gene, tdTomato-ER, in pollen and a green fluorescent protein gene, mgfp5-er, was expressed in vegetative structures of the plant. Hybrids were created that utilized the fluorescent proteins as a research tool for monitoring pollen movement and gene flow. Manual greenhouse crosses were used to assess hybrid sexual compatibility with N. tabacum, resulting in seed formation from hybrid pollination in 2% of crosses, which yielded non-viable seed. Pollen transfer to the hybrid formed seed in 19% of crosses and 10 out of 12 viable progeny showed GFP expression. The orange fluorescent protein is visible when expressed in the pollen of N. glauca, N. tabacum, and the Nicotiana hybrid, although hybrid pollen did not appear as bright as the parent lines. The hybrid plants, which show limited ability to outcross, could provide bioconfinement with the benefit of detectable pollen using this system. Fluorescent protein-tagging could be a valuable tool for breeding and in vivo ecological monitoring.
Systematic gene tagging using CRISPR/Cas9 in human stem cells to illuminate cell organization.

PubMed

Roberts, Brock; Haupt, Amanda; Tucker, Andrew; Grancharova, Tanya; Arakaki, Joy; Fuqua, Margaret A; Nelson, Angelique; Hookway, Caroline; Ludmann, Susan A; Mueller, Irina A; Yang, Ruian; Horwitz, Rick; Rafelski, Susanne M; Gunawardane, Ruwanthi N

2017-10-15

We present a CRISPR/Cas9 genome-editing strategy to systematically tag endogenous proteins with fluorescent tags in human induced pluripotent stem cells (hiPSC). To date, we have generated multiple hiPSC lines with monoallelic green fluorescent protein tags labeling 10 proteins representing major cellular structures. The tagged proteins include alpha tubulin, beta actin, desmoplakin, fibrillarin, nuclear lamin B1, nonmuscle myosin heavy chain IIB, paxillin, Sec61 beta, tight junction protein ZO1, and Tom20. Our genome-editing methodology using Cas9/crRNA ribonuclear protein and donor plasmid coelectroporation, followed by fluorescence-based enrichment of edited cells, typically resulted in <0.1-4% homology-directed repair (HDR). Twenty-five percent of clones generated from each edited population were precisely edited. Furthermore, 92% (36/39) of expanded clonal lines displayed robust morphology, genomic stability, expression and localization of the tagged protein to the appropriate subcellular structure, pluripotency-marker expression, and multilineage differentiation. It is our conclusion that, if cell lines are confirmed to harbor an appropriate gene edit, pluripotency, differentiation potential, and genomic stability are typically maintained during the clonal line-generation process. The data described here reveal general trends that emerged from this systematic gene-tagging approach. Final clonal lines corresponding to each of the 10 cellular structures are now available to the research community. © 2017 Roberts, Haupt, et al. This article is distributed by The American Society for Cell Biology under license from the author(s). Two months after publication it is available to the public under an Attribution–Noncommercial–Share Alike 3.0 Unported Creative Commons License (http://creativecommons.org/licenses/by-nc-sa/3.0).
Vectors for fluorescent protein tagging in Phytophthora: tools for functional genomics and cell biology.

PubMed

Ah-Fong, Audrey M V; Judelson, Howard S

2011-09-01

Fluorescent tagging has become the strategy of choice for examining the subcellular localisation of proteins. To develop a versatile community resource for this method in oomycetes, plasmids were constructed that allow the expression of either of four spectrally distinct proteins [cyan fluorescent protein (CFP), green fluorescent protein (GFP), yellow fluorescent protein (YFP), and mCherry], alone or fused at their N- or C-termini, to sequences of interest. Equivalent sets of plasmids were made using neomycin or hygromycin phosphotransferases (nptII, hpt) as selectable markers, to facilitate double-labelling and aid work in diverse species. The fluorescent proteins and drug-resistance markers were fused to transcriptional regulatory sequences from the oomycete Bremia lactucae, which are known to function in diverse oomycetes, although the promoter in the fluorescence cassette (ham34) can be replaced easily by a promoter of interest. The function of each plasmid was confirmed in Phytophthora infestans. Moreover, fusion proteins were generated using targeting sequences for the endoplasmic reticulum, Golgi, mitochondria, nuclei, and peroxisomes. Studies of the distribution of the fusions in mycelia and sporangia provided insight into cellular organisation at different stages of development. This toolbox of vectors should advance studies of gene function and cell biology in Phytophthora and other oomycetes. Copyright © 2011 British Mycological Society. Published by Elsevier Ltd. All rights reserved.
Survival, growth, and tag retention in age-0 Chinook Salmon implanted with 8-, 9-, and 12-mm PIT tags

USGS Publications Warehouse

Tiffan, Kenneth F.; Perry, Russell W.; Connor, William P.; Mullins, Frank L.; Rabe, Craig; Nelson, Doug D

2015-01-01

The ability to represent a population of migratory juvenile fish with PIT tags becomes difficult when the minimum tagging size is larger than the average size at which fish begin to move downstream. Tags that are smaller (e.g., 8 and 9 mm) than the commonly used 12-mm PIT tags are currently available, but their effects on survival, growth, and tag retention in small salmonid juveniles have received little study. We evaluated growth, survival, and tag retention in age-0 Chinook Salmon Oncorhynchus tshawytscha of three size-groups: 40–49-mm fish were implanted with 8- and 9-mm tags, and 50– 59-mm and 60–69-mm fish were implanted with 8-, 9-, and 12-mm tags. Survival 28 d after tagging ranged from 97.8% to 100% across all trials, providing no strong evidence for a fish-size-related tagging effect or a tag size effect. No biologically significant effects of tagging on growth in FL (mm/d) or weight (g/d) were observed. Although FL growth in tagged fish was significantly reduced for the 40–49-mm and 50–59-mm groups over the first 7 d, growth rates were not different thereafter, and all fish were similar in size by the end of the trials (day 28). Tag retention across all tests ranged from 93% to 99%. We acknowledge that actual implantation of 8- or 9-mm tags into small fish in the field will pose additional challenges (e.g., capture and handling stress) beyond those observed in our laboratory. However, we conclude that experimental use of the smaller tags for small fish in the field is supported by our findings.
Genetic validation of whole-transcriptome sequencing for mapping expression affected by cis-regulatory variation

PubMed Central

2010-01-01

Background Identifying associations between genotypes and gene expression levels using microarrays has enabled systematic interrogation of regulatory variation underlying complex phenotypes. This approach has vast potential for functional characterization of disease states, but its prohibitive cost, given hundreds to thousands of individual samples from populations have to be genotyped and expression profiled, has limited its widespread application. Results Here we demonstrate that genomic regions with allele-specific expression (ASE) detected by sequencing cDNA are highly enriched for cis-acting expression quantitative trait loci (cis-eQTL) identified by profiling of 500 animals in parallel, with up to 90% agreement on the allele that is preferentially expressed. We also observed widespread noncoding and antisense ASE and identified several allele-specific alternative splicing variants. Conclusion Monitoring ASE by sequencing cDNA from as little as one sample is a practical alternative to expression genetics for mapping cis-acting variation that regulates RNA transcription and processing. PMID:20707912
How did nature engineer the highest surface lipid accumulation among plants? Exceptional expression of acyl-lipid-associated genes for the assembly of extracellular triacylglycerol by Bayberry (Myrica pensylvanica) fruits.

PubMed

Simpson, Jeffrey P; Thrower, Nicholas; Ohlrogge, John B

2016-09-01

Bayberry (Myrica pensylvanica) fruits are covered with a remarkably thick layer of crystalline wax consisting of triacylglycerol (TAG) and diacylglycerol (DAG) esterified exclusively with saturated fatty acids. As the only plant known to accumulate soluble glycerolipids as a major component of surface waxes, Bayberry represents a novel system to investigate neutral lipid biosynthesis and lipid secretion by vegetative plant cells. The assembly of Bayberry wax is distinct from conventional TAG and other surface waxes, and instead proceeds through a pathway related to cutin synthesis (Simpson and Ohlrogge, 2016). In this study, microscopic examination revealed that the fruit tissue that produces and secretes wax (Bayberry knobs) is fully developed before wax accumulates and that wax is secreted to the surface without cell disruption. Comparison of transcript expression to genetically related tissues (Bayberry leaves, M. rubra fruits), cutin-rich tomato and cherry fruit epidermis, and to oil-rich mesocarp and seeds, revealed exceptionally high expression of 13 transcripts for acyl-lipid metabolism together with down-regulation of fatty acid oxidases and desaturases. The predicted protein sequences of the most highly expressed lipid-related enzyme-encoding transcripts in Bayberry knobs are 100% identical to the sequences from Bayberry leaves, which do not produce surface DAG or TAG. Together, these results indicate that TAG biosynthesis and secretion in Bayberry is achieved by both up and down-regulation of a small subset of genes related to the biosynthesis of cutin and saturated fatty acids, and also implies that modifications in gene expression, rather than evolution of new gene functions, was the major mechanism by which Bayberry evolved its specialized lipid metabolism. This article is part of a Special Issue entitled: Plant Lipid Biology edited by Kent D. Chapman and Ivo Feussner. Copyright © 2016 The Authors. Published by Elsevier B.V. All rights reserved.
Comparing the hierarchy of author given tags and repository given tags in a large document archive

NASA Astrophysics Data System (ADS)

Tibély, Gergely; Pollner, Péter; Palla, Gergely

2016-10-01

Folksonomies - large databases arising from collaborative tagging of items by independent users - are becoming an increasingly important way of categorizing information. In these systems users can tag items with free words, resulting in a tripartite item-tag-user network. Although there are no prescribed relations between tags, the way users think about the different categories presumably has some built in hierarchy, in which more special concepts are descendants of some more general categories. Several applications would benefit from the knowledge of this hierarchy. Here we apply a recent method to check the differences and similarities of hierarchies resulting from tags given by independent individuals and from tags given by a centrally managed repository system. The results from our method showed substantial differences between the lower part of the hierarchies, and in contrast, a relatively high similarity at the top of the hierarchies.
Construction and heterologous expression of a truncated Haemagglutinin (HA) protein from the avian influenza virus H5N1 in Escherichia coli.

PubMed

Chee Wei, T; Nurul Wahida, A G; Shaharum, S

2014-12-01

Malaysia first reported H5N1 poultry case in 2004 and subsequently outbreak in poultry population in 2007. Here, a recombinant gene encoding of peptide epitopes, consisting fragments of HA1, HA2 and a polybasic cleavage site of H5N1 strain Malaysia, was amplified and cloned into pET-47b(+) bacterial expression vector. DNA sequencing and alignment analysis confirmed that the gene had no alteration and in-frame to the vector. Then, His-tagged truncated HA protein was expressed in Escherichia coli BL21 (DE3) under 1 mM IPTG induction. The protein expression was optimized under a time-course induction study and further purified using Ni-NTA agarose under reducing condition. Migration size of protein was detected at 15 kDa by Western blot using anti-His tag monoclonal antibody and demonstrated no discrepancy compared to its calculated molecular weight.
Structure-Related Roles for the Conservation of the HIV-1 Fusion Peptide Sequence Revealed by Nuclear Magnetic Resonance.

PubMed

Serrano, Soraya; Huarte, Nerea; Rujas, Edurne; Andreu, David; Nieva, José L; Jiménez, María Angeles

2017-10-17

Despite extensive characterization of the human immunodeficiency virus type 1 (HIV-1) hydrophobic fusion peptide (FP), the structure-function relationships underlying its extraordinary degree of conservation remain poorly understood. Specifically, the fact that the tandem repeat of the FLGFLG tripeptide is absolutely conserved suggests that high hydrophobicity may not suffice to unleash FP function. Here, we have compared the nuclear magnetic resonance (NMR) structures adopted in nonpolar media by two FP surrogates, wtFP-tag and scrFP-tag, which had equal hydrophobicity but contained wild-type and scrambled core sequences LFLGFLG and FGLLGFL, respectively. In addition, these peptides were tagged at their C-termini with an epitope sequence that folded independently, thereby allowing Western blot detection without interfering with FP structure. We observed similar α-helical FP conformations for both specimens dissolved in the low-polarity medium 25% (v/v) 1,1,1,3,3,3-hexafluoro-2-propanol (HFIP), but important differences in contact with micelles of the membrane mimetic dodecylphosphocholine (DPC). Thus, whereas wtFP-tag preserved a helix displaying a Gly-rich ridge, the scrambled sequence lost in great part the helical structure upon being solubilized in DPC. Western blot analyses further revealed the capacity of wtFP-tag to assemble trimers in membranes, whereas membrane oligomers were not observed in the case of the scrFP-tag sequence. We conclude that, beyond hydrophobicity, preserving sequence order is an important feature for defining the secondary structures and oligomeric states adopted by the HIV FP in membranes.

A prokaryotic viral sequence is expressed and conserved in mammalian brain.

PubMed

Yeh, Yang-Hui; Gunasekharan, Vignesh; Manuelidis, Laura

2017-07-03

A natural and permanent transfer of prokaryotic viral sequences to mammals has not been reported by others. Circular "SPHINX" DNAs <5 kb were previously isolated from nuclease-protected cytoplasmic particles in rodent neuronal cell lines and brain. Two of these DNAs were sequenced after Φ29 polymerase amplification, and they revealed significant but imperfect homology to segments of commensal Acinetobacter phage viruses. These findings were surprising because the brain is isolated from environmental microorganisms. The 1.76-kb DNA sequence (SPHINX 1.8), with an iteron before its ORF, was evaluated here for its expression in neural cells and brain. A rabbit affinity purified antibody generated against a peptide without homology to mammalian sequences labeled a nonglycosylated ∼41-kDa protein (spx1) on Western blots, and the signal was efficiently blocked by the competing peptide. Spx1 was resistant to limited proteinase K digestion, but was unrelated to the expression of host prion protein or its pathologic amyloid form. Remarkably, spx1 concentrated in selected brain synapses, such as those on anterior motor horn neurons that integrate many complex neural inputs. SPHINX 1.8 appears to be involved in tissue-specific differentiation, including essential functions that preserve its propagation during mammalian evolution, possibly via maternal inheritance. The data here indicate that mammals can share and exchange a larger world of prokaryotic viruses than previously envisioned.
Leveraging algal omics to reveal potential targets for augmenting TAG accumulation.

PubMed

Arora, Neha; Pienkos, Philip T; Pruthi, Vikas; Poluri, Krishna Mohan; Guarnieri, Michael T

2018-04-18

Ongoing global efforts to commercialize microalgal biofuels have expedited the use of multi-omics techniques to gain insights into lipid biosynthetic pathways. Functional genomics analyses have recently been employed to complement existing sequence-level omics studies, shedding light on the dynamics of lipid synthesis and its interplay with other cellular metabolic pathways, thus revealing possible targets for metabolic engineering. Here, we review the current status of algal omics studies to reveal potential targets to augment TAG accumulation in various microalgae. This review specifically aims to examine and catalog systems level data related to stress-induced TAG accumulation in oleaginous microalgae and inform future metabolic engineering strategies to develop strains with enhanced bioproductivity, which could pave a path for sustainable green energy. Copyright © 2018. Published by Elsevier Inc.
Tissue Distribution of Kir7.1 Inwardly Rectifying K+ Channel Probed in a Knock-in Mouse Expressing a Haemagglutinin-Tagged Protein.

PubMed

Cornejo, Isabel; Villanueva, Sandra; Burgos, Johanna; López-Cayuqueo, Karen I; Chambrey, Régine; Julio-Kalajzić, Francisca; Buelvas, Neudo; Niemeyer, María I; Figueiras-Fierro, Dulce; Brown, Peter D; Sepúlveda, Francisco V; Cid, L P

2018-01-01

Kir7.1 encoded by the Kcnj13 gene in the mouse is an inwardly rectifying K + channel present in epithelia where it shares membrane localization with the Na + /K + -pump. Further investigations of the localisation and function of Kir7.1 would benefit from the availability of a knockout mouse, but perinatal mortality attributed to cleft palate in the neonate has thwarted this research. To facilitate localisation studies we now use CRISPR/Cas9 technology to generate a knock-in mouse, the Kir7.1-HA that expresses the channel tagged with a haemagglutinin (HA) epitope. The availability of antibodies for the HA epitope allows for application of western blot and immunolocalisation methods using widely available anti-HA antibodies with WT tissues providing unambiguous negative control. We demonstrate that Kir7.1-HA cloned from the choroid plexus of the knock-in mouse has the electrophysiological properties of the native channel, including characteristically large Rb + currents. These large Kir7.1-mediated currents are accompanied by abundant apical membrane Kir7.1-HA immunoreactivity. WT-controlled western blots demonstrate the presence of Kir7.1-HA in the eye and the choroid plexus, trachea and lung, and intestinal epithelium but exclusively in the ileum. In the kidney, and at variance with previous reports in the rat and guinea-pig, Kir7.1-HA is expressed in the inner medulla but not in the cortex or outer medulla. In isolated tubules immunoreactivity was associated with inner medulla collecting ducts but not thin limbs of the loop of Henle. Kir7.1-HA shows basolateral expression in the respiratory tract epithelium from trachea to bronchioli. The channel also appears basolateral in the epithelium of the nasal cavity and nasopharynx in newborn animals. We show that HA-tagged Kir7.1 channel introduced in the mouse by a knock-in procedure has functional properties similar to the native protein and the animal thus generated has clear advantages in localisation studies. It
Expression and purification of membrane protein diacylglycerol acyltransferase

USDA-ARS?s Scientific Manuscript database

Diacylglycerol acyltransferases (DGATs) catalyze the last and rate-limiting step of triacylglycerol (TAG) biosynthesis in eukaryotic organisms. Plants and animals deficient in DGATs accumulate less TAG. Over-expression of DGATs increases TAG in seeds and other tissues. DGAT knockout mice are resista...
Cloning, over-expression and purification of Pseudomonas aeruginosa murC encoding uridine diphosphate N-acetylmuramate: L-alanine ligase.

PubMed

El Zoeiby, A; Sanschagrin, F; Lamoureux, J; Darveau, A; Levesque, R C

2000-02-15

We cloned and sequenced the murC gene from Pseudomonas aeruginosa encoding a protein of 53 kDa. Multiple alignments with 20 MurC peptide sequences from different bacteria confirmed the presence of highly conserved regions having sequence identities ranging from 22-97% including conserved motifs for ATP-binding and the active site of the enzyme. Genetic complementation was done in Escherichia coli (murCts) suppressing the lethal phenotype. The murC gene was subcloned into the expression vector pET30a and overexpressed in E. coli BL21(lambdaDE3). Three PCR cloning strategies were used to obtain the three recombinant plasmids for expression of the native MurC, MurC His-tagged at N-terminal and at C-terminal, respectively. MurC His-tagged at C-terminal was chosen for large scale production and protein purification in the soluble form. The purification was done in a single chromatographic step on an affinity nickel column and obtained in mg quantities at 95% homogeneity. MurC protein was used to produce monoclonal antibodies for epitope mapping and for assay development in high throughput screenings. Detailed studies of MurC and other genes of the bacterial cell cycle will provide the reagents and strain constructs for high throughput screening and for design of novel antibacterials.
DNA sequencing using fluorescence background electroblotting membrane

DOEpatents

Caldwell, Karin D.; Chu, Tun-Jen; Pitt, William G.

1992-01-01

A method for the multiplex sequencing on DNA is disclosed which comprises the electroblotting or specific base terminated DNA fragments, which have been resolved by gel electrophoresis, onto the surface of a neutral non-aromatic polymeric microporous membrane exhibiting low background fluorescence which has been surface modified to contain amino groups. Polypropylene membranes are preferably and the introduction of amino groups is accomplished by subjecting the membrane to radio or microwave frequency plasma discharge in the presence of an aminating agent, preferably ammonia. The membrane, containing physically adsorbed DNA fragments on its surface after the electroblotting, is then treated with crosslinking means such as UV radiation or a glutaraldehyde spray to chemically bind the DNA fragments to the membrane through said smino groups contained on the surface thereof. The DNA fragments chemically bound to the membrane are subjected to hybridization probing with a tagged probe specific to the sequence of the DNA fragments. The tagging may be by either fluorophores or radioisotopes. The tagged probes hybridized to said target DNA fragments are detected and read by laser induced fluorescence detection or autoradiograms. The use of aminated low fluorescent background membranes allows the use of fluorescent detection and reading even when the available amount of DNA to be sequenced is small. The DNA bound to the membrances may be reprobed numerous times.
DNA sequencing using fluorescence background electroblotting membrane

DOEpatents

Caldwell, K.D.; Chu, T.J.; Pitt, W.G.

1992-05-12

A method for the multiplex sequencing on DNA is disclosed which comprises the electroblotting or specific base terminated DNA fragments, which have been resolved by gel electrophoresis, onto the surface of a neutral non-aromatic polymeric microporous membrane exhibiting low background fluorescence which has been surface modified to contain amino groups. Polypropylene membranes are preferably and the introduction of amino groups is accomplished by subjecting the membrane to radio or microwave frequency plasma discharge in the presence of an aminating agent, preferably ammonia. The membrane, containing physically adsorbed DNA fragments on its surface after the electroblotting, is then treated with crosslinking means such as UV radiation or a glutaraldehyde spray to chemically bind the DNA fragments to the membrane through amino groups contained on the surface. The DNA fragments chemically bound to the membrane are subjected to hybridization probing with a tagged probe specific to the sequence of the DNA fragments. The tagging may be by either fluorophores or radioisotopes. The tagged probes hybridized to the target DNA fragments are detected and read by laser induced fluorescence detection or autoradiograms. The use of aminated low fluorescent background membranes allows the use of fluorescent detection and reading even when the available amount of DNA to be sequenced is small. The DNA bound to the membranes may be reprobed numerous times. No Drawings
An efficient and scalable pipeline for epitope tagging in mammalian stem cells using Cas9 ribonucleoprotein

PubMed Central

Dewari, Pooran Singh; Southgate, Benjamin; Mccarten, Katrina; Monogarov, German; O'Duibhir, Eoghan; Quinn, Niall; Tyrer, Ashley; Leitner, Marie-Christin; Plumb, Colin; Kalantzaki, Maria; Blin, Carla; Finch, Rebecca; Bressan, Raul Bardini; Morrison, Gillian; Jacobi, Ashley M; Behlke, Mark A; von Kriegsheim, Alex; Tomlinson, Simon; Krijgsveld, Jeroen

2018-01-01

CRISPR/Cas9 can be used for precise genetic knock-in of epitope tags into endogenous genes, simplifying experimental analysis of protein function. However, Cas9-assisted epitope tagging in primary mammalian cell cultures is often inefficient and reliant on plasmid-based selection strategies. Here, we demonstrate improved knock-in efficiencies of diverse tags (V5, 3XFLAG, Myc, HA) using co-delivery of Cas9 protein pre-complexed with two-part synthetic modified RNAs (annealed crRNA:tracrRNA) and single-stranded oligodeoxynucleotide (ssODN) repair templates. Knock-in efficiencies of ~5–30%, were achieved without selection in embryonic stem (ES) cells, neural stem (NS) cells, and brain-tumor-derived stem cells. Biallelic-tagged clonal lines were readily derived and used to define Olig2 chromatin-bound interacting partners. Using our novel web-based design tool, we established a 96-well format pipeline that enabled V5-tagging of 60 different transcription factors. This efficient, selection-free and scalable epitope tagging pipeline enables systematic surveys of protein expression levels, subcellular localization, and interactors across diverse mammalian stem cells. PMID:29638216
Gene discovery in Eimeria tenella by immunoscreening cDNA expression libraries of sporozoites and schizonts with chicken intestinal antibodies.

PubMed

Réfega, Susana; Girard-Misguich, Fabienne; Bourdieu, Christiane; Péry, Pierre; Labbé, Marie

2003-04-02

Specific antibodies were produced ex vivo from intestinal culture of Eimeria tenella infected chickens. The specificity of these intestinal antibodies was tested against different parasite stages. These antibodies were used to immunoscreen first generation schizont and sporozoite cDNA libraries permitting the identification of new E. tenella antigens. We obtained a total of 119 cDNA clones which were subjected to sequence analysis. The sequences coding for the proteins inducing local immune responses were compared with nucleotide or protein databases and with expressed sequence tags (ESTs) databases. We identified new Eimeria genes coding for heat shock proteins, a ribosomal protein, a pyruvate kinase and a pyridoxine kinase. Specific features of other sequences are discussed.
Characterisation of a DNA sequence element that directs Dictyostelium stalk cell-specific gene expression.

PubMed

Ceccarelli, A; Zhukovskaya, N; Kawata, T; Bozzaro, S; Williams, J

2000-12-01

The ecmB gene of Dictyostelium is expressed at culmination both in the prestalk cells that enter the stalk tube and in ancillary stalk cell structures such as the basal disc. Stalk tube-specific expression is regulated by sequence elements within the cap-site proximal part of the promoter, the stalk tube (ST) promoter region. Dd-STATa, a member of the STAT transcription factor family, binds to elements present in the ST promoter-region and represses transcription prior to entry into the stalk tube. We have characterised an activatory DNA sequence element, that lies distal to the repressor elements and that is both necessary and sufficient for expression within the stalk tube. We have mapped this activator to a 28 nucleotide region (the 28-mer) within which we have identified a GA-containing sequence element that is required for efficient gene transcription. The Dd-STATa protein binds to the 28-mer in an in vitro binding assay, and binding is dependent upon the GA-containing sequence. However, the ecmB gene is expressed in a Dd-STATa null mutant, therefore Dd-STATa cannot be responsible for activating the 28-mer in vivo. Instead, we identified a distinct 28-mer binding activity in nuclear extracts from the Dd-STATa null mutant, the activity of this GA binding activity being largely masked in wild type extracts by the high affinity binding of the Dd-STATa protein. We suggest, that in addition to the long range repression exerted by binding to the two known repressor sites, Dd-STATa inhibits transcription by direct competition with this putative activator for binding to the GA sequence.
49 CFR 234.239 - Tagging of wires and interference of wires or tags with signal apparatus.

Code of Federal Regulations, 2011 CFR

2011-10-01

... with signal apparatus. 234.239 Section 234.239 Transportation Other Regulations Relating to... Tagging of wires and interference of wires or tags with signal apparatus. Each wire shall be tagged or... of the apparatus. This requirement applies to each wire at each terminal in all housings including...
49 CFR 234.239 - Tagging of wires and interference of wires or tags with signal apparatus.

Code of Federal Regulations, 2010 CFR

2010-10-01

... with signal apparatus. 234.239 Section 234.239 Transportation Other Regulations Relating to... Tagging of wires and interference of wires or tags with signal apparatus. Each wire shall be tagged or... of the apparatus. This requirement applies to each wire at each terminal in all housings including...
Sequencing-based gene network analysis provides a core set of gene resource for understanding thermal adaptation in Zhikong scallop Chlamys farreri.

PubMed

Fu, X; Sun, Y; Wang, J; Xing, Q; Zou, J; Li, R; Wang, Z; Wang, S; Hu, X; Zhang, L; Bao, Z

2014-01-01

Marine organisms are commonly exposed to variable environmental conditions, and many of them are under threat from increased sea temperatures caused by global climate change. Generating transcriptomic resources under different stress conditions are crucial for understanding molecular mechanisms underlying thermal adaptation. In this study, we conducted transcriptome-wide gene expression profiling of the scallop Chlamys farreri challenged by acute and chronic heat stress. Of the 13 953 unique tags, more than 850 were significantly differentially expressed at each time point after acute heat stress, which was more than the number of tags differentially expressed (320-350) under chronic heat stress. To obtain a systemic view of gene expression alterations during thermal stress, a weighted gene coexpression network was constructed. Six modules were identified as acute heat stress-responsive modules. Among them, four modules involved in apoptosis regulation, mRNA binding, mitochondrial envelope formation and oxidation reduction were downregulated. The remaining two modules were upregulated. One was enriched with chaperone and the other with microsatellite sequences, whose coexpression may originate from a transcription factor binding site. These results indicated that C. farreri triggered several cellular processes to acclimate to elevated temperature. No modules responded to chronic heat stress, suggesting that the scallops might have acclimated to elevated temperature within 3 days. This study represents the first sequencing-based gene network analysis in a nonmodel aquatic species and provides valuable gene resources for the study of thermal adaptation, which should assist in the development of heat-tolerant scallop lines for aquaculture. © 2013 John Wiley & Sons Ltd.
Nucleic and amino acid sequences relating to a novel transketolase, and methods for the expression thereof

DOEpatents

Croteau, Rodney Bruce; Wildung, Mark Raymond; Lange, Bernd Markus; McCaskill, David G.

2001-01-01

cDNAs encoding 1-deoxyxylulose-5-phosphate synthase from peppermint (Mentha piperita) have been isolated and sequenced, and the corresponding amino acid sequences have been determined. Accordingly, isolated DNA sequences (SEQ ID NO:3, SEQ ID NO:5, SEQ ID NO:7) are provided which code for the expression of 1-deoxyxylulose-5-phosphate synthase from plants. In another aspect the present invention provides for isolated, recombinant DXPS proteins, such as the proteins having the sequences set forth in SEQ ID NO:4, SEQ ID NO:6 and SEQ ID NO:8. In other aspects, replicable recombinant cloning vehicles are provided which code for plant 1-deoxyxylulose-5-phosphate synthases, or for a base sequence sufficiently complementary to at least a portion of 1-deoxyxylulose-5-phosphate synthase DNA or RNA to enable hybridization therewith. In yet other aspects, modified host cells are provided that have been transformed, transfected, infected and/or injected with a recombinant cloning vehicle and/or DNA sequence encoding a plant 1-deoxyxylulose-5-phosphate synthase. Thus, systems and methods are provided for the recombinant expression of the aforementioned recombinant 1-deoxyxylulose-5-phosphate synthase that may be used to facilitate its production, isolation and purification in significant amounts. Recombinant 1-deoxyxylulose-5-phosphate synthase may be used to obtain expression or enhanced expression of 1-deoxyxylulose-5-phosphate synthase in plants in order to enhance the production of 1-deoxyxylulose-5-phosphate, or its derivatives such as isopentenyl diphosphate (BP), or may be otherwise employed for the regulation or expression of 1-deoxyxylulose-5-phosphate synthase, or the production of its products.
Preparative SDS PAGE as an Alternative to His-Tag Purification of Recombinant Amelogenin

PubMed Central

Gabe, Claire M.; Brookes, Steven J.; Kirkham, Jennifer

2017-01-01

Recombinant protein technology provides an invaluable source of proteins for use in structure-function studies, as immunogens, and in the development of therapeutics. Recombinant proteins are typically engineered with “tags” that allow the protein to be purified from crude host cell extracts using affinity based chromatography techniques. Amelogenin is the principal component of the developing enamel matrix and a frequent focus for biomineralization researchers. Several groups have reported the successful production of recombinant amelogenins but the production of recombinant amelogenin free of any tags, and at single band purity on silver stained SDS PAGE is technically challenging. This is important, as rigorous structure-function research frequently demands a high degree of protein purity and fidelity of protein sequence. Our aim was to generate His-tagged recombinant amelogenin at single band purity on silver stained SDS PAGE for use in functionality studies after His-tag cleavage. An acetic acid extraction technique (previously reported to produce recombinant amelogenin at 95% purity directly from E. coli) followed by repeated rounds of nickel column affinity chromatography, failed to generate recombinant amelogenin at single band purity. This was because following an initial round of nickel column affinity chromatography, subsequent cleavage of the His-tag was not 100% efficient. A second round of nickel column affinity chromatography, used in attempts to separate the cleaved His-tag free recombinant from uncleaved His-tagged contaminants, was still unsatisfactory as cleaved recombinant amelogenin exhibited significant affinity for the nickel column. To solve this problem, we used preparative SDS PAGE to successfully purify cleaved recombinant amelogenins to single band purity on silver stained SDS PAGE. The resolving power of preparative SDS PAGE was such that His-tag based purification of recombinant amelogenin becomes redundant. We suggest that acetic
Soldier Data Tag Study Effort.

DTIC Science & Technology

1985-06-10

interested in protecting it. The tag itself is difficult--though not impossible--to counterfeit . Also, it (’• iii 71 -, potentially improves the data...attacks during the design, manufacture, and distribution processes, counterfeiting , unauthorized access/alteration of tag data, and use of the tag to...45 3.3.2 Hijacking of SOT System Shipments, or Large- Scale Counterfeit of SOT Systems ....................... 46 3.3.3 Unauthorized Alteration
Uncertainty of exploitation estimates made from tag returns

USGS Publications Warehouse

Miranda, L.E.; Brock, R.E.; Dorr, B.S.

2002-01-01

Over 6,000 crappies Pomoxis spp. were tagged in five water bodies to estimate exploitation rates by anglers. Exploitation rates were computed as the percentage of tags returned after adjustment for three sources of uncertainty: postrelease mortality due to the tagging process, tag loss, and the reporting rate of tagged fish. Confidence intervals around exploitation rates were estimated by resampling from the probability distributions of tagging mortality, tag loss, and reporting rate. Estimates of exploitation rates ranged from 17% to 54% among the five study systems. Uncertainty around estimates of tagging mortality, tag loss, and reporting resulted in 90% confidence intervals around the median exploitation rate as narrow as 15 percentage points and as broad as 46 percentage points. The greatest source of estimation error was uncertainty about tag reporting. Because the large investments required by tagging and reward operations produce imprecise estimates of the exploitation rate, it may be worth considering other approaches to estimating it or simply circumventing the exploitation question altogether.
RAC-tagging: Recombineering And Cas9-assisted targeting for protein tagging and conditional analyses

PubMed Central

Baker, Oliver; Gupta, Ashish; Obst, Mandy; Zhang, Youming; Anastassiadis, Konstantinos; Fu, Jun; Stewart, A. Francis

2016-01-01

A fluent method for gene targeting to establish protein tagged and ligand inducible conditional loss-of-function alleles is described. We couple new recombineering applications for one-step cloning of gRNA oligonucleotides and rapid generation of short-arm (~1 kb) targeting constructs with the power of Cas9-assisted targeting to establish protein tagged alleles in embryonic stem cells at high efficiency. RAC (Recombineering And Cas9)-tagging with Venus, BirM, APEX2 and the auxin degron is facilitated by a recombineering-ready plasmid series that permits the reuse of gene-specific reagents to insert different tags. Here we focus on protein tagging with the auxin degron because it is a ligand-regulated loss-of-function strategy that is rapid and reversible. Furthermore it includes the additional challenge of biallelic targeting. Despite high frequencies of monoallelic RAC-targeting, we found that simultaneous biallelic targeting benefits from long-arm (>4 kb) targeting constructs. Consequently an updated recombineering pipeline for fluent generation of long arm targeting constructs is also presented. PMID:27216209
De novo characterization of Larix gmelinii (Rupr.) Rupr. transcriptome and analysis of its gene expression induced by jasmonates.

PubMed

Men, Lina; Yan, Shanchun; Liu, Guanjun

2013-08-13

Larix gmelinii is a dominant tree species in China's boreal forests and plays an important role in the coniferous ecosystem. It is also one of the most economically important tree species in the Chinese timber industry due to excellent water resistance and anti-corrosion of its wood products. Unfortunately, in Northeast China, L. gmelinii often suffers from serious attacks by diseases and insects. The application of exogenous volatile semiochemicals may induce and enhance its resistance against insect or disease attacks; however, little is known regarding the genes and molecular mechanisms related to induced resistance. We performed de novo sequencing and assembly of the L. gmelinii transcriptome using a short read sequencing technology (Illumina). Chemical defenses of L. gmelinii seedlings were induced with jasmonic acid (JA) or methyl jasmonate (MeJA) for 6 hours. Transcriptomes were compared between seedlings induced by JA, MeJA and untreated controls using a tag-based digital gene expression profiling system. In a single run, 25,977,782 short reads were produced and 51,157 unigenes were obtained with a mean length of 517 nt. We sequenced 3 digital gene expression libraries and generated between 3.5 and 5.9 million raw tags, and obtained 52,040 reliable reference genes after removing redundancy. The expression of disease/insect-resistance genes (e.g., phenylalanine ammonialyase, coumarate 3-hydroxylase, lipoxygenase, allene oxide synthase and allene oxide cyclase) was up-regulated. The expression profiles of some abundant genes under different elicitor treatment were studied by using real-time qRT-PCR.The results showed that the expression levels of disease/insect-resistance genes in the seedling samples induced by JA and MeJA were higher than those in the control group. The seedlings induced with MeJA elicited the strongest increases in disease/insect-resistance genes. Both JA and MeJA induced seedlings of L. gmelinii showed significantly increased expression
Interferon-gamma of the giant panda (Ailuropoda melanoleuca): complementary DNA cloning, expression, and phylogenetic analysis.

PubMed

Tao, Yaqiong; Zeng, Bo; Xu, Liu; Yue, Bisong; Yang, Dong; Zou, Fangdong

2010-01-01

Interferon-gamma (IFN-gamma) is the only member of type II IFN and is vital in the regulation of immune and inflammatory responses. Herein we report the cloning, expression, and sequence analysis of IFN-gamma from the giant panda (Ailuropoda melanoleuca). The open reading frame of this gene is 501 base pair in length and encodes a polypeptide consisting of 166 amino acids. All conserved N-linked glycosylation sites and cysteine residues among carnivores were found in the predicted amino acid sequence of the giant panda. Recombinant giant panda IFN-gamma with a V5 epitope and polyhistidine tag was expressed in HEK293 host cells and confirmed by Western blotting. Phylogenetic analysis of mammalian IFN-gamma-coding sequences indicated that the giant panda IFN-gamma was closest to that of carnivores, then to ungulates and dolphin, and shared a distant relationship with mouse and human. These results represent a first step into the study of IFN-gamma in giant panda.

Some links on this page may take you to non-federal websites. Their policies may differ from this site.